PATENT APPLICATION 

Attorney Docket No. 21402-258 (Cura-558) 

PROTEINS AND NUCLEIC ACIDS ENCODING SAME 

FIELD OF THE INVENTION 

The invention generally relates to nucleic acids and polypeptides encoded thereby. 

BACKGROUND OF THE INVENTION 

The invention generally relates to nucleic acids and polypeptides encoded therefrom. 
More specifically, the invention relates to nucleic acids encoding cytoplasmic, nuclear, 
membrane bound, and secreted polypeptides, as well as vectors, host cells, antibodies, and 
recombinant methods for producing these nucleic acids and polypeptides. 

SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of nucleic acid sequences encoding 
novel polypeptides. The novel nucleic acids and polypeptides are referred to herein as NOVX, 
or NOV 1-99 nucleic acids and polypeptides. These nucleic acids and polypeptides, as well as 
derivatives, homologs, analogs and fragments thereof, will hereinafter be collectively 
designated as "NOVX" nucleic acid or polypeptide sequences. 

In one aspect, the invention provides an isolated NOVX nucleic acid molecule 
encoding a NOVX polypeptide that includes a nucleic acid sequence that has identity to the 
nucleic acids disclosed in SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, . In 
some embodiments, the NOVX nucleic acid molecule will hybridize under stringent 
conditions to a nucleic acid sequence complementary to a nucleic acid molecule that includes 
a protein-coding sequence of a NOVX nucleic acid sequence. The invention also includes an 
isolated nucleic acid that encodes a NOVX polypeptide, or a fragment, homolog, analog or 
derivative thereof. For example, the nucleic acid can encode a polypeptide at least 80% 
identical to a polypeptide comprising the amino acid sequences of SEQ ID NOS:2n, wherein n 
is an integer between 1 and 162. The nucleic acid can be, for example, a genomic DNA 
fragment or a cDNA molecule that includes the nucleic acid sequence of any of SEQ ED 
NOS:2n-l, wherein n is an integer between 1 and 162. 

Also included in the invention is an oligonucleotide, e.g., an oligonucleotide which 

includes at least 6 contiguous nucleotides of a NOVX nucleic acid (e.g., SEQ ID NOS:2n-l, 

wherein n is an integer between 1 and 162) or a complement of said oligonucleotide. 
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Also included in the invention are substantially purified NOVX polypeptides (SEQ TD 
NOS:2n, wherein n is an integer between 1 and 162). In certain embodiments, the NOVX 
polypeptides include an amino acid sequence that is substantially identical to the amino acid 
sequence of a human NOVX polypeptide. 
5 The invention also features antibodies that immunoselectively bind to NOVX 

polypeptides, or fragments, homologs, analogs or derivatives thereof. 

In another aspect, the invention includes pharmaceutical compositions that include 
therapeutically- or prophylactically-effective amounts of a therapeutic and a pharmaceutically- 
acceptable carrier. The therapeutic can be, e.g., a NOVX nucleic acid, a NOVX polypeptide, 
10 or an antibody specific for a NOVX polypeptide. In a further aspect, the invention includes, in 
one or more containers, a therapeutically- or prophylactically-effective amount of this 
pharmaceutical composition. 

In a further aspect, the invention includes a method of producing a polypeptide by 
culturing a cell that includes a NOVX nucleic acid, under conditions allowing for expression 
1 5 of the NOVX polypeptide encoded by the DNA. If desired, the NOVX polypeptide can then 
be recovered. 

In another aspect, the invention includes a method of detecting the presence of a 
NOVX polypeptide in a sample. In the method, a sample is contacted with a compound that 
selectively binds to the polypeptide under conditions allowing for formation of a complex 
20 between the polypeptide and the compound. The complex is detected, if present, thereby 
identifying the NOVX polypeptide within the sample. 

The invention also includes methods to identify specific cell or tissue types based on 
their expression of a NOVX. 

Also included in the invention is a method of detecting the presence of a NOVX 
25 nucleic acid molecule in a sample by contacting the sample with a NOVX nucleic acid probe 
or primer, and detecting whether the nucleic acid probe or primer bound to a NOVX nucleic 
acid molecule in the sample. 

In a further aspect, the invention provides a method for modulating the activity of a 
NOVX polypeptide by contacting a cell sample that includes the NOVX polypeptide with a 
30 compound that binds to the NOVX polypeptide in an amount sufficient to modulate the 

activity of said polypeptide. The compound can be, e.g., a small molecule, such as a nucleic 
acid, peptide, polypeptide, peptidomimetic, carbohydrate, lipid or other organic (carbon 
containing) or inorganic molecule, as further described herein. 
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Also within the scope of the invention is the use of a therapeutic in the manufacture of 
a medicament for treating or preventing various disorders or syndromes described below. 

The therapeutic can be, e.g., a NOVX nucleic acid, a NOVX polypeptide, or a NOVX- 
specific antibody, or biologically-active derivatives or fragments thereof. 

For example, the compositions of the present invention will have efficacy for treatment 
of patients suffering from the diseases and disorders disclosed above and/or other pathologies 
and disorders of the like. The polypeptides can be used as immunogens to produce antibodies 
specific for the invention, and as vaccines. They can also be used to screen for potential 
agonist and antagonist compounds. For example, a cDNA encoding NOVX may be useful in 
gene therapy, and NOVX may be useful when administered to a subject in need thereof. By 
way of non-limiting example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from the diseases and disorders disclosed above and/or other 
pathologies and disorders of the like. 

The invention further includes a method for screening for a modulator of disorders or 
syndromes including, e.g., the diseases and disorders disclosed above and/or other pathologies 
and disorders of the like. The method includes contacting a test compound with a NOVX 
polypeptide and determining if the test compound binds to said NOVX polypeptide. Binding 
of the test compound to the NOVX polypeptide indicates the test compound is a modulator of 
activity, or of latency or predisposition to the aforementioned disorders or syndromes. 

Also within the scope of the invention is a method for screening for a modulator of 
activity, or of latency or predisposition to disorders or syndromes including, e.g., the diseases 
and disorders disclosed above and/or other pathologies and disorders of the like by 
administering a test compound to a test animal at increased risk for the aforementioned 
disorders or syndromes. The test animal expresses a recombinant polypeptide encoded by a 
NOVX nucleic acid. Expression or activity of NOVX polypeptide is then measured in the test 
animal, as is expression or activity of the protein in a control animal which recombinantly- 
expresses NOVX polypeptide and is not at increased risk for the disorder or syndrome. Next, 
the expression of NOVX polypeptide in both the test animal and the control animal is 
compared. A change in the activity of NOVX polypeptide in the test animal relative to the 
control animal indicates the test compound is a modulator of latency of the disorder or 
syndrome. 

In yet another aspect, the invention includes a method for determining the presence of 
or predisposition to a disease associated with altered levels of a NOVX polypeptide, a NOVX 
nucleic acid, or both, in a subject (e.g., a human subject). The method includes measuring the 



amount of the NOVX polypeptide in a test sample from the subject and comparing the amount 
of the polypeptide in the test sample to the amount of the NOVX polypeptide present in a 
control sample. An alteration in the level of the NOVX polypeptide in the test sample as 
compared to the control sample indicates the presence of or predisposition to a disease in the 
subject. Preferably, the predisposition includes, e.g., the diseases and disorders disclosed 
above and/or other pathologies and disorders of the like. Also, the expression levels of the new 
polypeptides of the invention can be used in a method to screen for various cancers as well as 
to determine the stage of cancers. 

In a further aspect, the invention includes a method of treating or preventing a 
pathological condition associated with a disorder in a mammal by administering to the subject 
a NOVX polypeptide, a NOVX nucleic acid, or a NOVX-specific antibody to a subject (e.g., a 
human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
preferred embodiments, the disorder, includes, e.g., the diseases and disorders disclosed above 
and/or other pathologies and disorders of the like. 

In yet another aspect, the invention can be used in a method to identity the cellular 
receptors and downstream effectors of the invention by any one of a number of techniques 
commonly employed in the art. These include but are not limited to the two-hybrid system, 
affinity purification, co-precipitation with antibodies or other specific-interacting molecules. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although methods and materials similar or equivalent to those described herein can 
be used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references 
mentioned herein are incorporated by reference in their entirety. In the case of conflict, the 
present specification, including definitions, will control. In addition, the materials, methods, 
and examples are illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides novel nucleotides and polypeptides encoded thereby. 
Included in the invention are the novel nucleic acid sequences and their encoded polypeptides. 
The sequences are collectively referred to herein as "NOVX nucleic acids" or "NOVX 
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polynucleotides" and the corresponding encoded polypeptides are referred to as "NOVX 
polypeptides" or "NOVX proteins." Unless indicated otherwise, "NOVX" is meant to refer to 
any of the novel sequences disclosed herein. Table A provides a summary of the NOVX 
nucleic acids and their encoded polypeptides. 

TABLE A. Sequences and Corresponding SEQ ID Numbers 



NOVX 
Assignment 


Internal Identification 


SEQ ID 

NO 
(nucleic 
acid) 


SEQ ID NO 
(polypeptide) 


Hnmnlnov 


la 


CG56592-01 


l 


2 


Claudin 6- like 


lb 


CG56586- 01 


3 


4 


Claudin-3-like 


lc 


CG56592-03 


5 


6 


Claudin- 6- like 


Id 


CG56592-02 


7 


8 


Claudin 6- like 


2 


CG56596-01 


9 


10 


Protein Serine Kinase -like 


3a 


CG56594-01 


11 


12 


Claudin-19-like 


3b 


CG56594-02 


13 


14 


Claudin- 19 -like 


3c 


CG57576-01 


15 


16 


Claudin-19-like 


4a 


CG5658 9-01 


17 


18 


Claudin- 6- like 


4b 


CGS 6 R fl 9 - m 

wV-T -J V _J O ^ Ul 


1 9 


20 


Claudin- 6 - like 


4C 


CG56589-02 


21 


22 


Claudin- 6 -like 


5a 


CG56635-01 


23 


24 


Monocarboxylate transporter 
(MCT3 ) -like 


5b 


CG56635-02 


25 


26 


Monocarboxylate transporter 
3 -like 


5c 


CG56635-03 


27 


28 


Monocarboxylate transporter 
3 - like 


5d 


CG56635- 04 


2 9 


3 0 


Monocarboxylate transporter 
3-like 


5e 


CG56635-05 


3 1 


3 2 


Monocarboxylate transporter 
3-like 


6 


CG5 66 74-01 


33 


34 


Nitri lass- l— like 


7a 


CG56613-01 


35 


36 


Cleavage Signal-1 Protein- 
Like 


7b 


CG56613-02 


37 


38 


Cleavage Signal-1 Protein- 
Like 


7c 


CG56613-03 


39 


40 


Cleavage Signal-1 Protein- 
Like 


7d 


174307820 


41 


42 


Cleavage Signal-1 Protein- 
Like 


7e 


167474749 


323 


324 


Cleavage Signal-1 Protein- 
Like 


8 


153472451 


43 


44 


Matriptase -like 


9a 


CG56554-01 


45 


46 


Neuropeptide Y/Peptide YY 
receptor -like 


9b 


CG56554-02 


47 


48 


Neuropeptide Y/Peptide YY 
receptor -like 


10 


CG55964-01 


49 


50 


G- Protein Coupled Receptor- 
like 


11 


CG55966-01 


51 


52 


G- Protein Coupled Receptor- 
like 


12 


CG56003-01 


53 


54 


Neuromodulin- like 


13a 


CG56021-01 


55 


56 


G-Protein Coupled Receptor- 
like 


13b 


CG56021-02 


57 


58 


G- Protein Coupled Receptor- 
like 


14 


CG56023-01 


59 


60 


G- Protein Coupled Receptor- 
like 


15a 


CG56065-01 


61 


62 


G- Protein Coupled Receptor- 
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like 


15b 


CG56065-02 


63 


64 


G-Protein Coupled Receptor- 
like 


16a 


CG56067-01 


65 


66 


G-Protein Coupled Receptor- 
like 


16b 


CG56753-02 


67 


68 


G- Protein Coupled Receptor- 
like 


17a 


CG56657-01 


69 


70 


G-Protein Coupled Receptor- 
like 


17b 


CG56657-02 


71 


72 


G-Protein Coupled Receptor- 
like 


17c 


CG56659-01 


73 


74 


G-Protein Coupled Receptor- 
like 


17d 


CG56659_02 


75 


76 


G-Protein Coupled Receptor- 
like 


18a 


CG56663-01 


77 


78 


G-Protein Coupled Receptor- 
like 


18b 


CG56663-02 


79 


80 


G-Protein Coupled Receptor- 
like 


19a 


CG56665-01 


81 


82 


G-Protein Coupled Receptor- 
like 


19b 


CG56665-02 


83 


84 


G-Protein Coupled Receptor- 
like 


20 


CG56667-01 


85 


86 


G-Protein Coupled Receptor- 
like 


21a 


CG56639-01 


87 


88 


Adrenal Secretory Serine 
Protease -Like 


21b 


CG56639-02 


89 


90 


Adrenal Secretory Serine 
Protease -Like 


22a 


CG56643-01 


91 


92 


Adrenal Secretory Serine 
Protease -Like 


22b 


CG5664 3 -02 


93 


94 


Adrenal Secretory Serine 

Prnhps Qp-Tii k<=» 


22c 


CG5 664 3-03 


95 


96 


Adrenal Secretory Serine 
Protease -Like 


23a 


CG56647-02 


97 


98 


Serine Protease DESCl-like 


23b 


CG56647-03 


99 


100 


Serine Protease DESCl-like 


23c 


CG5664 7-01 


101 


102 


Serine Protease DESCl-like 


24a 


CG56155-01 


103 


104 


Parchorin-like 


24b 


CG5 615 5-02 


105 


106 


Parchorin- 1 ike 


25 


CG56457-01 


107 


108 


Protein Phosphatase-like 


26a 


CG56461-01 


109 


110 


GAGE- 7- like 


26b 


CG56461-02 


111 ■ 


112 


GAGE - 7 - 1 ike 


27a 


CG56645-01 


113 


114 


Sodium-Glucose 
Cotransporter- like 


27b 


CG5664 5- 02 


115 


116 


Sodium— Glucose 
Cotransporter- 1 ike 


27c 


191828203 


117 


118 


Sodium- Glucose 
Cotransporter- like 


28 


CG56185-01 


119 


120 


MYD-l-like 


29a 


CG56187-01 


121 


122 


CRAL- TRIO- like 


2 9b 


CG56187-03 


123 


124 


CRAL- TRIO- like 


29c 


CG56189-01 


125 


126 


CRAL- TRIO- like 


30 


CG56191-01 


127 


128 


Ryudocan- like 


31 


CG56392-01 


129 


130 


Sulfur-rich Keratin-like 


32 


CG56686-01 


131 


132 


DNMT1 associated protein- 1 
(DMAP) 


33 


CG56688-01 


133 


134 


Notchl-like 


34 


CG56715-01 


135 


136 


Olfactory Receptor- like 


35 


CG56718-01 


137 


138 


Olfactory Receptor-like 


36a 


CG56729-01 


139 


140 


Cadherin 11-like 


36b 


CG56729-02 


141 


142 


Cadherin 11-like 


37 


CG56733-01 


143 


144 


Ten-M2-like 


38 


CG56737-01 


145 


146 


Activin Beta C Chain-like 


39a 


CG56737-02 


147 


148 


Activin Beta C Chain-like 
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39b 


CG56637-03 


149 


150 


Inhibin Beta E Chain- like 


40 


CG56097-01 


151 


152 


UDP Glycosyltransf erase- 
like 


41a 


CG56680-01 


153 


154 


Sodium/ Hydrogen Exchanger 
4 -like 


41b 


CG56680-02 


155 


156 


Sodium/Hydrogen Exchanger 
4 -like 


42a 


CG56682-01 


157 


158 


Kupffer Cell Receptor- like 


42b 


CG56682-02 


159 


160 


Kupffer Cell Receptor- like 


42c 


CG56682-03 


161 


162 


Kupffer Cell Receptor- like 


42d 


CG56682-04 


163 


164 


Kupffer Cell Receptor- like 


43 


CG56690-01 


165 


166 


P2Y Purinoceptor -like 


44 


CG56692-01 


167 


168 


G Protein Coupled Receptor- 
like 


45 


CG56694-01 


169 


170 


Mas Proto-Oncogene- like 


A CZ 25 


v_ljb b o y b - U ± 


JL / JL 


JL /<£ 


Mas Proto-Oncogene - 1 xke 


4 6b 


CG56696-02 


173 


174 


Mas-Related G Protein- 
Coupled Receptor- like 


46c 


CG56702-01 


175 


176 


Mas Proto-Oncogene- like 


46d 


CG56698-01 


177 


178 


Mas Proto-Oncogene- like 


47 


CG56700-01 


179 


180 


Peptidyl- Prolyl Cis-Trans 
Isomerase-like 


48a 


CG56743-01 


181 


182 


Phospholipase C Deltas- 
like 


4 8b 


CG56743-02 


183 


184 


Phospholipase C Delta-4- 
like 


4 9 


v — w J D / J 27 w _L 


185 


186 


T ,£=>i iVa t - v "i ana. /L Om o rx 5» — 

JjC LLJyU L 1 _L — 1 1 0*± L L L K7Z y CX 

Hydroxylase- like 


5 0a 




18 7 


188 


Methyltransf erase 2- like 


5 Ob 


CG^ 6 7 71 - 09 


18 9 


190 


Methyltransf erase 2- like 


51 


CGS 6 7 5 9 - 01 


191 


192 




52 


CG56731-01 


193 


194 


H326-like 


53 


CG56745- 01 


195 


196 


Uracil 

Phosphor lbosyl trans f erase - 
Like 


54a 


CG56773-01 


197 


198 


Protein Phosphatase 2C-like 


54b 


V— ^3 -JO / / O L/^i 


19 9 


2 0 0 


O "v*/— \ +~ £^ n O Vi o c? Y"*\ V\ a h a c a O It a 

±r JL \J XII rllUoUlldLao— JL -Livd 


55 


^.uD O O U D \J JL 


2 01 


2 02 


Sulfotransf erase 3 -like 


56a 


CG56816-01 


203 


204 


N-Hydroxyaryl amine 
Sulfotransf erase -like 


56b - 


CG56816-02 


205 


206 


N - Hydroxy a ry 1 am i ne 
Sulfotransf erase -like 


57 


CG56829-01 


207 


208 


Testis Specific Serine 
Kinase - 3-1 ike 


5 8a 


rrm fin r - n 1 


2 09 


210 


n^ri .Tiin (~> t- -j on "R^-t-^i _ C. _ "I -i lr c» 
VJCip ULUltl — LUil J3 1: L- ct JLXJ\.t; 


5 8b 


CG5 6 3 1^-02 


211 


212 


fnnnpyi n — 1 H Vrc^ 


59 


CG56633-01 


213 


214 


Translation Initiation 
Factor 5-like 


60a 


CG56894-01 


215 


216 


Lynxl-like 


6 0b 


CG5 6 8 94-02 


217 


218 


Lynxl-like 


61 


CG56453-01 


219 


220 


Adli can -like 


62 


CG56781-01 


221 


222 


Neuropsin Precursor- 1 ike 


63 


CG53054-02 


223 


224 


Wnt-14 Precursor- like 


64 


CG56884-01 


225 


226 


Dipeptidyl peptidase- like 


65a 


CG56651-01 


227 


228 


Protein phosphatase- like 


65b 


CG56651-02 


229 


230 


Protein phosphatase- 1 ike 


66 


CG56313-01 


231 


232 


Olfactory receptor- like 


67 


CG56571-01 


233 


234 


Olfactory Receptor-Like 
Protein OLF3-like 


68 


CG56844-01 


235 


236 


Endoglin (CD105 antigen) - 
like 



7 




69a 


CG56950- 01 


237 


238 


Interleukin l Epsilon-like 


6 9b 


CG56136-02 


239 


240 


Interleukin 1 Epsilon-like 


7 Oa 




241 


242 


OS - 9 - like 


7 0b 


CG56878-04 


243 


244 


OS-9-like 


71 


CG56906- 01 


245 


246 


Sodium/Hydrogen Exchanger 
6-like 


72 


CG56910- 01 


247 


248 


Ubiquit in- Specific 
Protease- like 


73 


CG56822- 01 


249 


250 


Sulf otransf erase -like 


74 


CG56775-01 


251 


252 


Dual Specificity 
Phosphatase - 1 ike 


75 


CG56783-01 


253 


254 


Dual Specificity 
Phosphatase- like 


76a 


CG56789-01 


255 


256 


Dual Specificity 
Phosphatase- like 


76b 


CG56789-02 


257 


258 


Dual Specificity 
Phosphatase- 1 ike 


77 


CG56804-01 


259 


260 


Dual Specificity 
Phosphatase- like 


78 


CG56810-01 


261 


262 


Dual Specificity 
Phosphatase- like 


79 


CG56862-01 


263 


264 


Dual Speci f icity 
Phosphatase- like 


80 


CG56882-01 


265 


266 


Dual Specificity 
Phosphatase- like 


81a 


CG56283-01 


267 


268 


Beta-1,3- 

Galactosyl transferase -like 


81b 


CG56283-02 


269 


270 


Beta-1, 3- 

Galac tosyl trans f erase -like 


82 


CG56983-01 


271 


272 


Peptide YY-like 


83 


CG56890-01 


273 


274 


G Protein- Coupled Receptor 
Kinase GRK7-like 


84 


CG56912-01 


275 


276 


Phospholipase C delta 1- 
like 


85 


CG56955-01 


277 


278 


GTPase-Act ivating Protein- 
like 


86 


CG56957-01 


279 


280 


GTPase-Activating Protein- 
like 


87a 


CG56886-01 


281 


282 


Rho-GTPase -Activating 
Protein- like 


8 7b 


CG56886-02 


283 


284 


Rho-GTPase -Activating 
Protein- like 


88 


CG56394-01 


285 


286 


Glycerol - 3 - Phosphate 
Dehydrogenase - 1 ike 


89 


CG56396-01 


287 


288 


Glycerol - 3 - Phosphate 
Dehydrogenase - 1 ike 


90 


CG56888-01 


289 


290 


Serine/ Threonine -Protein 
Kinase PAK 2-like 


91 


CG56779-01 


291 


292 


D-Dop a chrome Tautomerase- 
like 


92 


CG56904- 01 


293 


294 


Secreted leucine-rich 

-y^ t~\ q ^% t T "D O ^ 1 -J \r 

X. Cpcd U- ^IjIvKJ "ilA.C 


93 


CG5 62 77- 01 


295 


296 


Inosine-5 * -Monophosphate 
Dehydrogenase - 1 ike 


94 


CG56281-01 


297 


298 


Male-Specific Lethal 3-Like 
1-like 


95 


CG56975-01 


299 


300 


Cysteine Conjugate Beta- 
Lyase-like 


96a 


CG56918-01 


301 


302 


Monocar boxy late 
transporter- like 


96b 


CG56918-02 


303 


304 


Monocarboxylate 
transporter- like 


96c 


CG56918-03 


305 


306 


Sugar Transporter- like 


97a 


CG57070-01 


307 


308 


Carboxypeptidase Al-like 


97b 


CG57070-02 


309 


310 


Carboxypeptidase Al-like 


97c 


CG57070-03 


311 


312 


Carboxypeptidase Al-like 
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97d 


CG57070-04 


313 


314 


Carboxypeptidase Al-like 


97e 


CG57070-05 


315 


316 


Carboxypeptidase Al-like 


97f 


CG57070-06 


317 


318 


Carboxypeptidase Al-like 


98 


CG56939-01 


319 


320 


Agrin- 1 ike 


99 


CG57010-01 


321 


322 


SNC73-like 



NOVX nucleic acids and their encoded polypeptides are useful in a variety of 
applications and contexts. The various NOVX nucleic acids and polypeptides according to the 
invention are useful as novel members of the protein families according to the presence of 
5 domains and sequence relatedness to previously described proteins. Additionally, NOVX 
nucleic acids and polypeptides can also be used to identify proteins that are members of the 
family to which the NOVX polypeptides belong. 

NOV1, NOV3, and NOV4 are homologous to a Claudin-like family of proteins. Thus, 
the NOV1, NOV3, and NOV4 nucleic acids, polypeptides, antibodies and related compounds 
10 according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions. 

NOV2 is homologous to the Protein Serine Kinase-like family of proteins. Thus 
NOV2 nucleic acids, polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
1 5 pathologies or conditions. 

NOV5 is homologous to a family of Monocarboxylate transporter-like proteins. Thus, 
the NOV5 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
20 NOV6 is homologous to the nitrilase-l-like family of proteins. Thus, NOV6 nucleic 

acids, polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions. 

NOV7 is homologous to the Cleavage Signal-l-like family of proteins. Thus NOV7 
25 nucleic acids, polypeptides, antibodies and related compounds according to the invention will 
be useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions. 

NOV8 is homologous to the Matripase-like family of proteins. Thus NOV8 nucleic 
acids, polypeptides, antibodies and related compounds according to the invention will be 
30 useful in therapeutic and diagnostic applications implicated in, various pathologies or 
conditions. 
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NOV9 is homologous to members of the Neuropeptide Y/Peptide YY receptor-like 
family of proteins. Thus, the NOV9 nucleic acids, polypeptides, antibodies and related 
compounds according to the invention will be useful in therapeutic and diagnostic applications 
implicated in various pathologies or conditions. 

NOVslO through 20, , NOV43, NOV44, and NOV83 are homologous to the G-Protein 
Coupled Receptor-like family of proteins. Thus, these nucleic acids and polypeptides, 
antibodies and related compounds according to the invention will be useful in therapeutic and 
diagnostic applications implicated in various pathologies or conditions. 

NOV21 and NOV22 are homologous to the Adrenal; secretory serine protease like 
growth factor binding protein-like family of proteins. Thus, NOV21 and NOV22 nucleic acids 
and polypeptides, antibodies and related compounds according to the invention will be useful 
in therapeutic and diagnostic applications implicated in various pathologies or conditions. 

NOV23 is homologous to the Serine Protease DESC-l-like family of proteins. Thus, 
NOV23 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in various pathologies or conditions. 

NOV24 is homologous to the Parchorin-like family of proteins. Thus, NOV24 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
disorders. 

NOV25 is homologous to theProtein Phosphatase-like family of proteins. Thus, 
NOV25 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions. 

NOV26 is homologous to the GAGE7-like family of proteins. Thus, NOV26 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies/disorders. 

NOV27 is homologous to the Sodium-Glucose Cotransporter-like family of proteins. 
Thus, NOV27 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV28 is homologous to the MYD-l-like family of proteins. Thus, NOV28 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

10 




NOV29 is homologous to the CRAL-TRIO-Iike family of proteins. Thus, NOV27 
nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

5 NOV30 is homologous to the Ryudocan-like family of proteins. Thus, NOV30 nucleic 

acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV31 is homologous to the Sulfur-rich Keratin-like family of proteins. Thus, 
10 NOV31 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV32 is homologous to the DMNT1 associated protein-like family of proteins. 
Thus, NOV32 nucleic acids and polypeptides, antibodies and related compounds according to 
15 the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV33 is homologous to the Notch 1 -like family of proteins. Thus, NOV33 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
20 conditions 

NOV34, NOV35, NOV51, NOV66, and NOV67 are homologous to the Olfactory 
Receptor-like family of proteins. Thus, NOV34, NOV35, NOV51, NOV66, and NOV67 
nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies or 
25 conditions 

NOV36 is homologous to the Cadherin 1 1 -like family of proteins. Thus, NOV36 
nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

30 NOV37 is homologous to the Ten-M2-like family of proteins. Thus, NOV33 nucleic 

acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 
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NOV38 and NOV39 are homologous to the Activin/Inhibin-like family of proteins. 
Thus, NOV38 and NOV39 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 
5 NOV40 is homologous to the UDP Glycosyltransferase-like family of proteins. Thus, 

NOV40 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV41 is homologous to the Sodium/Hydrogen Exchanger 4-like family of proteins. 
10 Thus, NOV41 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV42 is homologous to the Kupffer Cell Receptor-like family of proteins. Thus, 
NOV42 nucleic acids and polypeptides, antibodies and related compounds according to the 
15 invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV45 and NOV46 is homologous to the Mas Proto-Oncogene-like family of 
proteins. Thus, NOV45 and NOV46 nucleic acids and polypeptides, antibodies and related 
compounds according to the invention will be useful in therapeutic and diagnostic applications 
20 implicated in various pathologies or conditions 

NOV47 is homologous to the Peptidyl -Prolyl Cis-Trans Isomerase-like family of 
proteins. Thus, NOV47 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 
25 NOV48 is homologous to the Phospholipase C Delta-4-like family of proteins. Thus, 

NOV48 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV49 is homologous to the Leukotriene-B4 Omega Hydroxylase-like family of 
30 proteins. Thus, NOV49 nucleic acids and polypeptides, antibodies and related compounds 

according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV50 is homologous to the Protein Arginine N-Methyltransferase 2-like family of 
proteins. Thus, NOV50 nucleic acids and polypeptides, antibodies and related compounds 
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according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV52 is homologous to the H326-like family of proteins. Thus, NOV52 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
5 useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV53 is homologous to the Uracil Phosphoribosyltransferase-like family of proteins. 
Thus, NOV53 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
10 pathologies or conditions 

NOV54 is homologous to the Protein Phosphatase 2C-like family of proteins. Thus, 
NOV54 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
15 NOV55 is homologous to the Heparan Sulfate 6-Sulfotransferase 3-like family of 

proteins. Thus, NOV55 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV56 is homologous to the N-Hydroxyarylamine Sulfotransferase 3-like family of 
20 proteins. Thus, NOV52 nucleic acids and polypeptides, antibodies and related compounds 

according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV57 is homologous to the Testis Specific Serine Kinase-3-like family of proteins. 
Thus, NOV57 nucleic acids and polypeptides, antibodies and related compounds according to 
25 the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV58 is homologous to the Gap Junction Beta-5-like family of proteins. Thus, 
NOV58 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
30 pathologies or conditions 

NOV59 is homologous to the Translation Initiation Factor 5-like family of proteins. 
Thus, NOV59 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

13 




NOV60 is homologous to the Lynxl-like family of proteins. Thus, NOV60 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

5 NOV61 is homologous to the Adlican-like family of proteins. Thus, NOV61 nucleic 

acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV62 is homologous to the Neuropsin Precursor-like family of proteins. Thus, 
10 NOV62 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV63 is homologous to the Wnt-14-like family of proteins. Thus, NOV63 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
15 useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV64 is homologous to the Dipeptidyl peptidase-like family of proteins. Thus, 
NOV64 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
20 pathologies or conditions 

NOV65 is homologous to the Protein phosphatase-like family of proteins. Thus, 
NOV65 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

25 NOV68 is homologous to the Endoglin (CD105 antigen)-like family of proteins. Thus, 

NOV68 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV69 is homologous to the Interleukin 1 Epsilom-like family of proteins. Thus, 

30 NOV69 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV70 is homologous to the OS-9-like family of proteins. Thus, NOV70 nucleic 
acids and polypeptides, antibodies and related compounds according to the invention will be 
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useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV71 is homologous to the Sodium/Hydrogen Exchanger 6-1 ike family of proteins. 
Thus, NOV71 nucleic acids and polypeptides, antibodies and related compounds according to 
5 the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV72 is homologous to the Ubiquitin Specific Protease-like family of proteins. 
Thus, NOV72 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
1 0 pathologies or conditions 

NOV73 is homologous to the Sulfotransferase-like family of proteins. Thus, NOV73 
nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

1 5 NOV74, NOV75, NOV76, NOV77, NOV78, NOV79, and NOV80 are homologous to 

the Dual Specificity Phosphatase-like family of proteins. Thus, NOV74, NOV75, NOV76, 
NOV77, NOV78, NOV79, and NOV80 nucleic acids and polypeptides, antibodies and related 
compounds according to the invention will be useful in therapeutic and diagnostic applications 
implicated in various pathologies or conditions 

20 NOV81 is homologous to the Beta-1, 3-Galactosyltransferase-like family of proteins. 

Thus, NOV81 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV82 is homologous to the Peptide YY-like family of proteins. Thus, NOV82 

25 nucleic acids and polypeptides, antibodies and related compounds according to the invention 
will be useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV84 is homologous to the Phospholipase C delta 1-like family of proteins. Thus, 
NOV84 nucleic acids and polypeptides, antibodies and related compounds according to the 
30 invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV85, NOIV86, and NOV87 are homologous to the GTPase-Activating Protein-like 
family of proteins. Thus, NOV85, NOIV86, and NOV87 nucleic acids and polypeptides, 
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antibodies and related compounds according to the invention will be useful in therapeutic and 
diagnostic applications implicated in various pathologies or conditions 

NOV88 and NOV89 are homologous to the Glyceroil-3-Phosphate Dehydrogenase- 
like family of proteins. Thus, NOV88 and NOV89 nucleic acids and polypeptides, antibodies 
5 and related compounds according to the invention will be useful in therapeutic and diagnostic 
applications implicated in various pathologies or conditions 

NOV90 is homologous to the Serine/Threonine-Protein Kinase PAK 2-like family of 
proteins. Thus, NOV90 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
10 in various pathologies or conditions 

NOV91 is homologous to the D-Dopachrome Tautomerase family of proteins. Thus, 
NOV91 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
1 5 m NOV92 is homologous to the Secreted leucine-rich repeat (LRR)-like family of 

proteins. Thus, NOV92 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV93 is homologous to the Inosine-5' -Monophosphate Dehydrogenase-like family 
20 of proteins. Thus, NOV93 nucleic acids and polypeptides, antibodies and related compounds 
according to the invention will be useful in therapeutic and diagnostic applications implicated 
in various pathologies or conditions 

NOV94 is homologous to the Male-Specific Lethal 3-like family of proteins. Thus, 
NOV94 nucleic acids and polypeptides, antibodies and related compounds according to the 
25 invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 

NOV95 is homologous to the Cysteine Conjugate Beta Lyase-like family of proteins. 
Thus, NOV95 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
30 pathologies or conditions 

NOV96 is homologous to the Monocarboxylate transporter-like family of proteins. 
Thus, NOV96 nucleic acids and polypeptides, antibodies and related compounds according to 
the invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
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NOV97 is homologous to the Carboxypeptidase Al-like family of proteins. Thus, 
NOV97 nucleic acids and polypeptides, antibodies and related compounds according to the 
invention will be useful in therapeutic and diagnostic applications implicated in various 
pathologies or conditions 
5 NOV98 is homologous to the Agrin-like family of proteins. Thus, NOV98 nucleic 

acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

NOV99 is homologous to the SNC73-like family of proteins. Thus, NOV99 nucleic 
10 acids and polypeptides, antibodies and related compounds according to the invention will be 
useful in therapeutic and diagnostic applications implicated in various pathologies or 
conditions 

The NOVX nucleic acids and polypeptides can also be used to screen for molecules, 
which inhibit or enhance NOVX activity or function. Specifically, the nucleic acids and 
15 polypeptides according to the invention may be used as targets for the identification of small 
molecules that modulate or inhibit, e.g., neurogenesis, cell differentiation, cell proliferation, . 
hematopoiesis, wound healing and angiogenesis. 

Additional utilities for the NOVX nucleic acids and polypeptides according to the 
invention are disclosed herein. 

20 NOV1 

NOV 1 includes three novel human 1 Claudin-like proteins disclosed below. The 
disclosed sequences have been named NOVla, NOVlb, NOVlc, NOVld, NOVle, NOVlf, 
andNOVlg. 

25 NOVla 

A disclosed NOVla nucleic acid of 687 nucleotides (also referred to as CG56592-02) 
encoding a novel human Claudin 6-like protein is shown in Table 1 A An open reading frame 
was identified beginning with an ATG initiation codon at nucleotides 6-8 and ending with a 
TAG termination codon at nucleotides 678-680. The start and stop codons are in bold letters 

30 in Table 1 A, and the 5' and 3' untranslated regions are underlined. 
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Table 1A. NOVla nucleotide sequence (SEQ ID NO:l). 

TGACT ATGGCCTGGAGTTTCCGTGCAAAAGTCCAGCTCGGGGGGCTACTTCTCTCCCTCCTTGGCTGGGTCT 
GC T C C TGTGTT AC C AC CAT C CTG C C C C AGTGG AAGAC TC T T AATCTGG AACTGAACG AG ATGG AG AC CTGG A 
TCATGGGGATTTGGGAGGTCTGCGTGGATCGAGAGGAAGTCGCCACTGTGTGCAAGGCCTTTGAATCCTTCT 
TGTCTCTGCCCCAGGAGCTCCAGGTAGCCCGCATCCTCATGGTAGCCTCCCATGGGCTGGGCCTATTGGGGC 
TTTTGCTCTGCAGCTTTGGGTCTGAATGCTTCCAGTTTCACAGGATCAGATGGGTATTCAAGAGGCGGCTTG 
GTCTCCTGGGAAGGACTTTGGAGGCATCCGCTTCAGCCACTACCCTCCTTCCAGTCTCCTGGGTGGCCCATG 
C C AC AAT CCAAGAC T T CTGGGATGACAG CAT C CCTGAC AT CATAC CCT CGGTGGGAGT T TGGAGGTG C C CTC 
TACTTGGGCTGGGCTGCTGGTATTTTCCTGGCTCTTGGTGGGCTACTCCTCATCTTCTCGGCCTGCCTGGGA 
AAAGAAGATGTGCCTTTTCCTTTGATGGCTGGTCCCACAGTCCCCCTATCCTGTGCTCCAGTGGAGGAGTCA 
GATGGCTCCTTCCACCTCATGCTAAGACCTAGGAACCTG 



In a search of public sequence databases, the NOVla nucleic acid sequence, located on 
chromsome 12 has 337 of 534 bases (63%) identical to a gb : GENB ANK- 
ID:HSA249735|acc:AJ249735.1 mRNA from Homo sapiens (CLDN6 gene for claudin-6). 

In all BLAST alignments herein, the "E-value" or "Expect" value is a numeric 
indication of the probability that the aligned sequences could have achieved their similarity to 
the BLAST query sequence by chance alone, within the database that was searched. For 
example, the probability that the subject ("Sbjct") retrieved from the NOVla BLAST analysis, 
e.g., Homo sapiens CLDN6 gene for claudin-6, matched the Query NOVla sequence purely 
by chance is 1.4e~ 15 . The Expect value (E) is a parameter that describes the number of hits one 
can "expect" to see just by chance when searching a database of a particular size. It decreases 
exponentially with the Score (S) that is assigned to a match between two sequences. 
Essentially, the E value describes the random background noise that exists for matches 
between sequences. 

The Expect value is used as a convenient way to create a significance threshold for 
reporting results. The default value used for blasting is typically set to 0.0001. In BLAST 2.0, 
the Expect value is also used instead of the P value (probability) to report the significance of 
matches. For example, an E value of one assigned to a hit can be interpreted as meaning that 
in a database of the current size one might expect to see one match with a similar score simply 
by chance. An E value of zero means that one would not expect to see any matches with a 
similar score simply by chance. See, e.g., 

http://www.ncbi.nlm.nih.gov/Education/BLASTinfo/. Occasionally, a string of X's or N's 
will result from a BLAST search. This is a result of automatic filtering of the query for low- 
complexity sequence that is performed to prevent artifactual hits. The filter substitutes any 
low-complexity sequence that it finds with the letter "N" in nucleotide sequence (e.g., 
"NNNNNNNNNNNNN") or the letter "X" in protein sequences (e.g., "XXXXXXXXX"). 
Low-complexity regions can result in high scores that reflect compositional bias rather than 
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significant position-by-position alignment. (Wootton and Federhen, Methods Enzymol 
266:554-571, 1996). 

The disclosed NOV la polypeptide (SEQ ID NO:2) encoded by SEQ ID NO:l has 229 
amino acid residues and is presented in Table IB using the one-letter amino acid code. Signal 
5 P, Psort and/or Hydropathy results predict that NOVla has no signal peptide and is likely to be 
localized the plasma membrane with a certainty of 0.6400. Alternatively, NOVla also may 
localize to the Golgi body with acertainty of 0.4600, the endoplasmic reticulum (membrane) 
with a certainty of 0.3700 or in the endoplasmic reticulum (lumen) with a certainty of 0.1000. 
The most likely cleavage site for a NOVla peptide is between amino acids 24 and 25, at: 
10 VCS-CV. 



I^WSFRAKVQLGGLLLSLLGWVCSCVTTILPQWKTLNLELNEMETWIMGIWEVCVDREEVATVCKAFESFLS 
LPQELQVARILMVASHGLGLLGLLLCSFGSECFQFHRIRWVFKRRLGLLGRTLEASASATTLLPVSWVAHAT 
IQDFWDDS I PDI I PRWEFGGALYLGWAAGI FLALGGLLLI FSACLGKEDVPFPLMAGPTVPLSCAPVEESDG 
SFHLMLRPRNLVI 



A search of sequence databases reveals that the NOVla amino acid sequence has 81 of 
207 amino acid residues (39%) identical to, and 111 of 207 amino acid residues (53%) similar 



15 to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q9Z262 protein from Mus musculus 
(Mouse) (Claudin-6) (E = 2.7e' 27 ). 

NOVla is predicted to be expressed in Bone Marrow, Brain, Liver, Placenta, and 

Lung. 



A disclosed NOVlb nucleic acid of 687 nucleotides (also referred to as CG56586-01) 
encoding a human Claudin-3-like protein is shown in Table 1C. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 6-8 and ending with a TAG 
codon at nucleotides 678-680. Putative untranslated regions upstream from the initiation 
25 codon, and downstream from the termination codon, if any, are underlined in Table 1C. The 
start and stop codons are in bold letters. 



Table IB. Encoded NOVla protein sequence (SEQ ID NO:2). 
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NOVlb 
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Table 1C. NOVlb nucleotide sequence (SEQ ID NO:3). 

TGACTATGGCCTGGAGTTTCCGTGCAAAAGTCCAGCTCGGGGGGCTACTTCTCTCCCTCCTTGGCTGGGTCT 
GCTCCTGTGTTACCACCATCCTGCCCCAGTGGAAGACTCTTAATCTGGAACTGAACGAGATGGAGACCTGGA 
TCATGGGGATTTGGGAGGTCTGCGTGGATCGAGAGGAAGTCGCCACTGTGTGCAAGGCCTTTGAATCCTTCT 
TGTCTCTGCCCCAGGAGCTCCAGGTAGCCCGCATCCTCATGGTAGCCTCCCATGGGCTGGGCCTATTGGGGC 
TTTTGCTCTGCAGCTTTGGGTCTGAATGCTTCCAGTTTCACAGGATCAGATGGGTATTCAAGAGGCGGCTTG 
GTCTCCTGGGAAGGACTTTGGAGGCATCCGCTTCAGCCACTACCCTCCTTCCAGTCTCCTGGGTGGCCCATG 
CCACAATCCAAGACTTCTGGGATGACAGCATCCCTGACATCATACCCTCGGTGGGAGTTTGGAGGTGCCCTC 
TACTTGGGCTGGGCTGCTGGTATTTTCCTGGCTCTTGGTGGGCTACTCCTCATCTTCTCGGCCTGCCTGGGA 
AAAGAAGATGTGCCTTTTCCTTTGATGGCTGGTCCCACAGTCCCCCTATCCTGTGCTCCAGTGGAGGAGTCA 
GATGGCTCCTTCCACCTCATGCTAAGACCTAGGAACCTG 



In a search of public sequence databases, the NOVlb nucleic acid sequence, located on 
chromsome 1 1 is 338 of 534 bases (63%) identical to a gb : GENB ANK- 

ID:HSA249735|acc:AJ249735. 1 mRNA from Homo sapiens (CLDN6 gene for claudin-6) (E - 
5 2.8e 16 ). 

The disclosed NOVlb polypeptide (SEQ ID NO:4) encoded by SEQ ID NO:3 has 224 
amino acid residues and is presented in Table ID using the one-letter amino acid code. Signal 
P, Psort and/or Hydropathy results predict that NOVlb has a signal peptide and is likely to be 
localized in the plasma membrane with a certainty of 0.4600. Alternatively, NOVlb may also 
10 localize to the microbody (peroxisome) with acertainty of 0.3200, the endoplasmic reticulum 
(membrane) with a certainty of 0.1000 or in the endoplasmic reticulum (lumen) with a 
certainty of 0.1000. The most likely cleavage site for a NOVlb peptide is between amino acids 
24 and 25, at: VCS-CV. 



Table ID, Encoded NOVlb protein sequence (SEQ ID NO:4). 

MAWSFRAKVQLGGLLLSLLGVFVCSCVTTI^ 

LPQELiQVARI LMVASHGHjGLLGIjIjLCS FGSECFQFHRI RWVFKRRLGLLGRTLEASAS ATTLLPVSWVAHAT 
I QDFWDDS I PD 1 1 PS VGVWRCPLLGLGCW YF PGS WWATPHLLGL PGKRRCAFS FDGWSHS P P I LCS SGGVRW 
LLPPHAKT 

15 ■ 

A search of sequence databases reveals that the NOVlb amino acid sequence has 50 of 
149 amino acid residues (33%) identical to, and 83 of 149 amino acid residues (55%) similar 
to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q63400 protein from Rattus 
norvegicus (Rat) (Claudin-3 (Ventral Prostate. 1 Protein) (RVP1)) (E = 0.0). 
20 NOVlb is predicted to be expressed in Bone Marrow, Brain, Liver, Placenta, and 

Lung. 

NOVlc 

A disclosed NOVlc nucleic acid of 642 nucleotides (also referred to as CG56592-03) 

encoding a novel Claudin-6-like protein is shown in Table IE. An open reading frame was 

20 
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identified beginning with a ATG initiation codon at nucleotides 6-8 and ending with a TAG 
codon at nucleotides 609-61 1 . The start and stop codons are in bold letters, and the 5' and 3' 
untranslated regions are underlined. 

Table IE. NOVlc Nucleotide Sequence (SEQ ID NO:5) 

TGACT ATGGCCTGGAGTTTCCGTGCAAAAGTCCAGCTCGGGGGGCTACTTCTCTCCCTCCTTGGCTGGGTC 
TGCTCCTGTGTTACCACCATCCTGCCCCAGTGGAAGACTCTTAATCTGGAACTGAACGAGATGGAGACCTG 
GATCATGGGGATTTGGGAGGTCTGCGTGGATCGAGAGGAAGTCGCCACTGTGTGCAAGGCCTTTGAATCCT 
TCTTGTCTCTGCCCCAGGAGCTCCAGTTTCACAGGATCAGATGGGTATTCAAGAGGCGGCTTGGTCTCCTG 
GGAAGGACTTTGGAGGCATCCGCTTCAGCCACTACCCTCCTTCCAGTCTCCTGGGTGGCCCATGCCACAAT 
CCAAGACTTCTGGGATGACAGCATCCCTGACATCATACCTCGGTGGGAGTTTGGAGGTGCCCTCTACTTGG 
GCTGGGCTGCTGGTATTTTCCTGGCTCTTGGTGGGCTACTCCTCATCTTCTCGGCCTGCCTGGGAAAAGAA 
GATGTGCCTTTTCCTTTGATGGCTGGTCCCACAGTCCCCCTATCCTGTGCTCCAGTGGAGGAGTCAGATGG 
CTCCTTCCACCTCATGCTAAGACCTAGGAACCTGGTCATCTA GGACTGGCTTCTGCCAAGGATCTCTGGAA 
TAA 



5 The disclosed NOVlc nucleic acid sequence maps to chromosome 12 and has 144 of 

220 bases (65%) identical to a gb:GENBANK-ID:HSA249735|acc:AJ249735.1 mRNA from 
Homo sapiens (CLDN6 gene for claudin-6) (E = 0.0). 

A disclosed NOVlc protein (SEQ ID NO:6) encoded by SEQ ID NO:5 has 201 amino 
acid residues, and is presented using the one-letter code in Table IF. Signal P, Psort and/or 

10 Hydropathy results predict that NOVlc does have a signal peptide, and is likely to be localized 
to the plasma membrane with a certainty of 0.4600. In other embodiments NOVlc is also 
likely to be localized to the microbody (peroxisome) with a certainty of 0.2651, to 
endoplasmic reticulum (membrane) with a certainty of 0.1000, or to the endoplasmic 
reticulum (lumen) with a certainty of 0.1000. The most likely cleavage site for NOV lc is 

1 5 between positions 24 and 25, (VCS-CV). 



Table IF. Encoded NOVlc protein sequence (SEQ ID NO:6). 

MAWSFRAKVQLGGLLLSLLGWVCSCVTTILPQWKTLNLELNEMETWIMGIWEVCVDREEVATVCKAFESFL 
SLPQELQFHRIRWVFKRRLGLLGRTLEASASATTLLPVSWVAHATIQDFWDDSIPDIIPRWEFGGALYLGW 
AAGIFLALGGLLLIFSACLGKEDVPFPLMAGPTVPLSCAPVEESDGSFHLMLRPRNLVI 

The disclosed NOVlc amino acid has 55 of 94 amino acid residues (58%) identical to, 
and 62 of 94 amino acid residues (65%) similar to, the 220 amino acid residue 
20 ptnr:SPTREMBL-ACC:Q9D7U6 protein from Mus musculus (Mouse) (22 1 0404A22RIK 
Protein) (E^S.le" 47 ). 

In addition, NOVlc is predicted to be expressed in Bone Marrow, Brain, Liver, 
Placenta, and Lung. 

NOVld 
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A disclosed NOVld nucleic acid of 726 nucleotides (also referred to as CG56592-02) 
encoding a novel Claudin 6-like protein is shown in Table 1G. An open reading frame was 
identified beginning with an ATG codon at nucleotides 6-8 and ending with a TAG codon at 
nucleotides 693-695. The start and stop codons are in bold letters and the 5' and 3' 
untranslated regions are underlined in Table 1G. 



Table 1G. NOVld nucleotide sequence (SEQ ID NO:7). 

TGACTATGGCCTGGAGTTTCCGTGCAAAAGTCCAGCTCGGGGGGCTACTTCTCTCCCTCCTTGGCTGGGTCT 
GTTCCTGTGTTACCACCATCCTGCCCCAGTGGAAGACTCTTAATCTGGAACTGAACGAGATGGAGACCTGGA 
TCATGGGGATTTGGGAGGTCTGCGTGGATCGAGAGGAAGTCGCCACTGTGTGCAAGGCCTTTGAATCCTTCT 
TGTCTCTGCCCCAGGAGCTCCAGGTAGCCCGCATCCTCATGGTAGCCTCCCATGGGCTGGGCCTATTGGGGC 
TTTTGCTCTGCAGCTTTGGGTCTGAATGCTTCCAGTTTCACAGGATCAGATGGGTATTCAAGAGGCGGCTTG 
GTCTCCTGGGAAGGACTTTGGAGGCATCCGCTTCAGCCACTACCCTCCTTCCAGTCTCCTGGGTGGCCCATG 
CCACAATCCAAGACTTCTGGGATGAC AGCATCCCTGACATCATAC CT CGGTGGGAGTTTGGAGGTGCCCTCT 
ACTTGGGCTGGGCTGCTGGTATTTTCCTGGCTCTTGGTGGGCTACTCCTCATCTTCTCGGCCTGCCTGGGAA 
AAGAAGATGTGCCTTTTCCTTTGATGGCTGGTCCCACAGTCCCCCTATCCTGTGCTCCAGTGGAGGAGTCAG 
ATGGCTCCTTCCACCTCATGCTAAGACCTAGGAACCTGGTCATCTAG GACTGGCTTCTGCCAAGGATCTCTG 
GAATAA 



In a search of public sequence databases, the NOVld nucleic acid sequence, located on 
chromsome 12 has 336 of 534 bases (62%) identical to a gb :GENB ANK- 

ID:HSA249735|acc:AJ249735.1 mRNA from Homo sapiens (CLDN6 gene for claudin-6) (E = 
6.5e 16 ). 

The disclosed NOVld polypeptide (SEQ ID NO:8) encoded by SEQ ID NO:7 has 229 
amino acid residues and is presented in Table 1H using the one-letter amino acid code. Signal 
P, Psort and/or Hydropathy results predict that NOVld has no signal peptide and is likely to be 
localized the the plasma membrane with a certainty of 0.6400. Alternatively, NOVld may also 
localize to the Golgi body with acertainty of 0.4600, the endoplasmic reticulum (membrane) 
with a certainty of 0.3700 or in the endoplasmic reticulum (lumen) with a certainty of 0.1000. 
The most likely cleavage site for a NOVld peptide is between amino acids 24 and 25, at: 
VCS-CV. 



Table 1H. Encoded NOVld protein sequence (SEQ ID NO:8). 

MAWS FRAKVQLGGLLLSLiLGWVCS CVTT I LPQWKTLNLELNEMETW I MG I WE VCVDREE VATVCKAFES FLS 
LPQELQVARI LMVASHGLGLLGLLLCS FGSECFQFHRI RWVFKRRLGLLGRTLEAS AS ATTLLPVS WVAHAT 
I QDFWDDS I PDI I PRWE FGGAL YLGWAAG I FLALGGLLLI FSACLGKEDVPFPLMAGPTVPLSCAPVEESDG 
SFHLMLRPRNLVI 

A search of sequence databases reveals that the NOVld amino acid sequence has 81 of 
207 amino acid residues (39%) identical to, and 111 of 207 amino acid residues (53%) similar 
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to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q9Z262 protein from Mus musculus 
(Mouse) (Claudin-6) (E - 2.8e" 27 ). 

Expression information was derived from the tissue sources of the sequences that were 
included in the derivation of NOVld. The sequence is predicted to be expressed in Bone 
5 Marrow, Brain, Liver, Placenta, and Lung. 

Homologies to any of the above NOV1 proteins will be shared by the other NOV1 
proteins insofar as they are homologous to each other as shown below. Any reference to 
NOV1 is assumed to refer to all four of the NOV1 proteins in general, unless otherwise noted. 

The disclosed NOV la polypeptide has homology to the amino acid sequences shown 
1 0 in the BLASTP data listed in Table II. 



Table 11. BLAST results for NO Via 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Po 
sitives 
(%) 


Expect 


gi | 17458947 | ref |XP__ 
061964. l| 
(XM_061964) 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] 


229 


229/229 
(100%) 


229/229 
(100%) 


e-109 


>gi | 17437506 | ref | XP 
_068031 . 1 | 
(XM_068031) 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] 


220 


99/172 
(57%) 


125/172 
(72%) 


4e-50 


gi | 17437504 | ref |XP_ 
06 8 03 0.1 | 
(XM068030) 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] 


220 


99/172 
(57%) 


126/172 
(72%) 


4e-43 


gi | 12 843 24 8 | db j | BAB 
25914. l| (AK008821) 


PMP- 

22 /EMP/MP2 0 /Claud 
in family- 
containing 
protein-data 
source : Pf am, 
source 
key :PF00822, 
evidence : ISS-puta 
tive [Mus 
musculus] 


220 


104/188 
(55%) 


131/188 
(69%) 


3e-40 


gi | 7710002 | ref |NP_0 
57 8 83. l| 
<NM 016674) 


claudin 1 [Mus 
musculus] 


211 


67/194 
(34%) 


99/194 
(50%) 


2e-20 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 1 J. In the ClustalW alignment of the NOV1 proteins, as 
15 well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
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can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 1J. ClustalW Analysis of NOV1 



Novel NOVla 

Novel NOVlb 

Novel NOVlc 

Novel NOV Id 



(SEQ ID NO: 2) 
(SEQ ID NO: 4) 
(SEQ ID NO: 6) 
(SEQ ID NO: 8) 



5 1) 
2) 
3) 
4) 

5) gi | 17458947 |ref | XP_061964 . 1 | (XM_061964) similar to putative (H. sapiens) [Homo 
10 sapiens] (SEQ ID NO: 1383) 

6) gi | 17437506 | ref | XP_068031 . 1 | (XM_068O31) similar to putative (H. sapiens) [Homo 
sapiens] {SEQ ID NO: 13 84) 

7) gi | 17437504 | ref | XP_068 03 0 . 1 | (XM_068030) similar to putative (H. sapiens) [Homo 
sapiens] (SEQ ID NO: 32 5) 

15 8) gi | 12 84 3 24 8 | db j | BAB2 5 914 . 1 | (AK008821) PMP-22/EMP/MP20/Claudin family containing 

protein-data source :Pf am, source key:PF00822, evidence : ISS-putative [Mus musculus] 
(SEQ ID NO:326) 

9) gi | 7710002 | ref | NP_057883 . 1 | (NM_016674) claudinl [Mus musculus] (SEQ ID NO: 3 27) 



20 



25 



30 



NOVla 
NOVlb 
NOVlc 
NOVld 

gi | 17458947 | 
gi | 17437506 j 
gi | 17437504 j 
gi | 12843248 | 
gi | 7710002 | 




35 



40 



NOVla 


58 


EEV 


ATV 


CK 


NOVlb 


58 


EEV 


ATV 


CK 


NOVlc 


58 


EEV 


ATV 


CK 


NOVld 


58 


EEV 


ATV 


CK 


gi| 17458947 | 


58 


EEV 


ATV 


CK 


gi|l7437506 | 


58 


EEV 


GMQ 


CK 


gi j 17437504 j 


58 


EEV 


GMQ 


CK 


gi j 12843248 | 


58 


EEV 


GRQ 


CK 


gi|7710002 | 


58 


STGQIQ 


CK 




110 

jSlBFpFHRIBl 

F^FHRlf" 
^FHRlIl 

E 

jsfe^ 

"igesQrdl 

.EjlGESORDL 
L^LGET^EGL 
ScLEDDE! 



120 



FQFHRIRVi 

iF^FHRlj^W 



117 
117 
117 
117 
117 
117 
117 
117 
117 



45 



50 



55 



60 



130 



NOVla 
NOVlb 
NOVlc 
NOVld 
9i 

gi 
gi 
gi 
gi 



17458947 | 
17437506 | 
17437504 j 
12843248 | 
7710002 | 



NOVla 
NOVlb 
NOVlc 
NOVld 

gi | 17458947 | 
gijl7437506| 




178 
178 
178 
178 
178 
178 



190 



200 



220 



..-.|....|....|....|....| 
SWWATPHLLGLPGKRRCAFSFDGWSHS| 
I FSggLGKEDgPFPLMgGPT , 

IF ll 

r F3™lgked|pfplmBgp' 

IsshapjJasghySvaqtq 



LGGfflLL 

lggSll 
lggHll 

LGGfflLL 

lggHll 



SlGKEDJ|PFPLmBgPT\ 

3lgkedvpfplmBgpt\ 



new 



210 

pilcssggvrw: 

scapveesEgs 

,S CAP VEE SWGS Fp 

scapveesSSgs 
scapveesBgs 




230 

IPPHAKT- 224 

PRgLVtq 22 9 

PrSlv| 22 9 

PR*Lvi 22 9 
229 

KH 22 0 
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gi I 17437504 I 178 Fj^c WlN CA^SSHAPIiALGHY^VAOMQ TQCP|jlEDGTAgPQ^ 220 

gi|l2843248| 178 BHc|fflHc|HwS PAPAAS SH YgVAGPR !3 H QQi3 E iiSQ A ii PE § 220 

gi|7710002| 177 B88 vflBsCSCPRKTTSYPTPRPYPKPTfg SSGKDY-g- 211 

5 

The claudins are a family of integral membrane proteins that are major components of 
tight junction (TJ) strands. When claudins are introduced into cells that lack tight junctions, 
networks of strands and grooves form at cell-cell contact sites that closely resemble native 
tight junctions. There are at least 17 members of this family in mammals. Claudin family 

10 members share -38% amino acid identity, and are predicted to have four transmembrane (TM) 
domains, which is reminiscent of occludin, although they share no sequence similarity with it. 
Multiple sequence alignment reveals their sequences to be fairly well conserved in the first 
and fourth putative TM domains, and in the first and second extracellular loops, but they 
diverge in the second and third TM domains. Although the sequences of their C-terminal 

1 5 cytoplasmic domains vary, the known family members share a common motif of -Y-V. This 
has been postulated as a possible binding motif for PDZ domains of other tight junction- 
associated peripheral membrane proteins, such as ZO-1. 

The disclosed NOV1 nucleic acid of the invention encoding a Human Claudin -like 
protein includes the nucleic acid whose sequence is provided in Table 1A ,1C, IE, 1G, or a 

20 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 1 A, 1C, IE, or 1G while 
still encoding a protein that maintains its Human Claudin-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 

25 that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 

30 stability of the modified nucleic acid, such that they may be used, for example, as antisense 

binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 37 percent of the bases may be so changed. 

The disclosed NOV1 protein of the invention includes the Human Claudin-like protein 
whose sequence is provided in Table IB, ID, IF, or 1H. The invention also includes a mutant 

35 or variant protein any of whose residues may be changed from the corresponding residue 
shown in Table IB, ID, IF, or 1H while still encoding a protein that maintains its Human 
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Claudin-like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 66 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 



(NOV1) is a member of a "Human Claudin family". Therefore, the NOV1 nucleic acids and 
proteins identified here may be useful in potential therapeutic applications implicated in (but 
not limited to) various pathologies and disorders as indicated below. The potential therapeutic 
applications for this invention include, but are not limited to: protein therapeutic, small 
10 molecule drug target, antibody target (therapeutic, diagnostic, drug target ing/cytotoxic 

antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 



15 therapeutic applications implicated in cancer including but not limited to various pathologies 
and disorders as indicated below. For example, a cDNA encoding the Human Claudin-like 
protein (NOV1) may be useful in gene therapy, and the Human Claudin -like protein (NOV1) 
may be useful when administered to a subject in need thereof. By way of nonlimiting 
example, the compositions of the present invention will have efficacy for treatment of patients 

20 suffering from Von Hippel-Lindau (VHL) syndrome, Cirrhosis, Transplantation, Hemophilia, 
hypercoagulation, Idiopathic thrombocytopenic purpura, autoimmume disease, allergies, 
immunodeficiencies, transplantation, Graft vesus host, Alzheimer's disease, Stroke, Tuberous 
sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, 
Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, Leukodystrophies, 

25 Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection, Systemic lupus 

erythematosus , Autoimmune disease, Asthma, Emphysema, Scleroderma, allergy, and 
Cancer, or other pathologies or conditions. The NOV1 nucleic acid encoding the Human 
Claudin-like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 

30 assessed. 

NOV1 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immuno-specifically to the novel NOV1 substances for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
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The above disclosed information suggests that this Human Claudin-like protein 



The NOV1 nucleic acids and proteins of the invention are useful in potential 
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section below. The disclosed NOV1 proteins have multiple hydrophilic regions, each of 
which can be used as an immunogen. These novel proteins can be used in assay systems for 
functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 

NOV2 

A disclosed NOV2 nucleic acid of 1361 nucleotides (also referred to as CG56596-01) 
encoding a novel Protein Serine Kinase-like protein is shown in Table 2A. An open reading 
frame was identified beginning with an ATG initiation codon at nucleotides 20-22 and ending 
with a TAA codon at nucleotides 1268-1270. A The start and stop codons are in bold letters 
in Table 2A. 

Table 2A. NOV2 nucleotide sequence (SEQ ID NO:9). 

~CGGCGGGCGTGTTGCGGGT ATGGGGTGCGGCGCCAGCAGGAAGGTGGTCCCGGGGCCACCAAAAATTCTTGT 
AAT AGAATTGG CAT CC AAAGTGGAAC C C AG AAATGGAAC AAAG AATGAT C T CT AT AAAT TTTTTT AT TAT AC 
TTTAAGTTCTACTCCTCCCTGCCCTCTGCCACTCCCCTCACTACCCCAGTGCCCCCTCCCTCCTTGCCCTGG 
GCCCGAGGCGGCGGCCCAGGCGGCGCAGAGGATACAGGTGGCTCGCTTCCGAGCCAAGTTCGACCCCCGGGT 
CCTTGCCAGATATGACATCAAAGCTCTTATTGGGACAGGCAGTTTCAGCAGGGTTGTCAGGGTAGAGCAGAA 
GACCACCAAGAAACCTTTTGCAATAAAAGTGATGGAAACCAGAGAGAGGGAAGGTAGAGAAGCGTGCGTGTC 
TGAGCTGAGCGTCCTGCGGCGGGTTAGCCATCGTTACATTGTCCAGCTCATGGAGATCTTTGAGACTGAGGA 
TCAAGTTTACATGGTAATGGAGCTGGCTACCGGAGGGGAGCTCTTTGATCGACTCATTGCTCAGGGATCCTT 
TACAGAGCGGGATGCCGTCAGGATCCTCCAGATGGTTGCTGATGGGATTAGGTATTTGCATGCGCTGCAGAT 
AACTCATAGGAATCTAAAGCCTGAAAACCTCTTATACTATCATCCAGGTGAAGAGTCGAAAATTTTAATTAC 
AGATTTTGGTTTGGCATACTCCGGGAAAAAAAGTGGTGACTGGACAATGAAGACACTCTGTGGGACCCCAGA 
GTACATAGCTCCTGAGGTTTTGCTAAGGAAGCCTTATACCAGTGCAGTGGACATGTGGGCTCTTGGTGTGAT 
CACATATGCTTTACTTAGCGGATTCCTGCCTTTTGATGATGAAAGCCAGACAAGGCTTTACAGGAAGATTCT 
G AAAGGC AAAT AT AATT AT AC AGGAGAGC CTTGGC CAAGCATT T C CC AC TT GGCG AAGG ACT TT ATAGAC AA 
ACTACTGATTTTGGAGGCTGGTCATCGCATGTCAGCTGGCCAGGCCCTGGACCATCCCTGGGTGATCACCAT 
GGCTGCAGGGTCTTCCATGAAGAATCTCCAGAGGGCCATATCCCGAAACCTCATGCAGAGGGCCTCTCCCCA 
CTCTCAGAGTCCTGGATCTGCACAGTCTTCTAAGTCACATTATTCTCACAAATCCAGGCATATGTGGAGCAA 
GAGAAACTTAAGGATAGTAGAATCGCCACTGTCTGCGCTTTTGTAA GCAGATGACCTCTAAAACTATTTTTG 
CCTATTTTAGGACCATTTCATCATGATTAGGGCACCCTCAAGCTCCAAAGACACGGGACTCCATG 



The disclosed NOV2 nucleic acid sequence, localized to the q21 .3-22 region of 
chromsome 18, has 685 of 997 bases (68%) identical to a gb:GENBANK- 
ID:HSA272212|acc:AJ272212.1 mRNA from Homo sapiens (mRNA for protein serine kinase 
(PSKH1 gene)) (E = 6.1e" 85 ). 

A NOV2 polypeptide (SEQ ID NO: 10) encoded by SEQ ID NO:9 has 416 amino acid 
residues and is presented using the one-letter code in Table 2B. Signal P, Psort and/or 
Hydropathy results predict that NOV2 contains no signal peptide and is likely to be localized 
to the endoplasmic reticulum (membrane) with a certainty of 0.5500. Alternatively, NOV2 
may also localize to the lysosome (lumen) with a certainty of 0.2403, the plasma membrane 
with a certainty of 0.1900, or the microbody (peroxisome) with a certainty of 0.1 1 1 1 . 
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Table 2B. Encoded NOV2 protein sequence (SEQ ID NO: 10). 

MGCGASRKWPGPPKILVIELASKVEPRNGTKNDLYKFFYYTLSSTPPCPLPLPSLPQCPLPPCPGPEAAAQ 
AAQRIQVARFRAKFDPRVLARYDIKALIGTGSFSRWRVEQKTTKKPFAIKVMETREREGREACVSELSVLR 
RVSHRYIVQLMEIFETEDQVYMVMELATGGELFDRLIAQGSFTERDAVRILQMVADGIRYLHALQITHRNLK 
PENIiLYYHPGEESKILITDFGLAYSGKKSGDWTMKTLCGTPEYIAPEVLLRKPYTSAVDMWALGVITYALLS 
GFLPFDDESQTRLYRKILKGKYNYTGEPWPSISHLAKDFIDKLLILEAGHRMSAGQALDHPWVITMAAGSSM 
KNLQRAISRNLMQRASPHSQSPGSAQSSKSHYSHKSRHMWSKRNLRIVESPLSALL 

The disclosed NOV2 amino acid sequence has 267 of 412 amino acid residues (64%) 
identical to, and 332 of 412 amino acid residues (80%) similar to, the 424 amino acid residue 
ptnr:SPTREMBL-ACC:Q9N Y 1 9 protein from Homo sapiens (Human) (Protein Serine 
5 Kinase) (E = 1.1 e 138 ). 

NOV2 is predicted to be expressed in Kidney, Lymph node, Pancreas, Salivary Glands, 
Brain, and Placenta because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:HSA272212jacc:AJ272212.1) a closely related Homo sapiens mRNA for protein serine 
kinase (PSKH 1 gene) homolog. 
10 In addition, the sequence is predicted to be expressed in keratinocytes because of the 

expression pattern of (GENBANK-ID: gb:GENBANK-ID:HSPI1371 l|acc:AJ00 1696.2) a 
closely related Homo sapiens mRNA for hurpin, clone R7-1 .1 homolog. 

NOV2 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 2C. 

15 



Table 2C. BLAST results for NOV2 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 14 9164 55 | ref | NP_ 
149117 . 1 | 
(NM_03 3126) 


serine/threo 
nine kinase 
PSKH2 [Homo 
sapiens] 


385 


369/416 
(88%) 


372/416 
(88%) 


0 . 0 


gi | 17 530179 | gb | AAL4 
0735.1) (AF416988) 


protein 
serine 

kinase/lucif 
erase fusion 
protein 


975 


257/391 
(65%) 


318/391 
(80%) 


e-149 


gi | 14 77 6113 | ref | XP_ 
043047 . 1 | 
(XM_043047) 


hypothetical 
protein 
XP_043047 
[Homo 
sapiens] 


424 


257/391 
(65%) 


318/391 
(80%) 


e-145 


gi | 15963448 } gb | AAL1 
1033. l| (AF236367) 


protein 
serine 

kinase Pskhl 
[Mus 

musculus] 


424 


254/386 
(65%) 


311/386 
(79%) 


e-144 
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gi | 2136035 |pir | | 138 


protein- 


319 


209/320 


258/320 


e-115 


138 


serine 
kinase (EC 
2.7.1.-) 
PSK-Hl - 
human 
(fragment) 




(65%) 


(80%) 





The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 2D. 



5 Table 2D. ClustalW Analysis of NOV2 

1) NOV2 (SEQ ID NO: 10) 

6) gi | 14916455 | ref |NP_149117 . 1 | (NM_033126) serine/threonine kinase PSKH2 [Homo 
sapiens] (SEQ ID NO: 328) 

7) gi | 17530179 |gb | AAL40735 . 1 | (AF416988) protein serine kinase/lucif erase fusion 
10 protein (SEQ ID NO:329) 

8) gi | 14776113 | ref |XP_043047 . 1 | (XM_043047) hypothetical protein XP043047 [Homo 
sapiens] (SEQ ID NO: 330) 

9) gi | 15963448 |gb | AAL11033 . 1 | (AF236367) protein serine kinase Pskhl [Mus musculus] 
(SEQ ID NO: 331) 

15 10) gi | 2136035 |pir | | 138138 protein- serine kinase (EC 2.7.1.-) PSK-Hl - human 

(fragment) (SEQ ID NO: 332) 



20 



25 



30 



35 



40 



45 



50 



55 



NOV2 

gi | 14916455 | 
gi | 17530179 | 
gi | 14776113 | 
gi j 15963448 | 
gi | 2136035 | 



NOV2 

gi | 14916455 | 
gi j 17530179 | 
gi I 14776113 j 
gi | 15963448 | 
gi j 2136035 | 



NOV2 

gi | 14916455 
gi | 17530179 
gi | 14776113 
gi j 15963448 
gi j 2136035 | 



NOV2 

gi | 14916455 | 
gi j 17530179 | 
gi j 14776113 | 
gi|15963448| 
gi j 2136035 | 



NOV2 
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gi | 14916455 | 
gi|l7530179| 
gi j 14776113 j 
gi |l5963448 j 
gi|2136035| 




10 



15 



20 



25 



30 



NOV2 

gi | 14916455 | 
gi j 17530179 j 
gi 1 14776113 j 
gi|l5963448| 
gi|2136035 | 



I 



310 



320 



330 



340 



295 
264 
299 
299 
299 
299 



D^^TRLYRJ|IL|GKYigY|jGEPWPSaS|LAKDFIDgLL 
D^^TRLYRglL|GKY^Y§GEPWPSys|LAKDFIDlSLL 

gepwpsSsSlakdfidIll 
gepwps|sIlakdfid|ll 
ddnrtrlyrqilrgkysy3gepwpsSsSlakdfidsll 



DDNRTRLYRQ I LRGK YS YgjGE PWPS 
DDNRTRLYRQ I LRGKYS y| GE PWP S 




350 
I - • 



360 



RMgAgQAL 
RM|A QAL 
RMifcA QAL 
RMMA QAL 
RMfifA SQAL 



DDNRTRLYRQ I LRGKYS YgjGE 




NOV2 


355 


S 5 

S s 


SSMKNL 


gi| 14916455 | 


324 


SSMKNL 


gi 1 17530179 j 


359 


SSMKNL 


gi| 14776113 j 


'359 


SSMKNL 


gi j 15963448 j 


359 


Sis 


SSMKNL 


gi|2136035| 


319 







RAS 


PHS 


QS 


PG 


SAQS 


RAS 


PHS 


QS 


PG 


SAQS 


RAS 


SRC 


QS 


TK 


SAQS 


RAS 


SRC 


QS 


TK 


SAQS 


RAS 


SRC 


QS 


TK 


sIqs 




NOV2 

gi | 14916455 | 
gi 1 17530179 j 
gi j 14776113 j 
gi | 15963448 | 
gi | 2136035 | 



I 



430 



440 



450 



460 



470 



480 
•-I 



415 LL 416 

384 LL 385 

419 QQQYNGAMEDAKNIKKGPAPFYPLEDGTAGEQLHKAMKRYALVPGTIAFTDAHIEVDITY 478 

419 QQQYNG 424 

419 QQQYNG 424 

319 319 



35 



40 



NOV2 

gi 1 14916455 | 
gi|l7530179| 
gi [14776113 | 
gi j 15963448 | 
gi|2136035 | 



416 



490 



500 
..|.. 



510 



520 
..|.. 



530 



540 
■•I 



416 



385 385 

4 7 9 AEYFEMSVRLAEAMKRYGLNTNHRI WCSENSLQFFMPVLGALFIGVAVAPANDI YNERE 53 8 



424 
424 
319 



424 
424 
319 



45 



50 



55 



60 



65 



70 



NOV2 


416 


gi| 14916455 


385 


gi | 17530179 


539 


gi j 14776113 


424 


gi | 15963448 


424 


gi|2136035| 


319 


NOV2 


416 


gi | 14916455 


385 


gi j 17530179 


599 


gi | 14776113 


424 


gi j 15963448 


424 


gi|2136035 | 


319 


NOV2 


416 


gi |14916455 


385 


gi| 17530179 


659 


gij 14776113 


424 


gi j 15963448 


424 


gi|2136035 | 


319 



550 
..|.. 



560 



570 



580 



590 
..|.. 



600 
■-I 



416 
385 
598 
424 
424 
319 



610 



620 



630 
..|.. 



640 



650 



660 



416 

385 
658 
424 
424 
319 



670 



680 
..|.. 



690 



700 



710 
..|.. 



720 



416 
385 
718 
424 
424 
319 



730 



740 



750 

30 



760 



770 



780 



ITS 7 



10 



15 



20 



25 



30 



35 



40 



NOV2 

gi | 14916455 | 
gi|l7530179| 
gi | 14776113 j 
gi j 15963448 j 
gi |2136035 | 



NOV2 

gi | 14916455 | 
gi | 17530179 j 
gi | 14776113 j 
gi 1 15963448 | 
gi j 2136035 | 



NOV2 

gi | 14916455 | 
gi | 17530179 | 
gi | 14776113 | 
gi | 15963448 | 
gi|2136035 | 



NOV2 

gi | 14916455 | 
gi | 17530179 | 
gi j 14776113 j 
gi|l5963448| 
gi [2136035 | 



NOV2 

gi | 14916455 | 
gi|l7530179| 
gi j 14776113 | 
gi | 15963448 j 
gi |2136035 | 



.--.|-...|....|....|....|....|....|....|....|....|....|....| 

416 416 

385 385 

719 FFAKSTLIDKYDLSNLHEIASGGAPLSKEVGEAVAKRFHLPGIRQGYGLTETTSAILITP 778 

424 424 

424 424 

319 319 

790 800 810 , 820 830 840 

....|....|....|....|....|....|....|....|....|....|....|....| 

416 416 

385 385 

779 EGDDKPGAVGKWPFFEAKWDLDTGKTLGVNQRGELCVRGPMIMSGYVNNPEATNALID 838 

424 424 

424 424 

319 319 

850 860 870 880 890 900 

--..|....|....|....|....|....|...-|....|....|....|....|....| 
41 6 416 

385 385 

83 9 KDGWLHSGDI AYWDEDEHFFI VDRLKSLIKYKGYQVAPAELES ILLQHPNI FDAGVAGLP 8 98 

424 424 

424 424 

319 319 

910 920 930 940 950 960 

.-..|....|....|....|..-.|---.|-...|....|....|....|....|....| 

416 416 

385 385 

8 99 DDDAGELPAAWVLEHGKTMTEKEIVDYVASQVTTAKKLRGGWFVDEVPKGLTGKLDAR 958 

424 424 

424 424 

319 319 

970 

....|....|....|.. 

416 416 

385 385 

959 KIREILIKAKKGGKIAV 975 

424 424 

424 424 

319 319 
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The presence of identifiable domains in NOV2, as well as all other NOVX proteins, 
was determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 
Pfam, ProDomain, and Prints, and then determining the Interpro number by crossing the 

50 domain match (or numbers) using the Interpro website (http:www.ebi.ac.uk/ interpro). 

DOMAIN results for NOV2 as disclosed in Tables 2E-2G, were collected from the Conserved 
Domain Database (CDD) with Reverse Position Specific BLAST analyses. This BLAST 
analysis software samples domains found in the Smart and Pfam collections. For Table 2K 
and all successive DOMAIN sequence alignments, fully conserved single residues are 

55 indicated by black shading or by the sign (|) and "strong" semi-conserved residues are 

indicated by grey shading or by the sign (+). The "strong" group of conserved amino acid 
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residues may be any one of the following groups of amino acids: STA, NEQK, NHQK, 
NDEQ, QHRK, MILV, MTLF, HY, FYW. 

Tables 2E-G lists the domain description from DOMAIN analysis results against 
NOV2. This indicates that the NOV2 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 2E Domain Analysis of NOV2 

gnl | Smart | smart 0022 0 , S_TKc , Serine/Threonine protein kinases, 
catalytic domain; Phosphotransferases. Serine or threonine- specif ic 
kinase subfamily. (SEQ ID NO: 79 9) 
CD-Length = 256 residues, 100.0% aligned 

Score = 261 bits (668) , Expect = 4e-71 



10 



15 



20 



25 



NOV 2 : 


94 


Sbjct : 


1 


NOV 2 : 


152 


Sbjct : 


61 


NOV 2 : 


212 


Sbjct : 


121 


NOV 2 : 


272 


Sbjct : 


177 


NOV 2 : 


332 


Sbjct : 


237 



YDIKALIGTGSFSRVVRVEQKTTKKPFAIKVMETRE- - REGREACVSELSVLRRVSHRYI 151 

I++ ++I l + l +1 III I I II++ + ++ 1 1 + 1+ + I+++ I I 

YEIiLEVLGKGAFGKVYLARDKKTGKLVAIKVIKKEKLKKKKRERILREIKILKKLDHPNI 6 0 
VQLMEIFETEDQVYMVMELATGGELFDRLIAQGSFTERDAVRILQMVADGIRYLHALQIT 211 

l + l ++II +I++I+III ll+lll I +1 +1 +1 + + + 111+ I 

VKLYDVFEDDDKLYLVMEYCEGGDLFDLLKKRGRLSEDEARFYARQILSALEYLHSQGII 12 0 
HRNLKPENLLYYHPGEESKILITDFGLAYSGKKSGDWTMKTLCGTPEYIAPEVLLRKPYT 2 71 

ll+lllll+l I + + Mill II + I 1 1 1 1 l + l I 1 1 1 1 I I 

HRDLKPENILLDSDGH VKLADFGLA- KQLDSGGTLLTTFVGTPEYMAPEVLLGKGYG 17 6 

SAVDMWALGVITYALLSGFLPFDDESQTRLYRKILKGKYNYTGEPWPSISHLAKDFIDKL 331 

lll+l+llll I ll+l M + I I + I II III I II 

KAVDIWSLGVILYELLTGKPPFPGDDQLLALFKKIGKPPPPFPPPEWKISPEAKDLIKKL 23 6 

LI LEAGHRMS AGQALDHPWV 3 51 

1+ + I++I +11+11+ 
LVKDPEKRLTAEEALEHPFF 25 6 



Table 2F Domain Analysis of NOV2 

gnl | Pf am | pfam0006 9, pkinase, Protein kinase domain {SEQ ID NO: 800) 
CD-Length = 256 residues, 100.0% aligned 
Score = 230 bits (586) , Expect = le-61 
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NOV 2 : 


94 


Sbjct : 


1 


NOV 2 : 


153 


Sbjct : 


61 


NOV 2 : 


212 


Sbj Ct : 


121 


NOV 2 : 


272 


Sbjct : 


178 


NOV 2 : 


332 


Sbjct : 


237 



YDIKALIGTGSFSRWRVEQKTTKKPFAIKVMETREREGREACV- SELSVLRRVSHRYIV 152 

| ++ + |+|+l +1 + + I I + II I+++ I ++ 1+ +111+11 M 

YELGEKLGSGAFGKVYKGKHKDTGEIVAIKILKKRSLSEKKKRFLREIQILRRLSHPNIV 6 0 

QLMEI FETEDQVYMVMELATGGELFDRLI AQGS - FTERDAVRILQMVADGIRYLHALQIT 211 

+ 1+ +|| +1 +I + MI ll + lll I I +I++I +1 + 1+ 111+ I 

RLLGVFEEDDHLYLVMEYMEGGDLFDYLRRNGLLLSEKEAKKIALQILRGLEYLHSRGI V 12 0 
HRNLKPENLLYYHPGEESKILITDFGLAYSGKKSGDWTMKTLCGTPEYIAPEVLLRKPYT 271 

ll + lllll + l I + I Mill +1 +1 IMM + IMM + 1 + 

HRDLKPENILLDENGT VKIADFGLARKLESSSYEKLTTFVGTPEYMAPEVLEGRGYS 177 

SAVDMWALGVITYALLSGFLPFDDESQTRLYRKILKGKYNYTGEPWPSISHLAKDFIDKL 331 

I ll + l + MM I ll + l III +1 + 1+ I II I I 

SKVDVWSLGVILYELLTGKLPFPGIDPLEELFRIKERPR-LRLPLPPNCSEELKDLIKKC 23 6 

LI LEAGHRMS AGQALDHPWV 3 51 

I + 1*1+1+111 

LNKDPEKRPTAKEILNHPWF 25S 
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Table 2G Domain Analysis of NOV2 

gnl | Smart | smart002 19 , TyrKc , Tyrosine kinase, catalytic domain; 
Phosphotransferases. Tyrosine-specif ic kinase subfamily. (SEQ ID 
NO: 801) 

CD-Length = 258 residues, 83.7% aligned 
Score = 117 bits (292) , Expect = 2e-27 



10 



15 



NOV 2 : 


100 


Sb j Ct : 


7 


NOV 2 : 


156 


Sbjct : 


67 


NOV 2 : 


214 


Sbjct : 


127 


NOV 2 : 


273 


Sbjct : 


184 



+ 11 + 11+ + + l + l + I +1+1 ++I++ I 1 1 + 1 + 

LGEGAFGEVYKGTLKGKGGVEVEVAVKTLKEDASEQQIEEFLREARLMRKLDHPNIVKLL 66 



II I 



III I 



I I I 



I I I I 



+ I + 



+ I 



1+ II 



+ +I I I I 



II 



1 + 1+ I 1+ + + + I 1+ I 



Protein phosphorylation is a fundamental process for the regulation of cellular 
functions. The coordinated action of both protein kinases and phosphatases controls the levels 

20 of phosphorylation and, hence, the activity of specific target proteins. One of the predominant 
roles of protein phosphorylation is in signal transduction, where extracellular signals are 
amplified and propagated by a cascade of protein phosphorylation and dephosphorylation 
events. Eukaryotic protein kinases are enzymes that belong to a very extensive family of 
proteins which share a conserved catalytic core common with both serine/threonine and 

25 tyrosine protein kinases. There are a number of conserved regions in the catalytic domain of 
protein kinases. In the N-terminal extremity of the catalytic domain there is a glycine-rich 
stretch of residues in the vicinity of a lysine residue, which has been shown to be involved in 
ATP binding. In the central part of the catalytic domain there is a conserved aspartic acid 
residue which is important for the catalytic activity of the enzyme. 

30 The disclosed NOV2 nucleic acid of the invention encoding a Protein Serine Kinase- 

like protein includes the nucleic acid whose sequence is provided in Tables 2A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Tables 2A while still encoding a protein 
that maintains its Protein Serine Kinase -like activities and physiological functions, or a 

35 fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 

nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
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chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
5 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 32 percent of the bases may be so changed. 

The disclosed NOV2 protein of the invention includes the Protein Serine Kinase -like 
protein whose sequence is provided in Tables 2B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

10 in Table 2B while still encoding a protein that maintains its Protein Serine Kinase-like 

activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 35 percent of the residues may be so changed. 

The NOV2 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Diabetes, Von Hippel-Lindau (VHL) syndrome , 

15 Pancreatitis, Obesity, Lymphedema , Allergies, Alzheimer's disease, Stroke, Tuberous 

sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, 
Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, Leukodystrophies, 
Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection, Diabetes, Autoimmune 
disease, Renal artery stenosis, Interstitial nephritis, Glomerulonephritis, Polycystic kidney 

20 disease, Systemic lupus erythematosus, Renal tubular acidosis, IgA nephropathy, and/or other 
pathologies and disorders. 

NOV2 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immunospeciflcally to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 

25 art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. The disclosed NOV2 protein has multiple hydrophilic regions, each of which 
can be used as an immunogen. These novel proteins can be used in assay systems for 
functional analysis of various human disorders, which are useful in understanding of 
pathology of the disease and development of new drug targets for various disorders. 
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NOV3 



NOV3 includes three novel human 1 Claudin-like proteins disclosed below. The 
disclosed sequences have been named NOV3a, NOV3b, and NOV3c. 
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NOV3a 

A disclosed NOV3a nucleic acid of 695 nucleotides (designated CuraGen Acc. No. 
CG56594-01) encoding a novel Claudin-19-like protein is shown in Table 3A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 53-55 and 
ending with a TGA codon at nucleotides 662-664. A putative untranslated region downstream 
from the termination codon is underlined in Table 3 A, and the start and stop codons are in 
bold letters. 



Table 3 A. NOV3a Nucleotide Sequence (SEQ ID NO: 11) 



GCACCCTGGCCCAGCTCTGAGTCCTGGGACCCTCGGTCCTCTCTCCTGGGCC ATGGCCAACTCAGGCCTC 
CAGCTCCTGGGCTACTTCTTGGCCCTGGGTGGCTGGGTGGGCATCATTGCTAGCACAGCCCTGCCACAGT 
GGAAGCAGTCTTCCTACGCAGGCGACGCCATCATCACTGCCGTGGGCCTCTATGAAGGGCTCTGGATGTC 
CTG CGC CT C C C AG AG C AC T GGG C AAGTGC AGTGC AAGC T CT AC G ACT CG CTGC T CGC C CTGG ACGGT AGG 
CCCCAGGCCGCGCGGGCCCTGATGGTGGTGGCCGTGCTCCTGGGCTTCGTGGCCATGGTCCTCAGCGTAG 
TTGGCATGAAGTGTACGCGGGTGGGAGACAGCAACCCCATTGCCAAGGGCCGTGTTGCCATCGCCGGGGG 
AGCCCTCTTCATCCTGGCAGGCCTCTGCACTTTGACTGCTGTCTCGTGGTATGCCACCCTGGTGACCCAG 
GAGTTCTTCAACCCAGAATTTGGCCCAGCCCTGTTCGTGGGCTGGGCCTCAGCTGGCCTGGCCGTGCTGG 
GCGGCTCCTTCCTCTGCTGCACATGCCCGGAGCCAGAGAGACCCAACAGCAGCCCACAGCCCTATCGGCC 
TGGACCCTCTGCTGCTGCCCGAGAGTACGTCTGAGCTCCGCCTGCCCTGGCCAGCCCCCCACCCA 
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15 



20 



The nucleic acid sequence, localized to chromosome 1, has 402 of 482 bases (83%) 
identical to a gb:GENBANK-ID:AF249889|acc:AF249889.1 mRNA from Mus musculus 
(claudin-19 mRNA, partial cds) (E = Lie 67 ). 

A NOV3a polypeptide (SEQ ID NO: 12) encoded by SEQ ID NO:l 1 is 203 amino acid 
residues and is presented using the one letter code in Table 3B. Signal P, Psort and/or 
Hydropathy results predict that NOV3a has no signal peptide and is likely to be localized at 
the endoplasmic reticulum (membrane) with a certainty of 0.6850. Alternatively, NOV3a may 
also localize to the plasma membrane with a certainty of 0.6400, the Golgi body with a 
certainty of 0.4600, or the endoplasmic reticulum (lumen) with a certainty of 0.1000. The most 
likely cleavage site for NOV3a is between positions 23 and 24: IIA-ST. 



Table 3B. NOV3a protein sequence (SEQ ID NO:12) 



MANSGLQLLGYFLALGGWVGIIASTALPQWKQSSYAGDAIITAVGLYEGLWMSCASQSTGQVQCKLYDSLLALD 
GR PQAARALM WAVLLGFVAM VX.S WGMKCTRVGDSNP I AKGRVA I AGGAL F I LAGLCTLTAVS WYATL VTQE F 
FNPEFGPALFVGWASAGLAVLGGSFLCCTCPEPERPNSSPQPYRPGPSAAAREYV 
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The full amino acid sequence of the protein of the invention was found to have 174 of 
193 amino acid residues (90%) identical to, and 178 of 193 amino acid residues (92%) similar 
to, the 193 amino acid residue ptnnTREMBLNE W-ACC:AAF98323 protein from Mus 
musculus (Mouse) (CLAUDIN-19) (E = 5.7e' 89 ). 
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NOV3a is predicted to be expressed in at least the Spinal cord. 
NOV3b 

A disclosed NOV3b nucleic acid of 695 nucleotides (also referred to as CG56594-01) 
encoding a novel Claudin-19-Iike protein is shown in Table 3C. An open reading frame was 
5 identified beginning with an ATG initiation codon at nucleotides 53-55 and ending with a 

TGA termination codon at nucleotides 662-664. The start and stop codons are in bold letters 
in Table 3C, and the 5' and 3' untranslated regions are underlined. 



Table 3C. NOV3b nucleotide sequence (SEQ ID NO: 13). 

GCACCCTGGCCCAGCTCTGAGTCCTGGGACCCTCGGTCCTCTCTCCTGGGCC ATGGCCAACTCAGGCCTCCA 
GCTCCTGGGCTACTTCTTGGCCCTGGGTGGCTGGGTGGGCATCATTGCTAGCACAGCCCTGCCACAGTGGAA 
GCAGTCTTCCTACGCAGGCGACGCCATCATCACTGCCGTGGGCCTCTATGAAGGGCTCTGGATGTCCTGCGC 
CTCCCAGAGCACTGGGCAAGTGCAGTGCAAGCTCTACGACTCGCTGCTCGCCCTGGACGGTAGGCCCCAGGC 
CGCGCGGGCCCTGATGGTGGTGGCCGTGCTCCTGGGCTTCGTGGCCATGGTCCTCAGCGTAGTTGGCATGAA 
GTGTACGCGGGTGGGAGACAGCAACCCCATTGCCAAGGGCCGTGTTGCCATCGCCGGGGGAGCCCTCTTCAT 
CCTGGCAGGCCTCTGCACTTTGACTGCTGTCTCGTGGTATGCCACCCTGGTGACCCAGGAGTTCTTCAACCC 
AGAATTTGGCCCAGCCCTGTTCGTGGGCTGGGCCTCAGCTGGCCTGGCCGTGCTGGGCGGCTCCTTCCTCTG 
CTGCACATGCCCGGAGCCAGAGAGACCCAACAGCAGCCCACAGCCCTATCGGCCTGGACCCTCTGCTGCTGC 
CCGAGAGTACGTCTGAGCTCCGCCTGCCCTGGCCAGCCCCCCACCCA 



10 In a search of public sequence databases, the NOV3b nucleic acid sequence, located on 

chromsome 1 has 402 of 482 bases (83%) identical to a gb:GENBANK- 

ID:AF249889|acc:AF249889.1 mRNA from Mus musculus (claudin-19 mRNA, partial cds) (E 
= l.le 67 ). 

The disclosed NOV3b polypeptide (SEQ ID NO: 14) encoded by SEQ ID NO: 13 has 
15 203 amino acid residues and is presented in Table 3D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV3b has a signal peptide and is 
likely to be localized the endoplasmic reticulum (membrane) with a certainty of 0.6850. 
Alternatively, NOV3b may also localize to the plasma membrane with acertainty of 0.6400, 
the Golgi body with a certainty of 0.4600 or in the endoplasmic reticulum (lumen) with a 
20 certainty of 0.1000. The most likely cleavage site for NOV3b is between positions 23 and 24: 
IIA-ST. 



Table 3D. Encoded NOV3b protein sequence (SEQ ID NO: 14). 

MANSGLQLLGYFLALGGWVGI IASTALPQWKQSSYAGDAI ITAVGLYEGLWMSCASQSTGQVQCKLYDSLLA 
LDGRPQAARALMWAVLLGFVAMVLSVVGMKCTRVGDSNPIAKGRVAIAGGALFILAGLCTLTAVSWYATLV 
TQEFFNPEFGPALFVGWASAGLAVL.GGSFLCCTCPEPERPNSSPQPYRPGPSAAAREYV 

A search of sequence databases reveals that the NOV3b amino acid sequence has 174 
25 of 193 amino acid residues (90%) identical to, and 178 of 193 amino acid residues (92%) 
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similar to, the 193 amino acid residue ptnr:TREMBLNEW-ACC:AAF98323 protein from Mus 
musculus (Mouse) (Claudin-19) (E = 5.7e 89 ). 

NOV3b is predicted to be expressed in at least the Spinal cord. 

NOV3c 

5 A disclosed NOV3c nucleic acid of 690 nucleotides (also referred to as CG57576-01) 

encoding a novel Claudin 19-like protein is shown in Table 3E. An open reading frame was 
identified beginning with an ATG codon at nucleotides 51-53 and ending with a TGA codon at 
nucleotides 684-686. The start and stop codons are in bold letters and the 5' and 3' 
untranslated regions are underlined in Table 31. Because the start codon is not a traditional 
10 initiation codon, NOV3c could be a partial reading frame. NOV3c could extend further in the 
5' direction. 



Table 3E. NOV3c nucleotide sequence (SEQ ID NO: 15). 



ACCCTGGCCCAGCTCTGAGTCCTQGGACCCTCGGTCCTCTCTCCTGGGCC ATGGCCAACTCAGGCCTCCAGC 
TCCTGGGCTACTTCTTGGCCCTGGGTGGCTGGGTGGGCATCATTGCTAGCACAGCCCTGCCACAGTGGAAGC 
AGTCTTCCTACGCAGGCGACGCCATCATCACTGCCGTGGGCCTCTATGAAGGGCTCTGGATGTCCTGCGCCT 
CCCAGAGCACTGGGCAAGTGCAGTGCAAGCTCTACGACTCGCTGCTCGCCCTGGACGGTCACATCCAATCAG 
CGCGGGCCCTGATGGTGGTGGCCGTGCTCCTGGGCTTCGTGGCCATGGTCCTCAGCGTAGTTGGCATGAAGT 
GTACGCGGGTGGGAGACAGCAACCCCATTGCCAAGGGCCGTGTTGCCATCGCCGGGGGAGCCCTCTTCATCC 
TGGCAGGCCTCTGCACTTTGACTGCTGTCTCGTGGTATGCCACCCTGGTGACCCAGGAGTTCTTCAACCCAA 
GCACACCTGTCAATGCCAGGTATGAATTTGGCCCAGCCCTGTTCGTGGGCTGGGCCTCAGCTGGCCTGGCCG 
TGCTGGGCGGCTCCTTCCTCTGCTGCACATGCCCGGAGCCAGAGAGACCCAACAGCAGCCCACAGCCCTATC 
GGCCTGGACCCTCTGCTGCTGCCCGAGAGTACGTCTGAGCTC 



In a search of public sequence databases, the NOV3c nucleic acid sequence, located on 
15 chromsome 1 has 445 of 671 bases (66%) identical to a gb:GENBANK- 

ID:HSA01 1497|acc:AJ01 1497.1 mRNA from Homo sapiens (mRNA for Claudin-7) (E = 5.3e" 

The disclosed NOV3c polypeptide (SEQ ID NO: 16) encoded by SEQ ID NO: 15 has 
211 amino acid residues and is presented in Table 3F using the one-letter amino acid code. 

20 Signal P, Psort and/or Hydropathy results predict that NOV3c has no signal peptide and is 
likely to be localized the the endoplasmic reticulum (membrane) with a certainty of 0.6850. 
Alternatively, NOV3c may also localize to the plasma membrane with acertainty of 0.6400, 
the Golgi body with a certainty of 0.4600 or in the endoplasmic reticulum (lumen) with a 
certainty of 0.1000. The most likely cleavage site for a NOV3c peptide is between amino acids 

25 23 and 24, at: IIA-ST. 
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Table 3F. Encoded NOV3c protein sequence (SEQ ID NO: 16). 

MANSGLQLLGYFLALGGWVGIIASTALPQWKQSSYAGDAIITAVGLYEGLWMSCASQSTGQVQCKLYDSLLA 
LDGHIQSARALMVVAVLLGFVAMVLSVVGMKCTRVGDSNPIAKGRVAIAGGALFILAGLCTLTAVSWYATLV 
TQEFFNPSTPVNARYEFGPALFVGWASAGLAVLGGSFLCCTCPEPERPNSSPQPYRPGPSAAAREYV 



A search of sequence databases reveals that the NOV3c amino acid sequence has 121 
of 21 1 amino acid residues (57%) identical to, and 159 of 21 1 amino acid residues (75%) 
similar to, the 211 amino acid residue ptnr:SWISSNEW-ACC:095832 protein from Homo 
5 sapiens (Human) (Claudin- 1 (Senescence-Associated Epithelial Membrane Protein)) (E = 
9.6e" 66 ). 

NOV3c is predicted to be expressed in at least Spinal cord. 

NOV3a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 3G. 

10 



Table 3G. BLAST results for NOV3a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 97894 76 | gb | AAF9 8 
323. l| (AF249889) 


claudin-19 [Mus 
musculus] 


193 


174/193 
(90%) 


178/193 
(92%) 


le-84 


gi | 17489134 | ref | XP 

060892.1] 

(XM_060892) 


similar to 
claudin-19 (H. 
sapiens) [Homo 
sapiens] 


309 


126/137 
(91%) 


127/137 
(91%) 


3e-59 


gi | 12 6544 55 | gb | AAHO 
1055 . 1 | AAH01055 
(BC001055) 


claudin 7 [Homo 
sapiens] 


211 


112/211 
(53%) 


149/211 
(70%) 


2e-55 


gi | 10835008 | ref | NP 
0012 98. l| 
<NM_001307) 


claudin 7 ; 
Clostridium 
perf ringens 
enterotoxin 
receptor- like 2; 
claudin 9 [Homo 
sapiens] 


211 


111/211 
(52%) 


148/211 
(69%) 


7e-55 


gi|7710002|ref |NP 0 
57883. l| 
(NM 016674) 


claudin 1 [Mus 
musculus] 


211 


112/212 
(52%) 


149/212 
(69%) 


8e-55 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 3H. 



Table 3H ClustalW Analysis of NOV3 

1) NOV3a (SEQ ID NO: 12) 

2) NOV3b (SEQ ID NO : 14 ) 

3) NOV3 C (SEQ ID NO: 16) 

4) gi|9789476|gb|AAF98323.l| (AF249889) claudin-19 [Mus musculus] (SEQ ID NO: 33 3) 

5) gi | 17489134 |ref | XP_060892 . 1 | (XM_060892) similar to claudin-19 (H . sapiens) 
[Homo sapiens] (SEQ ID NO: 33 4) 

6) gi| 12654455 |gb | AAH01055.1 | AAH01055 (BC001055) claudin 7 {Homo sapiens] (SEQ ID 
NO:335) 

7) gi| 10835008 | ref |NP_001298.1 | (NM_0013 07 ) claudin 7; Clostridium perfringens 
enterotoxin receptor- like 2; claudin 9 [Homo sapiens] (SEQ ID NO:336) 

38 



15 

20 



8) gi | 7710002 | ref |NP_057883 . 1 | (NM_016674) claudin 1 [Mus musculus] (SEQ ID NO:327) 
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NOV3A 
NOV3b 
NOV3c 

gi | 9789476 | 
gi | 17489134 | 
gi | 12654455 | 
gi | 10835008 | 
gi|7710002| 
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NOV3A 
NOV3b 
NOV3c 

gi | 9789476 | 
gi | 17489134 | 
gi j 12654455 j 
gi | 10835008 | 
gi |7710002 | 



NOV3A 
NOV3b 
NOV3c 

gi | 9789476 | 
gi j 17489134 | 
gi | 12654455 | 
gi 1 10835008 j 
gi j 7710002 | 



NOV3A 
NOV3b 
NOV3c 



gi 
gi 
gi 
gi 



9789476 | 
17489134 | 
12654455 j 
10835008 j 
7710002 | 



NOV3A 
NOV3b 
NOV3C 

gi 
gi 
gi 
gi 
gi 



9789476 | 
17489134 | 
12654455 | 
10835008 | 
7710002 | 



NOV3A 
NOV3b 
NOV3c 

gi | 9789476 | 
gi j 17489134 | 
gi j 12654455 | 
gi | 10835008 j 
gi |7710002 | 
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Table 31 lists the domain description from DOMAIN analysis results against NOV3. 
This indicates that the NOV3 sequence has properties similar to those of other proteins known 
to contain this domain. 



Table 31 Domain Analysis of NOV3 

gnl | Pfam|pfam00822, PMP2 2_Claudin / PMP-22/EMP/MP20/Claudin family 
(SEQ ID NO: 802) 

CD-Length = 162 residues, 99.4% aligned 
Score = 80.5 bits (197), Expect = 9e-l7 



NOV 3 : 5 GLQLLGYFLALGGWVG- I IASTALPQWKQSSYAGDAI ITAVGLYEGLWMSCASQS-TGQV 62 

+ Ml- + + II + +1 III I I I I Ml +| +|| |||+ 

Sbjct: 2 LVLLLGFIVSHIAWVILLFVATITDQWKVSRYVGAAA S AGL WRNCTTQS CTGQ I 55 

10 °° ■ 

NOV 3 : 63 QCKLYDSLLALDGRPQAAi^Z,MWAV£^ 12 2 

11+ 11+ I I +1 I I+ + + + +I | + + + + + + || +| 
Sbjct: 56 SCKV LELNDALQAVQALMILSI ILGIISLIVFFFQLFTMRKGGRFKLA 103 

15 NOV 3 : 12 3 GAiFJLAGLCTLTAVSWYATLVTQEFFNP EFGPALFVGWASAGLAVLGGSFL 174 

I +I+++I III I I + + +1 I I | | + +| | + | | +| | 

Sbjct: 104 G 1 1 FL VS GL C VL VGAS I Y T S R I ATD FGN P FT PNRK Y S FG Y S F I LG WVAF AL AF I GG VL Y 162 

The claudins are a family of integral membrane proteins that are major components of 

20 tight junction (TJ) strands. When claudins are introduced into cells that lack tight junctions, 
networks of strands and grooves form at cell-cell contact sites that closely resemble native 
tight junctions. There are at least 17 members of this family in mammals. Claudin family 
members share -38% amino acid identity, and are predicted to have four transmembrane (TM) 
domains, which is reminiscent of occludin, although they share no sequence similarity with it. 

25 Multiple sequence alignment reveals their sequences to be fairly well conserved in the first 
and fourth putative TM domains, and in the first and second extracellular loops, but they 
diverge in the second and third TM domains. Although the sequences of their C-terminal 
cytoplasmic domains vary, the known family members share a common motif of -Y-V. This 
has been postulated as a possible binding motif for PDZ domains of other tight junction - 

30 associated peripheral membrane proteins, such as ZO-1 . 

The disclosed NOV3 nucleic acid of the invention encoding a Claudin- 19 -like protein 
includes the nucleic acid whose sequence is provided in Table 3A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 3A while still encoding a protein that maintains 

35 its Claudin- 19 -like activities and physiological functions, or a fragment of such a nucleic acid. 
The invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
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just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
5 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 17 percent of the 
bases may be so changed. 

The disclosed NOV3 protein of the invention includes the Claudin-19 -like protein 

10 whose sequence is provided in Table 3B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
3B while still encoding a protein that maintains its Claudin-19 -like activities and 
physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 48 percent of the residues may be so changed. 

15 The protein similarity information, expression pattern, and map location for the 

Claudin-19-like protein and nucleic acid (NOV3) disclosed herein suggest that this NOV3 
protein may have important structural and/or physiological functions characteristic of the 
Claudin-19 family. Therefore, the NOV3 nucleic acids and proteins of the invention are 
useful in potential diagnostic and therapeutic applications. These include serving as a specific 

20 or selective nucleic acid or protein diagnostic and/or prognostic marker, wherein the presence 
or amount of the nucleic acid or the protein are to be assessed, as well as potential therapeutic 
applications such as the following: (i) a protein therapeutic, (ii) a small molecule drug target, 
(iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) a 
nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a composition 

25 promoting tissue regeneration in vitro and in vivo. 

The NOV3 nucleic acids and proteins of the invention are useful in potential diagnostic 
and therapeutic applications implicated in various diseases and disorders described below. For 
example, the compositions of the present invention will have efficacy for treatment of patients 
suffering from Von Hippel-Lindau (VHL) syndrome, Cirrhosis, Transplantation, Hemophilia, 

30 hypercoagulation, Idiopathic thrombocytopenic purpura, autoimmume disease, allergies, 

immunodeficiencies, transplantation, Graft vesus host, Alzheimer's disease, Stroke, Tuberous 
sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, 
Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, Leukodystrophies, 
Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection, Systemic lupus 




erythematosus , Autoimmune disease, Asthma, Emphysema, Scleroderma, allergy, and 
Cancer, and/or other pathologies. The NOV3 nucleic acids, or fragments thereof, may further 
be useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the 
protein are to be assessed. 
5 NOV3 nucleic acids and polypeptides are further useful in the generation of antibodies 

that bind immunospecifically to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. These novel proteins can be used in assay systems for functional analysis of 
10 various human disorders, which will help in understanding of pathology of the disease and 
development of new drug targets for various disorders. 

NOV4 

NOV4 includes three novel human 1 Claudin-like proteins disclosed below. The 
15 disclosed sequences have been named NOV4a, NOV4b, and NOV4c. 
NOV4a 

A disclosed NOV4a nucleic acid of 694 nucleotides (also referred to as CG56589-01) 
encoding a novel Claudin-6-like protein is shown in Table 4A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 11-13 and ending with a 
20 TAA codon at nucleotides 671-673. Putative untranslated regions upstream from the initiation 
codon and downstream from the termination codon are underlined in Table 4A, and the start 
and stop codons are in bold letters. 



Table 4A. NOV4a Nucleotide Sequence (SEQ ID NO:17) 

ACCTGTCGCAA TGGCTTTAATCTTTAGAACAGCAATGCAATCTGTTGGACTTTTACTATCTC 
TCCTGGGATGGATTTTATCCATTATTACAACTTATTTGCCACACTGGAAGAACCTCAACCTG 
GACTT AAATG AAATGG AAAACTGG AC C ATGGG ACT CTGGC AAAC C TGTGT C AT CC AAG AGGA 
AGTGGGGATGCAATGCAAGGACTTTGACTCCTTCCTGGCTTTGCCTGCTGAACTCAGGGTCT 
CCAGGATCTTAATGTTTCTGTCAAATGGGCTGGGATTTCTGGGCCTGCTGGTCTCTGGGTTT 
GGCCTGGACTGTTTGAGAATTGGAGAGAGTCAGAGAGATCTCAAGAGGCGACTGCTCATTCT 
GGGAGGAATTCTGTCCTGGGCCTCGGGAATCACAGCCCTGGTTCCCGTCTCTTGGGTTGCCC 
ACAAGACGGTTCAGGAGTTCTGGGATGAGAACGTCCCAGACTTTGTCCCCAGGTGGGAGTTT 
GGGGAGGCCCTGTTTCTGGGCTGGTTTGCTGGACTTTCTCTTCTGCTAGGAGGGTGTCTGCT 
C AACTGCG C AG C C TGC T C C AG CC ACG CTC C CC T AGCT TTGGG C C ACT ATGC AGTGG CG CAAA 
TGCAAACTCAGTGTCCCTACCTGGAAGATGGGACAGCAGATCCTCAAGTGTAA GACTCCGAC 
AAGGCCAGAGAT 
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The NOV4a nucleic acid was identified on chromosome 4 and has 330 of 556 bases 
(59%) identical to a gb:GENBANK-ID:AF134160|acc:AF134160.1 mRNA from Homo 
sapiens (claudin-1 (CLDN1) mRNA, complete cds) (E - 2.9e' 9 ). 

A disclosed NOV4a polypeptide (SEQ ID NO: 1 8) encoded by SEQ ID NO: 1 7 is 220 
5 amino acid residues and is presented using the one-letter code in Table 4B. Signal P, Psort 
and/or Hydropathy results predict that NOV4a has no signal peptide and is likely to be 
localized in the plasma membrane with a certainty of 0.6400. Alternatively, NOV4a may also 
localize to the Golgi body with acertainty of 0.4600, the endoplasmic reticulum (membrane) 
with a certainty of 0.3700, or the enoplasmic reticulum (lumen) with a certainty of 0.1000. The 
10 most likely cleavage site for NOV4a is between positions 24 and 25: ILS-II. 



Table 4B. Encoded NOV4a protein sequence (SEQ ID NO: 18) 

MALIFRTAMQSVGLLLSLLGWILSI ITTYLPHWK 

LPAELRVSRILMFLSNGLGFLGLLVSGFGLDCLRIGESQRDLKRRLLILGGILSWASGITALVPVSWVAHKT 
VQEFWDENVPDFVPRWEFGEALFLGWFAGLSLLLGGCLLNCAACSSHAPLALGHYAVAQMQTQCPYLEDGTA 
DPQV 

The disclosed NOV4a amino acid sequence has 84 of 204 amino acid residues (41%) 
identical to, and 1 19 of 204 amino acid residues (58%) similar to, the 219 amino acid residue 
1 5 ptnr:S WISSPROT-ACC:Q9Z262 protein from Mus musculus (Mouse) (Claudin-6) (E = 1 . 1 e 

32 ). 

NOV4a is predicted to be expressed in at least Brain. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
20 sources. 

In addition, the sequence is predicted to be expressed in Adrenal Gland/Suprarenal 
gland, Brain, Bronchus, Brown adipose, Cervix, Colon, Coronary Artery, Epidermis, Gall 
Bladder, Heart, Hippocampus, Islets of Langerhans, Kidney, Liver, Lung, Lung Pleura, 
Mammary gland/Breast, Oesophagus, Ovary, Oviduct/Uterine Tube/Fallopian tube, Parotid 
25 Salivary glands, Peripheral Blood, Placenta, Prostate, Proximal Convoluted Tubule, 
Respiratory Bronchiole, Skin, Stomach, Substantia Nigra, Thymus, Thyroid, Trachea, 
Umbilical Vein, Uterus, and Vulva. 

NOV4b 

A disclosed NOV4b nucleic acid of 694 nucleotides (also referred to as CG56589-01) 
30 encoding a novel Claudin-6-like protein is shown in Table 4C. An open reading frame was 

identified beginning with an ATG codon at nucleotides 11-13 and ending with a TAA codon at 
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nucleotides 671-673. The start and stop codons are in bold letters and the 5' and 3' 
untranslated regions are underlined in Table 4C. Because the start codon is not a traditional 
initiation codon, NOV4b could be a partial reading frame. NOV4b could extend further in the 
5' direction. 



Table 4C. NOV4b nucleotide sequence (SEQ ID NO:19). 

ACCTGTCGCAA TGGCTTTAATCTTTAGAACAGCAATGCAATCTGTTGGACTTTTACTATCTCTCCTGGGATG 
GATTTTATCCATTATTACAACTTATTTGCCACACTGGAAGAACCTCAACCTGGACTTAAATGAAATGGAAAA 
CTGGACCATGGGACTCTGGCAAACCTGTGTCATCCAAGAGGAAGTGGGGATGCAATGCAAGGACTTTGACTC 
CTTCCTGGCTTTGCCTGCTGAACTCAGGGTCTCCAGGATCTTAATGTTTCTGTCAAATGGGCTGGGATTTCT 
GGGCCTGCTGGTCTCTGGGTTTGGCCTGGACTGTTTGAGAATTGGAGAGAGTCAGAGAGATCTCAAGAGGCG 
ACTGCTCATTCTGGGAGGAATTCTGTCCTGGGCCTCGGGAATCACAGCCCTGGTTCCCGTCTCTTGGGTTGC 
CCACAAGACGGTTCAGGAGTTCTGGGATGAGAACGTCCCAGACTTTGTCCCCAGGTGGGAGTTTGGGGAGGC 
CCTGTTTCTGGGCTGGTTTGCTGGACTTTCTCTTCTGCTAGGAGGGTGTCTGCTCAACTGCGCAGCCTGCTC 
CAGCCACGCTCCCCTAGCTTTGGGCCACTATGCAGTGGCGCAAATGCAAACTCAGTGTCCCTACCTGGAAGA 
TGGGAC AGCAGAT CC T CAAGTGT AAGAC T C CG AC AAGGCC AGAGAT 



In a search of public sequence databases, the NOV4b nucleic acid sequence, located on 
chromsome 4 has 330 of 556 bases (59%) identical to a gb : GENB ANK- 
K):AF134160|acc:AF134160.1 mRNA from Homo sapiens (claudin-1 (CLDNl)mRNA, 
complete cds) (E = 2.9e" 09 ). 

The disclosed NOV4b polypeptide (SEQ ID NO:20) encoded by SEQ ID NO: 19 has 
220 amino acid residues and is presented in Table 4D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV4b has no signal peptide and is 
likely to be localized the the plasma membrane with a certainty of 0.6400. Alternatively, 
NOV4b may also localize to the Golgi body with acertainty of 0.4600, the endoplasmic 
reticulum (membrane) with a certainty of 0.3700 or in the endoplasmic reticulum (lumen) with 
a certainty of 0.1000. The most likely cleavage site for a NOV4b peptide is between amino 
acids 24 and 25, at: ILS-II. 



Table 4D. Encoded NOV4b protein sequence (SEQ ID NO:20). 

MALIFRTAMQSVGLLLSLLGWILSIITTYLPHWKNLNLDLNEMENWTMGLWQTCVIQEEVGMQCKDFDSFLA 
LPAELRVSRILMFLSNGLGFLGLLVSGFGLDCLRIGESQRDLKRRLLILGGILSWASGITALVPVSWVAHKT 
VQEFWDENVPDFVPRWEFGEALFLGWFAGLSLLLGGCLLNCAACSSHAPLALGHYAVAQMQTQCPYLEDGTA 
DPQV 

A search of sequence databases reveals that the NOV4b amino acid sequence has 84 of 

204 amino acid residues (41%) identical to, and 1 19 of 204 amino acid residues (58%) similar 

to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q9Z262 protein from Mus musculus 

(Mouse) (Claudin-6) (E = 1 . 1 e 32 ). 

NOV4b is predicted to be expressed in at least Brain. 
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In addition, NOV4b is predicted to be expressed in Adrenal Gland/Suprarenal gland, 
Brain, Bronchus, Brown adipose, Cervix, Colon, Coronary Artery, Epidermis, Gall Bladder, 
Heart, Hippocampus, Islets of Langerhans, Kidney, Liver, Lung, Lung Pleura, Mammary 
gland/Breast, Oesophagus, Ovary, Oviduct/Uterine Tube/Fallopian tube, Parotid Salivary 
glands, Peripheral Blood, Placenta, Prostate, Proximal Convoluted Tubule, Respiratory 
Bronchiole, Skin, Stomach, Substantia Nigra, Thymus, Thyroid, Trachea, Umbilical Vein, 
Uterus, and Vulva. 

NOV4c 

A disclosed NOV4c nucleic acid of 694 nucleotides (also referred to as CG565 89-02) 
encoding a novel Claudin 6-like protein is shown in Table 4E. An open reading frame was 
identified beginning with an ATG codon at nucleotides 11-13 and ending with a TAA codon at 
nucleotides 671-673. The start and stop codons are in bold letters and the 5' and 3' 
untranslated regions are underlined in Table 4E. 



Table 4E. NOV4c nucleotide sequence (SEQ ID NO:21). 

~ACCTGTCGCAATGGCTTTAATCTTTAAAACAGCAATGCAATCTGTTGGACTTTTGCTATCTTTCCTGGGATG 
GATTTTATCCATTATTACAACTTATTTGCCACACTGGAAGAACCTCAACCTGGACTTAAATGAAATGGAAAA 
CTGGACCATGGGACTCTGGCAAACCTGTGTCATCCAAGAGGAAGTGGGGATGCAATGCAAGGACTTTGACTC 
CTTCCTGGCTTTGCCTGCTGAACTCAGGGTCTCCAGGATCTTAATGTTTCTGTCAAATGGGCTGGGATTTCT 
GGGCCTGCTGGTCTCTGGGTTTGGCCTGGACTGTTTGAGAATTGGAGAGAGTCAGAGAGATCTCAAGAGGCG 
ACTGCTCATTCTGGGAGGAATTCTGTCCTGGGCCTCGGGAATCACGGCCCTGGTTCCCGTCTCTTGGGTTGC 
CCACAAGACGGTTCAGGAGTTCTGGGATGAGAACGTCCCAGACTTTGTCCCCAGGTGGGAGTTTGGGGAGGC 
CCTGTTTCTGGGCTGGCTTGCTGGACTTTCTCTTCTGCTAGGAGGGTGTCTGCTCAACTGCGCAGCCTGCTC 
CAGCCACGCTCCCCTAGCTTTGGGCCACTATGCAGTGGCGCAAATGCAAACTCACTGTCCCTACCTGGAAGA 
TGGGACAG C AGATC CTCAAGTGT A AGACT C CGAC AAGGC C AGAGAT 

In a search of public sequence databases, the NOV4c nucleic acid sequence, located on 
chromsome 4 has 331 of 556 bases (59%) identical to a gb : GENB ANK- 
ID:AF134160|acc:AF134160.1 mRNA from Homo sapiens (claudin-1 (CLDN1) mRNA, 
complete cds) (E = 3.2e~ 9 ). 

The disclosed NOV4c polypeptide (SEQ ID NO:22) encoded by SEQ ID NO:21 has 
220 amino acid residues and is presented in Table 4F using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV4c has no signal peptide and is 
likely to be localized the the plasma membrane with a certainty of 0.6400. Alternatively, 
NOV4c may also localize to the Golgi body with acertainty of 0.4600, the endoplasmic 
reticulum (membrane) with a certainty of 0.3700 or in the endoplasmic reticulum (lumen) with 
a certainty of 0.1000. The most likely cleavage site for a NOV4c peptide is between amino 
acids 24 and 25, at: ILS-II. 
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Table 4F. Encoded NOV4c protein sequence (SEQ ID NO:22). 

MAL I FKTAMQS VGLLLS FLGWI LS 1 1 TTYLPHWKNLNLDLNEMENWTMGLWQTCV I QEE VGMQCKDFDS FLA 
LPAELRVSRILMFLSNGLGFLGLLVSGFGLDCLRIGESQRDLKRRLLILGGILSWASGITALVPVSWVAHKT 
VQEFWDENVPDFVPRWEFGEALFLGWLAGLSLLLGGCLLNCAACSSHAPLALGHYAVAQMQTHCPYLEDGTA 
DPQV 



A search of sequence databases reveals that the NOV4c amino acid sequence has 83 of 
204 amino acid residues (40%) identical to, and 118 of 204 amino acid residues (57%) similar 
to, the 219 amino acid residue ptnr:SWISSPROT-ACC:Q9Z262 protein from Mus musculus 
5 (Mouse) (Claudin-6) (E - 9.6e" 66 ). 

The sequence is predicted to be expressed in the following tissues : Adrenal 
Gland/Suprarenal gland, Brain, Bronchus, Brown adipose, Cervix, Colon, Coronary Artery, 
Epidermis, Gall Bladder, Heart, Hippocampus, Islets of Langerhans, Kidney, Liver, Lung, 
Lung Pleura, Mammary gland/Breast, Oesophagus, Ovary, Oviduct/Uterine Tube/Fallopian 
10 tube, Parotid Salivary glands, Peripheral Blood, Placenta, Prostate, Proximal Convoluted 

Tubule, Respiratory Bronchiole, Skin, Stomach, Substantia Nigra, Thymus, Thyroid, Trachea, 
Umbilical Vein, Uterus, and Vulva. 

NOV4 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 4G. 
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Table 4G. BLAST results for NOV4 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17437504 | ref |XP_ 
068030 . 1 | 
(XM_068030) 


similar to 
putative (H . 
sapiens) [Homo 
sapiens] 


220 


220/220 
(100%) 


220/220 
(100%) 


e-105 


gi | 17437506 | ref | XP_ 
068031 . 1 | 
<XM_068031) 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] 


220 


192/212 
(90%) 


198/212 
(92%) 


9e-96 


gi | 12 843248 | db j | BAB 
25914. l| (AK008821) 


PMP- 

2 2/EMP/MP2 0/Claud 
in family- 
containing 
protein-data 
source : Pf am, 
source 
key :PF00822 , 
evidence : ISS~puta 
tive [Mus 
musculus] 


220 


158/220 
(71%) 


182/220 
(81%) 


3e-70 


gi | 174 5 8 94 7 | ref |XP_ 
061964 . 1 | 
(XM_061964) 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] 


229 


108/188 
(57%) 


137/188 
(72%) 


2e-45 


gi | 7710002 | ref |NP_0 
57883 . 1 | 
(NM 016674) 


claudin 1 [Mus 
musculus] 


211 


72/181 
(39%) 


105/181 
(57%) 


le-27 
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The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 4H. 

Table 4H Clustal W Sequence Alignment 
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1) NOV4a (SEQ ID NO: 18) 

2) NOV4b (SEQ ID NO: 20) 

3) NOV4C (SEQ ID NO: 22) 

4) gi | 17437504 | ref |XP_068030 . 1 | (XM_068030) similar to putative (H. sapiens) [Homo 
sapiens] (SEQ ID NO: 325) 

5) gi | 17437506 | ref | XP_068031 . 1 | (XM_068031) similar to putative (H. sapiens) [Homo 
Sapiens] (SEQ ID NO: 324) 

6) gi | 12843248 | dbj | BAB25914 . 1 | (AK008821) PMP- 22/EMP/MP2 0 /Claudin family containing 
protein-data source :Pf am, source key:PF00822, evidence : ISS-putative [Mus musculus] 

(SEQ ID NO;326) 

7) gi | 17458947 | ref |XP_061964 . 1 | (XM_061964) similar to putative (H. sapiens) [Homo 
sapiens] (SEQ ID NO: 323) 

8) gi | 7710002 | ref |NP_057883 . 1 | (NM_016674) claudin 1 [Mus musculus] (SEQ ID NO: 32 7) 



NOV4a 
NOV4b 



NOV4c 

gi | 17437504 | 
gi j 17437506 j 
gi | 12843248 | 
gi j 17458947 j 
gi |7710002 | 



NOV4a 
NOV4b 
NOV4c 

gi 1 17437504 | 
gi | 17437506 | 
gi 1 12843248 j 
gi j 17458947 j 
gi |7710002 | 



NOV4a 
NOV4b 
NOV4c 

gi 1 17437504 | 
gi j 17437506 j 
gi 1 12843248 j 
gi j 17458947 j 
gi |7710002 | 



NOV4a 
NOV4b 
NOV4c 




150 



180 



gi 
gi 
gi 
gi 
gi 



17437504 | 
17437506 | 
12843248 | 
17458947 j 
7710002 I 
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Table 41 lists the domain description from DOMAIN analysis results against NOV4. 
This indicates that the NOV4 sequence has properties similar to those of other proteins known 
to contain this domain. 



Table 41 Domain Analysis of NOV4 

gnl | Pfam|pf am00822 , PMP22_Claudin / PMP-22/EMP/MP20/Claudin family. 
(SEQ ID NO: 802) 

CD-Length - 162 residues, 67.3% aligned 
Score = 35.0 bits (79), Expect = 0.004 



NOV 4: 49 GI> WQT CVI QE E VGM - Q CKD FD S FL AL P AE LRVS R I LMFLSNGLGFLGLLVSGFGLDCIjR I 107 

Ml + II I II I 1+ + I I I I I I + 1 + 1 II +| 

Sbjct: 41 GLWRNCTTQSCTGQISCKVL ELNDALQAVQALM I L S 1 1 LG 1 1 S L I VFFFQL FTMRK 96 

NOV 4 : 10 8 GESQRDLKJ?J?LLJI,GGJLSWASGITALVPVSWVAHKTVQEFWDENVPDFVPRWEFGEALF 167 

I I 11+ 11+ III + +| | ++ || + 

Sbjct : 97 GGR FKLAG 1 1 FL VSGL C VL VGAS I Y T SRI ATD F - - GN PFT PNRK Y S FG YS F I 146 

NOV 4: 168 LGW 170 
III 

Sbjct: 14 7 LGW 14 9 

The claudins are a family of integral membrane proteins that are major components of 
tight junction (TJ) strands. When claudins are introduced into cells that lack tight junctions, 
networks of strands and grooves form at cell-cell contact sites that closely resemble native 
tight junctions. There are at least 17 members of this family in mammals. Claudin family 
members share -38% amino acid identity, and are predicted to have four transmembrane (TM) 
domains, which is reminiscent of occludin, although they share no sequence similarity with it. 
Multiple sequence alignment reveals their sequences to be fairly well conserved in the first 
and fourth putative TM domains, and in the first and second extracellular loops, but they 
diverge in the second and third TM domains. Although the sequences of their C-terminai 
cytoplasmic domains vary, the known family members share a common motif of -Y-V. This 
has been postulated as a possible binding motif for PDZ domains of other tight junction- 
associated peripheral membrane proteins, such as ZO-1 . 

The disclosed NOV4 nucleic acid of the invention encoding a Claudin-6 -like protein 
includes the nucleic acid whose sequence is provided in Table 4A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 4A while still encoding a protein that maintains 
its Claudin-6 -like activities and physiological functions, or a fragment of such a nucleic acid. 
The invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 

48 



*l G If!i "? l .5: 



complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
5 used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 41 percent of the 
bases may be so changed. 

The disclosed NOV4 protein of the invention includes the Claudin-6 -like protein 
whose sequence is provided in Table 4B. The invention also includes a mutant or variant 

10 protein any of whose residues may be changed from the corresponding residue shown in Table 
4B while still encoding a protein that maintains its Claudin-6-like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 61 
percent of the residues may be so changed. 

The NOV4 nucleic acids and proteins of the invention are useful in potential 

15 therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, Cirrhosis, 
Transplantation, Hemophilia, hypercoagulation, Idiopathic thrombocytopenic purpura, 
autoimmume disease, allergies, immunodeficiencies, transplantation, Graft vesus host, 
Alzheimer's disease, Stroke, Tuberous sclerosis, hypercalceimia, Parkinson's disease, 
Huntington's disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple sclerosis, 

20 Ataxia-telangiectasia, Leukodystrophies, Behavioral disorders, Addiction, Anxiety, Pain, 

Neuroprotection, Systemic lupus erythematosus , Autoimmune disease, Asthma, Emphysema, 
Scleroderma, allergy, and Cancer, and/or other pathologies and disorders of the like. The 
NOV4 nucleic acid, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

25 NOV4 nucleic acids and polypeptides are further useful in the generation of antibodies 

that bind immunospecifically to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. For example the disclosed NOV4 protein have multiple hydrophilic regions, 

30 each of which can be used as an immunogen. This novel protein also has value in 

development of powerful assay system for functional analysis of various human disorders, 
which will help in understanding of pathology of the disease and development of new drug 
targets for various disorders. 
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NOV5 

NOV5 includes three novel Monocarboxylate transporter (MCT3)-like proteins 
disclosed below. The disclosed sequences have been named NOV5a, NOV5b, NOV5c, 
NOV5d, and NOV5e. 
5 NOVSa 

A disclosed NOV5a nucleic acid of 1502 nucleotides (also referred to as CG56635-01) 
encoding a novel Monocarboxylate transporter (MCT3)-like protein is shown in Table 5a. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 24- 
26 and ending with a TGA codon at nucleotides 1365-1367. The start and stop codons are in 
1 0 bold letters in Table 5 A. 



Table 5A. NOVSa Nucleotide Sequence (SEQ ID NO:23) 

GTTTCCCCACCCCGCAGACGGCGATGACCCCCCAGCCCGCCGGACCCCCGGATGGGGGCTGGGGCTGGGT 
GGTGGCGGCCGCAGCCTTCGCGATAAACGGGCTGTCCTACGGGCTGCTGCGCTCGCTGGGCCTTGCCTTC 
CCTGACCTTGCCGAGCACTTTGACCGAAGCGCCCAGGACACTGCGTGGATCAGCGCCCTGGCCTTGGCCG 
TGCAGCAGGCAGCCAGCCCCGTGGGCAGCGCCCTGAGCACGCGCTGGGGGGCCCGCCCCGTGGTGATGGT 
TGGGGGCGTCCTCGCCTCGCTGGGCTTCGTCTTCTCGGCTTTCGCCAGCGATCTGCTGCATCTCTACCTC 
GGCCTGGGCCTCCTCGCTGGCTTTGGTTGGGCCCTGGTGTTCGCCCCCGCCCTAGGCACCCTCTCGCGTT 
ACTTCTCCCGCCGTCGAGTCTTGGCGGTGGGGCTGGCGCTCACCGGCAACGGGGCCTCCTCGCTGCTCCT 
GGCGCCCGCCTTGCAGCTTCTTCTCGATACTTTCGGCTGGCGGGGCGCTCTGCTCCTCCTCGGCGCGATC 
ACCCTCCACCTCACCCCCTGTGGCGCCCTGCTGCTACCCCTGGTCCTTCCTGGAGACCCCCCAGCCCCAC 
CGCGTAGTCCCCTAGCTGCCCTCGGCCAGAGTCTGTTCACACGCCGGGCCTTCTCAATCTTTGCTCTAGG 
CACAGCCCTGGTTGGGGGCGGGTACTTCGTTCCTTACGTGCACTTGGCTCCCCACGCTTTAGACCGGGGC 
CTGGGGGGATACGGAGCAGCGCTGGTGGTGGCCGTGGCTGCGATGGGGGATGCGGGCGCCCGGCTGGTCT 
GCGGGTGGCTGGCAGACCAAGGCTGGGTGCCCCTCCCGCGGCTCCTGGCCGTATTCGGGGCTCTGACTGG 
GCTGGGGCTGTGGGTGGTGGGGCTGGTGCCCGTGGTGGGCGGCGAAGAGAGCTGGGGGGGTCCCCTGCTG 
GCCGCGGCTGTGGCCTATGGGCTGAGCGCGGGGAGTTACGCCCCGCTGGTTTTCGGTGTACTCCCCGGGC 
TGGTGGGCGTCGGAGGTGTGGTGCAGGCCACAGGGCTGGTGATGATGCTGATGAGCCTCGGGGGGCTCCT 
GGGCCCTCCCCTGTCAGGCTTCCTAAGGGATGAGACAGGAGACTTCACCGCCTCTTTCCTCCTGTCTGGT 
TCTTTGATCCTCTCCGGCAGCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTGGTCCAGCCT 
CCCCTCCAGCCACGCCTCCCCCAGAGACGGGGGAGCTGCTTCCCGCTCCCCAGGCAGTCTTGCTGTCCCC 
AGGAGGCCCTGGCTCCACTCTGGACACCACTTGTTGA TTATTTTCTTGTTTGAGCCCCTCCCCCAATAAA 
GAATT TT T AT CGGGTTT T C CTG AAAC CTC C AACTGTT C AC C AAT C T AGG AC C C TG AAAAT AT T CT AC AT A 
AG AC AGC C AG AAAGG CTGGTT C AAAGG AAC AG 



The disclosed NOV5a nucleic acid sequence , located on chromosome 17, has 672 of 
1 110 bases (60%) identical to a gb:GENBANK-ID:AF132610|acc:AF132610.1 mRNA from 
15 Homo sapiens (monocarboxylate transporter MCT3 mRNA, complete cds) (E = 1 .6e~ 29 ). 

A disclosed NOV5a polypeptide (SEQ ID NO:24) encoded by SEQ ID NO:23 is 447 
amino acid residues and is presented using the one-letter amino acid code in Table 5B. Signal 
P, Psort and/or Hydropathy results predict that NOV5a contains no signal peptide and is likely 
to be localized in the endoplasmic reticulum (membrane) with a certainty of 0.6850. 
20 Alternatively, NOV5a is also likely to be localized to the plasma membrane with a certainty of 
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0.6400, to the Golgi body with a certainty of 0.4600, or to the endoplasmic reticulum (lumen) 
with a certainty of 0. 1 000 



Table 5B. Encoded NOV5a protein sequence (SEQ ID NO:24). 

MTPQPAGPPDGGWGWWAAAAFAINGLSYGLLRSLGLAFPDLAEHFDRSAQDTAWISALALAVQQAASPVGSALS 
TRWGARP WM VGGVLAS L.GFVFS AFAS DLLHL YLGLGLLAGFGWALVFAPALGTL S RYFS RRRVLAVGLALTGNG 
ASSLLLAPALQLLLDTFGWRGALLLLGAITLHLTPCGALLLPLVLPGDPPAPPRSPLAALGQSLFTRRAFSIFAL 
GTALVGGGYFVPYVHLAPHALDRGLGGYGAALWAVAAMGDAGARLVCGWLADQGWVPLPRLLAVFGALTGLGLW 
WGLVPWGGEESWGGPLIJU^VAYGLSAGSYAPLVFGVLPGIiVGVGGWQATGLVMMLMSLGGLLGPPLSGFLR 
DETGDFTASFLLSGSLILSGSFI YIGLPRALPSCGPASPPATPPPETGELLPAPQAVLIjSPGGPGSTLDTTC 

5 The disclosed NOV5a amino acid sequence has 96 of 198 amino acid residues (48%) 

identical to, and 122 of 198 amino acid residues (61%) similar to, the 504 amino acid residue 
ptnr:SPTREMBL-ACC: 095907 protein from Homo sapiens (Human) (DJ1039K5.2 (Similar 
To Monocarboxylate Transporter (MCT3))) (E = 1 .2e~ 67 ). 

NOV5a is predicted to be expressed in at least Adrenal gland, bone marrow, brain - 
10 amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, retina, salivary gland, skeletal 
muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus. 

NOV5b 

15 A disclosed NOV5b nucleic acid of 61 1 nucleotides (also referred to as CG56635-02) 

encoding a novel Monocarboxylate transporter 3-like protein is shown in Table 5C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 6-8 and 
ending with a TGA codon at nucleotides 500-502. The start and stop codons are in bold letters 
in Table 5B. 



Table 5C. NOV5b Nucleotide Sequence (SEQ ID NO:25) 

ACGGCGATGACCCCCCAGCCCGCCGGACCCCCGGATGGGGGCTGGGGCTGGGTGGTGGCGGCCGCAGCCT 
TCGCGATAAACGGGCTGTCCTACGGGCTGCTGCGCTCGCTGGGCCTTGCCTTCCCTGACCTTGCCGAGCA 
CTTTGACCGAAGCGCCCAGGACACTGCGTGGATCAGCGCCCTGGCCCTGGCCGTGCAGCAGGCAGCCAGC 
CCCGTGGGCAGCGCCCTGAGCACGCGCTGGGGGGCCCGCCCCGTGGTGATGGTTGGGGGCGTCCTCGCCT 
CGCTGGGCTTCGTCTTCTCGGCTTTCGCCAGCGATCTGCTGCATCTCTACCTCGGCCTGGGCCTCCTCGC 
TGGCTTCCTAAGGGATGAGACAGGAGACTTCACCGCCTCTTTCCTCCTGTCTGGTTCTTTGATCCTCTCC 
GGCAGCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTGGTCCAGCCTCCCCTCCAGCCACGC 
CTCCCCCAGAGACGGGGGAGCTGCTTCCCGCTCCCCAGGCAGTCTTGCTGTCCCCAGGAGGCCCTGGCTC 
CACTCTGGACACCACTTGTTGATTATTTTCTTGTTTGAGCCCCTCCCCCAC 



The disclosed NOV5b nucleic acid sequence , located on chromosome 17, has 323 of 
520 bases (62%) identical to a gb:GENBANK-lD:AF132610|acc:AF132610.1 mRNA from 
Homo sapiens (monocarboxylate transporter MCT3 mRNA, complete cds) (E = 3.2e" 18 ). 
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A disclosed NOV5b polypeptide (SEQ ID NO:26) encoded by SEQ ID NO:25 is 191 
amino acid residues and is presented using the one-letter amino acid code in Table 5D. Signal 
P, Psort and/or Hydropathy results predict that NOV5b contains no signal peptide and is likely 
to be localized in the endoplasmic reticulum (membrane) with a certainty of 0.9325. 
5 Alternatively, NOV5b is also likely to be localized to the plasma membrane with a certainty of 
0.4960, to the microbody (peroxisome) with a certainty of 0.3200, or to the Golgi body with a 
certainty of 0.1900 The most likely cleavage site for NOV5b is between positions 38 and 39: 
GLA-FP. 



Table 5D. Encoded NOV5b protein sequence (SEQ ID NO:26). 

MTPQPAGPPDGGWGWVVAAAAFAINGLSYGLLRSLGIxAFPDIxA-EHFDRSAQDTAWISAIjAIjAVQQAASPVGSALS 
TRWGARPVVMVGGVLASLGFVFSAFASDLLHLYLGLGLLAGFLRDETGDFTASFLLSGSLILSGSFIYIGLPRAL 
PSCGPASPPATPPPETGELLPAPQAVLLSPGGPGSTLDTTC 

.10 * ~ — 

The disclosed NOV5b amino acid sequence has 53 of 1 10 amino acid residues (48%) 
identical to, and 72 of 1 10 amino acid residues (65%) similar to, the 504 amino acid residue 
ptnr:SPTREMBL-ACC:Q9UBE2 protein from Homo sapiens (Human) (Monocarboxylate 
Transporter MCT3) (E = 2.9e" 28 ). 
15 NOV5b is predicted to be expressed in at least the following tissues: adrenal gland, 

bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
20 trachea and uterus. 

NOV5c 

A disclosed NOV5c nucleic acid of 704 nucleotides (also referred to as CG56635-03) 
encoding a novel Monocarboxylate transporter 3 -like protein is shown in Table 5E. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 28-30 and 
25 ending with a TGA codon at nucleotides 673-675. The start and stop codons are in bold letters 
in Table 5E. 



Table 5E. NOV5c Nucleotide Sequence (SEQ ID NO:27) 

CGAGCAGCCAGAGGCTGGATCTCAGGG ATGCCAGCTCCCCAGCGGAAGCACAGGCGTGGAGGCTTCTCTC 
ACAGATGTTTCCCCACCCCGCAGACGGCGATGACCCCCCAGCCCGCCGGACCCCCGGATGGGGGCTGGGG 
CTGGGTGGTGGCGGCCGCAGCCTTCGCGATAAACGGGCTGTCCTACGGGCTGCTGCGCTCGCTGGGCCTT 
GCCTTCCCTGACCTTGCCGAGCACTTTGACCGAAGCGCCCAGGACACTGCGTGGATCAGCGCCCTGGCCC 
TGGCCGTGCAGCAGGCAGCCAGCCCCGTGGGCAGCGCCCTGAGCACGCGCTGGGGGGCCCGCCCCGTGGT 
GATGGTTGGGGGCGTCCTCGCCTCGCTGGGCTTCGTCTTCTCGGCTTTCGCCAGCGATCTGCTGCATCTC 
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TACCTCGGCCTGGGCCTCCTCGCTGGCTTCCTAAGGGATGAGACAGGAGACTTCACCGCCTCTTTCCTCC 
TGTCTGGTTCTTTGATCCTCTCCGGCAGCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTGG 
TCCAGCCTCCCCTCCAGCCACGCCTCCCCCAGAGACGGGGGAGCTGCTTCCCGCTCCCCAGGCAGTCTTG 
CTGTCCCCAGGAGGCCCTGGCTCCACTCTGGACACCACTTGTTGA TTATTTTCTTGTTTGAGCCCCTCCC 
CCAC 



The disclosed NOV5c nucleic acid sequence , located on chromosome 17, has 340 of 
547 bases (62%) identical to a gb:GENBANK-ID:AF0191 1 l|acc:AF0191 1 1.2 mRNA from 
Mus musculus (monocarboxylate transporter 3 (MCT3) mRNA, complete cds) (E = 2.4e~ 15 ). 
5 A disclosed NOV5c polypeptide (SEQ ID NO:28) encoded by SEQ ID NO:27 is 215 

amino acid residues and is presented using the one-letter amino acid code in Table 5F. Signal 
P, Psort and/or Hydropathy results predict that NOV5c contains no signal peptide and is likely 
to be localized in the endoplasmic reticulum (membrane) with a certainty of 0.8500. 
Alternatively, NOV5c is also likely to be localized to the microbody (peroxisome) with a 
1 0 certainty of 0.6400, to the plasma membrane with a certainty of 0.4400, or to the nucleus with 
a certainty of 0.3000 



Table 5R Encoded NOV5c protein sequence (SEQ ID NO:28). 

MPAPQRKHRRGGFSHRCFPTPQTAMTPQPAGPPDGGWGWWAAAAFAINGLSYGLLRSLGLAFPDLAEHFDRSAQ 
DTAWISALALAVQQAASPVGSALSTRWGARPWMVGGVLASLGFVFSAFASDLLHLYLGLGLLAGFLRDETGDFT 
ASFLLSGSLILSGSFIYIGLPRALPSCGPASPPATPPPETGELLPAPQAVLLSPGGPGSTLDTTC 

The disclosed NOV5c amino acid sequence has 53 of 1 10 amino acid residues (48%) 
1 5 identical to, and 72 of 1 10 amino acid residues (65%) similar to, the 504 amino acid residue 
ptnr:SPTREMBL-ACC:Q9UBE2 protein from Homo sapiens (Human) (Monocarboxylate 
Transporter MCT3) (E = 2.9e' 28 ). 

NOV5c is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
20 nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. . 

NOV5d 

25 A disclosed NOV5d nucleic acid of 1513 nucleotides (also referred to as CG56635-04) 

encoding a novel Monocarboxylate transporter 3-like protein is shown in Table 5G. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 28-30 and 
ending with a TGA codon at nucleotides 1444-1446. The start and stop codons are in bold 
letters in Table 5G. 



Table 5G. NOV5d Nucleotide Sequence (SEQ ID NO:29) 

CGAGCAGCCAGAGGCTGGATCTCAGGG ATOCCAGCTCCCCAGCGGAAGCACAGGCGTGGAGGCTTCTCTC 
ACAGATGTTTCCCCACCCCGCAGACGGCGATGACCCCCCAGCCCGCCGGACCCCCGGATGGGGGCTGGGG 
CTGGGTGGTGGCGGCCGCAGCCTTCGCGATAAACGGGCTGTCCTACGGGCTGCTGCGCTCGCTGGGCCTT 
GCCTTCCCTGACCTTGCCGAGCACTTTGACCGAAGCGCCCAGGACACTGCGTGGATCAGCGCCCTGGCCC 
TGGCCGTGCAGCAGGCAGCCAGTCCCGTGGGCAGCGCCCTGAGCACGCGCTGGGGGGCCCGCCCCGTGGT 
GATGGTTGGGGGCGTCCTCGCCTCGCTGGGCTTCGTCTTCTCGGCTTTCGCCAGCGATCTGCTGCATCTC 
TACCTCGGCCTGGGCCTCCTCGCTGGTTTTGGTTGGGCCCTGGTGTTCGCCCCCGCCCTAGGCACCCTCT 
CGCGTTACTTCTCCCGCCGTCGAGTCTTGGCGGTGGGGCTGGCGCTCACCGGCAACGGGGCCTCCTCGCT 
GCTCCTGGCGCCCGCCTTGCAGCTTCTTCTCGATACTTTCGGCTGGCGGGGCGCTCTGCTCCTCCTCGGC 
GCGATCACCCTCCACCTCACCCCCTGTGGCGCCCTGCTGCTACCCCTGGTCCTTCCTGGAGACCCCCCAG 
CCCCACCGCGTAGTCCCCTAGCTGCCCTCGGCCTGAGTCTGTTCACACGCCGGGCCTTCTCAATCTTTGC 
T C T AGGC AC AGC C C TGGT TGGGGG CGGGT AC TT C GTT CC TT ACGT GC ACT TGG CT CCC C AC GCTTT AGAC 
CGGGGCCTGGGGGGATACGGAGCAGCGCTGGTGGTGGCCGTGGCTGCGATGGGGGATGCGGGCGCCCGGC 
TGGTCTGCGGGTGGCTGGCAGACCAAGGCTGGGTGCCCCTCCCGCGGCTGCTGGCCGTATTCGGGGCTCT 
GACTGGGCTGGGGCTGTGGGTGGTGGGGCTGGTGCCCGTGGTGGGCGGCGAAGAGAGCTGGGGGGGTCCC 
CTGCTGGCCGCGGCTGTGGCCTATGGGCTGAGCGCGGGGAGTTACGCCCCGCTGGTTTTCGGTGTACTCC 
CCGGGCTGGTGGGCGTCGGAGGTGTGGTGCAGGCCACAGGGCTGGTGATGATGCTGATGAGCCTCGGGGG 
GCTCCTGGGCCCTCCCCTGTCAGGTAAGTTCCTAAGGGATGAGACAGGAGACTTCACCGCCTCTTTCCTC 
CTGTCTGGTTCTTTGATCCTCTCCGGCAGCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTG 
GTCCAGCCTCCCCTCCAGCCACGCCTCCCCCAGAGACGGGGGAGCTGCTTCCCGCTCCCCAGGCAGTCTT 
GCTGTCCCCAGGAGGCCCTGGCTCCACTCTGGACACCACTTGTTG ATTATTTTCTTGTTTGAGCCCCTCC 
CC C AATAAAG AATTT TT ATCGGGTT T T C C TG AAAC C T C C AAC T 



The disclosed NOV5d nucleic acid sequence , located on chromosome 17, has 567 of 
940 bases (60%) identical to a gb:GENBANK-ID:HSU81800|acc:U8 1800.1 mRNA from 
Homo sapiens (monocarboxylate transporter (MCT3) mRNA, complete cds) (E = 6.5e~ 30 ). 

A disclosed NOV5d polypeptide (SEQ ID NO:30) encoded by SEQ ID NO:29 is 472 
amino acid residues and is presented using the one-letter amino acid code in Table 5H. Signal 
P, Psort and/or Hydropathy results predict that NOV5d contains no signal peptide and is likely 
to be localized in the plasma membrane with a certainty of 0.6000. Alternatively, NOV5d is 
also likely to be localized to the Golgi body with a certainty of 0.4000, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
certainty of 0.3000 



Table 5H. Encoded NOV5d protein sequence (SEQ ID NO:30). 

MPAPQRKHRRGGFSHRCFPTPQTAMTPQPAGPPDGGWGWWAAAAFAINGLSYGLLRSLGLAFPDLAEHFDRSAQ 
DTAWISALALAVQQAASPVGSALSTRWGARPVVMVGGVLASLGFVFSAFASDLLHLYLGLGLLAGFGWALVFAPA 
LGTLSRYFSRRRVLAVGLALTGNGASSLLLAPALQLLLDTFGWRGALLLLGAITLHLTPCGALLLPLVLPGDPPA 
PPRSPLAALGLSLFTRRAFSIFALGTALVGGGYFVPYVHLAPHALDR 

ADQGWVPLPRLLAVFGALTGLGLWWGLVPWGGEESWGGPLLAAAVAYGLSAGSYAPLVFGVLPGLVGVGGWQ 
ATGLVMMLMSLGGLLGPPLSGKFLRDETGDFTASFLLSGSLILSGSFIYIGLPRALPSCGPASPPATPPPETGEL 
LPAPQAVLLSPGGPGSTLDTTC __ 



The disclosed NOV5d amino acid sequence has 96 of 198 amino acid residues (48%) 
identical to, and 122 of 198 amino acid residues (61%) similar to, the 504 amino acid residue 
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ptnr:SPTREMBL-ACC:O95907 protein from Homo sapiens (Human) (DJ1039K5.2 (Similar 
To Monocarboxylate Transporter (MCT3))) (E = 7.9e" 68 ). 

NOV5d is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
5 nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. . 
NOV5e 

10 A disclosed NOV5e nucleic acid of 465 nucleotides (also referred to as CG56635-05) 

encoding a novel Monocarboxylate transporter 3-like protein is shown in Table 51. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 7-9 and 
ending with a TGA codon at nucleotides 436-438. The start and stop codons are in bold letters 
in Table 51., and the 5' and 3' untranslated regions, if any, are underlined. 

15 

Table 51. NOV5e Nucleotide Sequence (SEQ ID NO:31) 

~ACGGCGATGACCCCCCAGCCCGCCGGACCCCCGGATGGGGGCTGGGGCTGGGTGGTGGCGGCCGCAGCCT 
TCGCGATAAACGGGCTGTCCTACGGGCTGCTGCGCTCGCTGGGCCTTGCCTTCCCTGTCCTTGCCGAGCA 
CTTTGACCGAAGCGCCCAGGACACTGCGTGGATCAGCGCCCTGGCCCTGGCCGTGCAGCAGGCAGCCAGC 
TTCCT AAGGG ATG AG AC AGG AG ACTT C AC CG CCT CTT TC C T C CTGTCTGGTT CTTTGATC C T C T C CGGC A 
GCTTCATCTACATAGGGTTGCCCAGGGCGCTGCCCTCCTGTGGTCCAGCCTCCCCTCCAGCCACGCCTCC 
CCCAGAGACGGGGGAGCTGCTTCCCGCTCCCCAGGCAGTCTTGCTGTCCCCAGGAGGCCCTGGCTCCACT 
CTGG AC AC C ACTTGT TG A TT ATT TTC TTGT T TG AGCC C C T C C C C C 



The disclosed NOV5e nucleic acid sequence , located on chromosome 17, has 351 of 
434 bases (80%) identical to a gb:GENBANK-ID:AX083362|acc:AX083362.1 mRNA from 
Homo sapiens (Sequence 54 from Patent WO01 12660) (E = 1.6e" 53 ). 

20 A disclosed NOV5e polypeptide (SEQ ID NO:32) encoded by SEQ ID NO:3 1 is 143 

amino acid residues and is presented using the one-letter amino acid code in Table 5 J. Signal 
P, Psort and/or Hydropathy results predict that NOV5e contains no signal peptide and is likely 
to be localized extracellularly with a certainty of 0.5040. Alternatively, NOV5e is also likely 
to be localized to the endoplasmic reticulum (membrane) with a certainty of 0.1000, to the 

25 endoplasmic reticulum (lumen) with a certainty of 0.1000, or to the lysosome (lumen) with a 
certainty of 0.1000. The most likely cleavage site for NOV5e is between positions 43 and 44: 
VLA-EH. 
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Table 5J. Encoded NOV5e protein sequence (SEQ ID NO:32). 

MTPQPAGPPDGGWGWWAAAAFAINGLSYGLLRSLGLAFPVLAEHFDRSAQDTAWISALALAVQQAASFLRDETG 
DFTASFLLSGSLILSGSFIYIGLPRALPSCGPASPPATPPPETGELLPAPQAVLLSPGGPGSTLDTTC 

The disclosed NOV5e amino acid sequence has 67 of 68 amino acid residues (98%) 
identical to, and 67 of 68 amino acid residues (98%) similar to, the 375 amino acid residue 
ptnr:REMTREMBL-ACC:CAC33285 protein from Homo sapiens (Human) (Sequence 54 
5 from Patent WO01 12660) (E - 2.9e 31 ). 

NOV5e is predicted to be expressed in at least Mammalian Tissue, Parathyroid Gland, 
Mammary gland/Breast, Prostate. . 

NOV5a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 5K. 

10 , 



Table 5K. BLAST results for NOV5a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 7670446 | dbj | BAA 9 
5074. l| (AB041591) 


unnamed protein 
product [Mus 
musculus] 


290 


252/288 
(87%) 


263/288 
(90%) 


le-86 


gi | 17491104 |ref |XP_ 
064368 . 1 | 
(XM_064368) 


similar to solute 
carrier family 16 
(monocarboxyl ic 
acid 

transporters) , 
member 8 {H. 
sapiens) [Homo 
sapiens] 


427 


196/398 j 
(49%) 


257/398 
(64%) 


6e-74 


gi|2497855|sp|Q6334 
4 | MOT2_RAT 


MONOCARBOXYLATE 
TRANSPORTER 2 
(MCT 2) 


489 


142/420 
(33%) 


220/420 
(51%) 


6e-53 


gi | 143 216 7 |gb| AAB04 
023. l| (U62316) 


monocarboxylate 
transporter 2 
[Rattus 
norvegicus] 


489 


143/420 
(34%) 


220/420 
(52%) 


6e-53 


gi | 6755536 | ref |NP_0 
35521. l| 
(NM_011391) 


solute carrier 
family 16 
(monocarboxyl ic 
acid 

transporters) , 
member 7 [Mus 
musculus] 


484 


142/421 
(33%) 


221/421 
(51%) 


2e-52 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 5J. 

Table 5J Information for the ClustalW proteins 

15 1) NOV5a (SEQ ID NO: 24) 

2) NOV5b (SEQ ID NO: 26) 

3) NOV5C (SEQ ID NO: 28) 

4) NOV5d (SEQ ID NO : 3 0 ) 

5) NOV5e (SEQ ID NO: 32) 

20 6) gi | 7670446 | dbj | BAA95 074 . 1 | (AB041591) unnamed protein product [Mus musculus] (SEQ 
ID NO: 337) 
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10 



15 



20 



7) gi | 17491104 | ref |XP_064368 . 1 | (XM_064368) similar to solute carrier family 16 
(monocarboxylic acid transporters) , member 8 <H. sapiens) [Homo sapiens] (SEQ ID 
NO:338) 

8) gi | 2497855 | sp|Q63344 |MOT2_RAT MONOCARBOXYLATE TRANSPORTER 2 (MCT 2) (SEQ ID 
NO : 3 3 9 ) 

9) gi | 1432167 |gb | AAB04023 . 1 1 (U62316) monocarboxylate transporter 2 [Rattus 
norvegicus] (SEQ ID NO: 34 0) 

10) . gi | 67 5553 6 | ref | NP_03 5521 . 1 | (NM_011391) solute carrier family 16 
(monocarboxylic acid transporters), member 7 [Mus musculus] (SEQ ID NO:341) 



NOV5a 
NOV5b 
NOVSc 
NOV5d 
NOVSe 

gi |7670446 | 
gi j 17491104 | 
gi j 2497855 | 
gi | 1432167 j 
gij 6755536 | 



10 20 30 40 50 60 

....|....|....|....|....|....|....|-...|.-..|--..|....|^.-| 

1 1 

1 1 

! 1 

! 1 

1 1 

1 1 

1 MARRT EPPDGGWGWXVWLSAFFQSALVFGVLRSFGVFFVEFVAAFEE 4 8 

1 MPSESSVKATAAPPPFPLPPDGGWGWVVVCAS-FISIGFSYAFPKAVTVFFNDIKDIFKT 5 9 

1 MPSESSVKATAAPPPFPLPPDGGWGWVWCAS - FISIGFSYAFPKAVTVFFNDIKDIFKT 5 9 

1 MPSEP S--APLPQPLPPDGGWGWWVCAS-FISIGFSYAFPKAVTVFFKDIQEIFNT 54 
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45 



NOV5a 
NOV5b 
NOVSc 
NOV5d 
NOV5e 

gi |7670446 | 
gi j 17491104 | 
gi|2497855 | 
gij 1432167 j 
gi|6755536 | 



1 
1 
1 
1 
1 
1 

49 
60 
60 
55 



70 
.|.. 



80 



90 
■I-- 



100 



110 



-MPAPQRKHRRGgFSHRCrapTPQTAMT 2 6 

-hpapqrkhrrgHfshrcqptpqtaIIt 26 




MVGGVLTSL 

QAARVSWIASIGIAVQQFGSPVGSALSTKFGPRPMVMTGGILAAL 
TSSQIAWI SS IMLAVMYAGGPI SSVLVNNYGSRPVjlil VGGLLCC^ 
TSSQI AWI SS IMLAVMYAGGP I SS VLVNNYGSRPjjjVI VGGLLCCTj 
TSSQIAWISSIMLAVMYAGGPISSVLVNNYGSRPpVIVGGLLCCI 



130 



140 



NOV5a 


3 


PQPAGPPDGj 


NOV5b 


3 


pqpagppdg! 


NOV5c 


27 


pqpagppdg! 


NOV5d 


27 


PQPAGPPDGj 


NOVSe 


3 


PQPAGPPDG^ 


gi | 7670446 | 


25 


LGLGLLAGS 


gi | 17491104 | 


109 


LSIGLLSGS 


gi | 2497855 | 


120 


LTVGFIGGL 


gij 1432167 j 


120 


LTVGFIGGlS 


gij 6755536 j 


115 


LTVGFIGGLg 



rF§: 



150 

....|. 

Jg - lSygl^r^. 

jG-L:YGLpR^ n 
fG-L: YGLgjRSL 
|g - L : YGL|jRS< 
b-L5YGLpRBJ 
_ JLGTLSRYF : RRR^LA\ 
|PTLAct§C YFgRRRS L^T 

iTiHgkyfyrkrpLj J 
.tiHgkyfyrkrplS 
.tiqgkyfyrrrpl] 



160 





H T 2 

RSLLHLY 24 
.TSLTHLY 108 
jSSSVIELjY 
SSSVIE|lY 



119 
119 



ILASgSNSVIEIjY 114 



180 



54 
54 
78 
78 
54 



lEHgDRSAQDTg 

IehHdrsaqdtB 
IehSdrsaqptB 
IehBdrsaqptE 

!EH§DRSAQ|Jrg_ 

PALQFLLDTFG 84 

1TGVGI1 S S FT F^P F§Q WL>L SHY^ 168 

.GSPVFLSTTOPFNQFLFNSYG 17 9 

.GSPVFLSTTOPFNQFLFNSYG 17 9 

,GSPVFLST™PFNOYLFNNYG 174 
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190 



200 



210 



220 



230 



NOV5a 
NOVSb 
NOVSc 
NOV5d 
NOVSe 

gi |7670446 | 
gi | 17491104 | 
gi j 2497855 | 
gi | 1432167 j 
gij 6755536 j 



NOVSa 
NOV5b 
NOVSc 
NOV5d 
NOVSe 




-STRWG 

- STRWG 

-STRWG 

-STRWG 




240 
•-I 



Irplals - - 

PPSLA-- 
fCLpRPVGPSP 
'LMRPVGPSP 




GDjSLAPPRTP 
ED§--aE§GP 

|iSKS 

;sks 



|CLMRPVGPSPNTK|jsKS 



2 S RQDgS tkrl s k VSTAE kinr 
srqdBstkrlskvstaekinr 
Itlkkaskvstaqkvnr 



92 

92 

116 

116 

68 

125 

207 

239 

239 

234 




250 260 
"P 7 Sj AFASD D LH R 



280 



290 



300 



Y 

{s|aFAS Djjd LHjJ Y[ 

asdiIlhiJ- - 

ASdSlhU Yi 




270 

IlSg FGW ALVFAPALGTL S R YFS RRR VLAVGLALT 147 



FGWALVFAPALGTLSR YFS RRR VLAVGLALT 



115 
139 
171 
68 
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gi 


|7670446 | 


125 - 


gi 


j 17491104 | 


207 - 


gi 


j 2497855 | 


240 F 


gi 


[1432167 j 


240 F 


gi 


j 6755536 j 


235 F 






/ 



NOV5a 
NOV5b 
NOV5c 
NOV5d 
NOV5e 

gi |7670446 | 
gi | 17491104 | 
gi|2497855 | 
gi | 1432167 | 
gi|6755536| 



NOVSa 
NOVSb 
NOV5c 
NOV5d 
NOV5e 

gi |7670446 | 
gi | 17491104 | 
gi [2497855 | 
gi j 1432167 j 
gi|6755536| 



5LGX^krrafs^fai]gtalii£gg5 

JtS -yLHHGPFLRYTyALTLpjNTB 

|GL|pHRGFL|YLSGNVvjlFffi 
|SL§MHRG Fl|yL S GNV I Jf^ 



iTFVPYVHLG- - 
fFIPYLHLV-- 
4F0P 1 1 FLAP - 
4FgPIIFLAP- 

E'fBpiiflaq- 



158 
239 
274 
274 
269 



I 



310 
I 



I 



320 
I 



330 



I 



340 
I 



I 



350 



I 



360 
- - I 



NOVSa 


148 


NOVSb 


115 


NOV5c 


139 


NOV5d 


172 


NOVSe 


68 


gi 


7670446 | 


158 


gi 


17491104 | 


239 


gi 


2497855 | 


274 


gi 


1432167 j 


274 


gi 


6755536 j 


269 



14 8 GNGASSLLLAPALQLLLDTFGWRGALLLLGAITLHLTPCGALLLPLVLPGDPPAPPRSPD 207 
115 

139 



68 

158 

239 

274 

274 

269 



420 



370 380 390 400 410 

....|....|....|....|....|..--|--..|.-..|....|.-.-|.... 
2 08 AALGQSLFTRRAFSIFALGTALVGGGYFVPYVHLAPHALDRGLGGYGAALVVAVAAMGDA 2 67 



115 



115 



139 139 

2 32 AALGLSLFTRRAFSIFALGTALVGGGYFVPYVHLAPHALDRGLGGYGAALVVAVAAMGDA 291 



68 

158 

239 

274 

274 

269 



68 

158 

239 

274 

274 

269 



430 440 450 460 470 480 

....|....|....|....|...-|....|....|-...|-...|....|.-.-|....| 
2 68 GARL VCGWL AD QG W VP L P RLL AVFG ALTGLGL WWGL VP WGGE E S WGG P L L AAAV AYGL 3 2 7 



115 



115 



139 139 

2 92 GARLVCGWLADQGWVPLP RLL AVFGALTGLGLWVVGLVPVVGGEESWGGPLLAAAV AYGL 351 



68 

158 

239 

274 

274 

269 



68 

158 

239 

274 

274 

269 



NOV5a 


328 


NOV5b 


115 


NOVSc 


139 


NOV5d 


352 


NOV5e 


68 


gi 


7670446 | 


158 


gi 


17491104 | 


239 


gi 


2497855 | 


274 


gi 


1432167 j 


274 


gi 


6755536 j 


269 



490 500 510 520 

....|....|....|....|....|....|....|....|....| 

sag s y ap l v fg vl p gl vg vgg vvq atgl vmmlm s lggllg-pplsgf 

g: 

GF 



SAGSYAPLVFGVLPGLVGVGGWQATGLVMMLMSLGGLLGPPLSGKF Irdet 




S RD M T 

DQG1 

-ahlqdl|wdplp! 

- - YAKDKGVDggNgA 

- - YAKDKGVDfflYNpl 
- - YAKHIGVDBYNfeA 
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NOV5a 
NOVSb 
NOVSc 
NOV5d 
NOV5e 

gi|7670446 | 
gi j 17491104 | 
gi|2497855| 
gi 1 1432167 j 
gi|6755536| 




jSL 

17 5 p?VyA^GDACA] 
2 5 6 
290 
290 
285 




km 



fi. 

fB Mg 

- - r - - « - ^9ADoqBS p L 

WAgSDLVGR§VSG^2SGDAVPGP§T 
vmaftdmfarpSvg lAan - TSpI ™ 

VMAFTDMFARPgVGLnAN- TSL'I 
VMAF I DM FARpHvGlHaN - Tsl I 



570 580 590 600 

|..^|. I.... 

- - TGE 42 5 

-TGE 169 

- -TGE 193 

-TGE 450 

— TGE 121 

;gGWGAP 234 

- VSLgLFgVAQAPTA 3 0 8 

- ICHLLC SLAHSYTA 344 

jlQYLFSVglMFTG ICHLLC SLAHSYTA 3 44 

JlQYLFSsHl I FTG ICHLLCgLATTYSAgj 3 3 9 
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NOV5a 
NOV5b 
NOV5c 
NOV5d 
NOV5e 

gi|7670446 | 
gi | 17491104 | 
gi|2497855| 
gi 1 1432167 j 
gij 6755536 j 



NOV5a 
NOV5b 
NOV5c 
NOV5d 
NOV5e 

gi|7670446 | 
gij 17491104 | 
gi | 2497855 | 
gi | 1432167 j 
gi | 6755536 j 



NOV5a 
NOV5b 
NOVSc 
NOV5d 
NOV5e 

gi | 7670446 | 
gi | 17491104 | 
gij 2497855 | 
gij 1432167 j 
gij 6755536 j 



610 

pSpqBvl 
pBpqHvl 
pHpqSvl 
pBpqHvl 

PgPQgVL 

.gByGj-jj^-. 

v^ygf^s 

IFFGJGF 

iffgIg: 

^VYWFFGjjjG] 
670 



620 



630 





SYAPLVFGVLPGLA 

PLAFSVLPELlg 
|S I SSLLFECLMDQ^ 
ISSLLFECLMDQ^ 
ISSLLFECLMDI 1 



640 
I 



650 
I 



660 
I 



gSTpDTHc 44 7 



191 

215 

472 

143 

JLVMMLMSLGGLLiGPPLSGKG 290 

:RRIYCGLGLLQMIESIGGLLGPPLSGYLRDVT 3 6 8 

^SRFSSAVGLVTIVECCPVLFGPPLAGKLLDIT 4 04 

3RFSSAVGLVTIVECCPVLPGPPLAGKLLDIT 4 04 

VTRFSSAVGLTTI VECCPVLFGPPLAGKLLDIT 3 99 



680 



690 



700 



710 



720 



447 
191 
215 
472 



191 

215 
472 



143 

290 



143 

290 

369 GNYTASFWAGAFLLSGSGILLTLPHFFC FSTTTSG 404 

4 05 GQYKYLYIASGIWLSSGIYLLICNAINYRLLEKERKREKARRKKSASQASKEMEALSRS 4 64 

4 05 GQYKYLYIASGI WLSSGIYLLICNAINYRLLEKERKREKARRKKSASQAS KEMEALSRS 4 64 

4 00 GEYKYLYIASGTWLVSGTYLLIGNAINYRLLDKERKREKAKKKKSASHASREMEALNRS 459 



730 
..|.. 



740 
..|.. 



447 
191 
215 
472 
143 
290 



447 
191 
215 
472 
143 
290 



405 PQDLVTEALDTKVPLPKEGLEED — 427 

4 65 KQDDVT VKVSNTHNP P SDRDKE SSI 489 

4 65 KQDDVTVKVSNTHNPPSDRDKESSI 4 89 

460 KQDEVTVKASNAHNPPSDRDKESNI 4 84 



Monocarboxylates such as lactate and pyruvate play a central role in cellular 
metabolism and metabolic communication between tissues. Essential to these roles is their 
rapid transport across the plasma membrane, which is catalysed by a recently identified family 
of proton-linked monocarboxylate transporters (MCTs). Nine MCT-related sequences have so 
far been identified in mammals, each having a different tissue distribution, whereas six related 
proteins can be recognized in Caenorhabditis elegans and 4 in Saccharomyces cerevisiae. 
Direct demonstration of proton-linked lactate and pyruvate transport has been demonstrated 
for mammalian MCT1-MCT4, but only for MCT1 and MCT2 have detailed analyses of 
substrate and inhibitor kinetics been described following heterologous expression in Xenopus 
oocytes. MCT1 is ubiquitously expressed, but is especially prominent in heart and red muscle, 
where it is up-regulated in response to increased work, suggesting a special role in lactic acid 
oxidation. By contrast, MCT4 is most evident in white muscle and other cells with a high 
glycolytic rate, such as tumour cells and white blood cells, suggesting it is expressed where 
lactic acid efflux predominates. MCT2 has a ten-fold higher affinity for substrates than MCT1 
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and MCT4 and is found in cells where rapid uptake at low substrate concentrations may be 
required, including the proximal kidney tubules, neurons and sperm tails. MCT3 is uniquely 
expressed in the retinal pigment epithelium. MCT1 and MCT4 have been shown to interact 
specifically with OX-47 (CD147), a member of the immunoglobulin superfamily with a single 
5 transmembrane helix. This interaction appears to assist MCT expression at the cell surface 

The disclosed NOV5 nucleic acid of the invention encoding a Monocarboxylate 
transporter (MCT3)-like protein includes the nucleic acid whose sequence is provided in Table 
5A, 5C, 5E, 5G, 51 or a fragment thereof. The invention also includes a mutant or variant 
nucleic acid any of whose bases may be changed from the corresponding base shown in Table 

10 5 A, 5C, 5E, 5G, or 51 while still encoding a protein that maintains its Monocarboxylate 

transporter (MCT3)-like activities and physiological functions, or a fragment of such a nucleic 
acid. The invention further includes nucleic acids whose sequences are complementary to 
those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. The invention additionally includes nucleic acids or nucleic acid 

15 fragments, or complements thereto, whose structures include chemical modifications. Such 
modifications include, by way of nonlimiting example, modified bases, and nucleic acids 
whose sugar phosphate backbones are modified or derivatized. These modifications are 
carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 

20 applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 40 percent of the bases may be so changed. 

The disclosed NOV5 protein of the invention includes the Monocarboxylate 
transporter (MCT3)-like protein whose sequence is provided in Table 5B, 5D, 5F, 5H, or 5 J. 
The invention also includes a mutant or variant protein any of whose residues may be changed 

25 from the corresponding residue shown in Table 5B, 5D, 5F, 5H, or 5J while still encoding a 
protein that maintains its Monocarboxylate transporter (MCT3)-like activities and 
physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 67 percent of the residues may be so changed. 

NOV5 nucleic acid and polypeptide show homology to the Monocarboxylate 

30 transporter (MCT3) familyof proteins. Accordingly, to the NOV5 nucleic acid and polypeptide 
may function as members of this family. The NOV5 nucleic acids and proteins identified here 
may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
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invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
5 those defined here. 

The nucleic acids and proteins of NOV5 are useful in metabolic disorders such as salla 
disease, infantile sialic acid storage disease, symptomatic deficiency in lactate transport, 
subnormal erythrocyte lactate transport, muscle injuries, cystinosis, streptozotocin-induced 
diabetes, hypoxia, cardiac arrest or stroke, neuronal disorders, retinal angiogenesis, and/or 

10 other pathologies and disorders. 

NOV5 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immunospeciflcaHy to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies'" 

15 section below. For example the disclosed NOV5 protein have multiple hydrophilic regions, 
each of which can be used as an immunogen. This novel protein also has value in 
development of powerful assay system for functional analysis of various human disorders, 
which will help in understanding of pathology of the disease and development of new drug 
targets for various disorders. 



A disclosed NOV6 nucleic acid of 1336 nucleotides (also referred to CG56674-01) 
encoding a novel Nitrilase-l-like protein is shown in Table 6A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 77-79 and ending with a 
25 TAA codon at nucleotides 1058-1060. In Table 6A, the 5' and 3' untranslated regions are 
underlined and the start and stop codons are in bold letters. 
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Table 6A. NOV6 Nucleotide Sequence (SEQ ID NO:33) 



10 



GCCCACTCGCTGCGGCCTATCTGGCTCCAGACCGCCCTCCGGATCGGACCCTGCGAATGGTTTTGGCTATA 
TCTTCATGCTGGGCTTCATCACCAGGCCTCCTCACAGATTCCTGTCCCTTCTGTGTCCTGGACTCCGGATA 
CCTCAACTCTCTGGGGAAGGTGCTCAGCCCAGGCCCAGAGCCATGGCTATCTCCTCTTCCTCCTGCGAACT 
GCCCCTGGTGGCTGTGTGCCAGGTAACATCGACGCCAGACAAGCAACAGAACTTTAAAACATGTGCTGAGC 
TGGTTCGAGAGGCTGCCAGACTGGGTGCCTGCCTGGCTTTCCTGCCTGAGGCATTTGACTTCATTGCACGG 
GACCCTGCAGAGACGCTACACCTGTCTGAACCACTGGGTGGGAAACTTTTGGAAGAATACACCCAGCTTGC 
C AGGG AATGT GGACT CTGGCTGT C CT TGGGTGGTTT C C ATGAGCGTGGCC AAG AC TGGG AGC AG AC T C AG A 
AAATCTACAATTGTCACGTGCTGCTGAACAGCAAAGGGGCAGTAGTGGCCATTTACAGGAAGACACATCTG 
TGTGACGTAGAGATTCCAGGGCAGGGGCCTATGTGTGAAAGCAACTCTACCATGCCTGGGCCCAGTCTTGA 
GTCACCTGTCAGCACACCAGCAGGCAAGATTGGTCTAGCTGTCTGCTATGACATGCGGTTCCCTGAACTCT 
CTCTGGCATTGGCTCAAGCTGGAACAGAGATACTTACCTATCCTTCAGCTTTTGGATCCATTACAGGCCCA 
GCCCACTGGGAGGTGTTGCTGCGGGCCCGTGCTATCGAAACCCAGTGCTATGTAGTGGCAGCAGCACAGTG 
TGGACGCCACCATGAGAAGAGAGCAAGTTATGGCCACAGCATGGTGGTAGACCCCTGGGGAACAGTGGTGG 
CCCGCTGCTCTGAGGGGCCAGGCCTCTGCCTTGCCCGAATAGACCTCAACTATCTGCGACAGTTGCGCCGA 
CACCTGCCTGTGTTCCAGCACCGCAGGCCTGACCTCTATGGCAATCTGGGTCACCCACTGTCTTAAGACTT 
GACTTCTGTGAGTTTAGACCTGCCCCTCCCACCCCCACCCTGCCACTATGAGCTAGTGCTCATGTGACTTG 
GAGGCAGGATCCAGGCACAGCTCCCCTCACTTGGAGAACCTTGACTCTCTTGATGGAACACAGATGGGCTG 
CTTGGGAAAGAAACTTTCACCTGAGCTTCACCTGAGGTCAGACTGCAGTTTCAGAAAGGTGGAATTTTATA 
TAGTCATTGTTTATTTCATGGAAACTGAAGTTCTGCTGAGGGCTGAGCACCTTCCCCA 



The disclosed NOV6 nucleic acid sequence, localized to the pi 4.2 region of 
chromosome 3, has 1319 of 1329 bases (99%) identical to a gb:GENBANK- 
ID:AF069987|acc:AF069987.1 mRNA from Homo sapiens (nitrilase 1 (NIT1) mRNA, 
complete cds) (E = 3.1e" 290 ). 

A disclosed NOV6 polypeptide (SEQ ID NO:34) encoded by SEQ ID NO:33 is 327 
amino acid residues and is presented using the one-letter amino acid code in Table 6B. Signal 
P, Psort and/or Hydropathy results predict that NOV6 has a signal peptide and is likely to be 
localized in the cytoplasm with a certainty of 0.4500. Alternatively, NOV6 is also likely to be 
localized to the microbody (peroxisome) with a certainty of 0.3000, to the lysosome (lumen) 
with a certainty of 0.2021, or to the mitochondrial matrix space with a certainty of 0.1000. The 
most likely cleavage site for NOV6 is between positions 27 and 28: LSG-EG 



MLGFITRPPHRFLSLLCPGLRIPQLSGEGAQPRPRAMAISSSSCELPLVAVCQVTSTPDKQQNFKTCAELV 
REAARLGACLAFLPEAFDFIARDPAETLHLSEPLGGKLLEEYTQLARECGLWLSLGGFHERGQDWEQTQKI 
YNCHVL3LNSKGAWAIYRKTHLCDVEIPGQGPMCESNSTMPGPSLESPVSTPAGKIGLAVCYDMRFPELSL 
ALAQAGTEILTYPSAFGSITGPAHWEVLLRARAIETQCYWAAAQCGRHHEKRASYGHSMWDPWGTWAR 
CSEGPGLCLARIDLNYLRQLRRHLPVFQHRRPDLYGNLGHPLS 



The disclosed NOV6 amino acid sequence has 322 of 327 amino acid residues (98%) 
identical to, and 322 of 327 amino acid residues (98%) similar to, the 327 amino acid residue 
ptnr:SPTREMBL-ACC:O76091 protein from Homo sapiens (Human) (Nitrilase Homolog 1) 



Table 6B. Encoded NOV6 protein sequence (SEQ ID NO:34). 
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NOV6 also has homology to the amino acid sequence shown in the BLASTP data 
listed in Table 6C. 



Table 6C. BLAST results for NOV6 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 5031947 | ref |NP 00 
5591.1) (NM_005600) 


nitrilase 1 [Homo 
sapiens] 


327 


322/327 
(98%) 


322/327 
(98%) 


0.0 


gi | 3242980 | gb | AAC4 01 
84. 1| (AF069985) 


nitrilase homolog 
1 [Mus musculus] 


323 


272/327 
(83%) 


298/327 
(90%) 


e-154 


gi | 6 754856 | ref |NP_03 
6179. 1| (NM_012049) 


nitrilase 1 [Mus 
musculus] 


323 


272/327 
(83%) 


297/327 
(90%) 


e-153 


gi | 18 204 913 | gb | AAH21 
634 . 1 | AAH21634 
(BC021634) 


Unknown (protein 
for MGC:13825) 
[Mus musculus] 


323 


271/327 
(82%) 


296/327 
(89%) 


e-153 


gi | 12836591 | dbj | BAB 2 
3723 . 1 | (AK004988) 


data source :MGD , 
source 

key:MGI :1350916, 
evidence : ISS~nitr 
ilase l~putative 
[Mus musculus] 


290 


251/288 
(87%) 


272/288 
(94%) 


e-145 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 6D. 



Table 6D. Information for the ClustalW proteins 

1) NOV6 (SEQ ID NO:34) 

2) gi | 5031947 | ref |NP_005591 . 1 | (NM_005600) nitrilase 1 [Homo sapiens] (SEQ ID 
NO:342) 

3) gi | 3242980 jgb | AAC40184 . 1 | (AF069985) nitrilase homolog 1 [Mus musculus] (SEQ ID 
NO:343) 

4) gi | 6754856 |ref | NP_036 179 . 1 | (NM_012049) nitrilase 1 [Mus musculus] (SEQ ID 
NO: 344) 

5) gi | 18204913 |gb|AAH21634 . 1 | AAH21634 (BC021634) Unknown (protein f or MGC : 13 825 ) 
[Mus musculus] (SEQ ID NO: 345) 

6) gi|l283659l|dbj |BAB23723.1| (AK004988) data source:MGD, source key : MGI : 13 50916 , 
evidence : ISS-nitrilase 1 -putative [Mus musculus] (SEQ ID NO: 346) 



NOV6 

gi | 5031947 | 
gi j 3242980 j 
gi | 6754856 j 
gi | 18204913 | 
gi | 1283659l| 




NOV6 

gi | 5031947 | 
gi | 3242980 | 
gi j 6754856 | 
gi j 18204913 | 
gi|l283659l| 



61 
61 
57 
57 
57 
24 



70 
J. . 



80 



90 



100 



110 



120 



YMQLAREC 
YMQLAREC 



QSNFKTCAELVgEAARLGACLAFLPEAFDFIARgPAETLgLSEPLgGSLL! 
Q^NFKTCAELVgEAARLGACLAFLPEAFDFIARgPAETLgjLSEPLg 
QENFKTCAELVQEAARLGACLAFLPEAFDFIARNPAETLLLSEPLNGDLLGQYSQLAREC 
QENFKTCAELVQEAARLGACLAFLPEAFDF I ARNPAETLLLSEPLNGDLLGQYSQLAREC 
QENFKTCAELVQEAARLGACLAFLPEAFDFIARNPAETLLLSEPLNGDLLGQYSQLAREC 
QENFKTCAELVQEAARLGACLAFLPEAFDFI ARNPAETLLLSEPLNGDLLGQYSQLAREC 



120 1 

120 

116 

116 

116 
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130 140 150 160 170 180 



NOV6 121 

gi | 5031947 | 121 

gi|3242980| 117 

gi|6754856| 117 



G^LSLGGFHERGQDWEQnQKIYNCHVLLNSKGgVVAnYRKTHLCDVE I PGQGPMKESNg 
gSwLSLGGFHERGQDWEq||QKI YNCHVLLNSKGgVVAyYRKTHLCDVEI PGQGPMgESNg 
GIWLSLGGFHERGQDWEQNQKI YNCHVLLNSKGSWASYRKTHLCDVEI PGQGPMRESNY 
GIWLSLGGFHERGQDWEQNQKI YNCHVLLNSKGS WAS YRKTHLCDVE I PGQGPMRESNY 
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GIWLSLGGFHERGQDWEQNQKIYNCHVLLNSKGSVVASYRKTHLCDVEI PGQGPMRESNY 
GIWLSLGGFHERGQDWEQNQKIYNCHVLLNSKGSWASYRKTHLCDVE I PGQGPMRESNY 





gi | 18204913 | 


117 




gi|l283659l| 


84 


5 








NOV6 


181 




gi| 5031947 | 


181 




gi | 3242980 | 


177 


10 


gi | 6754856 | 


177 


gi j 18204913 | 


177 




gi j 12836591 | 


144 


15 


NOV6 


241 




gi | 5031947 | 


241 




gi | 3242980 | 


237 




gi | 6754856 | 


237 




gi|!8204913 | 


237 


20 


gi j 12836591 | 


204 



190 



200 




250 



I 



260 



270 



I 



I 



280 
. . I . . 



290 



300 

1 



LLRARAIEjlQCYVgAAAQCGRHHEraRASYGHSMVVDPWGTWARCSEGPGLCLARIDL 
LLRARAIEjyQCYVgAAAQCGRHHEgRASYGHSMWDPWGTWARCSEGPGLCLARIDL; 
LLRARAIESQCYVIAAAQCGRHHETRASYGHSMWDPWGTWARCSEGPGLCLARIDLHF 
LLRARAIESQCYVIAAAQCGRHHETRASYGHSMWDPWGTWARCSEGPGLCLARIDLHF 
LLRARAIESQCYVIAAAQCGRHHETRASYGHSMWDPWGTWARCSEGPGLCLARIDLHF 
LLRARAIESQCYVIAAAQCGRHHETRASYGHSMWDPWGTWARCSEGPGLCLARIDLHF 



25 



30 



NOV6 


301 


gi|5031947| 


301 


gi j 3242980 | 


297 


gi | 6754856 | 


297 


gij 18204913 | 


297 


gi j 12836591 1 


264 



310 
. - I . . 



I 



320 



I 



LgQgRgHLPVFQHRRPDLYGBLGHPLS 

l!|q3r1§hlpvfqhrrpdlyg||lghpls 
lqqmrqhlpvfqhrrpdlygslghpls 
lqqmrqhl p vfqhrrpdlygs lghpls 
lqqmrqhlpvfqhrrpdlygslghpls 
lqqmrqhlpvfqhrrpdlygslghpls 



327 
327 
323 
323 
323 
290 



Tables 6E list the domain description from DOMAIN analysis results against NOV6. 
This indicates that the NOV6 sequence has properties similar to those of other proteins known 
to contain this domain. 

35 

Table 6E. Domain Analysis of NOV6 

gnl | Pf am | pf am00795 , CN_hydrolase, Carbon-nitrogen hydrolase. This 
family contains hydrolases that break carbon-nitrogen bonds. The 
family includes: Nitrilase EC: 3. 5. 5.1 f Aliphatic amidase EC: 3. 5. 1.4, 
Biotidinase EC: 3. 5. 1.12, Beta-ureidopropionase EC: 3. 5. 1.6. (SEQ ID 
NO : 8 0 3 ) 

CD-Length = 267 residues, 100.0% aligned 
Score = 273 bits (698), Expect = le-74 



40 



45 



50 



NOV 6: 


51 


Sbj Ct : 


1 


NOV 6: 


107 


Sbjct : 


61 


NOV 6 : 


167 


Sbjct : 


116 


NOV 6 : 


224 


Sbj Ct : 


166 


NOV 6: 


279 



VCQVTSTP-DKQQNFKTCAELVREAARLGACLAFLPEAFDFI ARDPAETLHLSEPLG 10 6 

I I I I - II- III- II I I I I I - II -I - 

AVQAEPVPEDLAANLQKAEELIEEAAKAGAELWFPEAFIPGYPYCKSDAEYYENAEAID 6 0 

GKLLEEYTQLARECGLWLSLGGFHERGQDWEQTQKIYNCHVLLNSKGAWAI YRKTHLCD 16 6 
I- - -III- I- - II |+ | + || || ++ | ++ Ml || 

GEETQFLSRLARKNGIVIVLGVSEREGEG KL YNT A VL I D PDG KL I GKY RK I HL FT 115 

V EI PGQGPMCESNSTMPGPSLES PVSTPAGKIGLAVCYDMRFPELSLALAQAGTEIL 22 3 

M I I II 11-11 +III+IIIII+ III I III 

DPERKVYGEG GGSGFP VFDTPVGKLGLLI CYDI RFPELARALALKGAE I L 16 5 

TYPSAFGSITGPAHWEVLLRARAIETQCYWAAAQCGRHHEKRA SYGHSMWDPW .27 8 

-inn ii -iii-i Mini ii-i ii i i - iiiii-ii 

AWPSAFGRKTGDSHWELLARARAIENQCFVAAANQVGTEEDLDLFDLGEFYGHSMI IDPD 22 5 



GTWA-RCSEGPGLCLARIDLNYLRQLRRHLPVFQHRRPDLY 

64 
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I 1 + 1 I II +1 1 1 1+ + + 1+ + Illllll 

Sbjct: 22 6 GKVLAAPAEEEEGLIIADIDLSRIAEARQKMDFLGHRRPDLY 2 67 

5 The tumor suppressor gene FHIT encompasses the common human chromosomal 

fragile site at 3pl4.2 and numerous cancer cell biallelic deletions. In human and mouse, the 
nitrilase homologs and Fhit are encoded by two different genes: FHIT and NIT1, localized on 
chromosomes 3 and 1 in human, and 14 and 1 in mouse, respectively. 

Bacterial and plant nitrilases are enzymes that cleave nitriles and organic amides to the 

10 corresponding carboxylic acids plus ammonia. The NIT1 gene is expressed as alternatively 

spliced transcripts. The major NIT1 transcript encodes a deduced 327-amino acid protein that 
shares 90% amino acid sequence identity with mouse Nitl, 58% identity with the nitrilase 
domain of C. elegans NitFhit, and 53% identity with the nitrilase domain of Drosophila 
NitFhit. The NIT1 gene spans approximately 3.2 kb and contains 7 exons. Northern blot 

15 analysis detected NIT1 transcripts of approximately 1.4 and 2.4 kb in all adult tissues 

examined, namely heart, brain, lung, liver, pancreas, kidney, skeletal muscle, and placenta. An 
approximately 1.2-kb NIT1 transcript was found in skeletal muscle and heart. 

The loss of Fhit expression in several common human cancers is well documented. 
The disclosed NOV6 nucleic acid of the invention encoding a Nitrilase-l-like protein 

20 includes the nucleic acid whose sequence is provided in Table 6A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 6A while still encoding a protein that maintains 
its Nitrilase-l-like activities and physiological functions, or a fragment of such a nucleic acid. 
The invention further includes nucleic acids whose sequences are complementary to those just 

25 described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

30 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1 percent of the 
bases may be so changed. 

The disclosed NOV6 protein of the invention includes the Nitrilase- 1 -like protein 

35 whose sequence is provided in Table 6B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
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6B while still encoding a protein that maintains its Nitrilase-l-like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 1 8 
percent of the residues may be so changed. 

The protein homology information, expression pattern, and map location for the 
5 Nitrilase-l-like protein and nucleic acid (NOV6) disclosed herein suggest that NOV6 may 
have important structural and/or physiological functions characteristic of the Nitrilase-l-like 
family. Therefore, the NOV6 nucleic acids and proteins of the invention are useful in 
potential diagnostic and therapeutic applications. These include serving as a specific or 
selective nucleic acid or protein diagnostic and/or prognostic marker, wherein the presence or 

10 amount of the nucleic acid or the protein are to be assessed, as well as potential therapeutic 
applications such as the following: (i) a protein therapeutic, (ii) a small molecule drug target, 
(iii) an antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) a 
nucleic acid useful in gene therapy (gene delivery/gene ablation), and (v) a composition 
promoting tissue regeneration in vitro and in vivo. 

15 The NOV6 nucleic acids and proteins of the invention are useful in potential diagnostic 

and therapeutic applications implicated in various diseases and disorders described below 
and/or other pathologies. For example, the compositions of the present invention will have 
efficacy for treatment of patients suffering from cancer, muscle conditions, disorders and 
diseases, longevity, and/or other pathologies/disorders. The NOV6 nucleic acid, or fragments 

20 thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

NOV6 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immunospecificaily to the novel substances of the invention for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 

25 art, using prediction from hydrophobic ity charts, as described in the "Anti-NOVX Antibodies" 
section below. For example the disclosed NOV6 protein have multiple hydrophilic regions, 
each of which can be used as an immunogen. This novel protein also has value in 
development of powerful assay system for functional analysis of various human disorders, 
which will help in understanding of pathology of the disease and development of new drug 

30 targets for various disorders. 
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NOV7 

NOV7 includes three novel cleavage signal-1 protein-like proteins disclosed below. 
The disclosed sequences have been named NOV7a, NOV7b, NOV7c, and NOV7d. 

NOV7a 

5 A disclosed NOV7a nucleic acid of 1 822 nucleotides (also referred to as CG566 13-01) 

encoding a novel cleavage signal-1 protein-like protein is shown in Table 7A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 98-100 
and ending with a TAA codon at nucleotides 839-841. A putative untranslated region 
upstream from the initiation codon is underlined in Table 7A. The start and stop codons are in 
10 bold letters. 



Table 7A. NOV7a nucleotide sequence (SEQ ID NO:35). 

GGGGCTGACGCAGCATTGCCAATTCTAAATCCATCATTTGACTGAGGAGGAGAGGTTTGAAGTTGATCAGCT 
CCAGGGTTTGAGAAATTCAGTCCGAA TGGAACTTCAGGACCTGGAACTGCAGCTGGAGGAGCGCCTGCTGGG 
CCTGGAGGAGCAGCTTCGTGCTGTGCGCATGCCTTCACCCTTCCGCTCCTCCGCACTCATGGGAATGTGTGG 
CAGTAGAAGCACTGATAACTTGTCATGCCCTTCTCCATTGAATGTAATGGAACCAGTCACTGAACTGATGCA 
GGAGCAGTCATACCTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGGATTTGAAATTCCTCCTGGAGA 
AAGCTCAGAATCTGTTTTTTCCAAGCAACGATCAGAATCATCTTCTATATGTTCTGGTCCCTCTCATGCTAA 
CAGAAGAACTGGAGTACCTTCTACTGCCTCAGTGGGCAAATCCAAAACCCCATTAGTGGCAAGGAAGAAAGT 
GTTCCGAGCATCGGTGGCTCTAACGCCAACAGCTCCTTCTAGAACAGGCTCTGTGCAGACACCTCCAGATTT 
GGAAAGTTCTGAGGAAGTTGATGCAGCTGAAGGAGCCCCAGAAGTTGTAGGACCTAAATCTGAAGTGGAAGA 
AGGGCATGGAAAACTCCCATCAATGCCAGCTGCTGAGGAAATGCATAAAAATGTGGAGCAAGATGAGTTGCA 
GCAAGTCATACGGGAGATTAAAGAGTCTATTGTTGGGGAAATCAGACGGGAAATTGTAAGTGGACTTTTGGC 
AGCAGTATCTTCAAGTAAAGCGTCTAATTCTAAGCAAGATTATCATTA AACAGAAATTATAGGTTGGCATGG 
ATCCTATTAGCTGTGTAATACTGGAATTATCAATGATATGCACTGGTGGAGGTGTTATTTGTGCTTTAGAAG 
ATACTTGCTGTTGAGCTGGGCTACTGTATACAGTGTACAATGTGTATTTCTTCAACCATATATTTTAAAAAG 
ACGTACATAGAAACTTAGGCACTTTGCTATTTCTTTTCTAAACTATCAAAAACTCTAGCAGTTTGAAAAGCC 
TAATATTTATTTGTATGTCAATATTTTTCATTTGATTCCCTATTAGAATTAATTTTAAAACTTGAAGACTTC 
CAGACTTATCCAACTTATAAATAACATATTTCTTCAGACTAACATCTTAAAACACTGACCTCTATGAGGTAT 
TTACTGTGCAATAACTGATTCATTTTTTTCAGAGCTTGAAGCATCCAATGATTTTTCCCTCCACTGCTGTTA 
ATTAATGTCACTTCCAAGAAGAAAAACTGTTCTGTTGTAAAAAATATAATTGCTCTTAATTCTTGGGGAGGT 
TACTAATAGCAGTAGGATAGAATTTTATGAGGTTACCTACAACTACTTAATGTACTTACACTGTAAGCCTTG 
TTGCTTTACCCAAGACAAATGTAATTTTATCATTGCTTATGTAGTATTTTTCTTTTGGAAATGTGCCTTATG 
TTAAACACTATGTACTTTTACTTTTTGCATTGTCCAGACTTCTTTATTAGATGGAGATGTTTCTTTTTCTGT 
CTTCTAGACTAAATAGAGTATCATCCAAATAATGGGGCCTATGACTTGAATGAATAGAAATGAATAAGCTGG 
TGTTTGTTTTTTCAAAATGGAAGTAATTTAGATTTGTTCTCCTCATACATAAAATGATTTTAGTTCAGTTTT 
AACCAGTGAAAACTTTGTTTTTATGAAAAAAAAGGAAAATGGTTTCCCATTTGGTTTTATATGTGTTAAATA 
AATGTGTAAAGTAACCACCCCC " " " " 



The disclosed NOV7a nucleic acid sequence, localized to chromosome 2, has 1822 of 
1828 bases (99%) identical to a gb:GENBANK-lD:HUMCSlPA|acc:M61 199.1 mRNA from 
15 Homo sapiens (Human cleavage signal 1 protein mRNA, complete cds) (E = 0.0). 

The disclosed NOV7a polypeptide (SEQ ID NO:36) encoded by SEQ ID NO:35 has 

247 amino acid residues and is presented in Table 7B using the one-letter amino acid code. 

Signal P, Psort and/or Hydropathy results predict that NOV7a has a signal peptide and is likely 

to be localized to the cytoplasm with a certainty of 0.6500. Alternatively, NOV7A may also 
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localize to the mitochondrial matrix space with a certainty of 0.1000, or the lysosome (lumen) 
with a certainty of 0.1000. 



Table 7B. Encoded NOV7a protein sequence (SEQ ID NO:36). 

MELQDLELQLEERLLGLEEQLRAVRMPSPFRSSALMGMCGSRSTDNLSCPSPLNVMEPVTELMQEQSYLKSE 
LGLGLGEMGFEIPPGESSESVFSKQRSESSSICSGPSHANRRTGVPSTASVGKSKTPLVARKKVFRASVALT 
PTAPSRTGSVQTPPDLESSEEVDAAEGAPEVVGPKSEVEEGHGKLPSMPAAEEMHKNVEQDELQQVIREIKE 
S I VGE I RRE I VSGLIjAAVS S S KASNS KQD YH 

5 A search of sequence databases reveals that the NOV7a amino acid sequence has 247 

of 249 amino acid residues (99%) identical to, and 247 of 249 amino acid residues (99%) 
similar to, the 249 amino acid residue ptnr:SWISSPROT-ACC:P28290 protein from Homo 
sapiens (Human) (Sperm-Specific Antigen 2 (Cleavage Signal- 1 Protein) (CS-1)) (E = 6.1e~ 

10 NOV7a is predicted to be expressed in at least adrenal gland, bone marrow, brain - 

amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus, Aorta, Ascending 

15 Colon, Bone, Cervix, Cochlea, Colon, Dermis, Gall Bladder, Hypothalamus, Islets of 
Langerhans, Liver, Lung, Lymphoid tissue, Ovary, Parathyroid Gland, Parotid Salivary 
glands, Pineal Gland, Retina, Right Cerebellum, Skin, Tonsils, Umbilical Vein, Vein, Whole 
Organism. . 

20 NOV7b 

In the present invention, the target sequence identified previously, NOV7a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 

25 examined, walking inward from the respective termini toward the coding sequence, until a 

suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 

30 related human sequences sequences from other species. These primers were then employed in 
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PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
5 gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting ampl icons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 

10 another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated Accession Number NOV7b (6 aminoacid 
different from NOV7a) and NOV7c (2 aminoacid different from NOV7a). 

A disclosed NOV7b nucleic acid of 806 nucleotides (also referred to as CG566 13-02) 

15 encoding a novel cleavage signal-1 protein-like protein is shown in Table 7C. An open 

reading frame was identified beginning with an ATG initiation codon at nucleotides 21-23 and 
ending with a TAA codon at nucleotides 762-764. A putative untranslated region upstream 
from the initiation codon is underlined in Table 7C. The start and stop codons are in bold 
letters, and the 5' and 3' untranslated regions, if any, are underlined. 

20 

Table 7C. NOV7b nucleotide sequence (SEQ ID NO:37). 

" gtttgagaaattcagtccgaa tggaacttcaggacctggaactgcagctggaggagcgcctgctgggcctgg 
aggagcagcttcgtgctgtgcgcatgccttcacccttccgctcctccgcactcatgggaatgtgtggcagta 
gaagcgctgataacttgtcatgcccttctccattgaatgtaatggaaccagtcactgaactgatgcaggagc 

AGTCATACCTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGGATTTGAAATTCCTCCTGGAGAAAGCT 
CAGAATCTGTTTTTTCCCAAGCAACATCAGAATCATCTTCTGTATGTTCTGGTCCCTCTCATGCTAACAGAA 
GAACTGGAGTACCTTCTACTGTCTCAGTGGGCAAATCCAAAACCCCATTAGTGGCAAGGAAGAAAGTGTTCC 
GAGCATCGGTGGCTCTAACGCCAACAGCTCCTTCTAGAACAGGCTCTGTGCAGACACCTCCAGATTTGGAAA 
GTTCTGAGGAAGTTGATGCAGCTGAAGGAGCCCCAGAAGTTGTAGGACCTAAATCTGAAGTGGAAGAAGGGC 
ATGGAAAACTCCCATCAATGCCAGCTGTTGAGGAAATGCATAAAAATGTGGAGCAAGATGAGTTGCAGCAAG 
TCATACGGGAGATTAAAGAGTCTATTGTTGGGGAAATCAGACGGGAAATTGTAAGTGGACTTTTGGCAGCAG 
TATCTTCAAGTAAAGCGTCTAATTCTAAGCAAGATTATCATTA AACAGAAATTATAGGTTGGCATGGATCCT 
ATTAGCTGTGTAAT 



The disclosed NOV7b nucleic acid sequence, localized to chromosome 2, has 801 of 

812 bases (98%) identical to a gb:GENBANK-ID:HUMCSlPA|acc:M61 199.1 mRNA from 

Homo sapiens (Human cleavage signal 1 protein mRNA, complete cds) (E = 7.6e~ 171 ). 

25 The disclosed NOV7b polypeptide (SEQ ID NO:38) encoded by SEQ ID NO:37 has 

247 amino acid residues and is presented in Table 7D using the one-letter amino acid code. 

Signal P, Psort and/or Hydropathy results predict that NOV7b has no signal peptide and is 
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likely to be localized to the cytoplasm with a certainty of 0.6500. Alternatively, NOV7b may 
also localize to the mitochondrial matrix space with a certainty of 0.1000, or the lysosome 
(lumen) with a certainty of 0.1 000. 



A search of sequence databases reveals that the NOV7b amino acid sequence has 240 
of 249 amino acid residues (96%) identical to, and 242 of 249 amino acid residues (97%) 
similar to, the 249 amino acid residue ptnr:SWISSNEW-ACC:P28290 protein from Homo 
sapiens (Human) (Sperm-Specific Antigen 2 (Cleavage Signal- 1 Protein) (CS-1)) (E = 9.7e~ 
10 121 ). 

NOV7b is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
15 small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus, Aorta, Ascending 
Colon, Bone, Cervix, Cochlea, Colon, Dermis, Gall Bladder, Hypothalamus, Islets of 
Langerhans, Liver, Lung, Lymphoid tissue, Ovary, Parathyroid Gland, Parotid Salivary 
glands, Pineal Gland, Retina, Right Cerebellum, Skin, Tonsils, Umbilical Vein, Vein, Whole 
Organism. . 



A disclosed NOV7c nucleic acid of 806 nucleotides (also referred to as CG566 13-03) 
encoding a novel cleavage signal- 1 protein-like protein is shown in Table 7E. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 21-23 and 
25 ending with a TAA codon at nucleotides 762-764. A putative untranslated region upstream 
from the initiation codon is underlined in Table 7E. The start and stop codons are in bold 
letters, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 7D. Encoded NOV7b protein sequence (SEQ ID NO:38). 



MELQDLELQLEERLLGLEEQLRAVRMPSPFRSSALMGMCGSRSADNLSCPSPLNVMEPVTELMQEQSYLKSE 
LGLGLGEMGFEIPPGESSESVFSQATSESSSVCSGPSHANRRTGVPSTVSVGKSKTPLVARKKVFRASVALT 
PTAPSRTGSVQTPPDLESSEEVBAAEGAPEWGPKSEVEEGHGKLPSMPAVEEMHKNVEQDELQQVIREIKE 
S I VGE I RRE I VSGLLAAVS S S KASNS KQDYH 



5 



20 



NOV7c 
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Table 7E. NOV7c nucleotide sequence (SEQ ID NO:39). 



GTTTGAGAAATTCAGTCCGA ATGGAACTTCAGGACCTGGAACTGCAGCTGGAGGAGCGCCTGCTGGGCCTGG 
AGGAGCAGCTTCGTGCTGTGCGCATGCCTTCACCCTTCCGCTCCTCCGCACTCATGGGAATGTGTGGCAGTA 
GAAGCGCTGATAACTTGTCATGCCCTTCTCCATTGAATGTAATGGAACCAGTCACTGAACTGATGCAGGAGC 
AGTCATACCTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGGATTTGAAATTCCTCCTGGAGAAAGCT 
CAGAATCTGTTTTTTCCCAAGCAACATCAGAATCATCTTCTGTATGTTCTGGTCCCTCTCATGCTAACAGAA 
GAACTGGAGTACCTTCTACTGCCTCAGTGGGCAAATCCAAAACCCCATTAGTGGCAAGGAAGAAAGTGTTCC 
GAGCATCGGTGGCTCTAACGCCAACAGCTCCTTCTAGAACAGGCTCTGTGCAGACACCTCCAGATTTGGAAA 
GTTCTGAGGAAGTTGATGCAGCTGAAGGAGCCCCAGAAGTTGTAGGACCTAAATCTGAAGTGGAAGAAGGGC 
ATGGAAAACTCCCATCAATGCCAGCTGCTGAGGAAATGCATAAAAATGTGGAGCAAGATGAGTTGCAGCAAG 
TCATACGGGAGATTAAAGAGTCTATTGTTGGGGAAATCAGACGGGAAATTGTAAGTGGACTTTTGGCAGCAG 
T AT CT T C AAGTAAAGCGT C TAAT T CTAAG C AAGATTAT C ATT A AAC AGAAAT T ATAGGT TGG CATGG AT C CT 
ATTAGCTGTGTAAT 



The disclosed NOV7c nucleic acid sequence, localized to chromosome 2, has 803 of 
812 bases (98%) identical to a gb:GENBANK-ID:HUMCSlPA|acc:M61 199.1 mRNA from 
Homo sapiens (Human cleavage signal 1 protein mRNA, complete cds) (E = 1.2e~ 171 ). 

The disclosed NOV7c polypeptide (SEQ ID NO:40) encoded by SEQ ID NO:39 has 
247 amino acid residues and is presented in Table 7F using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV7c has no signal peptide and is 
likely to be localized to the cytoplasm with a certainty of 0.6500. Alternatively, NOV7f may 
also localize to the mitochondrial matrix space with a certainty of 0.1000, or the lysosome 
(lumen) with a certainty of 0.1000. 



Table 7F. Encoded NOV7c protein sequence (SEQ ID NO:40). 

MELQDLELQLEERLLGLEEQLRAVRMPSPFRSSALMGMCGSRSADNLSCPSPLNVMEPVTELMQEQSYLKSE 
LGLGLGEMGFEIPPGESSESVFSQATSESSSVCSGPSHANRRTGVPSTASVGKSKTPLVARKKVFRASVALT 
PTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSEVEEGHGKI J PSMPAAEE^4HKNVEQDELQQVIREIKE 
S I VGE I RRE I VS GLLAAVS S S KASNS KQD YH 



A search of sequence databases reveals that the NOV7c amino acid sequence has 242 
of 249 amino acid residues (97%) identical to, and 244 of 249 amino acid residues (97%) 
similar to, the 249 amino acid residue ptnr:SWISSNEW-ACC:P28290 protein from Homo 
sapiens (Human) (Sperm-Specific Antigen 2 (Cleavage Signal-1 Protein) (CS-1)) (E = 1.4e" 



NOV7c is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus, Aorta, Ascending 
Colon, Bone, Cervix, Cochlea, Colon, Dermis, Gall Bladder, Hypothalamus, Islets of 
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Langerhans, Liver, Lung, Lymphoid tissue, Ovary, Parathyroid Gland, Parotid Salivary 
glands, Pineal Gland, Retina, Right Cerebellum, Skin, Tonsils, Umbilical Vein, Vein, Whole 
Organism. . 
NOV7d 

A disclosed NOV7d nucleic acid of 705 nucleotides (also referred to as 174307820) 
encoding a novel cleavage signal- 1 protein-like protein is shown in Table 7G. An open 
reading frame was identified beginning with an AG A initiation codon at nucleotides 1-3 and 
ending with nucleotides 703-705. The start codon is in bold letters, and the 5' and 3' 
untranslated regions, if any, are underlined. Because the start codon is not a traditional 
initiation codon, and there is no stop codon, NOV7d could be a partial open reading frame 
extending further in the 5' and 3' directions. 



Table 7G. NOV7d nucleotide sequence (SEQ ID NO:41). 

AGATCTCCCACCATGGAACTTCAGGACCTGGAACTGCAGCTGGAGGAGCGCCTGCTGGGCCTGGAGGAGCAG 
CTTCGTGCTGTGCGCATGCCTTCACCCTTCCGCTCCTCCGCACTCATGGGAATGTGTGGCAGTAGAAGCGCT 
GATAACTTGTCATGCCCTTCTCCATTGAATGTAATGGAACCAGTCACTGAACTGATGCAGGAGCAGTCATAC 
CTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGGATTTGAAATTCCTCCTGGAGAAAGCTCAGAATCT 
GTTTTTTCCCAAGCAACATCAGAATCATCTTCTGTATGTTCTGGTCCCTCTCATGCTAACAGAAGAACTGGG 
GTACCTTCTACTGCCTCAGTGGGCAAATCCAAAACCCCATTAGTGGCAAGGAAGAAAGTGTTCCGAGCATCG 
GTGGCTCTAACGCCAACAGCTCCTTCTAGAACAGGCTCTGTGCAGACACCTCCAGATTTGGAAAGTTCTGAG 
G AAGTTG ATGCAGCTGAAGGAGC C CCAGAAGTTGT AGG AC CT AAAT CTG AAGTGGAAG AAGGGCATGGAAAA 
CTCCCATCAATGCCAGCTGCTGAGGAAATGCATAAAAATGTGGAGCAAGATGAGTTGCAGCAAGTCATACGG 
GAGATTAAAGAGTCTATTGTTGGGGAAATCAGACGGGAAATTGTAAGTGGACTCGAG 

The disclosed NOV7d polypeptide (SEQ ID NO:42) encoded by SEQ ID NO:41 has 
235 amino acid residues and is presented in Table 7H using the one-letter amino acid code. 

Table 7H. Encoded NOV7d protein sequence (SEQ ID NO:42). 

RSPTMELQDLELQLEERLLGLEEQLRAVRMPSPFRSSALMGMCGSRSADNLSCPSPL^a^MEPVTELMQEQSY 
LKSELGLGLGEMGFEIPPGESSESVFSQATSESSSVCSGPSHANRRTGVPSTASVGKSKTPLVARKKVFRAS 
VAIiT P T AP SRTGS VQT P PDLES S E E VDAAEGAP E WGP KS EVE E GHG KL P SM P AAE EMH KNV EQDE LQQ V I R 
EI KES I VGE I RRE I VSGLE 

NOV7e 

A disclosed NOV7e nucleic acid of 759 nucleotides (also referred to as 174307820) 
encoding a novel cleavage signal- 1 protein-like protein is shown in Table 71. An open reading 
frame was identified beginning with an AG A initiation codon at nucleotides 1-3 and ending 
with nucleotides 757-759. The start codon is in bold letters, and the 5' and 3' untranslated 
regions, if any, are underlined. Because the start codon is not a traditional initiation codon, and 
there is no stop codon, NOV7e could be a partial open reading frame extending further in the 
5' and 3' directions. 
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Table 7L NOV7e nucleotide sequence (SEQ ID NO:323). 

AGATCTCCCACCATGGAACTTCAGGACCTGGAACTGCAGCTGGAGGAGCGCCTGCTGGGCCTGGAGGAGCAG 
CTTCGTGCTGTGCGCATGCCTTCACCCTTCCGCTCCTCCGCACTCATGGGAATGTGTGGCAGTAGAAGCGCT 
GATAACTTGTCATGCCCTTCTCCATTGAATGTAATGGAACCAGTCACTGAACTGATGCAGGAGCAGTCATAC 
CTGAAGTCTGAATTGGGCCTGGGACTTGGAGAAATGGGATTTGAAATTCCTCCTGGAGAAAGCTCAGAATCT 
GTTTTTTCCCAAGCAACATCAGAATCATCTTCTGTATGTTCTGGTCCCTCTCATGCTAACAGAAGAACTGGA 
GTACCTTCTACTGCCTCAGTGGGCAAATCCAAAACCCCATTAGTGGCAAGGAAGAAAGTGTTCCGAGCATCG 
GTGGCTCTAACGCCAACAGCTCCTTCTAGAACAGGCTCTGTGCAGACACCTCCAGATTTGGAAAGTTCTGAG 
GAAGT TG ATG C AG CTG AAGG AGCC C C AG AAGTTGT AGG ACCTAAATC TGAAGTGGAAGAAGGGCATGGAAAA 
CT CC CAT CAATG C C AG CTG CTGAGG AAATGC AT AAAAATGTGG AGCAAGATG AGTTGCAGCAAGTCATACGG 
GAGATTAAAGAGTCTATTGTTGGGGAAATCAGACGGGAAATTGTAAGTGGACTTTTGGCAGCAGTATCTTCA 
AGTAAAG CGTCTAAT T CTAAGCAAG AT TAT CAT CT CGAG 



The disclosed NOV7e polypeptide (SEQ ID NO:324) encoded by SEQ ID NO:323 has 
253 amino acid residues and is presented in Table 7J using the one-letter amino acid code. 



Table 7J. Encoded NOV7e protein sequence (SEQ ID NO:324). 

RSPTMELQDLELQLEERLLGLEEQLRAVRMPSPFRSSALMGMCGSRSADNLSCPSPLNVMEPVTELMQEQSY 
LKSELGLGLGEMGFEIPPGESSESVFSQATSESSSVCSGPSHANRRTGVPSTASVGKSKTPLVARKKVFRAS 
VALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSEVEEGHGKLPSMPAAEEMHKNVEQDELQQVIR 
EIKESI VGE I RRE I VSGLLAAVSSS KASNS KQDYHLE ' 



5 NOV7a also has homology to the amino acid sequence shown in the BLASTP data 

listed in Table 7K. 



Table 7K. BLAST results for NOV7a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 15620913 | dbj | BAB 6 
7820. 1| (AB067514) 


KIAA1927 protein 
[Homo sapiens] 


772 


242/247 
(97%) 


244/247 
(97%) 


e-109 


gi | 16159686 | ref |XP_0 
57458 .1 | (XM__057458) 


sperm specific 
antigen 2 [Homo 
sapiens] 


727 


242/247 
(97% 


244/247 
(97%) 


e-108 


gi | 15277922 |gb| AAH12 
947. 1 | AAH12947 
(BC012947) 


Unknown (protein 
for MGC:21202) 
[Homo sapiens] 


267 


242/247 
(97%) 


244/247 
(97%) 


e-102 


gi | 5803179 | ref |NP_00 
6742. 1| (NM_006751) 


sperm specific 
antigen 2; 
KIAA1927 protein 
[Homo sapiens] 


249 


247/249 
(99%) 


247/249 
(99%) 


e-102 


gi| 18017599 | ref |NP_5 
42125 .1 | (NM_080558) 


sperm specific 
antigen 2 [Mus 
musculus] 


264 


197/248 
(79%) 


212/248 
(85%) 


9e-81 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
10 in Table 7L. 



Table 7L. Information for the ClustalW proteins 

1) NOV7a (SEQ ID NO: 36) 

2) NOV7b (SEQ ID N0:38) 

3) NOV 7c (SEQ ID NO: 40) 
15 4) NOV7d (SEQ ID NO: 42) 

5) NOV7e (SEQ ID NO: 324) 

5) gi | 15620913 | dbj |BAB67820.l| (AB067514) KIAA1927 protein [Homo sapiens] (SEQ ID 
NO:347) 
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6) gi | 16159686 | ref |XP_057458 . 1 | (XM_057458) sperm specific antigen 2 [Homo sapiens] 
(SEQ ID NO: 348) 

7) gi | 15277922 | gb | AAH12947 , 1 | AAH12947 (BC012947) Unknown (protein for MGC:21202) 
[Homo sapiens] (SEQ ID NO: 349) 

8) gi | 5803179 | ref |NP_006742 . 1 | (NM 006751) sperm specific antigen 2; KIAA1927 
protein [Homo sapiens] (SEQ ID NO: 350) 

9) gi | 18017599 | ref |NP_542125 . 1 | (NM_080558) sperm specific antigen 2 [Mus musculus] 
(SEQ ID NO:351) 



10 20 30 40 50 60 

....|....|....|....|....|....|....|....|--..|....|....|..-.| 

NOV7a 1 1 

NOV7b 1 ' 1 

NOV7c 1 1 

NOV7d 1 1 

NOV7e 1 1 

gi | 15620913 | 1 YSHIFLFIDHLLRTASQHSDSSGFAEDSTDCLSLNHLQVQESLQAMGSSADSCDSETTVT 60 

gi 1 16159686 | 1 MGSSADSCDSETTVT 15 

gi | 15277922 | 1 1 

gi | 5803179 | 1 1 

gi | 18017599 | 1 1 

70 80 90 100 110 120 

....|....|....|....|....|....|....|-...|-...|....|....|-...| 

NOV7a 1 1 

NOV7b 1 1 

NOV7C 1 1 

NOV7d 1 1 

NOV7e 1 1 

gi 1 15620913 | 61 SLGEDLATPTAQDQPYFNESEEESLVPLQKGLEK7A7AAV/ADKRKSGSQDFPQCNTIENTGT 120 
gi | 16159686 | 16 SLGEDLATPTAQDQPYFNESEEESLVPLQKGLEKAAAVADKRKSGSQDFPQCNTIENTGT 75 

gi | 15277922 | 1 1 

gi|5803179| 1 1 

gi j 18017599 | 1 1 

130 140 150 160 170 180 

....|....|....|....|....|....|....|....|....|....|....|....| 

NOV7a 1 1 

NOV7b 1 1 

NOV7c 1 1 

NOV7d 1 1 

NOV7e 1 1 

gi | 15620913 | 121 KQSTCSPGDHI IE ITEVEEDLFPAETVELLREASAESDVGKSSESEFTQYTTHHILKSLA 180 

gi | 16159686 | 76 KQSTCSPGDHI IE ITEVEEDLFPAETVELLREASAESDVGKSSESEFTQYTTHHILKSLA 135 

gi|l5277922| 1 1 

gi|5803179| 1 1 

gi 1 18017599 1 1 1 

190 200 210 220 230 240 

....|....|....|....|....|....|....|....|....|....|....|-.-.| 

NOV7a 1 1 

NOV7b 1 1 

NOV7c 1 1 

NOV7d 1 1 

NOV7e 1 1 

gi 1 15620913 | 181 SIEAKCSDMSSENTTGPPSSMDRVNTALQRAQMKVCSLSNQRMGRSLLKSKDLLKQRYLF 240 
gi | 16159686 | 136 SIEAKCSDMSSENTTGPPSSMDRVNTALQRAQMKVCSLSNQRMGRSLLKSKDLLKQRYIiF 195 

gi | 15277922 | 1 1 

gi j 5803179 | 1 1 

gi j 18017599 1 1 1 

250 260 270 280 290 300 

....|....|....|....|....|....|....|..-.|..-.|....|-...|-...| 

NOV7a 1 1 

NOV7b 1 1 

NOV7c 1 1 

NOV7d 1 1 

NOV7e 1 1 
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n i 



gi | 15620913 | 
gi | 16159686 | 
gi | 15277922 j 
gi I 5803179 | 
gi | 18017599 | 



241 AKAGY PLRRSQSLPTTLLS PVRWS S VNVRLS PGKETRCS PPS FTYKYTPEEEQELEKRV 300 
19 6 AKAGY PLRRSQSLPTTLLSPVRVVSS VNVRLS PGKETRCS PPS FTYKYTPEEEQELEKRV 255 



10 



15 



20 



NOV7a 
NOV7b 
N0V7c 
NOV7d 
NOV7© 

gi|l5620913| 
gi|16159686| 
gi|l5277922| 
gi|5803179| 
gij 18017599 | 



310 
|....|.. 



320 



330 



340 



350 



360 
I I 



3 01 MEHDGQSLVKSTIFISPSSVKKEEAPQSEAPRVEECHHGRTPTCSRLAPPPMSQSTCSLH 3 60 
2 56 MEHDGQSLVKSTIFISPSSVKKEEAPQSEAPRVEECHHGRTPTCSRLAPPPMSQSTCSLH 315 



370 



380 



390 



400 



410 



420 



25 



30 



NOV7a 
NOV7b 
NOV7c 
NOV7d 
NOV7e 

gi | 15620913 | 
gi j 16159686 | 
gi |15277922 | 
gi |5803179| 
gi j 18017599 | 



3 61 SIHSEWQERPLCEHTRTLSTHSVPNISGATCSAFASPFGCPYSHRHATYPYRVCSVNPPS 42 0 
3 16 SIHSEWQERPLCEHTRTLSTHSVPNISGATCSAFASPFGCPYSHRHATYPYRVCSVNPPS 375 



35 



40 



45 



50 



55 



60 



65 



70 



NOV7a 
NOV7b 
NOV7c 
NOV7d 
NOV7e 

gi|l5620913| 
gi|l6159686| 
gi | 15277922 j 
gi | 5803179 | 
gi|18017599| 



430 



450 



470 



421 AIEMQLRRVLHDIRNSLQNLSQYPMMRGPDPAAAPYSTQKSSVLPLYENTFQELQVMRRS 4 80 

3 76 AIEMQLRRVLHDIRNSLQNLSQYPMMRGPDPAAAPYSTQKSSVLPLYENTFQELQVMRRS 43 5 

X 1 

X ! 

1 1 



490 



500 



510 



520 



530 



540 

A 



NOV7a 


1 


NOV7b 


1 


NOV7c 


1 


NOV7d 


1 


N0V7e 


1 


gi|15620913| 


481 


gi|l6159686| 


436 


gi |15277922 j 


1 


gi|5803179| 


1 


gi|l8017599| 


1 


NOV7a 


16 


NOV7b 


16 


NOV7c 


16 


N0V7d 


16 


NOV7e 


16 


gi|l5620913 | 


541 


gi|l6159686| 


496 


gi |15277922 | 


36 


gi|5803179| 


16 


gi|l8017599| 


36 





MELQDLELQLEERLL 
MELQDLELQLEERLL 
MELQDLELQLEERLL 
MELQDLELQLEERLL 
MELQDLELQLEERLL 
MELQDLELQLEERLL 
MELQDLELQLEERLL 
MELQDLELQLEERLL 
MELQDLELQLEERLL 
MELQDLELQLEERLL 










LNL FRTQMMDL EL AMLRQQTM VYHHMT EE E RFE VD QL QGLRNS VR 
LNLFRTQMMDLELAMLRQQTMVYHHMTEEERFEVDQLQGLRNSVR 
MTEEERFEVDQLQGLRNSVR 


MTEEDRYEVDQLQGLRNSVR 



15 

15 

15 

15 

15 

540 

495 

35 

15 

35 



550 



560 



570 



580 



590 
. - I I 



600 



gleeqlravrmpspfrssalmgmcgsrsmdnlscpsplnvmepvtelmqeqsylkselgl 
gleeqlravrmpspfrssalmgmcgsrsHdnlscpsplnvmepvtelmqeqsylkselgl 
gleeqlravrmpspfrssalmgmcgsrsBdnlscpsplnvmepvtelmqeqsylkselgl 
gleeqlravrmpspfrssalmgmcgsrsBdnlscpsplnvmepvtelmqeqsylkselgl, 
gleeqlravrmpspfrssalmgmcgsrsBdnlscpsplnvmepvtelmqeqsylkselgl 
gleeqlravrmpspfrssalmgmcgsrsHdnlscpsplnvmepvtelmqeqsylkselgl' 
gleeqlravrmpspfrssalmgmcgsrsBdnlscpsplnvmepvtelmqeqsylkselgl ! 
gleeqlravrmpspfrssalmgmcgsrsHdnlscpsplnvmepvtelmqeqsylkselgl 
gleeqlravrmpspfrssalmgmcgsrsBdnlscpsplnvmepvtelmqeqsylkselgl 
glseqlravrepspfr[3sali3gmcgsrsffldnlscpsplnvmepvtelfseqsylkselgl 



75 



NOV7a 


76 


N0V7b 


76 


NOV7c 


76 


NOV7d 


76 


N0V7e 


76 


gi|15620913 | 


601 


gi|16159686| 


556 


gi ( 15277922 j 


96 


gi j 5803179 | 


76 


gi | 18017599 | 


96 


NOV7a 


135 


NOV7b 


135 


NOV7c 


135 


NOV7d 


135 


NOV7e 


135 


gi | 15620913 | 


660 


gi j 16159686 j 


615 


gi 1 15277922 j 


155 


gi j 5803179 | 


135 


gi | 18017599] 


152 


NOV7a 


193 


NOV7b 


193 


NOV7c 


193 


NOV7d 


193 


NOV7e 


193 


gi| 15620913 | 


718 


gi j 16159686 j 


673 


gi| 15277922| 


213 


gi j 5803179 | 


195 


gi|18017599| 


210 



I 



610 
. . I . . 



620 

Be 



630 



640 



650 



660 



I 



glgemgfeippgessesvfs 
glgemgfeippgessesvfsfj 
glgemgfeippgessesvfs*; 
glgemgfeippgessesvfs^ 
glgemgfe i ppgessesvfs^ 
glgemgfeippgessesvfs^ 
glgemgfe ippgessesvfs^ 
glgemgfeippgessesvfs< 
glgemgfeippgessesvfs? 
glg1mS3eippgessesvfs< 



sesssycsgpshanrrt, 
sesssycsgpshanrrt 
ppgessesvfsqatsesssvcsgpshanrrt 
ppgessesvfsqatsesssvcsgpshanrrt' 
ppgessesvfsqatsesssvcsgpshanrrt! 
ppgessesvfsqatsesssvcsgpshanrrti 

PPGESSESVFSQATSESSSVCSGPSHANRRT, 

ppgessesvfs qat sesssvcsgpshanrrt, 
ppgessesvfs j35s sesssbcsgpshanrrt 
ppgessesvfsqatsesssvcsSpshRnr 




670 



680 



690 



700 



710 



720 



kvfrasvaltptapsrtgsvqtppdlesseevdaaegapewgpkse 
kvfrasvaltptapsrtgsvqtppdlesseevdaaegapewgpkse' 

KVFRASVALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSEi 
KVFRASVALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSE! 
KVFRASVALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSEi 
KVFRASVALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSEj 
KVFRASVALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSE i 
KVFRASVALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEWGPKSE 
KVFRASVALTPTAPSRTGSVQTPPDLESSEE VDAA EGAPEWGPKSE 1 
KS8FRASVALTPTAPSRTGSVQTPPDLESSEE^HgAEi5lASgVVG^S! 



730 



740 



jAGGggEgSP^ 

750 760 



PAAEEMHKNVEQDELQQVI RE I KES I VGE IRRE I VSGLLAAVS SS KAS 
PAAEEMHKNVEQDELQQVI RE I KES I VGE I RRE I VSGLLAAVS SS KAS 
PAAEEMHKNVEQDELQQVI RE I KE S I VGE I RRE I VSGLLAAVS S S KAS 
PAAEEMHKNVEQDELQQVI RE I KESIVGEIRREIVSGLLAAVSSSKAS 
PAAEEMHKNVEQDELQQV I RE I KES I VGE I RRE I VSGLLAAVS S S KAS 
PAAEEMHKNVEQD ELQQVI RE I KES I VGE 1 RRE I VSGLLAAVS S S KAS 
PAAEEMHKNVEQDELQQVI REIKES I VGE I RRE I VSGLLAAVS SS KAS 
PAAEEMHKNVEQDELQQVI RE I KE S I VGE! RRE I VSGLLAAVS S S KAS 
PAAEEMHKNVEQDELQQVI RE I KE S I VGE I RRE I VSGLLAAVS SSKAS 
P AAEEMhInVEQDELQQVI RE I KE S I VGE I RRE I VSGLLAAVSSS KaT 




pg: 



The cleavage signal-1 protein (CS-1), a doublet antigen comprised of approximately 
14-kDa and 1 8-kDa proteins has been shown to be present on the surface of sperm of various 
mammalian species including humans. Polyclonal antibodies to CS-1 inhibit the early 
cleavage of fertilized eggs without apparently affecting sperm penetration and pronuclear 
formation. The human CS-1 cDNA has been cloned and expressed in vitro to obtain the 
recombinant protein (reCS-1) molecule. The CS-1 cDNA clone has been isolated by 
immunological screening of a human testis lambda gtl 1 cDNA library with mono-specific 
polyclonal antibody against CS-1 . The cDNA is 1 828 bp long; the start codon assigned to the 
first ATG (bp 98-100) encodes a protein with 249 amino acid residues terminating at TAA (bp 
845-847). 

XCS-1 is a maternally expressed gene product that is the Xenopus homologue of the 

human cleavage signal protein (CS-1). XCS-1 may play an important role in regulating mitosis 

during early embryogenesis in Xenopus laevis. XCS-1 transcripts have been detected in 

oocytes. During development the XCS-1 protein has been detected on the membrane and in 
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the nucleus of blastomeres. It has also been detected on the mitotic spindle in mitotic cells and 
on the centrosomes in interphase cells. Overexpression of myc-XCS-1 in Xenopus embryos 
results in abnormal mitoses with increased numbers of centrosomes, multipolar spindles, and 
abnormal distribution of chromosomes. Incomplete cytokinesis resulting in multiple nuclei 
5 residing in the same cytoplasm with the daughter nuclei in different phases of the cell cycle 
has been observed. The phenotype depended on the presence of the N terminus of XCS-1 (aa 
1-73) and a consensus NIMA kinase phosphorylation site (aal59-167). Mutations in this site 
affect the ability of the overexpressed XCS-1 protein to produce the phenotype. 

The disclosed NOV7 nucleic acid of the invention encoding a Cleavage signal- 1 

10 protein-like protein includes the nucleic acid whose sequence is provided in Table 7A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 7 A while still encoding a 
protein that maintains its Cleavage signal-1 protein-like activities and physiological functions, 
or a fragment of such a nucleic acid. The invention further includes nucleic acids whose 

1 5 sequences are complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

20 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 1 percent of the bases may be so changed. 

The disclosed NOV7 protein of the invention includes the Cleavage signal-1 protein- 

25 like protein whose sequence is provided in Table 7B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 2 while still encoding a protein that maintains its Cleavage signal- 1 protein-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 21 percent of the residues may be so changed. 

30 The invention further encompasses antibodies and antibody fragments, such as F ab or 

(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Cleavage signal-1 protein-like 
protein (NOV7) is a member of a "Cleavage signal-1 protein family". Therefore, the NOV7 
nucleic acids and proteins identified here may be useful in potential therapeutic applications 
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implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
5 delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV7 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in regulation of the cell cycle during early embryogenesis, 
and therefore may have potential application in the management of embryonic defects. 

10 Additionally, this antigen may also be involved in human immunoinfertility and therefore may 
have application in the treatment of infertility, and/or other diseases or pathologies. 

NOV7 nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immuno-specifically to the novel NOV7 substances for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 

15 art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. The disclosed NOV7 protein has multiple hydrophilic regions, each of which 
can be used as an immunogen. These novel proteins can be used in assay systems for 
functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 



A disclosed NOV8 nucleic acid of 2838 nucleotides (also referred to as 153472451) 
encoding a novel Matriptase-like protein is shown in Table 8A. An open reading frame was 
identified beginning with an TAG initiation codon at nucleotides 8-10 and ending with a TGA 
25 codon at nucleotides 2279-2281 . The start and stop codons are in bold letters in Table 8A, and 
the 5' and 3' untranslated regions, if any, are underlined. 



GGGGACCATGGGGAGCGATCGGGCCCGCAAGGGCGGAGGGGGCCCGAAGGACTTCGGCGCGGGACTCAAGTA 
CAACTCCCGGCACGAGAAAGTGAATGGCTTGGAGGAAGGCGTGGAGTTCCTGCCAGTCAACAACGTCAAGAA 
GGTGGAAAAGCATGGCCCGGGGCGCTGGGTGGTGCTGGCAGCCGTGCTGATCGGCCTCCTCTTGGTGGAGGA 
GGCCGAGCGCGTCATGGCCGAGGAGCGCGTAGTCATGCTGCCCCCGCGGGCGCGCTCCCTGAAGTCCTTTGT 
GGTCACCTCAGTGGTGGCTTTCCCCACGGACTCCAAAACAGTACAGAGGACCCAGGACAACAGCTGCAGCTT 
TGGCCTGCACGCCCGCGGTGTGGAGCTGATGCGCTTCACCACGCCCGGCTTCCCTGACAGCCCCTACCCCGC 
TCATGCCCGCTGCCAGTGGGCCCTGCGGGGGGACGCCGACTCAGTGCTGAGCCTCACCTTCCGCAGCTTTGA 
CCTTGCGTCCTGCGACGAGCGCGGCAGCGACCTGGTGACGGTGTACAACACCCTGAGCCCCATGGAGCCCCA 
CGCCCTGGTGCAGTTGTGTGGCACCTACCCTCCCTCCTACAACCTGACCTTCCACTCCTCCCAGAACGTCCT 
GCTCATCACACTGATAACCAACACTGAGCGGCGGCATCCCGGCTTTGAGGCCACCTTCTTCCAGCTGCCTAG 



20 



NOV8 



Table 8 A. NOV8 nucleotide sequence (SEQ ID NO:43). 
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GATGAGCAGCTGTGGAGGCCGCTTACGTAAAGCCCAGGGGACATTCAACAGCCCCTACTACCCAGGCCACTA 
CCCACCCAACATTGACTGCACATGGAACATTGAGGTGCCCAACAACCAGCATGTGAAGGTGAGCTTCAAATT 
CTTCTACCTGCTGGAGCCCGGCGTGCCTGCGGGCACCTGCCCCAAGGACTACGTGGAGATCAATGGGGAGAA 
ATACTGCGGAGAGAGGTCCCAGTTCGTCGTCACCAGCAACAGCAACAAGATCACAGTTCGCTTCCACTCAGA 
TCAGTCCTACACCGACACCGGCTTCTTAGCTGAATACCTCTCCTACGACTCCAGTGACCCATGCCCGGGGCA 
GTTCACGTGCCGCACGGGGCGGTGTATCCGGAAGGAGCTGCGCTGTGATGGCTGGGCCGACTGCACCGACCA 
CAGCGATGAGCTCAACTGCAGTTGCGACGCCGGCCACCAGTTCACGTGCAAGAACAAGTTCTGCAAGCCCCT 
CTTCTGGGTCTGCGACAGTGTGAACGACTGCGGAGACAACAGCGACGAGCAGGGGTGCAGTTGTCCGGCCCA 
GACCTTCAGGTGTTCCAATGGGAAGTGCCTCTCGAAAAGCCAGCAGTGCAATGGGAAGGACGACTGTGGGGA 
CGGGTCCGACGAGGCCTCCTGCCCCAAGGTGAACGTCGTCACTTGTACCAAACACACCTACCGCTGCCTCAA 
TGGGCTCTGCTTGAGCAAGGGCAACCCTGAGTGTGACGGGAAGGAGGACTGTAGCGACGGCTCAGATGAGAA 
GGACTGCGACTGTGGGCTGCGGTCATTCACGAGACAGGCTCGTGTTGTTGGGGGCACGGATGCGGATGAGGG 
CGAGTGGCCCTGGCAGGTAAGCCTGCATGCTCTGGGCCAGGGCCACATCTGCGGTGCTTCCCTCATCTCTCC 
CAACTGGCTGGTCTCTGCCGCACACTGCTACATCGATGACAGAGGATTCAGGTACTCAGACCCCACGCAGTG 
GACGGCCTTCCTGGGCTTGCACGACCAGAGCCAGCGCAGCGCCCCTGGGGTGCAGGAGCGCAGGCTCAAGCG 
CATCATCTCCCACCCCTTCTTCAATGACTTCACCTTCGACTATGACATCGCGCTGCTGGAGCTGGAGAAACC 
GGCAGAGTACAGCTCCATGGTGCGGCCCATCTGCCTGCCGGACGCCTCCCATGTCTTCCCTGCCGGCAAGGC 
CATCTGGGTCACGGGCTGGGGACACACCCAGTATGGAGGCACTGGCGCGCTGATCCTGCAAAAGGGTGAGAT 
CCGCGTCATCAACCAGACCACCTGCGAGAACCTCCTGCCGCAGCAGATCACGCCGCGCATGATGTGCGTGGG 
CTTCCTCAGCGGCGGCGTGGACTCCTGCCAGGGTGATTCCGGGGGACCCCTGTCCAGCGTGGAGGCGGATGG 
GCGGATCTTCCAGGCCGGTGTGGTGAGCTGGGGAGACGGCTGCGCTCAGAGGAACAAGCCAGGCGTGTACAC 
AAGGCTCCCTCTGTTTCGGGACTGGATCAAAGAGAACACTGGGGTATAG GGGCCGGGGCCACCCAA ATGTGT 
ACACCTGCGGGGCCACCCATCGTCCACCCCAGTGTGCACGCCTGCAGGCTGGAGACTGGACCGCTGACTGCA 
CCAGCGCCCCCAGAACATACACTGTGAACTCAATCTCCAGGGCTCCAAATCTGCCTAGAAAACCTCTCGCTT 
CCTCAGCCTCCAAAGTGGAGCTGGGAGGTAGAAGGGGAGGACACTGGTGGTTCTACTGACCCAACTGGGGGC 
AAAGGTTTGAAGACACAGCCTCCCCCGCCAGCCCCAAGCTGGGCCGAGGCGCGTTTGTGTATATCTGCCTCC 
CCTGTCTGTAAGGAGCAGCGGGAACGGAGCTTCGGAGCCTCCTCAGTGAAGGTGGTGGGGCTGCCGGATCTG 
GGCTGTGGGGCCCTTGGGCCACGCTCTTGAGGAAGCCCAGGCTCGGAGGACCCTGGAAAACAGACGGGTCTG 
AGACTGAAATTGTTTTACCAGCTCCCAGGGTGGACTTCAGTGTGTGTATTTGTGTAAATGGGTAAAACAATT 
TATTTCTTTTTAAAAAAAAAAAAAAAAAAA 



The disclosed NOV8 nucleic acid sequence has 2644 of 2678 bases (98%) identical to 
a gb:GENB ANK-1D: AF 1 18224|acc:AFl 18224.2 mRNA from Homo sapiens (matriptase 
mRNA, complete cds) (E = 0.0). 
5 The disclosed NOV8 polypeptide (SEQ ID NO:44) encoded by SEQ ID NO:43 has 

757 amino acid residues is presented in Table 8B using the one-letter amino acid code. Signal 
P, Psort and/or Hydropathy results predict that NOV8 has a signal peptide and is likely to be 
localized in the plasma membrane^with a certainty of 0.81 10. Alternatively, NOV8 is 
predicted to be localized to the Golgi body with a certainty of 0.3000, to the endoplasmic 
10 reticulum (membrane) with a certainty of 0.2000, or to the microbody (peroxisome) with a 
certainty of 0.1 527. The most likely ceavage site for NOV8 is between positions 8 and 9, 
ARK-GG. 



Table 8B. Encoded NOV8 protein sequence (SEQ ID NO:44). 

MGSDRARKGGGGPKDFGAGLKYNSI^EKWGLEEGVEFLPVNNVKKVEKHGPGRWWLAAVLIGLLiLVEEAE 
RVMAEERWMLPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPAHA 
RCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLI 
TLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVSFKFFY 
LLEPGVPAGTCPKDYVE INGE KYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLS YDS SDPCPGQFT 
CRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTF 
RCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDC 
DCGLRSFTRQARWGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRYSDPTQWTA 
FLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIW 
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VTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRI 
FQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 

A BLASTX of NOV 8 shows that it has 699 of 729 amino acid residues (95%) identical 
to, and 702 of 729 amino acid residues (96%) similar to, the 855 amino acid residue 
ptnr:SPTREMBL-ACC:Q9Y5Y6 protein from Homo sapiens (Human) (Matriptase) (E = 0.0). 
5 NOV8 is predicted to be expressed in at least the following tissues: Adrenal 

Gland/Suprarenal gland, Aorta, Ascending Colon, Bone Marrow, Brain, Bronchus, Cartilage, 
Colon, Duodenum, Gall Bladder, Heart, Islets of Langerhans, Kidney, Kidney Cortex, Lung, 
Mammary gland/Breast, Ovary, Pancreas, Parathyroid Gland, Parotid Salivary glands, 
Peripheral Blood, Pituitary Gland, Placenta, Prostate, Small Intestine, Stomach, 
10 Thymus,Thyroid, Tonsils, Uterus, Vulva, Whole Organism. 

In addition, NOV8 is predicted to be expressed in breast cancer, according to NOV8 
nucleic acids, polypeptides, and antibodies.Accordingly to the invention will have diagnostic 
and therapeutic applications for the detection of breast cancer. 

The disclosed NOV8 polypeptide has homology to the amino acid sequences shown in 
1 5 the BLASTP data listed in Table 8C. 



Table 8C. BLAST results for NOV8 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%> 


Positives 
(%) 


Expect 


gi 1 102 573 9 0 |gb| AAG1 
5395 . 1 | AF057145_1 
(AF057145) 


serine protease 
TADG15 [Homo 
sapiens] 


855 


691/691 
(100%) 


691/691 
(100%) 


0 . 0 


gi | 11415040 | ref |NP_ 
068813. 1( 
(NM_021978) 


suppression of 
tumorigenicity 14 

(colon carcinoma, 
matriptase , 
epithin) ; 
suppression of 
tumorigenicity 14 

(colon 
carcinoma) ; 
matriptase [Homo 
sapiens] 


855 


690/691 
(99%) 


690/691 
(99%) 


0.0 


gi | 1224 9015 | db j | BAB 
20376. l| (AB030036) 


prostamin [Homo 
sapiens] 


855 


689/691 
(99%) 


689/691 
(99%) 


0 . 0 


gi | 7363445 | ref |NP_0 
35306. 2 | 
(NM 011176) 


protease, serine, 
14 (epithin) [Mus 
musculus] 


855 


573/691 
(82%) 


633/691 
(90%) 


0.0 


gi | 16758444 | ref |NP_ 
446087. l| 
(NM_053635) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase, 
epithin) [Rattus 
norvegicus] 


855 


571/691 
(82%) 


632/691 
(90%) 


0.0 
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The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 8D. In the ClustalW alignment of the NOV8 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence {i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 8D. ClustalW Analysis of NOV8 



1) Novel NOV8 (SEQ ID NO: 44) 

2) gi | 10257390 | gb | AAG153 95 . 1 |AF057145_1 (AF057145) serine protease TADG15 [Homo 
sapiens] (SEQ ID NO: 3 52) 

3) gi | 11415040 | ref |NP_068813 . 1 | (NM_021978) suppression of tumorigenici ty 14 (colon 
carcinoma, matriptase, epithin) ; suppression of tumor igeni city 14 (colon carcinoma) ; 
matriptase [Hojno sapiens] (SEQ ID NO: 353) 

4) gi 1 1224 9 015 | dbj | BAB2037 6 . 1 1 (AB030036) prostamin [Homo sapiens] (SEQ ID NO.-354) 

5) gi J 7363445 |ref | NP_0353 06 . 2 | (NM_011176) protease, serine, 14 (epithin) [Mas 
musculus] (SEQ ID NO: 3 55) 

6) gi | 16758444 | ref |NP_446087 . 1 | 
carcinoma, matriptase, epithin) 



<NM_053635) suppression of tumorigenici ty 14 
[Rattus norvegicus] (SEQ ID NO: 3 56) 



{colon 



NOV8 

gi| 10257390 | 
gij 11415040 | 
gij 12249015 | 
gi|7363445 | 
gi|l6758444| 



NOV8 

gi 1 10257390 | 
gi j 11415040 | 
gij 12249015 | 
gi j 7363445 | 
gi 1 16758444 | 



NOV8 


61 




gi 


10257390 | 


61 




gi 


11415040 | 


61 




gi 


12249015 j 


61 




gi 


7363445| 


61 


ftjFS 


gi 


16758444| 


61 


f^FS 



NOV8 


66 


gi 


10257390 | 


121 


gi 


11415040 j 


121 


gi 


12249015 j 


121 


gi 


7363445 | 


121 


gi 


16758444 | 


121 



83 

181 

181 

181 

181 

181 




250 



260 



270 ■ 
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280 



290 



300 




NOV8 

gi|l0257390| 
gi | 11415040 j 
gi | 12249015 j 
gi (7363445 | 
gi 1 16758444 | 




10 



15 



20 



25 



30 



35 



40 



45 



50 



310 
. . I . . 



320 



330 



350 



360 



NOV8 


203 


gi|l0257390 | 


301 


gX j 114X3040 | 


3 01 


gi j 12249015 j 


301 


gi)7363445 | 


301 


gi |16758444 | 


301 


NOV8 


263 


gi | 10257390 | 


361 


gi j 11415040 | 


361 


gi | 12249015 | 


361 


gi |7363445 | 


361 


gi |16758444 | 


361 


NOV8 


323 


gi | 10257390 | 


421 


gi 1 11415040 j 


421 


gij 12249015] 


421 


gi|7363445 | 


421 


gi|16758444| 


421 


NOV8 


383 


gi| 10257390 | 


481 


gij 11415040 j 


481 


gij 12249015 j 


481 


gi|7363445 | 


481 


gij 16758444 | 


481 



YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 
YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 
YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 
YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRL RKA QGTFNSPYYPGHY 
YNLTFfflsSQNVgL|TLITNTgRRHPGFEATFFQLP|MSSCGGraL^gQGTF|SPYYPGHY 
YNL T FfflS S QNvBlST L I TNTilsRRHPG FE AT F FQ L P&M S S CGGHLSIaOGT F*S P Y Y P GH Y 



ITNTlRRHPGFEATFFQLPgMSSCGG |LgggQGTF|S PYYPGH^ 
ITNtSrRHPGFEATFFQLpSmSSCGGIL SgAQGTF ! S PYYPGH'V 



262 
360 
360 
360 
360 
360 



410 



420 



NOV8 


443 z 


gi 1 10257390 | 


541 s 


gi j 11415040 j 


541 i 


gi 1 12249015 j 


541 : 


gi|7363445 | 


541 j 


gij 16758444 | 


541 2 




55 



60 



NOV8 

gi|l0257390| 
gij 11415040 | 
gi|12249015 j 
gi|7363445 | 
gij 16758444 | 



610 



620 



630 



I 



640 
I 



650 



660 
I 



503 
601 
601 
601 
601 
601 K 



DCDCGLRSFTRQARWGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYID 

dcdcglrsftrqarwggtdadegewpwqvslhalgqghicgaslispnwlvsaahcyid 
dcdcglrsftrqarwggtdadegewpwqvslhalgqghicgaslispnwlvsaahcyid 
dcdcglrsftrqarwggtdadegewpwqvslhalgqghicgaslispnwlvsaahcyid 



CDCGLRS FT0QAR WGGTglADEGE WPWQVSLHALGQGHgCGASL I S PjgWLVS AAHC|5Bd 
CDCGLRS FtBqAR WGGtSaDEGEWP WQ VS LHALGQGHiCGASL I S pSwL VS AAHC| | D 



562 
660 
660 
660 
660 
660 



65 



70 



NOV8 


563 


gi 


10257390 | 


661 


gi 


11415040 j 


661 


gi 


12249015 j 


661 


gi 


7363445 | 


661 


gi 


16758444 | 


661 



670 



680 



690 



700 



710 



720 



DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 

drgfrysdptqwtaflglhdqsqrsapgvqerrlkriishpffndftfdydiallelekp 
drgfrysdptqwtaflglhdqsqrsapgvqerrlkriishpffndftfdydiallelekp 
drgfrysdptqwt3flglhdqsqrsapgvqerrlkriishpffndftfdydiallelekp 

D SFgYSDBTfflwTAFLGLgDQsSRSAHGVQE^LKRIl3HP@FNDFTFDYDIALLELEKp 
FSYSDBTfflwTAFLGLfflDQsiRSABGVQEffiLKRI I«(HpBfNDFTFDYDIALLELEKP 



82 



622 
720 
720 
720 
720 
720 




780 



NOV8 

gi |10257390 | 
gi j 11415040 j 
gi [12249015 j 
gi |7363445 | 
gi 1 16758444 | 




10 



15 



NOV8 


683 


gi 


10257390] 


781 


gi 


11415040 | 


781 


gi 


12249015 j 


781 


gi 


7363445 | 


781 


gi 


16758444 | 


781 



790 



800 



810 



820 



830 



840 
•-I 



pqqitprmmcvgflsggvdscqgdsggplssveadgrifqagvvswgdgcaqrnkpgvyt 
pqqitprmmcvgflsggvdscqgdsggplssveadgrifqagwswgdgcaqrnkpgvyt 
pqqitprmmcvgflsggvdscqgdsggplssveadgrifqagvvswgdgcaqrnkpgvyt 
p qq i t p rmmc vg flsgg vd s c qgdsgg p l s s ve adgr i fqag v vs wgdgc aqrnk p g v y t 
pqqitprmmcvgflsggvdscqgdsggplssSeBdgr^fqagvvswg||gcaqrnkpgvyt 
pqqitprmmcvgflsggvdscqgdsggplssveHdgrifqagvvswgSgcaqrnkpgvyt 



20 



25 



850 



NOV8 


743 


gi | 10257390 | 


841 


gi | 11415040 | 


841 


gi | 12249015 j 


841 


gi (7363445 ) 


841 


gi j 16758444 | 


841 




Tables 8E-8R list the domain descriptions from DOMAIN analysis results against 
30 NOV8. This indicates that the NOV8 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 8E. Domain Analysis of NOV8 

gnl | Smart | smart 0 0 020 , Tryp_SPc, Trypsin- like serine protease; Many of 
these are synthesised as inactive precursor zymogens that are cleaved 
during limited proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others are inactive 
due to substitutions of the catalytic triad residues. (SEQ ID 
NO: 8 04) 

CD-Length = 23 0 residues, 100.0% aligned 
Score = 259 bits (662) , Expect = 4e-70 



35 I I I II Mill I+++IIII I 1 + 

R I VGG S E AN I G S F P WQ VS LQ Y RGGRH FCGGS LIS PRW VLT AAHC VYGSAPSSIR 54 

AFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLP 63 5 

II III I I ++I II +1 l + l I I I I l + l + l I MIMII 

40 Sbjct: 55 VRLGSHDLS - - SGEETQTVKVSKVIVHPNYNPSTYDNDIALLKLSEPVTLSDTVRPICLP 112 



45 



NOV 8: 


516 


Sbjct : 


1 


NOV 8: 


576 


Sbjct ; 


55 


NOV 8: 


636 


Sb j ct : 


113 


NOV 8: 


693 


Sb j Ct : 


173 



+ + III kill I 1+ 11+ + +++ II II h 

;SGYNVPAGTTCTVSGWGRTSESSGSLPDTLQEVNVPIVSNATCRRAYSGGPAITDNML( 
'GFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWG-IXSCAQRNKPGVYTRLPLFRDW: 

I I II MINIMI! I I I Mill Ilk milllk < II 
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Table 8F. Domain Analysis of NOV8 

gnl | Pf am| pf am00089 , trypsin, Trypsin. Proteins recognized include all 
proteins in families SI, S2A, S2B, S2C, and S5 in the classification 
of peptidases. Also included are proteins that are clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ*) . (SEQ ID NO: 805) 

CD-Length = 217 residues, 100.0% aligned 
Score = 201 bits (510) , Expect = 2e-52 



5 



NOV 8 : 


517 


WGGTDADEGE W P WQ VS LHALGQGH I CGAS L I S PNWL VS AAH C Y I DDR GFR YSDPT QWT A 


576 






+ iii +t i +111111 + ii ii mi 11+++1111 i + 




Sbjct : 


1 


IVGGREAQAGSFPWQVSLQ-VSSGHFCGGSLISENWVLTAAHCV SGASSVRV 


51 


NOV 8 : 


577 


FLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPD 


636 






II 1+ 1+ +1+11 II +1 1 IIIII+I+ 1 IMIIII 




Sbjct : 


52 


VLGEHNLGTTEG- TEQKFDVKKI I VHPNYNPDT - - NDI ALLKLKS PVTLGDTVRPI CLPS 


108 


NOV 8 : 


637 


ASHVFPAGKAI WVTGWGHTQYGGTGALILQKGE I RVINQTTCENLLPQQI TPRMMCVGFL 


696 






II II l+lll 1+ II + 11+ + ++++ II + +11+111 




Sbjct : 


109 


ASSDLPVGTTCSVSGWGRTKNLGT-SDTLQEVWPIVSRETCRSAYGGTVTDTMICAGAL 


167 


NOV 8 : 


697 


SGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWI 751 




Sbjct : 


168 


II l + MIMMII l + llll Ml 1 IIIIII+ + Ml 

- GGKDACQGDSGGPL VCSDGELVGI VSWGYGCAVGNYPGVYTRVSRYLDWI 217 





Table 8G. Domain Analysis of NOV8 

gnl|Pfam|pfam00431, CUB, CUB domain (SEQ ID NO: 80 6) 
CD-Length = 110 residues, 100.0% aligned 
Score = 99.0 bits (245), Expect = 9e-22 



NOV 8: 242 CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVSFKFFYLLEPGVPAGTCPK 301 

20 III I ++ 1+ +11 M MM +1 I I I I+++I+ I I I 

Sbjct: 1 CGGVLTE S S GS I S S PN YPND Y P PNKE C VWT I RAP PG YR VELT FQD FDL EDHTGCRY 56 

NOV 8: 302 DYVEI NGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEY 346 

Mill l + M + I + II++I++I 111+ MM 

25 Sbjct: 57 DYVEIRDGDGSSSPLLGKFCGSGPPEDIVSSSNRMTIKFVSDASVSKRGFKATY 110 



Table 8H. Domain Analysis of NOV8 

gnl|Pfam|pfam0043l, CUB, CUB domain (SEQ ID NO: 806) 
CD-Length = 110 residues, 90.9% aligned 
Score = 62.4 bits (150), Expect = 9e-ll 



NOV 8: 12 9 RFTTPGFPDSPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDIjVTVYNTLiSPME 18 8 

+ +I +1+ II + I I +1 + IM+ III M + + 

30 Sbjct: 11 SISSPNYPN-DYPPNKECVWTIRAPPGYRVELTFQDFDLEDHTGCRYDYVEIRDGDGSSS 69 

NOV 8: 189 PHALVQLCGTYPPSYNLTFHSSQNVLLITLITNTERRHPGFEATF 233 

I I + 11+ II M I + I +++ M + II + 

Sbjct: 70 PL-LGKFCGSGPP EDIVSSSNRMTIKFVSDASVSKRGFKATY 110 

35 
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Table 81. Domain Analysis of NOV8 

gnl | Smart | smart 0 0 042 , CUB, Domain first found in Clr, Cls, uEGF, and 
bone morphogenetic protein.; This domain is found mostly among 
developmental ly- regulated proteins. Spermadhesins contain only this 
domain. (SEQ ID NO: 8 07) 

CD-Length = 114 residues, 99.1% aligned 
Score = 97.4 bits (241), Expect - 3e-21 



NOV 8: 242 CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVSFKFFYLLEPGVPAGTCPK 3 01 

III I + II II II II I++I II I +++ I I I - I 

Sbjct: 1 CGGTLTAS SGT ITS PNYPNS Y PNNLNCVWT I SAP PGYRI ELKFTDFDLE SSDNCTY 5 6 

NOV 8: 302 DYVEI-NGE KYCG-ERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLS 348 

Mill +1 + +II I +++I+II HI I II I II I I + 

Sbjct: 57 DYVEIYDGPSTSSPLLGRFCGSELPPPIISSSSNSMTVTFVSDSSVQKRGFSARYSA 113 
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Table 8 J. Domain Analysis of NOV8 



gnl | Smart | smart 0 0 042 , CUB, Domain first found in Clr, Cls, uEGF, and 
bone morphogenetic protein. ; This domain is found mostly among 
developmental ly- regulated proteins. Spermadhesins contain only this 
domain. (SEQ ID NO: 807) 

CD-Length = 114 residues, 89.5% aligned 
Score = 58.5 bits (140), Expect = le-09 
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NOV 8: 12 9 



Sbjct: 11 



NOV 8 : 



189 



Sbjct: 70 



RFTTPGFPDSPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPME 

1-1 +|+| II + I I + + II III I I I I + l + I 

T I TSPNYPNS-YPNNLNCWT I SAP PGYRI ELKFTDFDLE SSDNCTYDYVE I YDGPSTSS 



188 



69 



PHALVQLCGTYPPSYNLTFHSSQNVLLITLITNTERRHPGFEATFF 

I I + I |+ I I I I + +1 + + + + + 111 + 

PL-LGRFCGSELP- - PPIISSSSNSMTVTFVSDSSVQKRGFSARYS 



234 



112 



Table 8K. Domain Analysis of NOV8 

gnl | Smart | smart 0 0 192 , LDLa, Low- density lipoprotein receptor domain 
class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N-terminal type A repeats in LDL receptor bind the 
lipoproteins. Other homologous domains occur in related receptors, 
including the very low-density lipoprotein receptor and the LDL 
receptor-related protein/alpha 2 -macroglobulin receptor, and in 
proteins which are functionally unrelated, such as the C9 component of 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 8 08) 
CD- Length =38 residues, 94.7% aligned 
Score = 58.5 bits (140), Expect - le-09 



20 NOV 8: 427 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCP 4 62 

II |+| ||+|+ I |+| Illllllll -II 

Sbjct: 2 CP PGE FQ C KNGRC I P L S W VC DG VD DCGDG S DE ENC P 37 
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Table 8L. Domain Analysis of NOV8 

gnl | Smart | smart 00 192 , LDLa, Low-density lipoprotein receptor domain 
class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N- terminal type A repeats in LDL receptor bind the 
lipoproteins- Other homologous domains occur in related receptors, 
including the very low-density lipoprotein receptor and the LDL 
receptor-related protein/alpha 2-macroglobulin receptor, and in 
proteins which are functionally unrelated, such as the C9 component of 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 808) 
CD-Length =38 residues, 92.1% aligned 
Score = 52.0 bits (123), Expect = le-07 



NOV 8: 3 56 PGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSC 3 90 

II - I 1+ I I I I III II I III I I 

Sbjct: 4 PGEFQCKNGRCIPLSWVCDGVDDCGDGSDEENCPS 38 

5 



Table 8M. Domain Analysis of NOV8 

gnl | Smart | smart 00192 , LDLa, Low-density lipoprotein receptor domain 
class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N- terminal type A repeats in LDL receptor bind the 
lipoproteins. Other homologous domains occur in related receptors, 
including the very low-density lipoprotein receptor and the LDL 
receptor-related protein/alpha 2-macroglobulin receptor, and in 
proteins which are functionally unrelated, such as the C9 component of 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 80 8) 
CD-Length = 3 8 residues, 89.5% aligned 
Score = 52.0 bits (123), Expect = le-07 



NOV 8: 3 94 HQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSC 427 

+1 III I II MM Mill III- I 
Sbjct: 5 GEFQCKNGRCIPLSWVCDGVDDCGDGSDEENCPS 3 8 

10 



Table 8N. Domain Analysis of NOV8 

gnl ( Smart | smart00192, LDLa, Low-density lipoprotein receptor domain 
class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N-terminal type A repeats in LDL receptor bind the 
lipoproteins. Other homologous domains occur in related receptors, 
including the very low-density lipoprotein receptor and the LDL 
receptor-related protein/alpha 2-macroglobulin receptor, and in 
proteins which are functionally unrelated, such as the C9 component of 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 808) 
CD-Length =38 residues, 94.7% aligned 
Score = 45.1 bits (105), Expect = le-05 



NOV 8: 4 68 TCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDC 5 04 

II ++| II |+ III +11 IIHI++I 

Sbjct: 1 TCPPGEFQCKNGRC I PLS WV - CDGVDDCGDGS DEENC 3 6 

15 
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Table 80. Domain Analysis of NOV8 

gnl | Pf am | pf am00 0 57 , ldl_recept_a , Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 809) 
CD-Length = 3 9 residues, 92.3% aligned 
Score = 53.1 bits (126), Expect = 5e-08 



NOV 8: 42 7 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCP 4 62 

i 1+1 i i+i ii inn +i 

Sbjct: 3 CGPNEFQCGSGECIPMSWVCDGDPDCEDGSDEKNCA 3 8 

5 



Table 8P. Domain Analysis of NOV8 

gnl | Pf am ] pf am0 0 0 57 , ldl_recept_a , Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 809) 
CD-Length =39 residues, 8 7.2% aligned 
Score = 47.4 bits (111), Expect = 3e-06 



NOV 8: 3 56 PGQFTCRTGRC I RKELRCDGWADCTDHSDELNCS 3 89 

I -I I -I II III II I III II- 

Sbjct: 5 PNE FQCGSGE C I PMS W VCDGDPDCEDG S DE KNC A 3 8 

10 



Table 8Q. Domain Analysis of NOV8 

gnl | Pfam|pfam00057, ldl_recept_a, Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 809) 
CD-Length = 3 9 residues, 84.6% aligned 
Score = 44.3 bits (103), Expect = 3e-05 



NOV 8: 3 94 HQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCS 426 

++i i + i i- mi n i in- i + 

Sbjct: 6 NEFQCGSGECIPMSWVCDGDPDCEDGSDEKNCA 3 8 

15 



Table 8R. Domain Analysis of NOV8 

gnl | Pfam|pfam00057, ldl_recept_a , Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 809) 
CD-Length =39 residues, 92.3% aligned 
Score - 42.0 bits (97), Expect = le-04 



NOV 8: 46 8 TCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDC 5 04 

II + ++I -I 1+ + III II illlll+l 

Sbjct: 2 TCGPNEFQCGSGECIPM-SWVCDGDPDCEDGSDEKNC 37 

20 

The predicted sequence described here belongs to the leucine-rich repeat protein 
family. It is homologous to insulin like growth factor binding protein (IGFBP) and RP105, 
novel B cell surface molecule. It contains five leucine-rich repeat domains. Leucine-rich 
repeats (LRRs) are relatively short motifs (22-28 residues in length) found in a variety of 
25 cytoplasmic, membrane and extracellular proteins (1) . A common property of this protein 
family involves protein -protein interaction. Other functions of LRR-containing proteins 
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include, for example, binding to enzymes and vascular repair (1) . LRRs form elongated non- 
globular structures and are often flanked by cysteine rich domains. The circulating insulin-like 
growth factors (IGF-I and -II) occur largely as components of a 140kDa protein complex with 
IGF binding protein-3 and the acid-labile subunit (ALS). This ternary complex regulates the 
5 metabolic effects of the serum IGFs by limiting their access to tissue fluids. 

Because of the presence of the Leucine rich repeat domains and the homology to the 
IGFBP and RP105, we anticipate that the novel sequence described here will have useful 
properties and functions similar to these genes. 

The NOV8 nucleic acid and polypeptide contain structural motifs (i.e. leucine rich 

10 repeat domains) that are characteristics of proteins belonging to the leucine-rich repeat protein 
family. Accordingly, the various NOV8 nucleic acids and polypeptides of the invention are 
useful, inter alia, as novel members of this protein family. 

The disclosed NOV8 nucleic acid of the invention encoding a Insulin like growth 
factor binding protein-like protein includes the nucleic acid whose sequence is provided in 

15 Table 8A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 8A while 
still encoding a protein that maintains its Insulin like growth factor binding protein-like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 

20 including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

25 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acid, up to about 2 percent of the bases may be so changed. 

The disclosed NOV8 protein of the invention includes the Insulin like growth factor 
binding protein-like protein whose sequence is provided in Table 8B. The invention also 

30 includes a mutant or variant protein any of whose residues may be changed from the 

corresponding residue shown in Table 8B while still encoding a protein that maintains its 
Insulin like growth factor binding protein-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 1 8 percent of the 
residues may be so changed. 
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The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab)2,that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Insulin like growth factor binding 
protein-like protein (NOV8) is a member of a "Insulin like growth factor binding protein 
5 family". Therefore, the NOV8 nucleic acids and proteins identified here may be useful in 
potential therapeutic applications implicated in (but not limited to) various pathologies and 
disorders as indicated below. The potential therapeutic applications for this invention include, 
but are not limited to: protein therapeutic, small molecule drug target, antibody target 
(therapeutic, diagnostic, drug target ing/cytotoxic antibody), diagnostic and/or prognostic 

10 marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo 
and in vitro of all tissues and cell types composing (but not limited to) those defined here. 

The NOV8 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in diabetes, obesity, Von Hippel-Lindau (VHL) syndrome, 
Alzheimer's disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, 

15 Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, 
ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, pain, 
neuroprotection, cirrhosis, transplantation, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, autoimmume disease, allergies, immunodeficiencies, graft versus 
host disease (GVHD), lymphaedema, and other diseases, disorders and conditions of the like. 

20 NOV8 nucleic acids and polypeptides are further useful in the generation of antibodies 

that bind immuno-specifically to the novel NOV8 substances for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" 
section below. The disclosed NOV8 protein has multiple hydrophilic regions, each of which 

25 can be used as an immunogen. These novel proteins can be used in assay systems for 

functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 



NOV9 

30 NOV9 includes three novel Neuropeptide Y/Peptide YY receptor -like proteins 

disclosed below. The disclosed sequences have been named NOV9a, and NOV9b. 

NOV9a 
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A disclosed NOV9a nucleic acid of 2276 nucleotides (also referred to as CG56554-01) 
encoding a novel Neuropeptide Y/Peptide YY receptor -like protein is shown in Table 9A. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 370- 
372 and ending with a TAA codon at nucleotides 1 549-1 55 1 . A putative untranslated region 
5 upstream from the initiation codon and downstream from the termination codon is underlined 
in Table 9A. The start and stop codons are in bold letters. 



Table 9A. NOV9a nucleotide sequence (SEQ ID NO:45). 

GGCCAGA A GGCGGGGAGCCAGAGGCGGCAGGACCCTAGCGTGGCGCTCCAGCACCCCAGACCQTGGCGGCGC 
CTCGCCTTAGGGAAGAGCAAGGGAAGAACTTTATTTGAACCGCGAACATTTTTTGGTCACTGAGATCGAGTC 
TCCCAGTGCTTTGGCTTCCCGCCTCTTTATCGTGGGTTTGATCCCTGAGCTGCTCTCCTTTCCCGAACCTCC 
CGGGGTGCAGCCTAGAGCCCTCCCGCGCGGCTGACTCCAGAGTAGAGGAAGGGAGGCGGCCTCCGGCTGGTC 
CCCCGAAGCCCTCGCTGCCCCGCAGATGCGGATGGCCAGCCAGTAGCGGGCGGTGGCCCCGCGTCCCGGGAG 
CGCACAGCA ATGCAGGCGCTTAACATTACCCCGGAGCAGTTCTCTCGGCTGCTGCGGGACCACAACCTGACG 
CGGGAGCAGTTCATCGCTCTGTACCGGCTGCGACCGCTCGTCTACACCCCAGAGCTGCCGGGACGCGCCAAG 
CTGGCCCTCGTGCTCACCGGCGTGCTCATCTTCGCCCTGGCGCTCTTTGGCAATGCTCTGGTGTTCTACGTG 
GTGACCCGCAGCAAGGCCATGCGCACCGTCACCAACATCTTTATCTGCTCCTTGGCGCTCAGTGACCTGCTC 
ATCACCTTCTTCTGCATTCCCGTCACCATGATCCAGAACATTTCCGACAACTGGCTGGAGGGTGCTTTCATT 
TGCAAGATGGTGCCATTTGTCCAGTCTACCGCTGTTGTGACAGAAATCCTCACTATGACCTGCATTGCTGTG 
GAAAGGCACCAGGGACTTGTGCATCCTTTTAAAATGAAGTGGCAATACACCAACCGAAGGGCTTTCACAATG 
CTAGGTGTGGTCTGGCTGGTGGCAGTCATCGTAGGATCACCCATGTGGCACGTGCAACAACTTGAGATCAAA 
TATGACTTCCTATATGAAAAGGAACACATCTGCTGCTTAGAAGAGTGGACCAGCCCTGTGCACCAGAAGATC 
TACACCACCTTCATCCTTGTCATCCTCTTCCTCCTGCCTCTTATGGAGAAGAAACGAGCTGTCATTATGATG 
GTGACAGTGGTGGCTCTCTTTGCTGTGTGCTGGGCACCATTCCATGTTGTCCATATGATGATTGAATACAGT 
AATTTTGAAAAGGAATATGATGATGTCACAATCAAGATGATTTTTGCTATCGTGCAAATTATTGGATTTTCC 
AACTCCATCTGTAATCCCATTGTCTATGCATTTATGAATGAAAACTTCAAAAAAAATGTTTTGTCTGCAGTT 
TGTTATTGCATAGTAAATAAAACCTTCTCTCCAGCACAAAGGCATGGAAATTCAGGAATTACAATGATGCGG 
AAG AAAG C AAAGTT T TC C CT C AGAG AG AAT C CAGTGG AGG AAAC C AAAGGAG AAGCATTCAGTGATGG CAAC 
ATTGAAGTCAAATTGTGTGAACAGACAGAGGAGAAGAAAAAGCTCAAACGACATCTTGCTCTCTTTAGGTCT 
GAACTGGCTGAGAATTCTCCTTTAGACAGTGGGCATTAA TTATAACAATATCTTCATAATTAATGCCCTTCA 
GATTGTAACCCAAAGAGAAAATTATTTTGAGCAAAGGTCAAATACTCTTTTTATTCTTAAGATGATGACAAG 
AAGAAAACAAATCATGTTTCCATTAAAAAATGACACGAGGCTAGTCCAAGTGCAGTGATGTTTACAACCAAT 
TGATCACAATCATTTAACAGATTTCTGTGTTCCTTCTCATTCCCACTGCTTCACTTGACTAGCCTTAAAAAA 
GCAACATGGAAGGCCAGGCACGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCTAGACGGGCGGAT 
CACGAGGTCAGGAGATCAAAACCATCCTGGCTAACACGGTGAAACCCCATCTCTGCTAAAAATACAAAAATT 
AGCCGGGCGTGGTGGCGGGCACCTGTAGTCCCAGCTACTTGGGAGCCTCAGGCGGGAGAATGGTGTGAACCC 
GGGAGGCGGAGCTTGCAGTGATCCGAGATCATGCCACTGCACTCCAGCCTGGGCGAAAGAGCGAGACTCCCC 
GTCTCAAAAAAAATTTTTTTGAAAAATTCGTAAACCATACTTTTAAGATTATTTCAGTGGATTTTTAAAAAT 
CTTGTACAGAAATCAGGGTTCTTAGCTAGCAGTTTTTCTCCCACGCAGTCACTGTAATGTGACTATGTATTG 
CTAGATTGAATAAGAAAATAAAATAATATCTTCTTCCTTGAAAA 



In a search of public sequence databases, the NOV9a nucleic acid sequence, localized 
10 to chromosome 4, has 372 of 434 bases (85%) identical to a gb:GENBANK- 

ID:HSA400877|acc:AJ400877.1 mRNA from Homo sapiens (ASCL3 gene, CEGP1 gene, 
CI lorfl4 gene, CI lorfl5 gene, CI lorfl6 gene and CI lorfl7 gene) (E - 2.5e 61 ). 

The disclosed NOV9a polypeptide (SEQ ID NO:46) encoded by SEQ ID NO:45 has 
393 amino acid residues and is presented in Table 9B using the one-letter amino acid code. 
15 Signal P, Psort and/or Hydropathy results predict that NOV9a has no signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV9a may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
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reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV9a is between positions 64 and 65: 
GNA-LV. 



Table 9B. Encoded NOV9a protein sequence (SEQ ID NO:46). 

MQALNITPEQFSRLLRDHNLTREQFI ALYRLRPLVYTPELPGRAKLALVLTGVLI FALALFGNALVFYWTR " 
SKAMRTVTMI FICSLALSDLL ITFFCI PVTM IQNI SDNWLEGAFI CKMVPFVQSTAWTEI LTMTCI AVERH 
QGLVHPFKMKWQYTNRRAFTMLGWWLVAVIVGSPMWHVQQLEIKYDFLYEKEHICCLEEWTSPVHQKIYTT 
FILVILFLLPLMEKKRAVIMMVTWALFAVCWAPFHVVHMMIEYSNFEKEYDDVTIKMrFAIVQIIGFSNSI 
CNPIVYAFMNENFKKNVLSAVCYCIVNKTFSPAQRHGNSGITMMRKKAKFSLRENPVEETKGEAFSDGNIEV 
KLCEQTEEKKKLKRHLALFRSELAENSPLDSGH 

5 ' """" ~ — — ■■ ~ - — - 

A search of sequence databases reveals that the NOV9a amino acid sequence has 63 of 
184 amino acid residues (34%) identical to, and 107 of 184 amino acid residues (58%) similar 
to, the 377 amino acid residue ptnr:SPTREMBL-ACC:073733 protein from Brachydanio 
rerio (Zebrafish) (Zebra danio) (Neuropeptide Y/Peptide YY Receptor YA) (E = 0.0). 
10 NOV9a is predicted to be expressed in at least kidney. This information was derived by 

determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
sources. 

In addition, the sequence is predicted to be expressed in lower small intestine, colon, 
15 and pancreas, brain, hypothalamus because of SAGE tags identifed for AI308124 and 

AI307658, ESTs which match to the sequence of the invention: pancreatic cancer, prostate, 
prostate cancer, brain, glioblastoma, astrocytoma, normal human luminar mammary epithelial 
cells, breast cancer, ovary, cystadenoma. The SAGE data is reproduced in Example 5. The 
sequence is also predicted to be expressed in the following tissues because of the expression 
20 pattern of related genes in the Neuropeptide Y/Peptide YY/ Orexin/ Galanin/ Cholecystokinin 
receptor family. 
NOV9b 

In the present invention, the target sequence identified previously, NOV9a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 

25 starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 

30 silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
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sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
5 brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spina! cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 

10 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV9b. This differs from the previously 

15 identified sequence (NOV9a) in having 38 less amino acids and 3 different ones. 

A disclosed NOV9b nucleic acid of 1472 nucleotides (also referred to as CG56554-02) 
encoding a novel Neuropeptide Y/Peptide YY receptor -like protein is shown in Table 9C. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 42- 
44 and ending with a TAA codon at nucleotides 1335-1337. A putative untranslated region 

20 upstream from the initiation codon and downstream from the termination codon is underlined 
in Table 9C. The start and stop codons are in bold letters. 



Table 9C. NOV9b nucleotide sequence (SEQ ID NO:47). 

CAGTAGCGGGCGGTGGCCCCGCGTCCCGGGAGCGCACA GCAATGCAGGCGCTTAACATTACCCCGGAGCAGT 
TCTCTCGGCTGCTGCGGGACCACAACCTGACGCGGGAGCAGTTCATCGCTCTGTACCGGCTGCGACCGCTCG 
TCTACACCCCAGAGCTGCCGGGACGCGCCAAGCTGGCCCTCGTGCTCACCGGCGTGCTCATCTTCGCCCTGG 
CGCTCTTTGGCAATGCTCTGGTGTTCTACGTGGTGACCCGCAGCAAGGCCATGCGCACCGTCACCAACATCT 
TTATCTGCTCCTTGGCGCTCAGTGACCTGCTCATCACCTTCTTCTGCATTCCCGTCACCATGCTCCAGAACA 
TTTCCGACAACTGGCTGGGGGGTGCTTTCATTTGCAAGATGGTGCCATTTGTCCAGTCTACCGCTGTTGTGA 
CAGAAATCCTCACTATGACCTGCATTGCTGTGGAAAGGCACCAGGGACTTGTGCATCCTTTTAAAATGAAGT 
GGCAATACACCAACCGAAGGGCTTTCACAATGCTAGGTGTGGTCTGGCTGGTGGCAGTCATCGTAGGATCAC 
CCATGTGGCACGTGCAACAACTTGAGATCAAATATGACTTCCTATATGAAAAGGAACACATCTGCTGCTTAG 
AAGAGTGGACCAGCCCTGTGCACCAGAAGATCTACACCACCTTCATCCTTGTCATCCTCTTCCTCCTGCCTC 
TTATGGTGATGCTTATTCTGTACAGTAAAATTGGTTATGAACTTTGGATAAAGAAAAGAGTTGGGGATGGTT 
CAGTGCTTCGAACTATTCATGGAAAAGAAATGTCCAAAATAGCCAGGAAGAAGAAACGAGCTGTCATTATGA 
TGGTGACAGTGGTGGCTCTCTTTGCTGTGTGCTGGGCACCATTCCATGTTGTCCATATGATGATTGAATACA 
GTAATTTTGAAAAGGAATATGATGATGTCACAATCAAGATGATTTTTGCTATCGTGCAAATTATTGGATTTT 
CCAACTCCATCTGTAATCCCATTGTCTATGCATTTATGAATGAAAACTTCAAAAAAAATGTTTTGTCTGCAG 
TTTGTTATTGCATAGTAAATAAAACCTTCTCTCCAGCACAAAGGCATGGAAATTCAGGAATTACAATGATGC 
GGAAGAAAGCAAAGTTTTCCCTCAGAGAGAATCCAGTGGAGGAAACCAAAGGAGAAGCATTCAGTGATGGCA 
ACATTGAAGTCAAATTGTGTGAACAGACAGAGGAGAAGAAAAAGCTCAAACGACATCTTGCTCTCTTTAGGT 
CTGAACTGGCTGAGAATTCTCCTTTAGACAGTGGGCATTAA TTATAACAATATCTTCATAATTAATGCCCTT 
CAGATTGTAACCCAAAGAGAAAATTATTTTGAGCAAAGGTCAAATACTCTTTTTATTCTTAAGATGATGACA 
AGAAGAAAACAAATATGTTTCATTAAAAATGA 
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In a search of public sequence databases, the NOV9b nucleic acid sequence, localized 
to chromosome 4, has 403 of 656 bases (61%) identical to a gb:GENBANK- 
ID:AB040103|acc:AB040103.1 mRNA from Rattus norvegicus (Rattus norvegicus OT7T022 
5 mRNA for RFamide-related peptide receptor, complete cds) (E = 7.8e~ 13 ). 

The disclosed NOV9b polypeptide (SEQ ID NO:48) encoded by SEQ ID NO:47 has 
393 amino acid residues and is presented in Table 9D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV9b has no signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
10 NOV9b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 

reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV9b is between positions 64 and 65: 
GNA-LV. 



15 



Table 9D. Encoded NOV9b protein sequence (SEQ ID NO:48). 

MQALNITPEQFSRLLRDHNLTREQFIALYRLRPLWTPELPGRAKIALVLTGVLIFALALFGNALVFYVVTR 
SKAMRTVTNIFICSLALSDLLITFFCIPVTMIQNISDNWLEGAFICKMVPFVQSTAWTEILTMTCIAVERH 
QGLVHPFKMKWQYTNRRAFTMLGVWLVAVIVGSPMWIIVQQLEIKYDFLYEKEHICCLEEWTSPVHQKIYTT 
FILVILFLLPLMEKKRAVir^VTWALFAVCWAPFHVVHMMIEYSNFEKEYDDVTIKMIFAIVQIIGFSNSI 
CNPIVYAFIWENFKKNVLSAVCYCIVNKTFSPAQRHGNSGITMMRKKAKFSLRENPVEETKGEAFSDGNIEV 
KLCEQTEEKKKLKRHLALFRSELAENSPLDSGH 



A search of sequence databases reveals that the NOV9b amino acid sequence has 108 
of 315 amino acid residues (34%) identical to, and 180 of 315 amino acid residues (57%) 
similar to, the 522 amino acid residue ptnr:SWISSNEW-ACC:Q9Y5X5 protein from Homo 
sapiens (Human) (Neuropeptide Ff Receptor 2 (Neuropeptide G Protein-Coupled Receptor) 

20 (G-Protein-Coupled Receptor HLWAR77)) (E = 5. 2c" 46 ). 

NOV9b is predicted to be expressed in at least the following tissues: lower small 
intestine, colon, and pancreas, brain, hypothalamus, kidney, pancreatic cancer, prostate, 
prostate cancer, glioblastoma, astrocytoma, normal human luminar mammary epithelial cells, 
breast cancer, ovary, cystadenoma. . 

25 The disclosed NOV9a polypeptide has homology to the amino acid. sequences shown 

in the BLASTP data listed in Table 9E. 



93 



H - it M 



Table 9E. BLAST results for NOV9a 




Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


rri M >=;fifi "3 4 7 1 crk> 1 AAL2 
64 8 8 . 1 | AF411117_1 
(AF411117) | 


G protein- coupled 
receptor [Homo 
sapiens] 


455 


382/393 
(97%) 


384/393 
(97%) 


0 . 0 


gi | 1302743 8 | ref | NP_ 
076470 . 1 | 
(NM023980) 


neuropeptide FF 
receptor 2 
[Rattus 
norvegicus] 


417 


99/314 
(31%) 


157/314 
(49%) 


J e - 3 1 


gi | 41063 97 | gb | AAD02 
833. l| (AF073925) 


neuropept ide 
Y/peptide YY 
receptor Yb 
[Gadus morhua] 


374 


90/320 
(28%) 


16 9/ JzU 
(52%) 


4e-37 


gi | 4758820 | ref |NP_0 
04876 . 1 | 
(NM_004885) 


neuropeptide G 
protein- coupled 
receptor; 
neuropeptide FF 2 
[Homo sapiens] 


522 


98/317 
(30%) 


159/317 
(49%) 


4e-37 


gi | 13 878604 | sp | Q9Y5 
X5 | NF F 2 HUMAN 


NEUROPEPTIDE FF 
RECEPTOR 2 
(NEUROPEPTIDE G 
PROTEIN- COUPLED 
RECEPTOR) (G- 
PROTEIN- COUPLED 
RECEPTOR HLWAR77) 


522 


98/317 
(30%) 


159/317 
(49%) 


4e-37 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 9F. In the ClustalW alignment of the NOV9 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence {i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



1 o Table 9F. ClustalW Analysis of NO V9 

1) Novel NOV9a (SEQ ID NO: 46) 

1) Novel NOV9b (SEQ ID NO: 48) 

2) gi|l6566347|gb|AAL26488.l|AF411117_l (AF411117) G protein- coupled receptor [Homo 
sapiens] (SEQ ID NO: 3 57) 

15 3) gi | 13027438 | ref | NP_076470 . 1 | (NM_023980) neuropeptide FF receptor 2 [Rattus 

norvegicus] (SEQ ID NO: 3 58) 

4) gi | 4106397 | gb | AAD02833 . 1 | (AF073925) neuropeptide Y/peptide YY receptor Yb 
[Gadus morhua] (SEQ ID NO: 35 9) 

5) gi | 4758820 | ref |NP_004876 . 1 | (NM_004885) neuropeptide G protein- coupled receptor; 
20 neuropeptide FF 2 [Homo sapiens] (SEQ ID NO: 3 60) 

6) gi | 13878604 | sp | Q9Y5X5 | NFF2_HUMAN NEUROPEPTIDE FF RECEPTOR 2 (NEUROPEPTIDE G 
PROTEIN -COUPLED RECEPTOR) (G-PROTEIN-COUPLED RECEPTOR HLWAR77) (SEQ ID NO:361) 



10 20 30 40 50 60 

25 . ... | .... | .... | .... | .... | .... I .... I - - - - I .... I - • - ■ I - - - ■ I - - - • I 

NOV9a l x 

NOV9b 1 1 

gi | 16566347 | 1 MICCSALSPRIHLSFHRSL 19 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



gi | 13027438 | 
gi|4106397| 
gi|4758820| 
gi|l3878604| 



NOV9a 
NOV9b 

gi | 16566347 | 
gi | 13027438 j 
gi | 4106397 | 
gi j 4758820 | 
gi j 13878604 | 



NOV9a 
NOV9b 

gi| 16566347 | 
gi j 13027438 j 
gi | 4106397 | 
gi|4758820| 
gi|l3878604 | 



MNSFFGTPAASWCLLESDVSSAPDKEAGRERRALSVQQRGGPAWSGSLEWSRQSAGDRRR 6 0 
MNSFFGTPAASWCLLESDVSSAPDKEAGRERRALSVQQRGGPAWSGSLEWSRQSAGDRRR 6 0 



120 



I 



70 80 90 100 110 

- - - 1 — I — I — I — I — I — I -a- -L — I — 1 „ 

x M^ALNITPE^FSRLgRQ 

x MgAE.NITPEpFSRLpRD 

2 0 TGIVLANSSLDIVLHDTYYVVAHCGGNVRRLHCGGPASRERTAMpALjSlTPEj^FSRI^y^ 

± MG|SRW§SNSSGSWDHMWS 

■ MEEAFDQFHME 

LGLSRQTAKSSWSRSRDRTCCCRRAWWILVPAADRARRERFIMI^KWDTNSSHNWHP 

lglsrqtaksswsrsrdrtcccrrawwilvpaadrarrerfimnbkw^tnss;enwhp| 



1 

61 
61 



18 

18 

80 

19 

12 

121 

121 



130 

JL — i- 

jREQF I AL YRXiR PIT 
P^EQFlSLYRllRPXf 
^EQFlKLYRIiRPki' 



140 
■ I 

PEL 
PEL 
PEL 

LHQg-HVTAV 

|qi1a||dqeecpssksgttf: 
xty^nHylhcS- pVAAI 
1 T J3? N S YLH QH " 9 VAAI 




150 

GRAKLALVLTGVT 
GRAKLALyL^GvB 

MaklalVSgv| 

Isl 




NOV9a 


78 


TVTNg 


fi 


cs 


LA 




SDLL 


XTF 


NOV9b 


78 


TVTNgj 


fi 


cs 


LA 




SDLL 


XTF 


gi 


16566347 | 


140 


tvtnB 


FI 




LA 




SDLL 


XTF 


gi 


13027438| 


78 


tvtn| 


FI 


fJ 


LA 


I 


SDLL 


VG 


I 


gi 


4106397 | 


70 


JjVTNW 


FI 


SnlI 


C 


SD0L 


MC 


I 


gi 


4758820 j 


180 


tvtn| 


FI 






SDLL 


VG 

* 


I 


gi 


13878604 | 


180 


tvtn| 


FI 




I 


SDLL 


VG 


I 




250 



260 



280 



290 



300 



NOV9a 
NOV9b 

gi | 16566347 | 
gi|l3027438| 
gi | 4106397 | 
gi j 4758820 j 
gi|l3878604| 



NOV9a 
NOV9b 

gi 1 16566347 | 
gi | 13027438 | 
gi |4106397 | 
gi | 4758820 | 
gi j 13878604 | 



NOV9a 
NOV9b 

gi|l6566347| 
gi j 13027438 | 
gi j 4106397 | 
gi|4758820| 
gi|l3878604 | 



NOV9a 
NOV9b 

gi | 16566347 | 




138 
138 
200 
138 
130 
240 
240 



310 

195 EKEHIC 

195 EgEHIC 

257 EKEHIC 

196 SHNKTSTVjX- 
188 FPG§DH--|?. 

sQnktsp 
scJnktsp" 




I 




330 



298 
298 




340 



QiLEIKYDF 

qlSikydf 

QLEIKYDF 
SAIlJjLjgjgEE&YRpE 
FLSFT VXlDN LPLQNpSg 
|SAV^]Lgg^EKYYR^I 
|S AvflL^^EE]|Y YR§" 

350 



360 



....|....|....| 

227 

227 

289 

IV I MHARHgAS fflF^T S AH S T G 2 55 

IAAcHl SDFLRHRRRK - DMVE 244 

JvIM^RUGIsHf'rAAVPHTG 3 57 

SvIM^ROGIsBfMaAVPHTG 357 



370 



380 



390 



400 



I 



227 ME 

227 ME 

289 WK 

256 fQRLEQWHVS-lK|_ 

245 Sards SRDNRApGSRRl 

3 58 pKNQEQWHWsMKjgQKXXj: 

3 58 BKNQEQWHWslKgQKI^- 



|p|Agi3 
IcRk^i 




2KVXP 




410 420 

»■ • -I- ••'•• H l 

I^NFEKEYDD^TIKMjiF| 2 76 

^FEKEYDD^TIKMM^ 276 

JSNFEKEYDDgTIKMpM 33 8 

BISlSPNKIIr^iSiY^ 314 

(4H- -HELM^SCQHNLyI| 3 02 

^"BLSPNEOQ^lSlIYBiS 417 

r)LSPNEilQiINIYai 417 



430 



440 



450 



460 



470 



480 



I 



2 77 AIVQISEGgS^ICj 

2 77 AIVQljfcHsSBlCSii 

3 39 AIVQIgGgS^I 




;KNVLSgvCYC| 

knvlsSvcyq; 

,SgVCYC| 

95 




FSBASRPGNSGITMMRKraA 33 6 
_JCWB ™jdLJEJ 336. 

398 




10 



15 



gi | 13027438 | 
gi|4106397 | 
gi|4758820| 
gi|13878604| 



NOV9a 
NOV9b 

gi 1 16566347 | 
gi|13027438 j 
gi |4106397 | 
gi j 4758820 | 
gi |13878604 | 



337 
337 
399 
367 
349 
472 
472 




;SGFiDgFQF- - CQfgKVKjgQ 

^QLpATLSHC gCWG. 



|gf^eBfqlq| 




490 



500 



510 



520 



530 



I 



KFSpRENPviETlGE. 
KFSflRENPVlETpGE. 

kfsBrenpvIet^ge. 





PASQ|PgG@NLGC: 

hmskgSisihs 

STgofPilGiTLlYR: 




HpALFpSELAEpSPLDSGH 
.HtfALFySELAEllsPLDSGH 
FgSELAEgSPLDSGH 
iSLMEETGEATNSTET- 




SppQlLVlEEI^lETTNSgEI - 
*p|jQ||LV§JEEL§ETTNS§E I - 



393 
393 
455 
417 
374 
522 
522 



20 



Tables 9G-9H list the domain descriptions from DOMAIN analysis results against 
NOV9. This indicates that the NOV9 sequence has properties similar to those of other 
proteins known to contain this domain. 



gnl ( Pf am | pf amOOOOl , 7tm_l, 
family). (SEQ ID NO:810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 146 bits (368) , Expect = 2e-36 



Table 9G Domain Analysis of NOV9 

7 transmembrane receptor (rhodopsin 



25 



30 



35 



40 



NOV 9: 


62 


Sbjct : 


1 


NOV 9: 


122 


Sbjct : 


61 


NOV 9: 


182 


Sbjct : 


121 


NOV 9: 


228 


Sbjct: 


177 


NOV 9: 


274 


Sbjct : 


234 



GNALVF YWTRS KAMRTVTN I F I CS LALS DLL I TF FC I P VTM I QN I S DNWLEGAF I CKMV 

II II |+ | + | +11 t I I 1+ +1 I++I II I + + +1+ I + l M 

GNLLVI LVI LRTKKLRTPTNI FLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 

PFVQSTAWTEILTMTCIAVERHQGLVHPFKMKWQYTNRRAFTMLGVVWLVAVIVGSPMW 

|| +| + +1 I I + + I II I +1 I++I+ + + I 

GALFWNGYAS I LLLTAI S I DRY LAI VH P LR YRR I RT P RRAKVL I LLVWVLALLLS LP PL 

HVQQLE I KYDFLYEKEHI CCLEEWTS PVHQKI YTTF I LVI LFLLPL 

| + | | | ++ ++ I ++ l + l I I 

LFSWLR TVEEGNTTVCLI DFPEESVKRS YVLLSTLVGFVLPLLVI LVCYTRI LRTL 



121 
60 
181 
120 
227 
176 
273 



MEKKP^VIMMVTWALFAVCWAPFHVVHMMIEYSNFEKEYDDVTIK 

+ + +I |++ I I +1 +11 l + M 
RKRARSQRSLKRRSSSERKAAKMLLVVVVVFVLCWLPYHIVLLL DSLCLLSIWRVLP 23 3 

MI FAIVQI IGFSNSI CNPIVY 2 94 

I + + I I I I l + l 
TALL I TLWLAYVNS CLNP I I Y 254 
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Table 9H Domain Analysis of NOV9 

gnl|Pfam|pfam01604, 7tm_5, 7TM chemoreceptor . This large family of 
proteins are related to pfamOOOOl. They are 7 transmembrane receptors. 
This family does not include all known members, as there are problems 
with overlapping specificity with pfamOOOOl. This family is greatly 
expanded in the nematode worm C. elegans . (SEQ ID NO: 811) 
CD-Length = 297 residues, 83.8% aligned 
Score = 38.1 bits (87), Expect = 0.001 



10 



15 



20 



25 



30 



35 



NOV 9: 


55 


Sbjct : 


16 


NOV 9: 


110 


Sbjct : 


76 


NOV 9: 


166 


Sbjct: 


135 


NOV 9: 


226 


Sbjct: 


195 


NOV 9: 


285 


Sbjct : 


247 



I FALALFGNALVFYVVTR - - S KAMRTVTN IFICSLALSDLLITFFCIPVTMIQNISD 109 

| ++| + | | + | |++| I I I I ++I M 

ITIISLPIHI FGFYCI LFKTPKKMKSVKWSLLNLHFWSALLDLYLS FLTI PYLFFPVLAG 7 5 

NWLEGAFI CKMVPFVQSTAWTE I LTMTC I AV E RHQGL VH P F KM KWQ YTNRRAFTM 16 5 

| + +| | | + + I I I 1+ I+ + 

YPLGLLSYLGVPTSIQIYIGVTILGWAVSIILLFENRHNSLVNINN-KFRIWKWIRILY 134 



| + +++I 1+ |++ + + + I l ++ I 



1 + 



+ + + + + + ++ + + + + + 

-LIVILQIIFFVSLIFYYLKILKNSTMSKKTRKLQ- 



I 1+ +1+ 
- KKFFI ALCIQVSI P 24 6 



I I++I 



The NOV9 nucleic acids and polypeptides share structure similarity to members to the 
Neuropeptide Y/Peptide YY/ Orexin/ Galanin/ Cholecystokinin/pancreatic polypeptide 
receptor family Neuropeptide Y (NPY) is one of the most abundant neuropeptides in the 
mammalian nervous system and exhibits a diverse range of important physiologic activities, 
including effects on psychomotor activity, food intake, regulation of central endocrine 
secretion, and potent vasoactive effects on the cardiovascular system. It shows sequence 
homology to peptide YY and over 50% homology in amino acid and nucleotide sequence to 
pancreatic polypeptide. Neuropeptide Y (NPY) signals through a family of G protein-coupled 
receptors present in the brain and sympathetic neurons. At least 3 types of neuropeptide Y 
receptor have been defined on the basis of pharmacologic criteria, tissue distribution, and 
structure of the encoding gene. The NPY Yl receptors have been identified in a variety of 
tissues, including brain, spleen, small intestine, kidney, testis, placenta, and aortic smooth 
muscle. The Y2 receptor is found mainly in the central nervous system. 

Orexin A and Orexin B, are derived from the same precursor, orexin, or hypocretin 
(HCRT), by proteolytic processing. One receptor, designated OX2R, binds both orexin A and 
orexin B. The predicted amino acid sequences of human and rat OX2R are 95% identical and 
contain 7 putative transmembrane domains. The other receptor, designated OX1R (HCRTR1), 
binds orexin A only and has 64% identity to OX2R. Northern blot analysis revealed that in the 
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rat a 3.5-kb OX2R mRNA is expressed exclusively in the brain. When administered 
intracerebroventricularly to rats, orexin A and orexin B stimulated food consumption. In 
addition, preproorexin mRNA levels are upregulated upon fasting, thust these peptides are 
mediators in the central feedback mechanism that regulates feeding behavior. 
5 PYY is secreted from endocrine cells in the lower small intestine, colon, and pancreas. 

It acts through the pancreatic polypeptide receptors in the gastrointestinal tract as an inhibitor 
of gastric acid secretion, gastric emptying, digestive enzyme secretion by the pancreas, and gut 
motility. 

The disclosed NOV9 nucleic acid of the invention encoding a Neuropeptide Y/Peptide 
10 YY receptor -like protein includes the nucleic acid whose sequence is provided in Table 9A or 
a fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 9A while still encoding a 
protein that maintains its Neuropeptide Y/Peptide YY receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
15 nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
20 are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 1 5 percent of the bases may be so 
changed. 

25 The disclosed NOV9 protein of the invention includes the Neuropeptide Y/Peptide YY 

receptor -like protein whose sequence is provided in Table 9B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 9B while still encoding a protein that maintains its Neuropeptide 
Y/Peptide YY receptor -like activities and physiological functions, or a functional fragment 

30 thereof. In the mutant or variant protein, up to about 70 percent of the residues may be so 
changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab>2, that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this Neuropeptide Y/Peptide YY 
receptor -like protein (NOV9) is a member of a "Neuropeptide Y/Peptide YY receptor 
family". Therefore, the NOV9 nucleic acids and proteins identified here may be useful in 
potential therapeutic applications implicated in (but not limited to) various pathologies and 
5 disorders as indicated below. The potential therapeutic applications for this invention include, 
but are not limited to: protein therapeutic, small molecule drug target, antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic 
marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo 
and in vitro of all tissues and cell types composing (but not limited to) those defined here. 
10 The NOV9 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in obesity, diabetes, kidney disorders, cardiovascular 
disorders, anorexia, eating disorders, gastrointestinal and digestive diseases, metabolic 
diseases,CNS disorders, cancer, autoimmune disease, inflammation, and/or other pathologies 
and disorders. 

1 5 NOV9 nucleic acids and polypeptides are further useful in the generation of antibodies 

that bind immuno-specifically to the novel NOV9 substances for use in therapeutic or 
diagnostic methods. These antibodies may be generated according to methods known in the 
art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies' 7 
section below. The disclosed NOV9 protein has multiple hydrophilic regions, each of which 

20 can be used as an immunogen. These novel proteins can be used in assay systems for 

functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 



NOV10 

25 A disclosed NOV 10 nucleic acid of 985 nucleotides (also referred to as CG5 5964-01) 

encoding a novel G-Protein Coupled Receptor-like protein is shown in Table 10A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 33-35 and 
ending with a TGA codon at nucleotides 981-983.* A putative untranslated region upstream 
from the initiation codon is underlined in Table 10A. The start and stop codons are in bold 

30 letters. 
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Table 10A. NOVIO nucleotide sequence (SEQ ID NO:49). 

CAAATCTACCACTTGATTCTGATGAACAAATC ATGCCGACATTCAATGGCTCAGTCTTCATGCCCTCTGCGT 
TTATACTAATTGGGATTCCTGGTCTGGAGTCAGTGCAGTGTTGGATTGGGATTCCTTTCTCTGCCATGTATC 
TTATTGGTGTGATTGGAAATTCCCTAATTTTAGTTATAATCAAATATGAAAACAGCCTCCATATACCCATGT 
AC ATTT TTT TGGCC ATGT T GG C AGC C AC AG AC ATTGC ACTT AAC AC C TG C AT TC TT C C C AAAATGTT AGG C A 
TCTTCTGGTTTCATTTGCCAGAGATTTCTTTTGATGCCTGTCTTTTTCAAATGTGGCTTATTCACTCATTCC 
AGG C AAT TG AAT CGGGT AT C CT TCT GG C AATGG CCC TGG AT CG CT ATGTGGC C AT CTGT AT C C C CTT GAG AC 
ATGCCACCATCTTTTCCCAGCAGTTCTTAACTCATATTGGACTTGGGGTGACACTCAGGGCTGCCATTCTTA 
T AAT AC CTT CCTT AGGGCT C AT C AAATG C T GTC TG AAAC AC T AT CG AAC T AC AGTC ATCT CT C AC T CT T ACT 
GTGAGCACATGGCCATCGTGAAGCTGGCTACTGAAGATATCCGAGTCAACAAGATATATGGCCTATTCGTTG 
CCTTTGCAATCCTAGGGTTTGACATAATATTTATAACCTTGTCCTATGTCCAAATTTTTATCACTGTCTTTC 
AGCTGCCCCAGAAGGAGGCACGATTCAAGGCCTTTAATACATGCATTGCCCACATTTGTGTCTTCCTACAGT 
T C T AC CTTCTTGCCTTCTTCTCTTT CT TC AC AC AC AGG T TTGGTT C AC AC AT AC CAC CAT AT ATT CAT AT C C 
TCTTGTCAAATCTTTACCTGTTAGTCCCACCTTTTCTCAACCCTATTGTCTATGGAGTGAAGACCAAGCAAA 
TTCGTGACCATATTGTGAAAGTGTTTTTCTTCAAAAAAGTAACTTGATC 



In a search of public sequence databases, the NOVIO nucleic acid sequence has 789 of 
974 bases (81%) identical to a gb:GENBANK-ID:AF133300|acc:AFl 33300.2 mRNA from 
Mus musculus (MOR 3'Betal, MOR 3'Beta2, MOR 3'Beta3, and MOR 3'Beta4 genes, 
5 complete cds; Cbx3 pseudogene, complete sequence; and MOR 3'Beta5 and MOR 3'Beta6 
genes, complete cds) (E = 4.3e~ 136 ). 

The disclosed NOVIO polypeptide (SEQ ID NO:50) encoded by SEQ ID NO:49 has 
316 amino acid residues and is presented in Table 10B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOVlOb has a signal peptide and is 
10 likely to be localized to the endoplasmic reticulum (membrane) with a certainty of 0.6850. 

Alternatively, NOV10 may also localize to the plasma membrane with a certainty of 0.6400, 
the Golgi body with a certainty of 0.4600, or in the endoplasmic reticulum (lumen) with a 
certainty of 0.1000. The most likely cleavage site for NOV10 is between positions 24 and 25: 
LES-VQ. 

15 

Table 10B. Encoded NOV10 protein sequence (SEQ ID NO:50). 

MPTFNGSVFMPSAFILIGIPGLESVQCWIGIPFSAMYLIGVIGNSLILVI IKYENSLHIPMYIF 
LAMLAATDIALNTCILPKMLGIFWFHLPEISFDACLFQMWLIHSFQAIESGILLAMALDRYVAI 
CIPLRHATIFSQQFLTHIGLGVTLRAAILI IPSLGLIKCCLKHYRTTVISHSYCEHMAIVKLAT 
EDIRVNKIYGLFVAFAILGFDI IFITLSYVQI FITVFQLPQKEARFKAFNTCIAHICVFLQFYL 
LAFFSFFTHRFGSHIPPYIHILLSNLYLLVPPFLNPIVYGVKTKQIRDHIVKVFFFKKVT 



A search of sequence databases reveals that the NOV10 amino acid sequence has 316 
of 316 amino acid residues (100%) identical to, and 316 of 316 amino acid residues (100%) 
similar to, the 316 amino acid residue ptnr:TREMBLNEW-ACC:AAG42368 protein from 
20 Homo sapiens (Human) (Odorant Receptor HOR3'BETA5) (E = 5.7e" 169 ). 
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NOV 10 is predicted to be expressed in at least Apical microvilli of the retinal pigment 
epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, corpus 
callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 
5 umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 

10 spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus, and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

The disclosed NOV 10 polypeptide has homology to the amino acid sequences shown 

1 5 in the BLASTP data listed in Table 1 0C. 



Table 10C. BLAST results for NOV10 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Po 
sitives 
(%) 


Expect 


gi | 11991867 | gb | AAG4 
2368 . 1 | (AF289204) 


odorant receptor 
HOR3 ' betas [Homo 
sapiens] 


316 


316/316 
(100%) 


316/316 
(100%) 


e-148 


gi | 7305351 | ref | NP_0 
38648 . 1 | 
(NM_013620) 


olfactory- 
receptor 68 [Mus 
musculus] 


315 


258/314 
(82%) 


281/314 
(89%) 


e-122 


gi | 7305353 | ref j NP_0 
38649.1 | 
(NM 013621) 


olfactory 
receptor 6 9 [Mus 
musculus] 


316 


255/314 
(81%) 


279/314 
(88%) 


e-120 


gi | 11908221 | gb | AAG4 
1685. 1| (AF133300) 


MOR 3 ' Beta6 [Mus 
musculus] 


316 


238/311 
(76%) 


268/311 
(85%) 


e-115 


gi | 6912560 | ref | NP_0 
36507. 1| 
(NM 012375) 


olfactory 
receptor, family 
52, subfamily A, 
member 1 [Homo 
sapiens] 


312 


233/310 
(75%) 


263/310 
(84%) 


e-110 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 10D. In the ClustalW alignment of the NOV 10 protein, as 
20 well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
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can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 10D. ClustalW Analysis of NOV10 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



1) Novel NOV10 (SEQ ID NO: 50) 

2) gi | 11991867 | gb | AAG4 236 8 . 1 | 
{SEQ ID NO: 362) 

3) gi | 7305351 | ref |NP_03864 8 . 1 | 
(SEQ ID NO:363) 

4) gi | 7305353 | ref | NP_03 864 9 . 1 | 
(SEQ ID NO: 3 64) 

5) gi | 11908221 | gb| AAG41685 . 1 | 

6) gi | 6912560 | ref |NP_036507 . 1 | 



(AF289204) odorant receptor HOR3'beta5 [Homo sapiens] 

(NM_013620) olfactory receptor 68 [Mus musculus] 

(NM_013621) olfactory receptor 69 [Mus musculus] 

(AF133300) MOR 3'Beta6 [Mus musculus] (SEQ ID NO:365) 
(NM_0123 75) olfactory receptor, family 52, subfamily 



A, member 1 [Homo sapiens] (SEQ ID NO: 366) 



NOV10 

gi | 11991867 | 
gi|730535l| 
gi|7305353 | 
gi | 11908221 | 
gi | 6912560 | 



NOV10 

gi|H991867 | 
gi |7305351| 
gi|7305353 j 
gi|ll90822l| 
gi | 6912560 | 



NOV10 

gi | 11991867 | 
gi|730535l| 
gi|7305353 j 
gi | 11908221 | 
gi | 6912560 | 



NOV10 

gi | 11991867 | 
gi |7305351| 
gi|7305353| 
gi|H90822l| 
gi|6912560| 



NOV10 

gi|H991867 | 
gi|730535l| 
gi |7305353 | 
gi j 11908221 | 
gi | 6912560 | 



NOV10 



241 
241 
241 
241 
241 
241 




20 



I 



30 
.|.. 



L|GI PGLESVQCWIGI PF 
LOG I PGLE S VQCW I G I P F 
L [GI PGLESVQCWIGI PF 
L|GI PGLESVQCWIGI PF 
lHgI PGLESVQCWIGI PF 
IlBgI PGLESVQCWIGI PF 





110 



120 



I 



QMWLIHSFQAIESGILLAM 
QMWLIHSFQAIESGILLAM 

qmElihs fqaHesgi LLAM 

QMgL I H S F Q AgE SGI Li LAM 

|qmwl i hHfqmi E sgi lBam 

!QMW[aiHSjQHlESGIL^aM 



120 
120 
120 
120 
120 
120 



130 



140 



150 



160 




170 



180 



LI KCHLKgYRTTVI SHS 
LIKCMLKgYRTTVISHS 

LI kcBlkSjIrttSIi SHS 
likJmlkS§RttRishs 

L I KCgLKjaYgT T VgS H S 

t t x^RsmIySitv i s hs 



190 



200 



210 



220 



230 



180 
180 
180 
180 
180 
180 



240 




250 



260 



270 



280 



290 



300 



I 



FNTCI AHI CVF 
FNTCIAHICVF 
FNTCI AHI CVF 
FNTCIAHICVF 
FNTCIAHICVF 
FNTCIAHICVF 




310 
I 



301 m 



T 316 



102 



gi 1 11991867 | 
gi|730535l| 
gi|7305353 | 
gi|ll90822l| 
gi|6912560| 




T 316 

isggH- 315 

IslBaPL 316 

GNYSY 316 

S 312 
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Table 10E lists the domain description from DOMAIN analysis results against 
NOV 1 0. This indicates that the NOV 1 0 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 10E Domain Analysis of NOV10 

gnl | Pf am | pf amOOOOl , 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO:810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 67.8 bits (164),, Expect = 9e-13 



15 



20 



25 



30 



35 



40 



NOV10 : 


43 


Sbjct : 


1 


NOV10 : 


103 


Sbjct : 


61 


NOV10 : 


163 


Sbjct : 


121 


NOV10 : 


220 


Sbjct : 


177 


NOV10 : 


278 


Sbjct : 


237 



GNSLILVIIKYENSLHIPMYIFLAMLAATDIALNTCILPKMLGIFWFHLPEISFDACLFQ 102 

|| | ++++ | I I III II 1+ +11 I 

GNLLVI LVI LRTKKLRTPTNI FLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 

MWLIHSFQAIESGILLAMALDRYVAICIPLRHATIFSQQFLTHIGLGVTLRAAILIIPSL 162 

| + | |+++| ll+ll 111+ I + + + I I + I -I +1 I 

GALFVWGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILLVWVLALLLSLPPL 12 0 



III 



I 



MM 



G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 

103 
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Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

5 The disclosed NOV 10 nucleic acid of the invention encoding a G-Protein Coupled 

Receptor -like protein includes the nucleic acid whose sequence is provided in Table 1 OA or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 10A while still encoding a 
protein that maintains its G-Protein Coupled Receptor-like activities and physiological 

10 functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 

15 modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 19 percent of the bases may be so changed. 

20 The disclosed NOV 10 protein of the invention includes the G-Protein Coupled 

Receptor-like protein whose sequence is provided in Table 10B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 10B while still encoding a protein that maintains its G-Protein 
Coupled Receptor-like activities and physiological functions, or a functional fragment thereof. 

25 In the mutant or variant protein, up to about 25 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor-like 
protein (NOV 10) is a member of a "G-Protein Coupled Receptor family". Therefore, the 

30 NOV10 nucleic acids and proteins identified here may be useful in potential therapeutic 

applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 



104 




delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV 10 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCII and III diseases 
5 (immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), non insulin-dependent diabetes mellitus (NIDDM1), bacterial, 

10 fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 

15 ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 

disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

20 NOV 10 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV 10 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobic ity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV 10 protein has multiple hydrophilic 

25 regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 



30 NOV11 

A disclosed NOV1 1 nucleic acid of 1014 nucleotides (also referred to as Curagen 
Accession No. CG55966-01) encoding a novel G-Protein Coupled Receptor -like protein is 
shown in Table 1 1 A. An open reading frame was identified beginning with an ATG initiation 
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codon at nucleotides 2-4 and ending with a TGA codon at nucleotides 947-949. Putative 
untranslated regions upstream from the initiation codon and downstream of the termination 
codon are underlined in Table 1 1 A. The start and stop codons are in bold letters. 



Table 11 A. NOV11 nucleotide sequence (SEQ ID NO:51). 

AATGATTACTTCAGTAAGCCCTAGCACCAGCACGAATTCTTCCTTTCTTCTCACTGGATTTTCTG 
GCATGGAGCAGCAATACCCCTGGTTTTCCATCCCCTTCTCCTCAATCTATGCCATGGTGCTTTTG 
GGCAATTGCATGGTTCTCCATGTGATATGGACTGAGCCAAGCCTGCACCAGCCTATGTTTTACTT 
CCTGTCCATGCTGGCCCTCACTGACCTGTGCATGGGGCTGTCCACTGTGTACACAGTGCTGGGGA 
TCCTGTGGCGGATCATTCGAGAGATCAGCTTGGATTCCTGCATTGCCCAGTCCTATTTCATCCAT 
GGTCTGTCCTTCATGGAGTCCTCTGTCCTCCTCACTATGGCCTTTGACCGGTACATTGCAATTTG 
CAATCCACTACGTTATTCCTCCATCCTGACTAATTCCAGAATTATCAAAATTGGGCTCACTATAA 
TAGGTAGGAGTTTTTTCTTTATTACACCCCCCATCATCTGTCTGAAATTTTTTAATTACTGTCAT 
TTCCACATCCTTTCTCACTCTTTCTGCCTGCACCAGGATCTTCTCCGCTTAGCCTGTTCAGACAT 
CCGATTCAATAGTTACTATGCCCTGATGCTGGTTATTTGCATACTGTTGTTGGATGCTATACTCA 
TCCTTTTCTCCTACATCCTGATTCTTAAGTCAGTCCTGGCAGTTGCCTCTCAGGAAGAGAGGCAT 
AAATTATTTCAGACCTGCATCTCCCACATCTGTGCTGTCCTTGTGTTCTACATCCCTATCATTAG 
CCTCACAATGGTGCACCGTTTTGGCAAGCACCTTTCCCCCGTGGCCCACGTTCTCATTGGCAACA 
T CTAC AT C C TT TT C CC AC CTTT AAT G AAT C C CAT CAT C T AC AGT GT C AAG AC C C AAC AG AT T CAT 
ACCAGAATGCTTAGACTCTTTTCTCTGAAAAGATATTG AGAGATATTGAGATGTATTGCCTAAAA 
AAAAG AAAG AAAAG C AG C AAC AAT AAT AAAC AAAAAT C A 



The disclosed NOV1 1 polypeptide (SEQ ID NO:52) encoded by SEQ ID NO:51 has 
315 amino acid residues and is presented in Table 1 IB using the one-letter amino acid code. 



Table 11B. Encoded NOV11 protein sequence (SEQ ID NO:52). 

MITSVSPSTSTNSSFLLTGFSGMEQQYPWFSIPFSSIYAMVLLGNCMVLHVIWTEPSLHQPMFY 
FLSMLALTDLCMGLSTVYTVLGILWRIIREISLDSCIAQSYFIHGLSFMESSVLLTMAFDRYIA 
ICNPLRYSSILTNSRI IKIGLTIIGRSFFFITPPIICLKFFNYCHFHILiSHSFCLHQDLLRLAC 
SDIRFNSYYALMLVICILLLDAILILFSYILILKSVLAVASQEERHKLFQTCISHICAVLVFYI 
P 1 1 S LTMVHR FGKHLS PVAHVLIGNIYILFPPLMNPIIYSVKTQQIHTRMLRLFSLKRY 



10 A search of sequence databases reveals that the NOV1 1 amino acid sequence has 165 

of 302 amino acid residues (54%) identical to, and 222 of 302 amino acid residues (73%) 
similar to, the 31 1 amino acid residue ptnr: SPTREMBL-ACC:Q9WVN4 protein from Mus 
musculus (Mouse) MOR 5 'BETA 1 (E = 7.0e" 88 ). 

The disclosed NOV1 1 polypeptide has homology to the amino acid sequences shown 

1 5 in the BLASTP data listed in Table 1 1 C. 



Table 1 1 C. BLAST results for NOV1 1 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 11991863 | gb |AAG4 
2364 . 1 | (AF289204) 


odorant receptor 
HOR3'betal [Homo 
sapiens] 


321 


315/315 
(100%) 


315/315 
(100%) 


e-139 
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gi | 119 0 8218 | gb | AAG4 
1683. l| (AF137396) 


HOR5 «Beta5 [Homo 
sapiens] 


312 


165/307 
(53%) 


231/307 
(74%) 


4e-78 


gi | 17456753 | ref |XP_ 
061614 . 1 | 
<XM_061614) 


similar to MOR 
3Beta4 (H. 
sapiens) [Homo 
sapiens] 


315 


163/307 
(53%) 


223/307 
(72%) 


le-77 


gi | 7305345 | ref |NP_0 
38645 . 1 | 
(NM_013617) 


olfactory 
receptor 65 [Mus 
musculus] 


307 


164/305 
(53%) 


223/305 
(72%) 


5e-77 


gi | 174 56767 | ref |XP_ 
061618 . 1 | 
(XM_061618) 


similar to 
prostate specific 
G-protein coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


879 


162/303 
(53%) 


226/303 
(74%) 


2e-76 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 1 ID. In the ClustalW alignment of the NOV 1 1. protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 

Table 11D. ClustalW Analysis of NO VI 1 

10 1) Novel NOV11 (SEQ ID NO: 52) 

2) gi | 119 918 6 3 | gb |AAG4 23 64 . 1 | (AF2 8 92 04) odor ant receptor HOR3'betal [Homo sapiens] 
(SEQ ID NO: 3 67) 

3) gi | 11908218 |gb | AAG41683 . 1 | (AF137396) HOR5 » Beta5 [Homo sapiens] (SEQ IDNO:368) 

4) gi | 17456753 j ref | XP_06 16 14 . 1 1 (XM_061614) similar to MOR 3Beta4 (H. sapiens) 
15 [Homo sapiens] (SEQ ID NO: 3 69) 

5) gi | 7305345 | ref |NP_038645 . 1 | (NM_013617) olfactory receptor 65 [Mus musculus] 
(SEQ ID NO:370) 

6) gi | 17456767 |.ref |XP_061618 . 1 | (XM_061618) similar to prostate specific G-protein 
coupled receptor (H. sapiens) [Homo sapiens] (SEQ ID NO: 371) 



25 



30 



35 



40 



NOVll 

gi | 11991863 | 
gi | 11908218 | 
gi j 17456753 j 
gi |7305345 | 
gi | 17456767 | 



NOVll 

gi | 11991863 | 
gi | 11908218 | 
gi j 17456753 j 
gi | 7305345 | 
gi (17456767 | 



10 20 30 40 50 60 

....|....|....|....|....|....|....|....|----|....|...-|.-.-| 

1 1 

1 1 

1 1 

! 1 

! 1 

1 MSLALDLCPLSQRLEAFPSSI VLFFQTAPAVRHPKGLLELHKTVPTS I KEELKGFFPTSD 6 0 

70 80 90 100 110 120 

....|....|....|....|....|....|....|..-.|....|.---|----|.--.| 

! 1 

! 1 

! 1 

! 1 

1 1 

61 HFIITDFIAKYHTDLKWAVLGIATPRQQFKALNTCISHICAVLIFYVPTLSAAMLHQFAR 120 

130 140 150 160 170 180 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



NOVll 

gi | 11991863 | 
gi|11908218| 
gi|l7456753 | 
gi | 7305345 | 
gi j 17456767 | 



NOVll 

gi | 11991863 | 
gi|H908218| 
gi | 17456753 | 
gi j 7305345 | 
gi j 17456767 | 



NOVll 

gi | 11991863 | 
gi | 11908218 j 
gi j 17456753 | 
gi j 7305345 | 
gi | 17456767 | 



NOVll 

gi|H991863| 
gi j 11908218 j 
gi | 17456753 | 
gi|7305345 | 
gi | 17456767 | 



NOVll 

gi | 11991863 | 
gi j 11908218 | 
gi|l7456753| 
gi | 73053.45 | 
gi j 17456767 | 



NOVll 

gi | 11991863 | 
gi|H908218| 
gi j 17456753 j 
gi |7305345 | 
gi | 17456767 | 



NOVll 

gi | 11991863 | 
gi j 11908218 | 
gi | 17456753 | 
gi|7305345 | 
gi | 17456767 | 



x 1 

! 1 

121 DVSPMIHVLMADIFLLVPPLLNPIVYCVKTHQIREKWGKLCPKNCFLKSKILPRCSFVP 180 

190 200 210 220 230 240 

....|....|....|--..|....|....|....|-.-.|....|.--.|..-.|---.| 



1 1 

! 1 

181 GFRLAYYYLPPHPSKVSFLDPVEKANRSAPTQFSPMPSADASLLADLGTFSSLQRATFFL 24 0 

250 260 270 280 290 300 

....|....|....|....|....|-...|---.|---.|....|....|.--.|-...| 



1 1 

I 1 

241 TGFQGLEGLHGWI SIP FCF I YLTVI LGNLT I LHVI CTDATLHGPM YYFLGMLAVTDLGLC 300 



310 320 330 340 350 360 

..|....|....|...-|....|....|...-|....|.---|..-.|....| 



1 1 

x 1 1 

3 01 LSTLPTVLGIFWFDTREIGIPACFTQLFFIHTLSSMESSVLLSMSIDRYVAVCNPLHDST 3 60 

370 380 390 400 410 420 



! 1 

! 1 

3 61 VLTPACIVKMGLSSVLRSALLILPLPFLLKRFQYCHSHVLAHAYCLHLEIMKLACSSIIV 420 



430 



440 



450 



460 



470 



480 



! 1 

! 1 

421 NHI YGLFWACTVGVDSLL I FLS YALI LRTVLS I ASHQERLRALNTCVSHI CAVLLFYI P 4 80 



490 



500 



510 



520 



530 



540 



|....|..-.| 



I---- I 



1 1 

4 81 MIGLSLVHRFGEHLPRWHLFMSYVYLLVPPLMNPIIYSIKTKQIRQRIIKKFQFIKSLR 54 0 



550 



560 



570 



580 



NOVll 

gi | 11991863 | 
gi | 11908218 | 
gi j 17456753 j 
gi|7305345| 

gi j 17456767 | 541 CNHQYCLNLLQDFGGHPPSPLSPHTMTlGSLG||sgSgV§g' 



MlTSVgPgTSTNpS 

MFLS SRMI TS vSpBtJItII^ 3 

I'-MSSSSg-- -jjfHP 

MGDWNNSD-gvEPI 



-MWSNI@ A^P 




108 



640 



650 



660 



NOV11 

gi | 11991863 | 
gi j 11908218 j 
gi j 17456753 | 
gi|7305345 | 
gi | 17456767 | 601 LCFI 




NOV11 

gi|H991863 | 
gi j 11908218 | 
gi j 17456753 j 
gi | 7305345 | 
gi|l7456767 | 



NOV11 

gi | 11991863 | 
gi | 11908218 | 
gi |17456753 j 
gi | 7305345 | 
gi | 17456767 | 



NOV11 

gi | 11991863 | 
gi|H908218| 
gi j 17456753 | 
gi|7305345 | 
gi|l7456767 | 



94 


EI 




100 


EI 


sld| 


89 


EI 


gspM 


93 


EI 


QASg 


89 


EI 


rhgB 


661 


EI 


shdS 



154 
160 
149 
153 
149 
721 



NOV11 


214 


AIL 


gi| 11991863 | 


220 




gi|H908218| 


209 




gi|l7456753 j 


213 




gi|7305345| 


209 


i 


gi|l7456767| 


781 






SHIiSP 273 

SP 279 

gQVPH 2 68 

* SP 272 

PH 268 

»APH 840 



Table 1 1 E lists the domain description from DOMAIN analysis results against 
NOV1 1 . This indicates that the NOV1 1 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table HE Domain Analysis of NOV11 

gnl | Pf am | pf amOOOOl , 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO:810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 71.2 bits (173), Expect = 8e-14 



NOVll: 44 GNCMVLHVIWTEPSLHQPMFYFL1SMLALTDLCMGL1STVYTVLGILWRI IREISLDSCIAQ 103 

I I +1+ I I I I II ll+ll 1+ II I 

Sbjct : 1 GNLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPW7U.YYLVGGDWVFGDALCKLV 60 

NOVll: 104 SYFIHGLSFMESSVLLTMAFDRYIAICNPLRYSSILTNSRIIKIGLTIIGRSFFFITPPI 163 

109 



1 i 



15 



Sbjct : 


61 


NOV11 : 


164 


Sbjct : 


121 


NOV11 : 


224 


Sbjct : 


172 


NOV11 : 


273 


Sbjct : 


232 



+1 ++ lll + ll + 1111 I I I + I + + 11+ 

GALFWNGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILLVWVLALLLSLPPL 12 0 

ICLKFFTJYCHFHIIiSHSFCLHQDLLRLACSDIRFNSYYALMiVICJil/I/DAJiJI.FSYIL 223 
+ I + + II + 11+ + +1 + +II I 



I I+++ I III + + I I + I ++ I + 

10 Sbjct: 172 ILRTLRKRARSQRSLKRRSSSERKAAKMLLVVVVVFVLCWLPYHIVLLLDSLCLLSIWRV 231 



+11 +111 



G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
20 and are likely to underlie the recognition and G-protein-mediated transduction of various 

signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

25 Olfactory receptors (ORs) have been identified as extremely large subfamily of G 

protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 

30 genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV1 1 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 1 1 A or a 

35 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 

bases may be changed from the corresponding base shown in Table 1 1 A while still encoding a 
protein that maintains its G-Protein Coupled Receptor-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 

40 that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 

110 




modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. 
5 The disclosed NOV1 1 protein of the invention includes the G-Protein Coupled 

Receptor-like protein whose sequence is provided in Table 1 IB. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 1 IB while still encoding a protein that maintains its G-Protein 
Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 

1 0 In the mutant or variant protein, up to about 47 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor-like 
protein (NOV1 1) is a member of a "G-Protein Coupled Receptor family". Therefore, the 

15 NOV1 1 nucleic acids and proteins identified here may be useful in potential therapeutic 

applications implicated in (but not limited to) various pathologies and disorders. The potential 
therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 

20 research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

The NOV1 1 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

25 Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 

30 cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 

uterus cancer), anorexia, bulimia, asthma, Parkinsons disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
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disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other pathologies and disorders. 
5 - NOV1 1 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV1 1 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. These novel proteins can be used in assay systems for 
10 functional analysis of various human disorders, which will help in understanding of pathology 
of the disease and development of new drug targets for various disorders. 

NOV12 

A disclosed NOV 12 nucleic acid of 1 067 nucleotides (also referred to as Curagen 
Accession No. CG56003-01) encoding a novel G-Protein Coupled Receptor -like protein is 
15 shown in Table 12A. An open reading frame was identified beginning with an ATG initiation 
codon at nucleotides 15-17 and ending with a TGA codon at nucleotides 1023-1025. The 
untranslated regions are underlined and the start and stop codons are in bold letters in Table 
12A. 



Table 12A. NOV12 nucleotide sequence (SEQ ID NO:53). 

Taaaacctgacataaa tgaacaacaatacaacatgtattcaaccatctatgatctcttccatggctttaccaa 
tcatttacatcctcctttgtattgttggtgtttttggaaacactctctctcaatggatatttttaacaaaaa 
taggtaaaaaaacatcaacgcacatctacctgtcacaccttgtgactgcaaacttacttgtgtgcagtgcca 
tgcctttcatgagtatctatttcctgaaaggtttccaatgggaatatcaatctgctcaatgcagagtggtca 
attttctgggaactctatccatgcatgcaagtatgtttgtcagtctcttaattttaagttggattgccataa 
gccgctatgctaccttaatgcaaaaggattcctcgcaagagactacttcatgctatgagaaaatattttatg 
gcc att tactg aaaaaatt t cgc cagc c c aact ttgct agaaaact atg c attt ac at atgggg agttgt ac 
t gggc ataat c att cc agt t ac cgt at act act c agtc at ag agg c t ac ag aaggag aag ag ag cc t atgct 
acaatcggcagatggaactaggagccatgatctctcagattgcaggtctcattggaaccacatttattggat 
tttcctttttagtagtactaacatcatactactcttttgtaagccatctgagaaaaataagaacctgtacgt 
ccattatggagaaagatttgacttacagttctgtgaaaagacatcttttggtcatccagattctactaatag 

TTTGCTTCCTTCCTTATAGTATTTTTAAACCCATTTTTTATGTTCTACACCAAAGAGATAACTGTCAGCAAT 
TGAATTATTTAATAGAAACAAAAAACATTCTCACCTGTCTTGCTTCGGCCAGAAGTAGCACAGACCCCATTA 
TATTTCTTTTATTAGATAAAACATTCAAGAAGACACTATATAATCTCTTTACAAAGTCTAATTCAGCACATA 
TGCAATCATATGGTTGA CTTTTGAATGGAAAACCCCACAATATTAAGAAAAGCATTCAT 

20 " ^ 

The disclosed NOV 12 polypeptide (SEQ ID NO 54) encoded by SEQ ID NO:53 has 
336 amino acid residues and is presented in Table 12B using the one-letter amino acid code. 
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Table 12B. Encoded NOV12 protein sequence (SEQ ID NO:54). 

MNNNTTCIQPSMISSMALPI IYILLCIVGVFGNTLSQWI FLTKIGKKTSTHI YLSHLVTANLLV 
CSAMPFMSIYFLKGFQWEYQSAQCRWNFLGTLSMHASMFVSLLILSWIAISRYATLMQKDSSQ 
ETTSCYEKIFYGHLLKKFRQPNFARKLiCIYIWGWLGI I IPVTVYYSVIEATEGEESLCYNRQM 
ELGAMISQIAGLIGTTFIGFSFLWLTSYYSFVSHLRKIRTCTSIMEKDLTYSSVKRHLLVIQI 
LLIVCFLPYSIFKPIFYVLHQRDNCQQLNYLIETKNILTCLASARSSTDPIIFLLLDKTFKKTLYNLFT 
KSNSAHMQSYG 



A search of sequence databases reveals that the NOV 12 amino acid sequence has 52 of 
179 amino acid residues (29%) identical to, and 86 of 179 amino acid residues (48%) similar 
to, the 339 amino acid residue ptnr: SWISSPROT-ACC:Q13304 protein from Homo sapiens 
5 Putative G Protein-Coupled Receptor GPR1 7 (R12) (E = 1 .6e' 22 ). 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
10 signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

The disclosed NOV 12 polypeptide has homology to the amino acid sequences shown 
15 in the BLASTP data listed in Table 12C. 



Table 12C. BLAST results for NOV12 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 182 0187 0 | ref | NP_ 
543007 . 1 | 
(NM_08 0817) 


G protein- coupled 
receptor 82 [Homo 
sapiens] 


336 


336/336 
(100%) 


336/336 
(100%) 


e-170 


gi | 4885301 | ref |NP_0 
0 52 82. l| 
(NM_005291) 


G protein- coupled 
receptor 17 [Homo 
sapiens] 


367 


85/322 
(26%) 


144/322 
(44%) 


6e-21 


gi | 174 6216 9 | ref | XP_ 
002705 . 4 | 
(XM_002705) 


G protein-coupled 
receptor 17 [Homo 
sapiens] 


339 


85/322 
(26%) 


144/322 
(44%) 


2e-20 


gi | 2695876 | emb | CAB0 
8108. l| (Z94155) 


P2Y-like G- 
protein coupled 
receptor [Homo 
sapiens] 


298 


80/302 
(26%) 


135/302 
(44%) 


3e-18 


gi | 5757634 |gb| AAD50 
531 . 1 | AF039686_1 
(AF039686) 


G-protein coupled 
receptor GPR34 
[Homo sapiens] 


381 


77/323 
(23%) 


152/323 
(46%) 


4e-18 
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The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 12D. In the ClustalW alignment of the NOV 12 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 

Table 12D. ClustalW Analysis of NOV12 

1) Novel NOV12 (SEQ ID NO: 54) 

9) gi | 18201870 | ref |NP_543007 . 1 | <NM_080817) G protein- coupled receptor 82 [Homo 
sapiens] {SEQ ID NO: 372) 

9) gi | 4885301 | ref |NP__005282 . 1 | (NM_005291) G protein-coupled receptor 17 [Homo 
sapiens] (SEQ ID NO: 3 73) 

9) gi | 17462169 |ref |XP_002705 .4 | (XM_002705) G protein- coupled receptor 17 [Homo 
sapiens] (SEQ ID NO: 3 74) 

9) gi | 2695876 | emb | CAB08108 . 1 | (Z94155) P2Y-like G-protein coupled receptor [Homo 
sapiens] (SEQ ID NO: 3 75) 

9) gi | 5757634 |gb|AAD50531 . 1 | AF039686_1 (AF039686) G-protein coupled receptor GPR34 
[Homo sapiens] (SEQ ID NO: 376) 



NO VI 2 

gi | 18201870 | 
gi|488530l| 
gi|l7462169| 
gi|2695876| 
gi | 5757634 | 



I 



10 



I 



20 



30 



40 



-fkfNNT' 



rtNNNT' 

mskrswwagsrkppremlklsgsdssqsmnglevappglitnfsSataei 
mnglevappgl i tnfsMataei 

mrshtitmtttsvsswpysshrmrfitnhsdqppqnfs- - atpnvttgpmdekglgt 55 




NOV12 

gi|l8201870| 
gi|488530l| 
gi |17462169 | 
gi|2695876| 
gi |5757634 | 




NOV12 

gi | 18201870 | 
gi|4885301| 
gi j 17462169 | 
gi|2695876| 
gi|5757634| 




J 



170 
I 



|lsgf|ATplQKDS 

jByatLmqkds 

flNRSgiQ- 



180 
....| 
QETTSCYEK 13 6 
QETTSCYEK 136 

167 

139 

98 

162 
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NOV12 

gi|l8201870| 
gi|488530l| 
gi|l7462169| 
gi|2695876| 
gi [5757634 | 



137 ifyghll: 

13 7 ifyghllk 

167 L 

139 L 

98 L 

162 




220 



230 



240 



|tEyysv]P|ategees~ 
|YYSv|j|ategees; 
ispqt&tI- -; 

|SPQTfeT|l-- 
|SPQTg^Td-- 
jLTLKKGGH-- 




I 



OM 196 



196 



RQMU 

rqm| 

iLQLYRgKAi 218 

LQLYrHkA^ 19 0 

LQLYrHkaM 14 9 

FHYRDKHNA 213 



NO VI 2 



250 260 270 280 290 

197 Mlf Q^GL|GTT[3l Ggsj^Vv|gsiYS F|SHj3g|i RTCTS I^Ed0TYS! 
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300 



I 256 



10 



15 



20 



25 



gi | 18201870 | 
gi [4885301 | 
gi | 17462169 | 
gi | 2695876 | 
gi j 5757634 j 



197 MlStQX^GL^CGTT 3lG 3s : 
|. j •.. i% s» am 

219 HHMUgS- 

191 HH§L|f|- 

150 HH^L^- 

214 KGEAJFNF§ 



rYSSgiRHSLyjQI 256 




NO VI 2 


257 


fill 


gi 1 18201870 | 


257 


mm 


gi | 4885301 1 


271 




gi | 17462169 | 


243 




gi | 2695876 | 


202 




gi j 5757634 j 


274 


llFil 




310 320 
I • 

~ ";Fk5pJ EYVLH^RD - 1 
* &SHG, 
RSHGAg 
JdHTO^HGAE 
SSQLtlVS-B 



330 





360 

JCqLLgDI^ 314 

xfeLLlbk 314 

^Hf^A^ 33 0 

FTO| 3 02 

FpAlB 261 

}S 332 



370 



380 



390 



410 




400 

•J 

SgSAH^QgYG 33 6 

pSsAHMQSYG 33 6 

FEGKMeSsSsIkIeL- 3 67 



NO VI 2 

gi 1 18201870 | 
gi | 4885301 | 
gi | 17462169 | 
gi|2695876| 
gi j 5757634 j 



Table 12E lists the domain description from DOMAIN analysis results against 
NOV 12. This indicates that the NOV 12 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 12E Domain Analysis of NOV12 

gnl | Pf am | pf amOOOOl , 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 99.6% aligned 
Score = 82.0 bits (201), Expect - 5e-17 
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NOV12 : 


32 


Sbjct : 


1 


NO VI 2 : 


92 


Sbjct : 


61 


NOV12 : 


152 


Sbjct : 


101 


NOV12 ; 


212 


Sbjct : 


160 


NOV12 : 


271 


Sbjct : 


220 



GNTLSQWIFLTKIGKKTSTHI YLSHLVTANLLVCSAMPFMSIYFLKGFQWEYQSAQCRW 91 

II I + I +1 l + l + l +1 Ml +1 ++I + I I I + I I++I 

GNLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 

NFLGTLSMHASMFVSLLILSWIAISRYATLMQKDSSQETTSCYEKIFYGHLLKKFRQPNF 151 

I ++ +1 1+ +1+ MM I + ++ I I 

GAL F WNGY AS I L LLTAISIDRYLA IVHPLRYRRIRTPRR 10 0 

ARKLCIYIWGWLGIIIPVTVYYSVIEATEGEESLCYNRQMELGAMISQIAGLIGTTFIG 211 

1+ I + +1 + I + +1 ++ + II ++I I I + |+ 

AKVLILLVWVLALLLSLPPLLFSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFV- 15 9 

FSFLWLTSYYSFVSHLRK- IRTCTSIMEKDLTYSSVKRHLLVIQILLIVCFLPYSIFKP 270 

11 + 11 + I I I 1+ 1+ + + + I I 1+ ++ ++I + I I I I 

LPLLVILVCYTRILRTLRKRARSQRSLKRRSSSERKAAKMLLVVVVVFVLCWLPYHIVLL 219 



+ + 

LDSLCLLSIWRVLPT- - 



I II I +111 

- ALL I TLWLAYVNS CLNP 1 1 



308 



253 



Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
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transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
5 displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 12 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 12A or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 

10 bases may be changed from the corresponding base shown in Table 12A while still encoding a 
protein that maintains its G-Protein Coupled Receptor-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 

15 includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 

include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 

20 binding nucleic acids in therapeutic applications in a subject. 

The disclosed NOV 12 protein of the invention includes the G-Protein Coupled 
Receptor-like protein whose sequence is provided in Table 12B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 12B while still encoding a protein that maintains its G-Protein 

25 Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 77 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor-like 

30 protein (NOV 12) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV 12 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
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targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV 12 nucleic acids and proteins of the invention are useful in potential 
5 therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 

10 starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 

fungal, protozoal and viral infections (particularly infections caused by HIV-l or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

15 Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 

20 or Gilles de la Tourette syndrome, and/or other pathologies and disorders. 

NOV 12 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 12 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

25 NOVX Antibodies" section below. The disclosed NOV 12 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

30 NOV13 

NOV 13 includes three novel G-Protein Coupled Receptor -like proteins disclosed 
below. The disclosed sequences have been named NOV 13a and NOV 13b. 
NOV13a 
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A disclosed NOV13a nucleic acid of 961 nucleotides (also referred to as CG56075-01) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 13A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 12-14 and 
ending with a TGA codon at nucleotides 936-938. The start and stop codons are shown in 
5 bold in Table 13 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 13A. NOV13a nucleotide sequence (SEQ ID NO:55). 

GACAACAAACTA TGAGACAGATAAATCAGACACAAGTGACAGAATTCCTCCTTCTGGGACTCTCTGATGGGC 
CACACACCGAGCAGCTGCTATTTATCGTATTATTGGGTGTCTACCTGGTCACTGTGCTTGGAAATCTGCTTC 
TAATCTCCCTTGTTCATGTTGACTCCCAACTTCACACACCCATGTATTTTTTTCTCTGCAACTTGTCTCTGG 
CTGACCTCTGTTTCTCTACCAACATAGTTCCTCAGGCACTAGTCCACCTGCTTTCCAGAAAGAAGGTCATTG 
CATTCACACTTTGCGCAGCTCGACTTCTCTTTTTCCTCATTTTTGGGTGTACCCAGTGCGCCCTTCTTGCAG 
TGATGTCCTATGATCGCTATGTTGCAATCTGCAATCCTCTGCGTTACCCTAACATCATGACCTGGAAAGTGT 
GTGT C C AGC TGGC AACAGGATC ATGG AC C AGTGGC ATT CTGGTGTCTGTGGT AG AC AC C ACCT TC AC AC TGA 
GGCTACCCTACCGAGGCAGTAACAGCATTGCTCATTTCTTTTGTGAGGCCCCTGCACTATTGATCTTAGCAT 
CCACAGACACCCATGCATCAGAGATGGCCATTTTTCTTACGGGGGTTGTGATTCTCCTCATACCTGTTTTTC 
TGATTCTGGTATCCTATGGCCGTATCATAGTAACTGTGGTCAAGATGAAGTCAACTGTGGGGAGTCTCAAGG 
CATTTTCTACCTGTGGCTCCCACCTCATGGTGGTCATACTTTTTTATGGATCAGCAATTATCACTTACATGA 
CACCCAAGTCTTCCAAACAGCAGGAAAAATCGGTGTCTGTTTTCTATGCAATAGTGACTCCCATGCTTAATC 
CCCTCATCTATAGCCTGAGAAACAAGGATGTGAAGGCAGCTCTGAGGAAAGTAGCCACAAGGAATTTCCCAT 
G AAGGCTTGGAAT C T CAC ACTG AC A 



The disclosed NOV13a polypeptide (SEQ ID NO:56) encoded by SEQ ID NO:55 has 
308 amino acid residues and is presented in Table 13B using the one-letter amino acid code. 
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Table 13B. Encoded NOV13a protein sequence (SEQ ID NO:56). 

MRQINQTQ^TEFLLLGLSDGPHTEQLLFIVLLGVYLVTVLGNLLLISLVHVDSQLHTPMYFFLC 
NLSLADLCFSTNIVPQALVHLLSRKKVIAFTLCAARLLFFLIFGCTQCALLAVMSYDRYVAICN 
PLRYPNIMTWKVCVQLATGSWTSGILVSWDTTFTLRLPYRGSNSIAHFFCEAPALLILASTDT 
HASEMAIFLTGWILLIPVFLILVSYGRIIVTWKMKSTVGSLKAFSTCGSHLMWILFYGSAI 
ITYMTPKSSKQQEKSVSVFYAIVTPMLNPLIYSLRNKDVKAALRKVATRNFP 



A search of sequence databases reveals that the NOV13a amino acid sequence has 216 
of 217 amino acid residues (99%) identical to, and 217 of 217 amino acid residues (100%) 
similar to, the 217 amino acid residue ptnr: SPTREMBL-ACC:095224 protein from Homo 
1 5 sapiens (Human) (Olfactory Receptor) (E = 2.2e~ 109 ). 

NOV13b 

A disclosed NOV 13b nucleic acid of 961 nucleotides (also referred to as CG56021-02) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 13C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 12-14 and 
20 ending with a TGA codon at nucleotides 936-938. A putative untranslated region upstream 
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from the initiation codon is underlined in Table 13C. The start and stop codons are in bold 
letters. 

Table 13C. NOV13b nucleotide sequence (SEQ ID NO:57). 

~GACAACAAACT ATGAGACAGATAAATCAGACACAAGTGACAGAATTCCTCCTTCTGGGACTCTGTGATCC;nr 
CACACACCGAGCAGCTGCTATTTATCGTATTATTGGGTGTCTACCTGGTCACTGTGCTTGGAAATCTGCTTC 
TAATCTCCCTTGTTCATGTTGACTCCCAACTTCACACACCCATGTATTTTTTTCTCTGCAACTTGTCTCTGG 
CTGACCTCTGTTTCTCTACCAACATAGTTCCTCAGGCACTAATCCACCTGCTTTCCAGAAAGAAGGTCATTG 
CATTCACACTTTGCGCAGCTCGACTTCTCTTTTTCCTCATTTTTGGGTGTACCCAGTGCGCCCTTCTTGCAG 
TGATGTCCTATGATCGCTATGTTGCAATCTGCAATCCTCTGCGTTACCCTAACATCATGACCTGGAAAGTGT 
GTGTCCAGCTGGCAACAGGATCATGGACCAGTGGCATTCTGGTGTCTGTGGTAGACACCACCTTCACACTGA 
GGCTACCCTACCGAGGCAGTAACAGCATTGCTCATTTCTTTTGTGAGGCCCCTGCACTATTGATCTTAGCAT 
CCACAGACACCCATGCATCAGAGATGGCCATTTTTCTTATGGGGGTTGTGATTCTCCTCATACCTGTTTTTC 
TGATTCTGGTATCCTATGGCCGTATCATAGTAACTGTGGTCAAGATGAAGTCAACTGTGGGGAGTCTCAAGG 
CATTTTCTACCTGTGGCTCCCACCTCATGGTGGTCATACTTTTTTATGGATCAGCAATTATCACTTGCATGA 
C AC C C AAGT CTTCC AAAC AGC AGGAAAAAT CGGTGTCTGTTT T CT ATGC AAT AGTG ACT C C C ATG CT T AATC 
CCCTCATCTATAGCCTGAGAAACAAGGATGTGAAGGCAGCTCTGAGGAAAGTAGCCACAAGGAATTTCCCAT 
GAAGGCTTGGAATCTCACACTGACA 



In a search of public sequence databases, the NOV 13b nucleic acid sequence has 648 
5 of 653 bases (99%) identical to a gb:GENBANK-ID:AF065876|acc:AF065876.1 mRNA from 
Homo sapiens (olfactory receptor (OR2D2) gene, partial cds) (E = 2.8e' 139 ). 

The disclosed NOV13b polypeptide (SEQ ID NO:58) encoded by SEQ ID NO:57 has 
308 amino acid residues and is presented in Table 13D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV 13b has a signal peptide and is 
10 likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV 13b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV 13b is between positions 53 and 54: 
VDS-QL. 

15 

Table 13D. Encoded NOV13b protein sequence (SEQ ID NO:58). 

MRQINQTQVTEFLLLGLCDGPHTEQLLFIVLLGVYLVTVLGNLLLISLVHVDSQLHTPMYFFLCNLSLADLC 
FSTNIVPQALIHLLSRKKVIAFTLCAARLLFFLIFGCTQCALLAVMSYDRYVAICNPLRYPNIMTWKVCVQL 
ATGSWTSGILVSWDTTFTLRLPYRGSNSIAHFFCEAPALLILASTDTHASEMAIFLMGWILLIPVFLILV 
SYGRIIVTWKMKSTVGSLKAFSTCGSHLMWILFYGSAIITCMTPKSSKQQEKSVSVFYAIVTPMLNPLIY 
SLRNKDVKAALRKVATRNFP 



A search of sequence databases reveals that the NOV1 3 amino acid sequence has 52 of 
179 amino acid residues (29%) identical to, and 86 of 179 amino acid residues (48%) similar 
to, the 339 amino acid residue ptnr: SWISSPROT-ACC:Q 13304 protein from Homo sapiens 
20 Putative G Protein-Coupled Receptor GPR17 (R12) (E - 3.3e" 157 ). 

NOV1 3b is predicted to be expressed in at least Apical microvilli of the retinal 
pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, 
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corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 
umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 
spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus, and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

The disclosed NOV 13a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 13E. 



Table 13E. BLAST results for NOV13a 



Gene Index/ 
identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 14423 8 07 | sp | Q9H2 
10 |02D2_HUMAN 


OLFACTORY 
RECEPTOR 2D2 

(OLFACTORY 
RECEPTOR 11-610) 

(OR11-610) (HB2) 


308 


307/308 
(99%) 


308/308 
(99%) 


e-148 


gi| 17461460)ref |XP 
062286 . 1 | 
(XM__062286) 


similar to hB2 
olfactory- 
receptor (H. 
sapiens) [Homo 
sapiens] 


308 


308/308 
(100%) 


308/308 
(100%) 


e-148 


gi | 120074 09 | gb | AAG4 
5183 . 1 [ (AF321233 ) 


B2 olfactory- 
receptor [Mus 
musculus] 


314 


261/305 
(85%) 


278/305 
(90%) 


e-127 


gi | 3 831619 |gb| AAC70 
020. 1| (AF065876) 


olfactory 
receptor [Homo 
sapiens] 


217 


216/217 
(99%) 


2'17/217 
(99%) 


e-100 


gi | 15293767 | gb | AAK9 
5076. 1 | (AF399591) 


olfactory 
receptor [Homo 
sapi ens] 


214 


213/214 
(99%) 


214/214 
(99%) 


e-100 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 13F. In the ClustalW alignment of the NOV13 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 13F. ClustalW Analysis of NOVO 



1) Novel NOV13a (SEQ ID NO: 56) 

2) Novel NOV13b (SEQ ID NO: 58) 

3) gi | 14423 807 | sp | Q9H210 |02D2_HUMAN OLFACTORY RECEPTOR 2D2 (OLFACTORY RECEPTOR 11- 
610) (OR11-610) (HB2) (SEQ ID NO:377) 

4) gi | 17461460 | ref |XP_062286 . 1 | (XM_062286) similar to hB2 olfactory receptor (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 378) 

5) gi | 12007409 |gb|AAG45183 . 1 | (AF321233) B2 olfactory receptor [Mus musculus] (SEQ 
ID NO: 3 79) 

6) gi| 3831619 |gb|AAC70020.l| (AF065876) olfactory receptor [Homo sapiens] (SEQ ID 
NO : 3 8 0 ) 

7) gi | 15293767 |gb | AAK95076 . 1 | (AF399591) olfactory receptor [Homo sapiens] (SEQ ID 
NO: 381) 



NOV13a 


1 


M 


NOV13b 


1 


M 


gi|l4423807| 


1 


M 


gij 17461460| 


1 


M 


gi | 12007409 j 


1 




gi j 3831619 | 


1 




gij 15293767 | 


1 






20 



LLGL 


S 


D 




PHT 


EQ 


LLFI 


1 

i 


LG 


LLGL 




D 




PHT 


KG 


LLFI 


LG 


LLGL 


S 


D 




PHT 


EQ 


LLFI 




LG 


LLGL 




D 




PHT 


EQ 


LLFI 


t 


LG 


LLGL 


S 


D 




PHT 


QK, 


LLFI 




LG 




60 
• I 



DSiLHTPMY 

ds-Slhtpmy 
dsSlhtpmy 
dsSlhtpmy 
dsSlhtpmy 



60 

60 

60 

60 

59 

1 

1 



70 



80 



NOV13a 

NO VI 3 b 

gi|l4423807| 

gi|l746146oj 

gi | 12007409 j 

gi|3831619| 

gi|!5293767| 



NOVl3a 


121 


NO VI 3b 


121 


gi | 14423807 | 


121 


gijl746146oj 


121 


gi | 12007409 j 


120 


gi|3831619| 


55 


gi | 15293767 | 


54 


NOV13a 


181 


NOV13b 


181 


gi| 14423807 | 


181 


gijl746146oj 


181 


gi jl2007409 j 


180 


gi|3831619| 


115 


gi | 15293767 | 


114 



I 



100 

. - 1 . . 



110 

. - 1 . . 
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I 



90 

| | „,_,___ 

61 120 

61 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 120 

1 ^^^^^^^^^^^^^^maai^Baaag^^^^^^^^^S 54 

1 -MMJ^iSSGBStW! ?^^ 53 

130 140 150 160 170 180 
I .... I I I | I I I I .... I .... I .... | 



DRYVAI 
DRYVAI 
DRYVAI 
DRYVAI 
DRYVAI 
DRYVAI 
DRYVAI 



CNPLRYPffilMTWKVCVQLATGSWTSG 

cnplryp|imtwkvcvqlatgswtsg 
cnplrypglmtwkvcvqlatgswtsg 
cnplrypSimtwkvcvqlatgswtsg 

CNPLlSY ^ IMTWigvC^OLATgjsWTSG 

cnplrypHimtwkvcvqlatgswtsg 
cnplrypHimtwkvcvqlatgswtsg 



190 



200 



210 



ILVSVVDTTFTLRLPYRGSNS 
ILVSWDTTFTLRLPYRGSNS 
ILVSVVDTTFTLRLPYRGSNS 
ILVSVVDTTFTLRLPYRGSNS 
ILVSVVDTTFTLRLPYRGSNS 
ILVSWDTTFTLRLPYRGSNS 
ILVSWDTTFTLRLPYRGSNS 



220 



IAHFFCE 

iahffce 

IAHFFCE 
IAHFFCE 
IAHFFCE 
IAHFFCE 
IAHFFCE 



180 
180 
180 
180 
179 
114 
113 




250 



260 



270 



280 



NOV13a 


241 


NOV13b 


241 


gi| 14423807 | 


241 


gi|l7461460| 


241 


gij 12007409 | 


240 


gij3831619| 


175 


gi j 15293767 | 


174 



I 



290 



I 



CGSHLMWILFYGSAI 
CGSHLMWILFYGSAI 
CGSHLMWILFYGSAI 
CGSHLMWILFYGSAI 
CGSHLMWI LFYGSAI 
CGSHLMWILFYGSAI 
CGSHLMWILFYGSAI 



I TYMTPKS S KQQE KS VS VFYAI VT PMLNPL I YSLRNKDVKgAL 
ITgMTPKSSKQQEKSVSVFYAIVTPMLNPLIYSLRNKDVKgAL 
ITYMTPKSSKQQEKSVSVFYAIVTPMLNPLIYSLRNKDVKgAL 
ITYMTPKSSKQQEKSVSVFYAIVTPMLNPLIYSLRNKDVKgAL 
I TYMTPKS S KgjQEKjjvS VFYAjJVT PML NPLI YSLRNKDV K§AL 
ITYMTPKSSKQQEKSVSVFYAIVTPML 
ITYMTPKSSKQQEKSVSVFYAIVTPl 



300 
I 

R 
R 
R 
R 
W 



300 
300 
300 
300 
299 
217 
214 



310 
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10 



NOVl3a 

NO VI 3 b 

gi 1 14423807 | 

gi j 17461460 | 

gi | 12007409 j 

gi|3831619| 

gi j 15293767 | 




Table 13G lists the domain description from DOMAIN analysis results against 
NOVO. This indicates that the NOV 13 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 13G Domain Analysis of NOV13 



gnl | Pfam|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 94.9% aligned 
Score = 93.2 bits (230), Expect = 2e-20 



NOV13 : 54 QLHTPMYFFLCNLSLADLCFSTNIVPQALVHLLSRKKVIAFTLCAARLLFFLIFGCTQCA 113 

+ 1 II II II++III I + I I I +1+ I || |++ | 

Sbjct: 14 KLRTPTNIFLI^JLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLVGALFVVNGYASIL 73 

NOV13 : 114 LLAVMSYDRYVAICNPLRYPNIMTWKVCVQLATGSWTSGILVSWDTTFTLRLPYRGSNS 173 

II +1 lll + ll +1 I I I I I - I I + | + |+ |+ | + 

Sbjct: 74 LLTAISIDRYIAIVHPLRYRRIRTPRP^KVLILLVWLALLLSLPPLLFSWLRTVEEGNT 133 
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30 



35 
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NOV13 : 
Sbjct : 

NOV13 : 
Sbjct : 
NOV13 : 
Sbjct : 



174 IAHFFC EAPALLILASTDTHASEMAIFLTGWJMXPVFLJI/VSYGRIIVTWKM 22 8 

+ ++I++ + + I | | | + | + | 

13 4 TVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILV CYTRI LRTLRKR 17 9 



22 9 KSTVGSLK AFSTCGSHLMWILFYGSAI TTYMTPKSSKQQEKSVSVFYAX - 

+ III I + + | + + | + + + + | 

18 0 ARSQRSLKRRSSSERKAAKMLLVVVVVFVLCWLPYHIVLLLDSLCLLSIWRVLPTALLIT 

27 9 VTPMLNPLIY 288 

I lll+ll 

24 0 LWLAYVNSCLNPIIY 2 54 



278 
239 
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G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 
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Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
5 Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 13 nucleic acid of the invention encoding a G-Protein Coupled 
10 Receptor -like protein includes the nucleic acid whose sequence is provided in Table 13 A, 

14C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 13 A, or 14C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
1 5 nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
20 are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 1 percent of the bases may be so 
changed. 

25 The disclosed NOV 13 protein of the invention includes the G-Protein Coupled 

Receptor -like protein whose sequence is provided in Table 13B, or 14D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 13B, or 14D while still encoding a protein that 
maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 

30 functional fragment thereof. In the mutant or variant protein, up to about 15 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab>2, that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV1 3) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV13 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
5 below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

10 The NOV 13 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 

15 disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NTDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 

20 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 

25 autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 13 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 13 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

30 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

NOVX Antibodies" section below. The disclosed NOV 13 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
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understanding of pathology of the disease and development of new drug targets for various 



A disclosed NOV 14 nucleic acid of 986 nucleotides (also referred to as CG56023-01) 



5 encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 14A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 23-25 and 
ending with a TGA codon at nucleotides 974-976. The start and stop codons are shown in 
bold in Table 14 A, and the 5' and 3' untranslated regions, if any, are underlined. 



CTGGGGATTTATGCCCATACTT ATGGCTATAGGAAACTGGACAGAAATAAGTGAATTTATCCTCATGAGCTT 
CTCTTCCCTACCTACTGAAATACAGTCATTGCTCTTCCTGACATTTCTAACTATCTATTTGGTTACTCTGAA 
GGGAAACAGCCTCATCATTCTGGTTACCCTAGCTGACCCCATGCTACACAGCCCCATGTACTTCTTCCTCAG 
AAACTTATCTTTCCTGGAGATTGGCTTCAACCTAGTCATTGTGCCCAAAATGCTGGGGACCCTGCTTGCCCA 
GGACACAACCATCTCCTTCCTTGGCTGTGCCACTCAGATGTATTTCTTCTTCTTCTTTGGGGTAGCTGAATG 
CTTCCTCCTGGCTACCATGGCATATGACCGCTATGTGGCCATCTGCAGTCCCTTGCACTACCCAGTCATCAT 
GAACCAAAGGACACGGGCCAAACTGGCTGCTGCTTCCTGGTTCCCAGGCTTTCCTGTAGCTACTGTGCAGAC 
CACATGGCTCTTCAGTTTTCCATTCTGTGGCACCAACAAGGTGAACCACTTCTTCTGTGACAGCCCGCCTGT 
GCTGAAGCTGGTCTGTGCAGACACAGCACTGTTTGAGATCTACGCCATCGTCGGAACCATTCTGGTGGTCAT 
GATCCCCTGCTTGCTGATCTTGTGTTCCTATACTCGCATTGCTGCTGCTATCCTCAAGATCCCATCAGCTAA 
AGGGAAGCATAAAGCCTTCTCTACGTGCTCCTCACACCTCCTTGTTGTCTCTCTTTTCTATATATCTTCTAG 
CCTCACCTACTTCTGGCCTAAATCAAATAATTCTCCTGAGAGCAAGAAGTTGTTATCATTATCCTACACTGT 
TGTGACTCCCATGTTGAACCCCATTATCTACAGCTTGAGAAATAGCGAGGTGAAGAATGCCCTCAGCAGGAC 
CTT CCAC AAGGT C C TAG C C CT C AG AAACTG TAT C C C AT AGAC C TTAGGAA 



The disclosed NOV 14 polypeptide (SEQ ID NO:60) encoded by SEQ ID NO:59 has 
321 amino acid residues and is presented in Table 14B using the one-letter amino acid code. 



Table 14B. Encoded NOV14 protein sequence (SEQ ID NO:60). 

MPILMAIGNWTEISEFILMSFSSLPTEIQSLLFLTFLTIYLVTLKGNSLIILVTLADPMLHSPM 
YFFLRNLSFLEIGFNLVIVPKMLGTLLAQDTTISFLGCATQMYFFFFFGVAECFLLATMAYDRY 
VAICSPLHYPVIMNQRTRAKLAAASWFPGFPVATVQTTWLFSFPFCGTNKVNHFFCDSPPVLKL 
VCADTALFEIYAIVGTILWMIPCLLILCSYTRIAAAILKIPSAKGKHKAFSTCSSHLLWSLF 
YISSSLTYFWPKSNNSPESKKLLSLSYTWTPMLNPIIYSLRNSEVKNALSRTFHKVLALRNCIP 

15 A search of sequence databases reveals that the NOV 14 amino acid sequence has 234 

of 310 amino acid residues (75%) identical to, and 264 of 310 amino acid residues (85%) 
similar to, the 3 15 amino acid residue ptnr: SPTREMBL-ACC:Q9JKA6 protein from Mus 
musculus (Mouse) (OLFACTORY RECEPTOR P2) (E = 4.0e" 124 ). 

The disclosed NOV 14 polypeptide has homology to the amino acid sequences shown 

20 in the BLASTP data listed in Table 14C. 



disorders. 



NO VI 4 



Table 14A. NOV14 nucleotide sequence (SEQ ID NO:59). 



10 
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Table 14C. BLAST results for NOV14 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 14423 805 | sp | Q9H2 
07 | OAA5_HUMAN 


OLFACTORY 
RECEPTOR 10A5 
(HP3) 


317 


317/317 
(100%) 


317/317 
(100%) 


e-154 


gi | 12007437 j gb | 7AAG4 
5207 . 1 | AF321237 4 
(AF321237) 


hP4 olfactory 
receptor [Homo 
sapiens] 


317 


300/317 
(94%) 


305/317 
(95%) 


e-145 


gi | 12007412 |gb | AAG4 
5186. l| (AF321233) 


P3 olfactory 
receptor [Mus 
musculus] 


317 


292/316 
(92%) 


302/316 
(95%) 


e-140 


gi | 154195 83 | gb j AAK9 
7076 . 1 |AF2 93 08 0 1 
(AF2 93 0 80) 


olfactory 
receptor P3 [Mus 
musculus] 


324 


294/320 
(91%) 


304/320 
(94%) 


e-140 


gi | 12 0 07411 | gb | AAG4 
5185.1 | (AF321233) 


P4 olfactory 
receptor [Mus 
musculus] 


317 


281/316 
(88%) 


296/316 
(92%) 


e-136 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 14F. In the ClustalW alignment of the NOV14 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 



15 



20 



1) Novel 

2) gi I 144 

3) gi|l20 
sapiens] ( 

4) gi|l20 
ID NO: 3 84) 

5) gi|l54 
musculus] 

6) gi|l20 
ID NO:386) 



Table 14D. ClustalW Analysis of NOV14 

N0V14 (SEQ ID NO: 60) 

23 805 |sp|Q9H207|OAA5_HUMAN OLFACTORY RECEPTOR 10A5 (HP3 ) (SEQ ID NO:382) 
07437|gb|AAG45207.l|AF321237_4 (AF321237) hP4 olfactory receptor [Homo 
SEQ ID NO: 3 83) 

07412 |gb|AAG45186.1 I (AF321233) P3 olfactory receptor [Mus musculus] (SEQ 

19583 |gb|AAK97076 . 1 |AF293080_1 (AF293080) olfactory receptor P3 [Mus 
(SEQ ID NO: 3 85) 

07411 |gb|AAG45185 . 1 1 (AF321233) P4 olfactory receptor [Mus musculus] (SEQ 



25 



30 



35 



NO VI 4 

gi I 14423805 | 
gi|l2007437 | 
gi j 12007412 j 
gi j 15419583 j 
gi|l200741l| 



NOV14 

gi 1 14423805 I 
gi|l2007437 j 
gi I 12007412 | 



58 
54 
54 
54 



10 20 30 40 50 60 
I I I j I j j j , I j I j 

1 TO Jlol^^ 53 

1 MLFMLIpMT^Q^OTr^^^^^^^^^ ^TOT^^H A^^^^ ^^B L ^^^^^^^^ffi 6 0 

1 K&qS^ 53 

70 80 90 100 110 120 
I I 1 I I .... I . . . . I I I I I I 



PMLgSPMYFFLRNLSFLEIGFNLVIVPKMLGTLgAQDTglSFLGCATQMYFFFFFGVAEC 
PMLgSPMYFFLRNLSFLEIGFNLVIVPKMLGTL|AQDT|lSFLGCATQMYFFFFFGVAEC 

pmlgspmyfflrnlsfleigfnlvivpkmlgtl^qdtglsflgcatqmyfffffgvaec 
pmlBspmyfflrnlsfleigfn lvivpkmlgt lBaqdt§isflgcatqmyfffffgvaec 



117 
113 
113 
113 



126 



t f u p 7' S"? l .„ O $ 3 1 s.;i «■ 



10 



15 



20 



25 



30 



35 



gi|l5419583 | 


61 


gij 12007411 | 


54 


NO VI 4 


118 


gi | 14423805 | 


114 


gi j 12007437 j 


114 


gi | 12007412 | 


114 


gi | 15419583 | 


121 


gij 12007411] 


114 


NO VI 4 


178 


gi 1 14423805 | 


174 


gi j 12007437 j 


174 


gi j 12007412 | 


174 


gi | 15419583 j 


181 


gi j 12007411 j 


174 



NO VI 4 


238 


gi 


|14423805| 


234 


gi 


12007437 j 


234 


gi | 12007412 | 


234 


gij 15419583| 


241 


gi j 12007411 j 


234 


NO VI 4 


298 


gi 


14423805 | 


294 


gi 


12007437 j 


294 


gi 


12007412 j 


2 94 


gi 


15419583 j 


301 


gi 


12007411 j 


294 



PMLffiSPMYFFLRNLSFLEIGFNLVIVPKMLGTLHAQDTg ISFLGCATQMYFFFFFGVAEC 
PMLgSPMYFFLRNLSFlEIGFNLVigPKMLBTLaAQDTBISFLGCATQMYFFFFFGVAEC 



120 
113 



130 
I I . . 



140 
. - I . . 



FLLATMAYDRYVAICSPLHYPVIMN 
FLLATMAYDRYVAI CS PLHYPVIMNQ 
FLLATMAYDRYVAICSPLHYPVIMNQ 
FLLATMAYDRYVAI CS PLHYPVIMNQ 
FLLATMAYDRYVAICSPLHYPVIMNQ 
FLLATMAYDRYVAI CS PLHYPBlMNQ 





250 260 270 28*0 290 
. . I I I I I | | I I I 



300 
• - I 



gkhkafstcsshllwslfyg 
gkhkafstcsshllwslfy! 
gk^kafstcsshllwslfyj 
gkhkafstcsshllwslfyL 
gkhkafstcsshllwslfyS 
gkhkafstcsshlSwslfyS 



ISSSLTYFmPKSNNSPESKKLLSLSYTVVTPMLNPIIYSL 
SSSLTYFggPKSNNSPESKKLLSLSYTVVTPMLNPIIYSL 
SjJSLTYFRPKSNNSPEgKKLLSLSYTVtjTPMLNPIIYSL 
jSSSLTYFRPKSNNSPESKKLLSLSYTVVTPMLNPI IYSL 
jSSSLTYFRPKSNNSPESKKLLSLSYTWTPMLNPIIYSL 
|S[ISLTYFRPKS[5NSPESKKLLSLSYTVVTPiLNPIIYSL 



297 
293 
293 
293 
300 
293 



310 



320 



E VKNALS RT FHKjgLAL RNWI 
EVKNALSRTFHKQLALRNSI 
RNNE VKNAL S RT^KALALRNH I 
RNNE VKjgAL S RT FHKAL AL RnS I 
RNNE VK^ALS RT FHKALALRNgl 
RNNEVKNALSRTFHKALALREaSl 



321 
317 
317 
317 
324 
317 
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Table 14E lists the domain descriptions from DOMAIN analysis results against 
NOV 14. This indicates that the NOV 14 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 14E Domain Analysis of NO VI 4 

gnl | Pf am|pf amOOOOl, 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO:810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 103 bits (256) , Expect = 2e-23 



NOV14 : 


46 


Sbjct : 


1 


NOV14 : 


106 


Sbjct : 


61 


NOV14 : 


166 


Sbjct : 


114 



GNSLIILVTLADPMLHSPMYFFLRNLSFLEIGFNLVIVPKMLGTLLAQDTTISFLGCATQ 105 

II Mil I I +1 1 1 1 1+ ++ I I + I I 1+ I I 

GNLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 60 

MYFFFFFGVAECFLLATMAYDRYVAI CSPLHYPVIMNQRTRAKLAAAS WFPGFPVATVQT 165 

I M M ++ lll+ll II I I I I I 
GALFWNGYAS ILLLTAI S I DRYLAI VHPLRYRR I RTPRRAKVL I LLVWVLAL 113 

TWLFSFPFCGTNKVNHFFCDSPPVLKLVCADTALFEIYAIVGTILWMIPCLLILCSYTR 225 

III + + + I + + | ++ |++ ++ | |+| | (I) 

- -LLSLPPLLFSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTR 171 
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NOV14: 226 IA AAILKIPSAKGKHKAFSTCSSHLLWSLFY ISSSLTYFWPKSNNS 272 

I II 1+ + I ++ I + + 

^ Sbjct: 172 ILRTLRKRARSQRSLKRRSSSERKAAKMLLWVWFVLCWLPYHIVLLLDSLCLLSIWR 231 

NOV14 : 273 PESKKLLSLS YTWTPMLNPI IY 295 

+ I++I I I I I I I I 

Sbjct: 23 2 LPTALLITLWLAYVNSCLNPIIY 2 54 

10 G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 

subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 

15 respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 

20 transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 

25 epithelium. 

The disclosed NOV 14 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 14A or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 14A while still encoding a 

30 protein that maintains its G-Protein Coupled Receptor -like activities and physiological 

functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 

35 include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. 
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The disclosed NOV 14 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 14B. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 14B while still encoding a protein that maintains its G-Protein 
5 Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 12 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor -like 

10 protein (NOV14) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV 14 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 

15 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV 14 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCII and III diseases 

20 (immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 

25 fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 

30 ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 

disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 
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NOV14 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 14 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
5 NOVX Antibodies" section below. The disclosed NOV 14 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 
10 NOV15 

NOV 15 includes three novel G-Protein Coupled Receptor -like proteins disclosed 
below. The disclosed sequences have been named NOV 15a and NOV 15b. 

NO VI 5a 

A disclosed NOV15a nucleic acid of 943 nucleotides (also referred to as CG56065-01) 
15 encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 1 5 A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 2-4 and 
ending with a TGA codon at nucleotides 935-937. The start and stop codons are shown in 
bold in Table 15 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 15A. NOV15a nucleotide sequence (SEQ ID NO:61). 

AATGGCAGCAGAAAACCATTCTTTTGTGACTAAGTTTATTCTGGTTGGGCTAACAGAGAAGTCAG 
AGCTACAGCTGCCCCTCTTCCTCGTCTTCCTGGGAATCTATGTAGTCACAGTGCTGGGGAACCTG 
GGCATGATCACACTGATTGGGCTCAGTTCTCACCTGCACACACCTATGTACTGTTTCCTCAGCAG 
TCTGTCCTTCATTGACTTCTGCCATTCCACTGTCATTACCCCTAAGATGCTGGTGAACTTTGTGA 
CAGAGAAGAACATCATCTCCTACCCTGAATGCATGACTCAGCTCTACTTCTTCCTCGTTTTTGCT 
ATTGCAGAGTGTCACATGTTGGCTGCAATGGCATATGACGGCTACGTGGCCATCTGTAGCCCCTT 
GCTGTACAGCATCATCATATCCAATAAGGCTTGCTTTTCTCTGATTTTAGTGGTGTATGTAATAG 
GCCTGATTTGTGCGTCAGCTCATATAGGCTGTATGTTTAGGGTTCAATTCTGCAAATTTGATGTG 
AT C AA CC AT T ATTT CT GT G AT C T T ATTT CT AT C TT G AAG C T C T C CTGTT C T AGT AC T T AC AT T AA 
TGAGTTACTGATTTTAATCTTTAGTGGAATTAACATCCTTGTCCCCAGCCTGACCATCCTCAGCT 
CTTACATCTTCATCATTGCCAGCATCCTCCGCATTCGCTACACTGAGGGCAGGTCCAAAGCCTTC 
AGCACTTGCAGCTCCCACATCTCGGCTGTTTCTGTTTTCTTTGGGTCTGCAGCATTCATGTACCT 
GCAGCCATCATCTGTCAGCTCCATGGACCAGGGGAAAGTGTCCTCTGTGTTTTATACTATTGTTG 
TGCCCATGCTGAACCCCCTGATCTACAGCCTGAGGAATAAAGATGTCCACGTTGCCCTGAAGAAA 
AC G C T AGGG AAAAG AAC AT T CT T AT G AAC AG AA 

20 1 

The disclosed NOV15a polypeptide (SEQ ID NO:62) encoded by SEQ ID NO:61 has 
31 1 amino acid residues and is presented in Table 15B using the one-letter amino acid code. 
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Table 15B. Encoded NOV15a protein sequence (SEQ ID NO:62). 

MAAENHSFVTKFILVGLTEKSELQLPLFLVFLGIYWTVLGNLGMITLIGLSSHLHTPMYCFLS 
SLSFIDFCHSTVITPKMLVNFVTEKNIISYPECMTQLYFFLVFAIAECHMLAAMAYDGYVAICS 
PLLYSIIISNKACFSLILWYVIGLICASAHIGCMFRVQFCKFDVINHYFCDLISILKLSCSST 
YINELLILIFSGINILVPSLTILSSYIFIIASILRIRYTEGRSKAFSTCSSHISAVSVFFGSAA 
FMYLQPSSVS SMDQGKVS SVFYTIWPMLNPLIYSLRNKDVHVALKKTLGKRTFL 



A search of sequence databases reveals that the NOV 15a amino acid sequence has 235 
of 31 1 amino acid residues (75%) identical to, and 270 of 31 1 amino acid residues (86%) 
similar to, the 31 1 amino acid residue ptnr: SPTREMBL-ACC:0351 84 protein from Rattus 
norvegicus (Rat) (Olfactory Receptor) (E = 9.9e" 121 ). 

NO VI 5b 

A disclosed NOV 15b nucleic acid of 943 nucleotides (also referred to as CG56065-02) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 1 5C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 2-4 and 
ending with a TGA codon at nucleotides 935-937. The start and stop codons are shown in 
bold in Table 15C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 15C. NOV15b nucleotide sequence (SEQ ID NO:63). 

AATGGCAGCAGAAAACCATTCTTTTGTGACTAAGTTTATTCTGGTTGGGCTAACAGAGAAGTCAGAGCTACA 
GCTGCCCCTCTTCCTCGTCTTCCTGGGAATCTATGTAGTCACAGTGCTGGGGAACCTGGGCATGATCACACT 
GATTGGGCTCAGTTCTCACCTGCACACACCTATGTACTGTTTCCTCAGCAGTCTGTCCTTCATTGACTTCTG 
C C ATTC C AC TGT C ATT AC C C CT AAG AT GCTGGT G AACT TTGTG AC AGAG AAGAAC AT C ATCT C CT AC CCTG A 
ATGCATGACTCAGCTCTACTTCTTCCTCGTTTTTGCTATTGCAGAGTGTCACATGTTGGCTGCAATGGCATA 
TGACGGCTACGTGGCCATCTGTAGCCCCGTGCTGTACAGCATCATCATATCCAATAAGGCTTGCTTTTCTCT 
GATTTTAGTGGTGTATGTAATAGGCCTGATTTGTGCGTCAGCTCATATAGGCTGTATGTTTAGGGTTCAATT 
CTGCAAATTTGATGTGATCAACCATTATTTCTGTGATCTTATTTCTATCTTGAAGCTCTCCTGTTCTAGTAC 
TTACATTAATGAGTTACTGATTTTAATCTTTAGTGGAATTAACATCCTTGTCCCCAGCCTGACCATCCTCAG 
CTCTTACATCTTCATCATTGCCAGCATCCTCCGCATTCGCTACACTGAGGGCAGGTCCAAAGCCTTCAGCAC 
TTGCAGCTCCCACATCTCGGCTGTTTCTGTTTTCTTTGGGTCTGCAGCATTCATGTACCTGCAGCCATCATC 
TGT C AG CTC C ATGG ACC AGGGG AAAGTGT CCT CTGTGTT TT AT ACT ATTGT TGTGC C CGTGC TG AAC CCC CT 
GATCTACAGCCTGAGGAATAAAGATGTCCACGTTGCCCTGAAGAAAACGCTAGGGAAAAGAACATTCTTATG 
AACAGAA 



In a search of public sequence databases, the NOV15b nucleic acid sequence, localized 
to chromosome 4, has 770 of 937 bases (82%) identical to a gbrGENBANK- 
ID:AF282271 |acc:AF282271 .1 mRNA from Mus musculus (odorant receptor Kl 1 gene, 
complete cds) (E - 5.2e" 135 ). 

The disclosed NOV15b polypeptide (SEQ ID NO:64) encoded by SEQ ID NO:63 has 

31 1 amino acid residues and is presented in Table 15D using the one-letter amino acid code. 

Signal P, Psort and/or Hydropathy results predict that NOV15b has no signal peptide and is 

likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 

NOV 1 5b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
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reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 03000. The most likely cleavage site for NOV 15b is between positions 41 and 42: 
VLG-NL. 



Table 15D. Encoded NOV15b protein sequence (SEQ ID NO:64). 

MAAENHSFVTKFILVGLTEKSELQLPLFLVFLGIYWTVLGNLGMITLIGLSSHLHTPMYCFLSSLSFIDFC 
HSTVITPKMLVNFVTEKNIISYPECMTQLYFFLVFAIAECHMLAAMAYDGYVAICSPVLYSIIISNKACFSL 
ILWYVIGLICASAHIGCMFRVQFCKFDVINHYFCDLISILKLSCSSTYINELLILIFSGINILVPSLTILS 
SYIFIIASILRIRYTEGRSKAFSTCSSHISAVSVFFGSAAFMYLQPSSVSSMDQGKVSSVFYTIWPVLNPL 
IYSLRNKDVHVALKKTLGKRTFL 

5 

A search of sequence databases reveals that the NOV15b amino acid sequence has 237 
of 31 1 amino acid residues (76%) identical to, and 273 of 31 1 amino acid residues (87%) 
similar to, the 314 amino acid residue ptnnTREMBLNE W-ACC : A AG3 9856 protein from 
Mus musculus (Mouse) (Odorant Receptor Kl 1) (E = 2.6e" 125 ). 

10 NOV 15b is predicted to be expressed in at least the following tissues: Apical microvilli 

of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 
tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 

15 hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 
pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 

20 thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

The disclosed NOV 15a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 15E. 
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Table 15E. BLAST results for NO VI 5a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 174 72 6 72 | ref | XP 
061794. l| 
(XM_061794) 


similar to 
odorant receptor 
Kll (H. sapiens) 
[Homo sapiens] 


311 


311/311 
(100%) 


311/311 
(100%) 


e-140 
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gi | 116 92 519 |gb |AAG3 
9 8 5 6 . 1 | AF2 822 7 1_1 
(AF282271) 


odorant receptor 

V 1 n T H/f-t i «-*• 

Kll LiVIIiS 

musculus] 


314 


239/311 

V / o ^ ) 


273/311 
( 86% ) 


e-110 


gi | 116 92 527 | gb | AAG3 
9860.1 | AF2 82275_1 
(AF282275) 


odorant receptor 
K15 [Mus 
musculus] 


311 


236/311 
(75%) 


271/311 
(86%) 


e-108 


gi | 17472662 | ref | XP_ 
061790. 1| 
(XM_061790) 


similar to 
odorant receptor 
K4hll (H. 
sapiens) [Homo 
sapiens] 


593 


233/301 
(77%) 


261/301 
(86%) 


e-105 


gi | 23177 04 | gb | AAB66 
333. l| (AF010293) 


olfactory 
receptor [Rattus 
norvegicus] 


311 


235/311 
(75%) 


270/311 
(86%) 


e-105 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 15F. In the ClustalW alignment of the NOV15 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 Table 15F. ClustalW Analysis of NOV15 

1) Novel NOV15a {SEQ ID NO: 62) 

2) Novel NOV15b (SEQ ID NO: 64) 

3) gi | 17472672 | ref | XP_06l794 . 1 | (XM_061794) similar to odorant receptor Kll (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 3 87) 

15 4) gi | 11692519 |gb | AAG39856 . 1 | AF282271_1 (AF282271) odorant receptor Kll [Mus 

musculus] (SEQ ID NO: 388) 

5) gi|ll692527|gb|AAG39860.l|AF282275_l (AF282275) odorant receptor K15 [MUS 
musculus] (SEQ ID NO: 389) 

6) gi | 17472662 | ref |XP_061790 . 1 | (XM061790) similar to odorant receptor K4hll (H. 
20 sapiens) [Homo sapiens] (SEQ ID NO: 3 90) 

7) gi | 2317704 |gb|AAB66333 .1 | (AF010293) olfactory receptor [Rattus norvegicus] (SEQ 
ID N0:391) 



10 20 30 40 50 60 

25 

NOV15a 1 

NOV15b 1 

gi | 17472672 | 1 

gi I 11692519 | 1 

30 gi 1 11692527 J 1 

gi|l7472662| 1 

gi|2317704| 1 



35 

NOVlSa 1 

NOV15b 1 

gi | 17472672 | 1 

gi|H692519| 1 

40 gi | 11692527 j 1 

gi|l7472662| 61 

gi|2317704| 1 



1 

1 

1 

1 

1 

MVKGNHSTVTEFNLAGLTDKPELQLPLFLLFLGIYWTWGNLSMITLIGFSSHLHTPMY 6 0 
1 

70 80 90 100 110 120 

1 

1 

1 

1 

1 

HFLSSLSFIDLCQSSVITPKWLVNFVSERNIISYPACMTQLYFFLVLVISECHMLAAMAY 12 0 
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NOV15a 

NOV15b 

gi | 17472672 | 

gi j 11692519 | 

gi | 11692527 | 

gi|l7472662| 

gi | 2317704 | 



NOV15a 
NOVlSb 
gi|l7472672 | 
gi|11692519 j 
gi|ll692527| 
gi|l7472662 | 
gi|2317704| 



NOVlSa 

NOV15b 

gi | 17472672 | 

gi | 11692519 j 

gi | 11692527 j 

gij 17472662 j 

gi j 2317704 | 



NOVlSa 

NOVl5b 

gi | 17472672 | 

gi j 11692519 | 

gi | 11692527 | 

gi | 17472662 | 

gij 2317704 | 



NOV15a 

NOV15b 

gi | 17472672 | 

gi|H692519| 

gi | 11692527 j 

gi|l7472662 | 

gi|2317704 | 
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|....|....| 



140 



160 
|....|.. 



170 
..|....| 



180 



! 1 

121 DHYIAICNPLLYHVAMSYQVCSWMWEVYFMGFIGATCSHSLHAKSAFLLTILSSYIFIV 180 
! 1 



190 



210 



220 



230 



240 



|....|....|....|....|....|---.|...-|-...|....|....| 



! 1 

181 ASILCIRSTEGRSKTFSTCSSHISAVSVFFGGTSRSRFQVLGLEVRSVRLGGCPDAGQTP 24 0 
! 1 



250 



260 
..|.. 



270 



280 



! 

! 

1 

! 

1 

241 etqppvqslfsghrnlapsaramekknvqpwtlaermetvdki^dpgH 

i Btgij 



-MNDI 




300 



Sjgl 


17 


,Sgl 


17 


jSgl 


17 


jHgl 


20 


,Hgl 


17 


Mgl 


300 


.Hgl 


17 



360 



NOV15a 


78 


NOVl5b 


78 


gi| 17472672 | 


78 


gi|H692519| 


81 


gi|H692527| 


78 


gi j 17472662 | 


361 


gi|2317704 | 


78 




13 8 NgACFE 

13 8 NKACFg 

13 8 nMaCF^ 

141 YpIYI 

13 8 YQIY' 

421 YHHCF 

13 8 YQSYI 




480 
[■■■■I 



LgLgCSSTYINEL 
LgLgCSSTYINEL 

lSlIcsstyinel 



|L|LScs|JiTYINE| 

lBlBBsstySnel 
lSlScsstyinel 
lIlBSsstyEnel 



197 
197 
197 
200 
197 
480 
197 



500 



530 



540 



NOVlSa 


198 


LIL 


I 


F 


SGIJ 


NOV15b 


198 


LIL 


I 


F 


SGIj 


gi| 17472672 | 


198 


LIL 


I 


F 


SGT\\ 


gi | 11692519 j 


201 


LIL 




F 


GTLj 


gi j 11692527 | 


198 


L I L 




I 


GTFi 


gi j 17472662 j 


481 


L i L 




LSAFii 


gi|2317704 | 


198 


1^ 


I 


CGTCJ 
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NOV15a 

NOV15b 

gi | 17472672 | 

gi 1 11692519 | 

gi j 11692527 | 

gi|l7472662| 

gi j 2317704 | 
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Table 15G lists the domain description from DOMAIN analysis results against 
NOV15. This indicates that the NOV15 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 15G Domain Analysis of NOV15 

gnl | Pf am|pf amOOOOl , 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 86.7 bits (213), Expect = 2e-18 



20 



25 



30 



35 



40 



NOV15 : 


41 


Sbjct : 


1 


NOV15 : 


101 


Sbjct : 


61 


NO VI 5 : 


161 


Sbjct : 


121 


NOV15 : 


216 


Sbjct : 


181 


NOV15 : 


272 


Sbjct : 


236 



GNLGMITLIGLSSHLHTPMYCFLSSLSFIDFCHSTVITPKMLVNFVTEKNIISYPECMTQ 10 0 

III +1+1 + III I i +1+ I +111 + I 

GNLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWFGDALCKLV 6 0 



l+l I +1 I++ I l+ll III I + + III+I+I+ 1+ + + 

GAL FWNGYAS I L LLT AI S I DR YL AI VHP LR Y RR I RT P RRAK VL. I LL V WVLAL L LS L P P L 12 0 

GCMFRVQFCKFDVINHYFCD LISILKLSCSSTYINELLILIFSGINILVPSLTIL 215 

I + II M+ + + II 



1+ I + I I + I + +1 I 

-RSLKRRSSSERKAAKMLLVVVVVFVLCWLPYHIVLLLDSLCLLSIWRVLPTA 23 5 



45 



- +++ I lll+ll 

L I TL WLA Y VNS C LNP I I Y 254 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 
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Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
5 Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 15 nucleic acid of the invention encoding a G-Protein Coupled 
10 Receptor -like protein includes the nucleic acid whose sequence is provided in Table 15 A, 

1 5C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 15A or 15C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
1 5 nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
20 are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 18 percent of the bases may be so 
changed. 

25 The disclosed NOV1 5 protein of the invention includes the G-Protein Coupled 

Receptor -like protein whose sequence is provided in Table 15B, or 15D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 15B, or 15D while still encoding a protein that 
maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 

30 functional fragment thereof. In the mutant or variant protein, up to about 23 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as_F ab or 
(F a t>)2, that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV15) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV 15 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
5 below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 
10 The NOV 15 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in developmental diseases, MHC1I and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
1 5 disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
20 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
25 autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 15 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 15 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
30 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

NOVX Antibodies" section below. The disclosed NOV 15 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 




understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NO VI 6a 

A disclosed NOV 16a nucleic acid of 891 nucleotides (also referred to as CG56067-01) 
5 encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 16A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 5-7 and 
ending with a TAA codon at nucleotides 878-880. The start and stop codons are shown in 
bold in Table 16a, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 16A. NOV16a nucleotide sequence (SEQ ID NO:65). 

~GAAAATGTCAGCAGGAAACCATTCCTCAGTGACTGAGTTCATTCTGGCTGGGCTCTCAGAACAGCCAGAGCT 
CCAGCTGCGCCTCTTCCTCCTGTTCTTAGGAATCTATGTGGTCACAGTGGTGGGCAACTTGAGCATGATCAC 
ACTGATTGGGCTCAGTTCTCACCTGCATACCCCCATGTACTATTTCCTCAGTGGTCTGTCCTTCATTGATAT 
CTGCCATTCCACTATCATTACCCCCAAAATGCTGGTGAACTTTGTGACAGAGAAGAACATCATCTCCTACCC 
TGAATGCATGACTCAGCTTTACTTCTTCCTCATTTTTGCTATTGCAGAGTGTCACATGTTGGCTGTAACGGC 
ATATGACCGCTATGTTGCCATCTGCAGCCCCTTGCTGTACAATGTCATCATGTCCTATCACCACTGCTTCTG 
GCTCACAGTGGGAGTTTACATTTTAGGCATCCTTGGATCTACAATTCACACCGGCTTTATGTTGAGACTCTT 
TTTGTGCAAGACTAATGTGATTAACCATTATTTTTGTGATCTCTTCCCTCTCTTGGGGCTCTCCTGCTCCAG 
CACCTACATCAATGAATTACTGGTTCTGGTCTTGAGTGCATTTAACATCCTGACGCCTGCCTTAACCATCCT 
TGCTTCTTACATCTTTATCATTGCCAGCATCCTCCGCATTCGCTCCACTGAGGGCAGGTCCAAAGCCTTCAG 
CACTTGCAGCTCCCACATCTTGGCTGTTGCTGTTTTCTTTGGGTCTGCAGCATTCATGTACCTGCAGCCATC 
ATCTGTCAGCTCCATGGACCAGGGGAAAGTGTCCTCTGTGTTTTATACTATTGTTGTGCCCATGCTGAACCC 
C C AAT CT AT AG CCT A AG AAAT AAGG AT 

io — — " 

In a search of public sequence databases, the NOV16a nucleic acid sequence, localized 
to chromosome 4, has729 of 888 bases (82%) identical to a gb:GENBANK- 
lD:AF282293|acc:AF282293.1 mRNA from Mus musculus (odorant receptor K4hl 1 gene, 

1 5 complete cds) (E = 9.8e" 127 ). 

The disclosed NOV 16a polypeptide (SEQ ID NO:66) encoded by SEQ ID NO:65 has 
311 amino acid residues and is presented in Table 16B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV 16a has no signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 

20 NOV 1 6A may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or in the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV16A is between positions 41 and 
42: VVG-NL. 



Table 16B. Encoded NOV16a protein sequence (SEQ ID NO:66). 

MSAGNHSSVTEFILAGLSEQPELQLRLFLLFLGIYWTWGNLSMITLIGLSSHIiHTPMYYFLS 
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GLSFIDICHSTIITPKMLVNFVTEKNIISYPECMTQLYFFLIFAIAECHMLAVTAYDRYVAICS 
PLLYNVIMSYHHCFWLTVGVYILGILGSTIHTGFMLRLFLCKTNVINHYFCDLFPLLGLSCSST 
YINELLVLVLSAFNILTPALTILASYIFIIASILRIRSTEGRSKAFSTCSSHILAVAVFFGSAA 
FMYLQPSSVSSMDQGKVSSVFYTI WPMLNPQSIA 

A search of sequence databases reveals that the NOV 16a amino acid sequence has 232 
of 287 amino acid residues (80%) identical to, and 253 of 287 amino acid residues (88%) 
similar to, the 307 amino acid residue ptnr:TREMBLNEW-ACC:AAG39878 protein from 
5 Mus musculus (Mouse) (Odorant Receptor K4H1 1) (E = 5.1e" 122 ). 

NOV 16a is predicted to be expressed in at least Apical microvilli of the retinal pigment 
epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, corpus 
callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 
10 umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 
15 spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus, and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 
NOV16b 

20 A disclosed NOV 16b nucleic acid of 939 nucleotides (also referred to as CG56753-02) 

encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 16C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 1-3 and 
ending with a TAG codon at nucleotides 934-936. The start and stop codons are shown in 
bold in Table 16C, and the 5' and 3' untranslated regions, if any, are underlined. 

25 
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Table 16C. NOV16b nucleotide sequence (SEQ ID NO:67). 

ATGTCAGGAGAAAATAATTCCTCAGTGACTGAGTTCATTCTGGCTGGGCTCTCAGAACAGCCAGAGCTCCAG 
CTGCCCCTCTTCCTCCTGTTCTTAGGAATCTATGTGGTCACAGTGGTGGGCAACCTGGGCATGACCACACTG 
ATTTGGCTCAGTTCTCACCTGCACACCCCTATGTACTATTTCCTCAGCAGTCTGTCCTTCATTGACTTCTGC 
CATTCCACTGTCATTACCCCTAAGATGCTGGTGAACTTTGTGACAGAGAAGAACATCATCTCCTACCCTGAA 
TGCATGACTCAGCTCTACTTCTTCCTCGTTTTTGCTATTGCAGAGTGTCACATGTTGGCTGCAATGGCGTAT 
GACCGTTACATGGCCATCTGTAGCCCCTTGCTGTACAGTGTCATCATATCCAATAAGGCTTGCTTTTCTCTG 
ATTTTAGGGGTGTATATAATAGGCCTGGTTTGTGCATCAGTTCATACAGACAGTATGTTTAGGGTTCAATTC 
TGCAAATTTGATTTGATTAACCATTATTTCTGTGATCTTCTTCCCCTCCTAAAGCTCTCTTGCTCTAGTATC 
TATGTCAACAAACTACTTATTCTATGTGTTGGTGCATTTAACATCCTTGTCCCCAGCCTGACCATCCTTTGC 
TCTTACATCTTTATTATTGCCAGCATCCTCCACATTCGCTCCACTGAGGGCAGGTCCAAAGCCTTCAGCACT 
TGTAGCTCCCACATGTTGGCGGTTGTAATCTTTTTTGGATCTGCAGCATTCATGTACTTGCAGCCATCTTCA 
ATCAGCTCCATGGACCAGGGGAAAGTATCCTCTGTGTTTTATACTATTATTGTGCCCATGTTGAACCCTCTG 
AT T T AT AGC CTGAGGAAT AAAG ATG T C CATGTTT CC CT G AAGAAAATG C T AC AG AGAAG AAC ATT AT TGT AA 
ACA 



In a search of public sequence databases, the NOV 16b nucleic acid sequence has 770 
of 935 bases (82%) identical to a gb:GENBANK-ID:AF282271|acc:AF282271.1 mRNA from 
Mus musculus (odorant receptor Kl 1 gene, complete cds) (E = 1.3e' 136 ). 
5 The disclosed NOV 16b polypeptide (SEQ ID NO:68) encoded by SEQ ID NO:67 has 

3 1 1 amino acid residues and is presented in Table 16D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV16b has A signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV 16b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 
1 0 reticulum (membrane) with a certainty of 0.3000, or in the endoplasmic reticulum (lumen) 

with a certainty of 0.3000. The most likely cleavage site for NOV16b is between positions 41 
and 42: VVG-NL. 



Table 16D. Encoded NOV16b protein sequence (SEQ ID NO:68). 

MSGENNSSVTEFIIAGLSEQPELQLPLFLLFLGIYVVTVVGNLGMTTLIWLSSHLHTPMYYFLSSLSFIDFC 
HSTVITPKMLVNFVTEKNIISYPECMTQLYFFLVFAIAEC 

ILGVYIIGLVCASVHTDSMFRVQFCKFDLINHYFCDLLPLLKLSCSSIYVNKLLILCVGAFNILVPSLTILC 
SYIFIIASILHIRSTEGRSKAFSTCSSHMLAWIFFGSAAFMYLQPSSISSMDQGKVSSVFYTIIVPMLNPL 
I Y S LRNKDVH VS L KKMLQRRTLL 



15 A search of sequence databases reveals that the NOV 16b amino acid sequence has 238 

of 31 1 amino acid residues (76%) identical to, and 274 of 31 1 amino acid residues (88%) 
similar to, the 314 amino acid residue ptnr : SPTREM BL- ACC : Q9EQB 8 protein from Mus 
musculus (Mouse) (Odorant Receptor Kl 1) (E = 1 .Oe" 127 ). 

NOV 16b is predicted to be expressed in at least the following tissues: Apical microvilli 
20 of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 

cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 
tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
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artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 
hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 
pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 
thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

The disclosed NOV 16a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed.in Table 16E. 



Table 16E. BLAST results for NOV16a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%> 


Expect 


gi | 17472662 | ref | XP_ 
061790 . 1 | 
(XM_061790) 


similar to 
odorant receptor 
K4hll (H. 
sapiens) [Homo 
sapi ens] 


593 


265/284 
(93%) 


267/284 ! 
(93%) 


e-121 


gi | 11692519 | gb | AAG3 
9856 . 1 | AF282271_1 
(AF282271) 


odorant receptor 
Kll [Mus 
musculus] 


314 


223/287 
(77%) 


250/287 
(86%) 


e-104 


gi | 11692563 | gb | AAG3 
9878 . 1 | AF282293_1 
(AF282293) 


odorant receptor 
K4hll [Mus 
musculus] 


307 


232/287 
(80%) 


253/287 
(87%) 


e-102 


gi | 17472672 | ref | XP_ 
061794 .1 | 
(XM_061794) 


similar to 
odorant receptor 
Kll (H. sapiens) 
[Homo sapiens] 


311 


226/287 
(78%) 


252/287 
(87%) 


e-102 


gi | 11692527 | gb | AAG3 
9860 . 1 | AF282275_1 
(AF282275) 


odorant receptor , 
K15 [Mus 
musculus] 


311 


224/287 
(78%) 


246/287 
(85%) 


e-102 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 16F. In the ClustalW alignment of the NOV 16 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 16F. ClustalW Analysis of NOV16 



1) Novel NOV16a (SEQ ID NO: 66) 

2) Novel NOV16b (SEQ ID NO: 68) 

3) gi | 17472662 | ref |XP_061790 . 1 | (XM_061790) similar to odorant receptor K4hll (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 3 90) 

4) gi|H692519|gb|AAG39856.l|AF282271_l (AF282271) odorant receptor Kll [Mus 
wusculus] (SEQ ID NO: 388) 

5) gi | 11692563 |gb | AAG39878 . 1 | AF282293_1 (AF282293) odorant receptor K4hll [Mus 
wusculus] (SEQ ID NO: 392) 

6) gi | 17472672 | ref |XP_061794 . 1 | (XM_061794) similar to odorant receptor Kll (H. 
sapiens) [Homo sapiens] (SEQ ID NO:393) 

7) gi | 11692527 |gb|AAG39860 . 1 | AF282275_1 (AF282275) odorant receptor K15 [Mus 
wusculus] (SEQ ID NO: 389) 



10 



20 



30 



40 



NOV16a 

NOV16b 

gi 1 17472662 | 

gi 1 11692519 j 

gi|11692563 j 

gi 1 17472672 j 

gi 1 11692527 j 



NOV16a 
NOV16b 
gi|l7472662 | 
gi 1 11692519 j 
gi j 11692563 | 
gi j 17472672 j 
gi 1 11692527 j 



NOV16a 

N0V16b 

gi | 17472662 | 

gi | 11692519 j 

gi j 11692563 j 

gi j 17472672 | 

gi j 11692527 j 



NOV16a 
NOVl6b 
NOV16b 
gi |17472662 | 
gi | 11692519 j 
gi|H692563 j 
gi 1 17472672 j 
gi j 11692527 | 



50 



60 



! 1 

1 I^KGNHSTVTEFNIxAGLTDKPELQLPLFLLFLGIYWTVVGNLSMITLIGFSSHLHTPMY 60 



70 



80 



90 



100 



110 



120 
•-I 



NOV16a 


1 


NOV16b 


1 


gi 


17472662 | 


61 


gi 


11692519 | 


1 


gi 


11692563 j 


1 


gi 


17472672 j 


1 


gi 


11692527 j 


1 



130 
..|.. 



140 



150 



I 



I 



160 
..|.. 



I 



170 



180 



1 1 

121 DHYIAICNPLLYHVAMSYQVCSWMWEVYFMGFIGATCSHSLHAKSAFLLTILSSYIFIV 180 

1 1 

1 1 

1 1 

1 1 



190 



200 
..|.. 



210 
..|.. 



220 
..|.. 



230 
..|.. 



240 
•♦I 



! 1 

181 AS I LCIRSTEGRSKTFSTCSSHI SAVSVFFGGTSRSRFQVLGLEVRSVRLGGCPDAGQTP 24 0 
1 



250 260 270 280 

|....|....|....|....|....|....|....|.. 



290 



300 



1 

x 

! 1 

241 ETQPPVQSLFSGHRNLAPSARAMEKKNVQPWTLAERMETVDKI; 

i mnd| 

x 

! 

1 
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310 



320 



330 



340 



350 



360 



10 



15 



20 



25 



30 



35 



40 



45 



50 



NOV16a 

NOV16b 

gi|l7472662 

gi|ll692519 

gi|ll692563 

gi j 17472672 

gij 11692527 



NOV16a 


78 


NO VI 6 b 


78 


gi 


17472662 


361 


gi 


11692519 


81 


gi 


11692563 


78 


gi 


17472672 


78 


gi 


11692527 


78 



NOV16a 

NOV16b 

gi | 17472662 | 

gi | 11692519 | 

gi | 11692563 | 

gi j 17472672 j 

gi | 11692527 | 



NOV16a 
NOV16b 
gi|17472662| 
gij 11692519 | 
gi | 11692563 j 
gi|l7472672| 
gi | 11692527 | 




490 



500 



510 



NOV16a 


198 


LiVL 


NOV16b 


198 


l|Jl 


gi | 17472662 | 


481 


LVL 


gi | 11692519 | 


201 


L0L 


gi | 11692563 j 


198 


LVL 


gi | 17472672 | 


198 


L||L 


gi|H692527| 


198 


LVL 




I 



I 



520 
. . I . . 



I 



530 
• - I - - 



540 

- -_L 



SYIFIIASI 
SYIFIIASI 
SYIFIIASI 
SYIFIIASI 
SYlgll^SI 
SYIFIIASI 
SYIFIIAjSl 




LRIRSTEGRSKAFSTCSSHILAVAVFFGSAAF 
LgjlRSTEGRSKAFSTCSSHlLAVffijFFGSAAF 

lrljj|stegrskafstcsshilavavffgsaaf 
lrirstegrskafstcsshilavavffgsjjaf 
lrirstegrskafstcsshiHavaJJfJSgsaaf 

LRIRgTEGRSKAFSTCSSHIgAVgVFFGSAAF 

lrirstHgrskafstcsshilavavffgsaaf 



590 
I 

291 
iRTLL 311 
iTAC- 5 93 
IkTFM 314 
307 

|K>RTFL 311 

:i|klfm 311 



257 
257 
540 
260 
257 
257 
257 



Table 16G lists the domain description from DOMAIN analysis results against 
NOV 16. This indicates that the NOV 16 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 16G Domain Analysis of NOV16 

gnl | Pf am | pf amOOOOl , 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 98.8% aligned 

Score = 85.9 bits (211), Expect = 3e-18 



GNLSMITLIGLSSHLHTPMYYFLSGLSFIDICHSTIITPKMLVNFVTEKNIISYPECMTQ 10 0 

III +1+1 + III II 1+ 1+ + I I I + I 

GNLLVILVILRTKKLRTPTNI FLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 

LYFFL I FA I AE CHMLAVT AYD R YVA I CS PLLYNVIMS YHHCFWLTVGVYI LGI LGSTIHT 16 0 

I++ I +1 + MMI II I I + I + I++I +1 I 

GALFVWGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILLVWVLALLLSLPPL 12 0 

GFMLRLFLCKTNVINHYFCDLFPLLG LSCSSTYINELLVLVLSAFNILTPALTIL 215 

I + + I + II IM+ + + II 

LFSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTRIL RT 175 

ASYIFIIASILRIRSTEGRSKAFSTCSSHILAVAVFFGSAAFMYL QPSSVSSMDQG 271 

|+ ||+ | | ++ I + +| | 

LRKRARSQRSLKRRSSSERKAAKMLLVWWFVLCWLPYHIVLLLDSLCLLSIWRVIiPTA 23 5 

KVSSVFYTIWPMLNP 2 87 

+ + + + I III 
LLITLWLAYVNSCLNP 251 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
25 and are likely to underlie the recognition and G-protein-mediated transduction of various 

signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

30 Olfactory receptors (ORs) have been identified as extremely large subfamily of G 

protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 

35 genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 16 nucleic acid of the invention encoding a G-Protein Coupled 

Receptor -like protein includes the nucleic acid whose sequence is provided in Table 16A or a 

40 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 

bases may be changed from the corresponding base shown in Table 16A while still encoding a 
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NOV18: 41 

^ Sbjct: 1 

NOV18: 101 

Sbjct: 61 

10 NOV18: 161 

Sbjct: 121 

NOV18: 216 

15 

Sbjct: 176 

NOV18 : 272 

20 Sbjct: 236 



protein that maintains its G-Protein Coupled Receptor -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
5 includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 

include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 

10 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 18 percent of the bases may be so changed. 

The disclosed NOV 16 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 16B. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 

15 residue shown in Table 16B while still encoding a protein that maintains its G-Protein 

Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 23 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

20 The above disclosed information suggests that this G-Protein Coupled Receptor -like 

protein (NOV16) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV 16 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 

25 protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV 16 nucleic acids and proteins of the invention are useful in potential 

30 therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
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starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
5 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 

10 autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 16 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 16 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

15 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

NOVX Antibodies" section below. The disclosed NOV 16 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 

20 disorders. 

NOV17 

NOV 17 includes three novel G-Protein Coupled Receptor -like proteins disclosed 
below. The disclosed sequences have been named NOV 17a, NOV 17b, NOV 17c, and 
NOV17d. 
25 NO VI 7a 

A disclosed NOV 17a nucleic acid of 962 nucleotides (also referred to as CG56657-01) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 17A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 18-20 and 
ending with a TAG codon at nucleotides 954-956. The start and stop codons are shown in 
30 bold in Table 17A, and the 5' and 3' untranslated regions, if any, are underlined. 



146 




Table 17A. NOV17a nucleotide sequence (SEQ ID NO:69). 

GAT CGT ATG AATG C CC C ATGGAAAATT AC AAT C AAACGTC AAC TG ATTT C ATCT T AT TGGGG C TGTT CCC AC 
CATCAAAAATTGGCCTTTTCCTCTTCATTCTCTTTGTTCTCATTTTCCTAATGGCTCTAATTGGAAACCTAT 
CCATGATTCTTCTCATCTTCTTGGACACCCATCTCCACACACCCATGTATTTCCTGCTTAGTCAGCTCTCCC 
TCATTGACCTAAATTACATCTCTACGATTGTTCCTAAGATGGCTTCTGATTTTCTGTATGGAAACAAGTCTA 
TCTCCTTCATTGGGTGTGGGATTCAGAGTTTCTTCTTCATGACTTTTGCAGGTGCAGAAGCGCTGCTCCTGA 
CATCAATGGCCTATGATCGTTATGTGGCCATTTGCTTTCCTCTCCACTATCCCATCCGTATGAGCAAAAGAA 
TGTATGTGCTGATGATAACAGGATCTTGGATGATAGGCTCCATCAACTCTTGTGCTCACACAGTATATGCAT 
TCCGTATCCCATATTGCAAGTCCAGAGCCATCAATCATTTTTTCTGTGATGTTCCAGCTATGTTGACATTAG 
CCTGTACAGACACCTGGGTCTATGAGTACACAGTGTTTTTGAGCAGCACCATCTTTCTTGTGTTTCCCTTCA 
CTGGCATTGCGTGTTCCTATGGCTGGGTTCTCCTTGCTGTCTACCGCATGCACTCTGCAGAAGGGAGGAAAA 
AGG C CT ATT CG AC C TG C AGC AC C C AC C TC ACTGT AGT AACTTT CT ACT ATGC AC CCT TTGCT T AT AC CT ATC 
TATGTCCAAGATCCCTGCGATCTCTGACAGAGGACAAGGTTCTGGCTGTTTTCTACACCATCCTCACCCCAA 
TGCTCAACCCCATCATCTACAGCCTGAGAAACAAGGAGGTGATGGGGGCCCTGACACGAGTGATTCAGAATA 
TCTTCTCGGTGAAAATGTAGACATAC 



The disclosed NOV 17a polypeptide (SEQ ID NO:70) encoded by SEQ ID NO:69 has 
312 amino acid residues and is presented in Table 17B using the one-letter amino acid code. 



Table 17B. Encoded NOV17a protein sequence (SEQ ID NO:70). 

MENYNQTSTDFIL.LGLFPPSKIGLFLFILFVLIFLMALIGNLSMILLIFLDTHLHTPMYFLLSQ 
LSLIDLNYISTIVPKMASDFLYGNKSISFIGCGIQSFFFMTFAGAEALLLTSMAYDRYVAICFP 
LiHYPIRMSKRMYVLMITGSWMIGSINSCAHTVYAFRIPYCKSRAINHFFCDVPAMLTLACTDTW 
VYEYTVFLSSTI FLVFPFTGIACSYGWVLLAVYRMHSAEGRKKAYSTCSTHLTWTFYYAPFAY 
TYLCPRSLRSLTEDKVLAVFYTILTPMLNPIIYSLRNKEVMGALTRVIQNIFSVKM 

5 u ~^~ " ^ """ 

A search of sequence databases reveals that the NOV 17a amino acid sequence has 148 
of 305 amino acid residues (48%) identical to, and 192 of 305 amino acid residues (62%) 
similar to, the 316 amino acid residue ptnr: TREMBLNEW-ACC:AAG45 1 96 protein from 
Mus musculus (Mouse) (T2 OLFACTORY RECEPTOR) (E = 8.0e" 73 ). 
10 NOV17b 

A disclosed NOV17b nucleic acid of 962 nucleotides (also referred to as CG56657-01) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 1 7C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 18-20 and 
ending with a TAG codon at nucleotides 954-956. The start and stop codons are shown in 
15 bold in Table 17C, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 17C. NOV17b nucleotide sequence (SEQ ID NO:71). 

GATCGTATGAATGCCCC ATGGAAAATTACAATCAAACGTCAACTGATTTCATCTTATTGGGGCTGTTCCCAC 
CATCAAAAATTGGCCTTTTCCTCTTCATTCTCTTTGTTCTCATTTTCCTAATGGCTCTAATTGGAAACCTAT 
CCATGATTCTTCTCATCTTCTTGGACACCCATCTCCACACACCCATGTATTTCCTGCTTAGTCAGCTCTCCC 
TCATTGACCTAAATTACATCTCTACGATTGTTCCTAAGATGGCTTCTGATTTTCTGTATGGAAACAAGTCTA 
TCTCCTTCATTGGGTGTGGGATTCAGAGTTTCTTCTTCATGACTTTTGCAGGTGCAGAAGCGCTGCTCCTGA 
CATCAATGGCCTATGATCGTTATGTGGCCATTTGCTTTCCTCTCCGCTATCCCATCCGTATGAGCAAAAGAA 
TGTATGTGCTGATGATAACAGGATCTTGGATGATAGGCTCCATCAACTCTTGTGCTCACACAGTATATGCAT 
TCCGTATCCCATATTGCAAGTCCAGAGCCATCAATCATTTTTTCTGTGATGTTCCAGCTATGTTGACATTAG 
CCTGTACAGACACCTGGGTCTATGAGTACACAGTGTTTTTGAGCAGCACCATCTTTCTTGTGTTTCCCTTCA 
CTGGCATTGCGTGTTCCTATGGCTGGGTTCTCCTTGCTGTCTACCGCATGCACTCTGCAGAAGGGAGGAAAA 
AGGCCTATTCGACCTGCAGCACCCACCTCACTGTAGTAACTTTCTACTATGCACCCTTTGCTTATACCTATC 
TATGTCCAAGATCCCTGCGATCTCTGACAGAGGACAAGGTTCTGGCTGTTTTCTACACCATCCTCACCCCAA 
TGCTCAACCCCATCATCTACAGCCTGAGAAACAAGGAGGTGATGGGGGCCCTGACACGAGTGATTCAGAATA 
TCTTCTCGGTGAAAATGTAG ACATAC 



In a search of public sequence databases, the NOV 17b nucleic acid sequence, localized 
to chromosome 4, has321 of 342 bases (93%) identical to a gbiGENBANK- 
ID:HSHTPRH07|acc:X64978.1 mRNA from Homo sapiens (H.sapiens mRNA HTPCRH07 
5 for olfactory receptor) (E = 2.9e" 62 ). 

The disclosed NOV17b polypeptide (SEQ ID NO:72) encoded by SEQ ID NO:71 has 
3 1 1 amino acid residues and is presented in Table 1 7D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV 17b has no signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.4600. Alternatively, 
10 NOV 17b may also localize to the microbody (peroxisome) with a certainty of 0.231 1, the 
endoplasmic reticulum (membrane) with a certainty of 0.1000, or in the endoplasmic 
reticulum (lumen) with a certainty of 0.1000. The most likely cleavage site for NOV1 7B is 
between positions 43 and 44: NLS-MI. 



Table 17D. Encoded NOV17b protein sequence (SEQ ID NO:72). 

MENYNQTSTDFILLGLFPPSKIGLFLFILFVLIFLMALIGNLSMILLIFLDTHLHTPMYFLLSQLSLIDLNY 
ISTIVPKMASDFLYGNKSISFIGCGIQSFFFMTFAGAEALLLTSMAYDRYVAICFPLRYPIRMSKRMYVIiMI 
TGSWMIGSINSCAHTWAFRIPYCKSRAINHFFCDVPAMLTLACTDTWVYEYTVFLSSTIFLVFPFTGIACS 
YGWVLLAVYRMHSAEGRKKAYSTCSTHLTWTFYYAPFAYTYLCPRSLRSLTEDKVLAVFYTILTPMLNPII 
YSLRNKEVMGAXiTRVIQNIFSVKM 

15 ' ~ ' ~ 

A search of sequence databases reveals that the NOV 17b amino acid sequence has 148 
of 305 amino acid residues (48%) identical to, and 191 of 305 amino acid residues (62%) 
similar to, the 316 amino acid residue ptnr:TREMBLNEW-ACC:AAG45 1 96 protein from 
Mus musculus (Mouse) (T2 Olfactory Receptor) (E = 8.0e" 73 ). 
20 NOV 17b is predicted to be expressed in at least the following tissues: Apical microvilli 

of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 
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tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 
hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
5 MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 
pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 
thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
10 sources, Public EST sources, Literature sources, and/or RACE sources. 
NO VI 7c 

A disclosed NOV17c nucleic acid of 883 nucleotides (also referred to as CG56659-01) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 17E. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 44-46 and 
15 ending with a TAG codon at nucleotides 875-877. The start and stop codons are shown in 
bold in Table 17E, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 17E. NOV17c nucleotide sequence (SEQ ID NO:73). 

AATTGGCCTTTTCGTATTCACCCTCATTTTTCTCATTTTCCTAA TGGCTCTAATTGGAAATCTATCCATGAT 
TCTTCTCATCTTTTTGGACATCCATCTCCACACACCTATGTATTTCCTACTTAGTCAGCTCTCCCTCATTGA 
CCTAAATTACATCTCCACCATTGTTCCAAAGATGGTTTATGATTTTCTGTATGGAAACAAGTCTATCTCCTT 
CACTGGATGTGGGATTCAGAGTTTCTTCTTCTTGACTTTAGCAGTTGCAGAAGGGCTGCTCCTGACATCAAT 
GGCCTATGATCGTTATGTGGCCATTTGCTTTCCTCTCCACTATCCCATCCGTATAAGCAAAAGAGTGTGTGT 
GATGATGATAACAGGATCTTGGATGATAAGCTCTATCAACTCTTGTGCTCACACAGTATATGCACTCTGTAT 
CCCATATTGCAAGTCCAGAGCCATCAATCATTTTTTCTGTGATGTTCCAGCTATGTTGACGCTAGCCTGCAC 
AGACACTTGGGTCTATGAGAGCACAGTGTTTTTGAGCAGCACCATCTTTCTTGTGCTTCCTTTCACTGGTAT 
TGCATGTTCCTATGGCCGGGTTCTCCTTGCTGTCTACCGCATGCACTCTGCAGAAGGGAGGAAGAAGGCCTA 
TTCAACCTGTAGCACCCACCTCACTGTAGTGTCCTTCTACTATGCACCCTTTGCTTATACCTATGTACGTCC 
AAGATCCCTGCGATCTCCAACAGAGGACAAGATTCTGGCTGTTTTCTACACCATCCTCACCCCAATGCTCAA 
CCCCATCATCTACAGCCTGAGAAACAAGGAGGTGATGGGGGCCCTGACACAAGTGATTCAGAAAATCTTCTC 
AGT G AAAAT GT AGACAT AC 



The disclosed NOV 17c polypeptide (SEQ ID NO:74) encoded by SEQ ID NO:73 has 
20 277 amino acid residues and is presented in Table 17F using the one-letter amino acid code. 



Table 17F. Encoded NOV17c protein sequence (SEQ ID NO:74). 

MALIGNLSMILLIFLDIHLHTPMYFLLSQLSLIDLNYISTIVPKMVYDFLYGNKSISFTGCGIQ 
SFFFLTLAVAEGLLLTSMAYDRYVAICFPLHYPIRISKRVCVMMITGSWMISSINSCAHTVYAL 
CIPYCKSRAINHFFCDVPAMLTLACTDTWVYESTVFLSSTIFLVLPFTGIACSYGRVLLAVYRM 
HSAEGRKKAYSTCSTHLTWSFYYAPFAYTYVRPRSLRSPTEDKILAVFYTILTPMLNPIIYSLRNKEVM 
GALTQVIQKIFSVKM 
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A search of sequence databases reveals that the NOV 17c amino acid sequence has 139 
of 272 amino acid residues (51%) identical to, and 181 of 272 amino acid residues (66%) 
similar to, the 316 amino acid residue ptnr: TREMBLNEW-ACC:AAG45 1 96 protein from 
Mus musculus (Mouse) (T2 OLFACTORY RECEPTOR) (E - 4.0e" 71 ). 
5 NOV17d 

A disclosed NOV17d nucleic acid of 926 nucleotides (also referred to as CG56659 02) 
encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 17G. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 87-89 and 
ending with a TAG codon at nucleotides 918-920. The start and stop codons are shown in 
10 bold in Table 17G, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 17G* NOV17d nucleotide sequence (SEQ ID NO:75). 

catcaactgatttcatcttattggggctgttcccacaatcaagaattggccttttcgtattcaccctcattt" 
ttctcattttcctaa tggctctaattggaaatctatccatgattcttctcatctttttggacatccatctcc 
acacacctatgtatttcctacttagtcagctctccctcattgacctaaattacatctccaccattgttccaa 
agatggtttatgattttctgtatggaaacaagtctatctccttcactggatgtgggattcagagtttcttct 
tcttgactttagcagttgcagaagggctgctcctgacatcaatggcctatgatcgttatgtggccatttgct 
ttcctctccactatcccatccgtataagcaaaagagtgtgtgtgatgatgataacaggatcttggatgataa 
gctctatcaactcttgtgctcacacagtatatgcactctgtatcccatattgcaagtccagagccatcaatc 
attttttctgtgatgttccagctatgttgacgctagcctgcacagacacttgggtctatgagagcacagtgt 
ttttgagcagcaccatctttcttgtgcttcctttcactggtattgcatgttcctatggccgggttctccttg 
ctgtctaccgcatgcactctgcagaagggaggaagaaggcctattcaacctgtagcacccacctcactgtag 

TGT C CT T CT AC T AT GC AC C CT T TGC TT AT AC C T ATGT ACGT C C AAG AT C C C TG CGAT CT C C AAC AGAGGAC A 
AGAT T C TGGCT GTTT TC T AC AC C AT CC T C AC CC C AATG CTC AAC CC C AT C AT CT AC AGC C TG AG AAAC AAGG 
AGGTGATGGGGGTCCTGACACAAGTGATTCAGAAAATCTTCTCAGTGAAAATGTAGACATAC 



In a search of public sequence databases, the NOV17d nucleic acid sequence has343 of 
343 bases (100%) identical to a gb:GENBANK-ID:HSHTPRH07|acc:X64978.1 mRNA from 

15 Homo sapiens (H.sapiens mRNA HTPCRH07 for olfactory receptor) (E = 5.4e~ 71 ). 

The disclosed NOV17D polypeptide (SEQ ID NO:76) encoded by SEQ ID NO:75 has 
277 amino acid residues and is presented in Table 17H using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV17d has no signal peptide and is 
likely to be localized to the endoplasmic reticulum (membrane) with a certainty of 0.6850. 

20 Alternatively, NOV17d may also localize to the plasma membrane with a certainty of 0.6400, 
the Golgi body with a certainty of 0.4600, or in the endoplasmic reticulum (lumen) with a 
certainty of 0.1000. The most likely cleavage site for NOV17D is between positions 22 and 
23: HTP-MY. 



Table 17H. Encoded NOV17d protein sequence (SEQ ID NO:76). 

MALIGNLSMILLIFLDIHLHTPMYFLLSQLSLIDLNYISTIVPKMVYDFLYGNKSISFTGCGIQSFFFLTLA 
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VAE GLLLTSMA YD R Y VA I C F P LH Y P I R I S KR VC VMM I TG S WM I S S I N S C AHT V Y AL C I P Y C K S RAI NHF FC D 
VPAMLTLACTDTWVYESTVFLSSTIFLVLPFTGIACSYGRVLLAVYRMHSAEGRKKAYSTCSTHLTWSFYY 
APFAYTYVRPRSLRSPTEDKILAVFYTILTPMLNPI I YSLRNKEVMGVLTQVIQKIFSVKM 



A search of sequence databases reveals that the NOV17d amino acid sequence has 138 
of 269 amino acid residues (51%) identical to, and 183 of 269 amino acid residues (68%) 
similar to, the 316 amino acid residue ptnr:SPTREMBL-ACC:Q9D3U9 protein from Mus 
5 musculus (Mouse) (4933433E02rik Protein) (E - 3.9e 71 ). 

NOV17d is predicted to be expressed in at least the following tissues: Apical microvilli 
of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 
tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
10 artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 

hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 
pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
15 artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 

thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

The disclosed NOV 17a polypeptide has homology to the amino acid sequences shown 
20 in the BLASTP data listed in Table 1 71. 



Table 171. BLAST results for NOV17a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


>gi | 17445356 | ref | XP 

060561 . 1 | 
(XM_060561) 


similar to 
OLFACTORY 
RECEPTOR 2T1 

{ OLFACTORY 
RECEPTOR 1-25) 

(ORl-25) (H. 
sapiens) [Homo 
sapiens] 


312 


312/312 
(100%) 


312/312 
(100%) 


e-149 


gi | 1744534 8 | ref |XP_ 
060559 . 1 | 
(XM_060559) 


similar to 
OLFACTORY 
RECEPTOR 2T1 

(OLFACTORY 
RECEPTOR 1-25) 

(ORl-25) (H. 
sapiens) [Homo 
sapiens] 


533 


199/233 
(85%) 


206/233 
(88%) 


le-95 
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gi | 17437047 | ref | XP_ 
060312 . 1 | 
( XM 0 6 0 3 1 0 ) 


similar to 
OLFACTORY 
RECEPTOR 2T1 

(OLFACTORY 
RECEPTOR 1-25) 

(ORl-25) (H. ; 
sapiens) [Homo 
sapiens] 


472 


149/299 
(49%) 


211/299 
(69%) 


5e-78 


gi | 17437056 | ref | XP_ 
060314 . 1 | 
(XM_060314) 


similar to 
OLFACTORY 
RECEPTOR 2T1 

(OLFACTORY 
RECEPTOR 1-25) 

(ORl-25) (H . 
sapiens ) [Homo 
sapiens] 


695 


155/295 
(52%) 


209/295 
(70%) 


le-74 


gi | 174 5 65 95 | ref |XP_ 
065073 . 1 | 
(XM_065073) 


similar to 
olfactory- 
receptor (H. 
sapiens) [Homo 
sapiens] 


638 


138/296 
(46%) 


193/296 
(64%) 


le-73 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 17J. In the ClustalW alignment of the NOV 17 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 



15 



20 



25 



30 



35 



Table 17J. ClustalW Analysis of NOV17 



Novel NOV17a 
Novel NOV17b 
Novel NOV17C 
Novel NOV17d 



(SEQ ID NO: 70) 

(SEQ ID NO: 72) 

(SEQ ID NO: 74) 

(SEQ ID NO: 76) 



1) 
2) 
2) 
2) 

3) gi | 17445356 | ref |XP_060561 . 1 | (XM_060561) s 
(OLFACTORY RECEPTOR 1-25) (ORl-25) (H. sapiens 

4) gi | 17445348 | ref |XP_060559 . 1 | (XM_060559) s 
(OLFACTORY RECEPTOR 1-25) (ORl-25) (H. sapiens 

5) gi | 17437047 | ref |XP_060312 . 1 | (XM_060312) s 
(OLFACTORY RECEPTOR 1-25) (ORl-25) (H. sapiens 

6) gi | 17437056 | ref |XP_060314 . 1 | (XM_06 03 14 ) s 
(OLFACTORY RECEPTOR 1-25) (ORl-25) (H. sapiens 

7) gi | 17456595 | ref |XP_065073 . 1 | (XM_065073) s 
sapiens) [Homo sapiens] (SEQ ID NO: 3 98) 



imilar to OLFACTORY RECEPTOR 2T1 
) [Homo sapiens] (SEQ ID NO: 3 94) 
imilar to OLFACTORY RECEPTOR 2T1 
) [Homo sapiens] (SEQ ID NO: 3 95) 
imilar to OLFACTORY RECEPTOR 2T1 
) [Homo sapiens] (SEQ ID NO: 3 96) 
imilar to OLFACTORY RECEPTOR 2T1 
) [Homo sapiens] (SEQ ID NO: 3 97) 
imilar to olfactory receptor (H. 



NOV17a 
NOV17b 
NOV17C 
NOV17d 
gi|l7445356| 
gi j 17445348 | 
gi j 17437047 j 
gi j 17437056 j 
gi j 17456595 j 



10 20 30 40 50 60 

....|....|....|....|....|....|....|....|..-.|--.-|--..|-.-.| 

1 1 

x 1 

1 1 

1 1 

1 1 

1 1 

1 - MDGLARLEE E PQARGAAE AM AW AQG - - SCKVGTEDKEATVAAAQG 43 

1 MCSGNQTSQNQTASTDFTLTGLFAESKHAALLYTVTFLLFLMALTGNALLILLIHSEPRL 60 

! 1 
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70 



90 



NO VI 7 a 

NOVl7b 

NOV17 c 

NOV17d 

gi | 17445356 | 

gi | 17445348 | 

gi | 17437047 j 

gi|l7437056| 

gi|l7456595| 



NOV17a 

NOV17b 

NOV17 c 

NOV17d 

gi | 17445356 | 

gi | 17445348 | 

gi | 17437047 j 

gi | 17437056 | 

gi|l7456595| 



NO VI 7 a 

NO VI 7b 

NO VI 7 c 

NO VI 7 d 

gi | 17445356 | 

gi | 17445348 | 

gi | 17437047 | 

gi | 17437056 | 

gi|l7456595| 



NOV17a 
NO VI 7b 
NO VI 7c 
NO VI 7 d 
gi|l7445356| 
gi j 17445348 | 
gi j 17437047 | 
gi | 17437056 | 
gi|l7456595 j 



NO VI 7 a 

NO VI 7b 

NOV17C 

NO VI 7 d 

gi | 17445356 | 

gi | 17445348 j 

gi j 17437047 j 

gi | 17437056 | 

gi j 17456595 | 



1 
1 
1 
1 
1 
1 

43 
61 
1 



1 
1 
1 
1 
1 
1 

146 



100 



110 
..|.. 



120 



- -QTDWSRREI ISEDKMFRTTTAGFQAESGVAG CTGPDVTLMWLRLD - LEGFMR 95 

HTPMYFFISQLALMDLMYLCVTVPKMLVGQVTGDDTISPSGCGIQMFFYLTLAGAEVFLL 12 0 
1 



130 



140 



150 



160 



170 
..|.. 



180 



! 

x 

96 QQGDRGKVRGTTRPLAWKLHPDG — TLRSVTSTADLSHLDRVLLPP SWSLCLP 146 

121 AAMAYDRYAAVCRPLHYPLLMNQRVCQLLVSACWVLGMVDGLLLTPITMSFPFCQSRKIL 18 0 

1 1 



190 
..|.. 



200 



210 



220 
..|.. 



230 
..|.. 



240 



1 
1 
1 
1 
1 
1 

146 



181 SFFCETPALLKLSCSDVSLYKTLMYLCCILMLLAPIMVISSSYTLILHLIHRMNSAAGHR 24 0 
1 1 



250 
..|.. 



260 



270 



280 



290 



300 
•-I 



1 

1 

147 VALG 150 

241 KALATCSSHMIIVLLLFGASFYTYMLPSSYHTAEQDMMVSAFYTIFTPVLNPLIYSLRNK 3 00 

1 1 



350 



360 




370 



NOV17 a 


48 1 






NOVl7b 


4 8 




JSm 


NO VI 7c 


13 I 


F 


MIH 


NO VI 7 d 


13 






gi 1 17445356 | 








gi j 17445348 | 


49 | 






gi j 17437047 j 


202 j 


Q 





I 



380 
• - I - 



390 



400 



LHTPMYFLLSQLSL 
LHTPMYFLLSQLSL 
LHTPMYFLLSQLSL 
LHTPMYFLLSQLSL 
LHTPMYFLLSQLSL 
LHTPMYFLLSQLS 
LHTPMYFLLSQLS 




410 



420 



82 
82 
47 
47 
82 



VS FLSQSKTI RFLGCE I QTYVFLALG 108 
A 23 6 



153 




gi|l7437056 


361 


gi jl7456595 


49 


NOV17 a 




NO VI 7b 


8 2 


NOV17C 


4 7 


NOV17d 


47 


gi|l7445356 


82 


gi | 17445348 


109 


gij 17437047 


236 


gi|l7437056 


395 


gi | 17456595 


83 


NO VI 7 a 


82 


NO VI 7b 


82 


NOV17C 


47 


NOV17d 


47 


gi|l7445356| 82 


gi| 17445348 


169 


gi | 17437047 


i 236 


gi | 17437056 


395 


gi j 17456595 


1 83 


NO VI 7 a 


82 


NO VI 7b 


82 


NO VI 7c 


47 


NOVl7d 


47 



gi | 17445356 | 
gij 17445348 | 
gij 17437047 | 
gi|l7437056 j 
gijl7456595j 



236 
395 
83 



NO VI 7 a 


82 


NO VI 7b 


82 


NOV17C 


47 


NO VI 7 d 


47 


gi 


17445356 | 


82 


gi 


17445348 j 


289 


gi 


17437047 j 


236 


gi 


17437056 j 


395 


gi 


17456595 j 


83 



NOV17a 


128 


NO VI 7b 


128 


NOV17C 


93 


NOV17d 


93 


gi |17445356 


128 


gi j 17445348 


349 


gij 17437047 


282 


gij 17437056 


441 


gi|l7456595 


129 


NOV17 a 


188 


NO VI 7b 


188 


NO VI 7c 


153 


NOV17d 


153 


gi|l7445356| 188 



395 

83 



430 



440 



450 



460 
..|.. 



470 



480 



82 

82 

47 

47 

82 

GTEALLLGFMSYDRYVAICHPLHYPMLMSKKICCLMVACAWASGSINAFIHTLYVFQLPF 16 8 

236 

395 

83 



490 



500 
..|.. 



510 



520 
..|.. 



530 



540 



82 

82 

47 

47 

82 

228 

236 

395 

83 



550 



560 



570 
..|.. 



580 



590 



600 
•I 



82 

82 

47 

47 

82 82 

2 2 9 SSGKGQAKAVSTCSSHLIVASLFYATTLFTYTRPHSLRSPSRDKAVAVFYTI VTPLLNPF 28 8 



236 
395 
83 



660 




670 



680 



690 




CAHTVY 


AFR 


CAHTVY 


AFR 


SiSSSuflAXiC 


CAHTVY 


Aiic 


CAHTVY 


AFR 


CAHTVY 





^GFLLHPITMNVO 
FLLHP VTjjyjQ ; 

Ssiqhsit; 




730 
. . | . . 



740 

A.. 



750 



760 



CTDTWVYEQTVFLSgTIFLVgPF 

ctdtwvyeHtvflsItiflvjSpf 

CTDTWVYeBtVFLS| TI FLVfflPF 

ctdtwvyebtvfls|tiflvgpf 
ctdtwvyeStvflsItiflvHpf 




I 



770 
• ■ L- 



SAEGRKKAY 
SAEGRKKAY 
SAEGRKKAY 
SAEGRKKAY 
SAEGRKKAY 
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gi 1 17445348 I 
gi j 17437047 j 
gi|l7437056 j 
gi | 17456595 | 



NOV17 a 


248 






NOV17b 


248 


T 


M, 


NOV17C 


213 


1 




NOV17d 


213 




gi 


17445356 | 


248 




gi 


17445348 | 


469 






gi 


17437047 | 


402 


s 




gi 


17437056 j 


561 




gi 


17456595 j 


249 


GL| 




NOV17a 


298 


NOV17b 


298 


NO VI 7 c 


263 


NOV17d 


263 


gi | 17445356 | 


298 


gi j 17445348 | 


519 


gi 1 17437047 j 


452 


gi|l7437056| 


611 


gi j 17456595 j 


309 


NOV17a 


312 


NOV17b 


312 


NOV17c 


277 


NOV17d 


277 


gi | 17445356 | 


312 


gi j 17445348 | 


533 


gi | 17437047 | 


472 


gij 17437056] 


652 


gi | 17456595 | 


369 



NOVl7a 

NOVl7b 

NOV17C 

NO VI 7 d 

gi | 17445356 | 

gi | 17445348 j 

gij 17437047 j 

gij 17437056 j 

gi|l7456595| 



NOV17a 
NOV17b 
NO VI 7c 



312 
312 
277 
277 
312 
533 
472 
695 
429 



NOVl7a 


312 


NOV17b 


312 


NOV17C 


277 


NO VI 7 d 


277 


gi 


17445356 


312 


gi 


17445348 


533 


gi 


17437047 


472 


gi 


17437056 


695 


gi 


17456595 


489 



312 
312 
277 



850 




860 
..| 



870 



880 




890 



900 
•I 



Rfa ccssaqkvaJcsda 

l|O^VGRM- - EWKTLPFQALQVRCVKWRRsMlVSSFIA TER 

PSARPLNGP^HAg^TCSGRCLPGESHVSLISLVEPPAVE*SVfGASVKGCPRTWCLPREQ 



312 
312 
277 
277 
312 
533 
472 
651 
368 



910 



920 
..| 



930 



940 
..|.. 



950 



960 



TLADTSHSSSHAEFPERGVR-MNCSKLFSLVEEPVTSLGDLFNFR 

VLWDGPDSGTSLESKQPHQEGLSDMHLSNTICTLVSELNQFWAYPIQHDLPKEVLLTPAP 



312 
312 
277 
277 
312 
533 
472 
695 
428 



970 



980 
..|.. 



990 
..|.. 



1000 

..I... 



1010 

..j... 



102C 
-•I 



CKVGAII IHPAAREDTLNTSQETPGTPKCYRGKNIKGVKEGKAEPEGPVGPETVGSKTEM 



312 
312 
277 
277 
312 
533 
472 
695 
488 



1030 



1040 
..|... 



1050 



1060 



1070 
..|... 



1080 
■-I 



312 
312 
277 
277 
312 
533 
472 
695 
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1100 
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1130 
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--- 312 
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NOVl7d 

gi | 17445356 | 

gi | 17445348 j 

gi | 17437047 j 
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gi | 17437047 | 
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54 9 S I RRQRE FMPEE KKDTV YWE KRR KNNE AAKRSRE KRRLND AA I EGRLAALMEENALLKGE 608 

1150 1160 1170 

....|. ...|....|....|....|....| 

312 312 

312 312 

277 277 

277 277 

312 312 

533 533 

472 472 

695 695 

609' LKALKLRFGLLPLTGSAIGSPLDWGPPAWG 638 
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Table 17F lists the domain description from DOMAIN analysis results against NOV 17. 
This indicates that the NOV 17 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 17F Domain Analysis of NO VI 7 

gnl | Pfam|pfam00001, 7tm_l , 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO:810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 99.4 bits (246), Expect = 3e-22 
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NOV1 7 : 
Sbjct : 


40 
1 


GNLSMILLIFLDTHLHTPMYFLLSQLSLIDLNYISTIVPKMASDFLYGNKSISFIGCGIQ 

III +U + 1 1 II 1 I++ II ++ 1+ 1 + 1+ 1 + 
GNLLVI LVI LRTKKLRTPTNI FLLNLAVADLLFLLTLPPWAL YYLVGGDWVFGDALCKLV 


99 
60 


NOV17: 


100 


S FFFMTFAGAEALLLTSMAYDRYVAI CFPLHYP I RMS KRMYVLMI TGS WMI GS INS CAHT 


159 


Sbjct : 


61 


1+ 1 IIII+++ lll+ll III + I ++I I++ + 1 
GALFWNGYAS I LLLTAI S I DRYLAI VHPLRYRRI RTPRRAKVLI LLVWVLALLLSLPPL 


120 


NOV17 : 


160 


VYAFRIPYCKSRAINHFFCDVPAMLTLACTDTWVYEYTVFLSSTIFLVFPFTGIACSYGW 


219 


Sbjct : 


121 


+ + + 1 1 11+ + 1 1 1 1 
LF SWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTR 


171 


NOV17 : 


220 


VLLAV YRMHS AEGRKKAYSTCS THLTWTF Y Y APFAYT YLCPRS LRS 


266 


Sbjct : 


172 


+1 + + 1+ III + 1 + + 1 
ILRTLRKRARSQRSLKRRSSSERKAAKMLLVWWFVLCWLPYHIVLLLDSLCLLSIWRV 


231 


NOV17 : 


267 


LTEDKVLAVFYTI LTPMLNPI I Y 289 

1 + + ++ + 1 1 1 1 1 1 
LPT ALL I TLWLAYVNS CLNP I I Y 254 




Sbjct : 


232 





45 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
50 signals. Previously, GPCR genes cloned in different species were from random locations in the 
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respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
5 protein-coupled receptors in a number of species. These receptors share a seven 

transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
10 displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 17 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 17A, 
17C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 

15 whose bases may be changed from the corresponding base shown in Table 17A or 17C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 

20 invention additionally includes nucleic acids or nucleic acid fragments, or complements 

thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 

25 antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 7 percent of the bases may be so 
changed. 

The disclosed NOV 17 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 17B or 17D. The invention also 
30 includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 17B or 17D while still encoding a protein that 
maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 54 percent of the 
residues may be so changed. 
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The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2,that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV 17) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
5 NOV 17 nucleic acids and proteins identified here may be useful in potential therapeutic 

applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

10 delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV17 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

15 Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NTDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV- 1 or HIV-2), pain, 

20 cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 

uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 

25 disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 

mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 17 nucleic acids and polypeptides are further useful in the generation of 

30 antibodies that bind immuno-specifically to the novel NOV17 substances for use in 

therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV1 7 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
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assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV18 

5 NOV1 8 includes three novel G-Protein Coupled Receptor -like proteins disclosed 

below. The disclosed sequences have been named NOV 18a and NOV 18b. 
NO VI 8a 

A disclosed NOV18a nucleic acid of 1062 nucleotides (also referred to as CG56663- 
01) encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 18 A. An 
10 open reading frame was identified beginning with an ATG initiation codon at nucleotides 1 0- 
12 and ending with a TAA codon at nucleotides 948-950. The start and stop codons are 
shown in bold in Table 18A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 18A. NOV18a nucleotide sequence (SEQ ID NO:77). 

TAGAGATGGATGGAACCAATGGCAGCACCCAAACCCATTTCATCCTACTGGGATTCTCTGACCGACCCCATC 
TGGAGAGGATCCTCTTTGTGGTCATCCTGATCGCGTACCTCCTGACCCTCGTAGGCAACACCACCATCATCC 
TGGTGTCCCGGCTGGACCCCCACCTCCACACCCCCATGTACTTCTTCCTCGCCCACCTTTCCTTCCTGGACC 
TCAGTTTCACCACCAGCTCCATCCCCCAGCTGCTCTACAACCTTAATGGATGTGACAAGACCATCAGCTACA 
TGGGCTGTGCCATCCAGCTCTTCCTGTTCCTGGGTCTGGGTGGTGTGGAGTGCCTGCTTCTGGCTGTCATGG 
CCTATGACCGGTGTGTGGCTATCTGCAAGCCCCTGCACTACATGGTGATCATGAACCCCAGGCTCTGCCGGG 
GCTTGGTGTCAGTGACCTGGGGCTGTGGGGTGGCCAACTCCTTGGCCATGTCTCCTGTGACCCTGCGCTTAC 
CCCGCTGTGGGCACCACGAGGTGGACCACTTCCTGCGTGAGATGCCCGCCCTGATCCGGATGGCCTGCGTCA 
GCACTGTGGCCATCGAAGGCACCGTCTTTGTCCTGAAAAAAGGTGTTGTGCTGTCCCCCTTGGTGTTTATCC 
TGCTCTCTTACAGCTACATTGTGAGGGCTGTGTTACAAATTCGGTCAGCATCAGGAAGGCAGAAGGCCTTCG 
GCACCTGCGGCTCCCATCTCACTGTGGTCTCCCTTTTCTATGGAAACATCATCTACATGTACATGCAGCCAG 

GAG C C AGTT CT TC C C AGG ACC AGGG C ATGTT C C TC ATGCT C T T C T ACAAC ATTGTC AC CCCCCTCCT C AAT C 
CTCTCATCTACACCCTCAGAAACAGAGAGGTGAAGGGGGCACTGGGAAGGTTGCTTCTGGGGAAGAGAGAGC 
TAGGAAAGGAGTA AAGGCATCTCCACCTGACTTCACTTCCATCCAGGGCCACTGGCAG CATCTGGAACGGCT 
GAATTCCAGCTGATATTAGCCCACGACTCCCAACTTGCCTTTTTCTGGACTTTT 



The disclosed NOV 18a polypeptide (SEQ ID NO:78) encoded by SEQ ID NO:77 has 
314 amino acid residues and is presented in Table 18B using the one-letter amino acid code. 



Table 18B. Encoded NO VI 8a protein sequence (SEQ ID NO:78). 

MDGTNGSTQTHFILLGFSDRPHLERILFWILIAYLLTLVGNTTIILVSRLDPHLHTPMYFFLA 
HLSFLDLSFTTSSIPQLLYNLNGCDKTISYMGCAIQLFLFLGLGGVECLLLAVMAYDRCVAICK 
PLHYMVIMNPRLCRGLVSVTWGCGVANSLAMSPVTLRLPRCGHHEVDHFLREMPALIRMACVST 
VAIEGTVFVLKKGWLSPLVFILLSYSYIVRAVLQIRSASGRQKAFGTCGSHLTWSLFYGNII 
YMYMQPGASSSQDQGMFLMLFYNIVTPLLNPLIYTLRNREVKGALGRLIiLGKRELGKE 

A search of sequence databases reveals that the NOV 18a amino acid sequence has 194 
20 of 237 amino acid residues (81%) identical to, and 215 of 237 amino acid residues (90%) 
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similar to, the 237 amino acid residue ptnr: SPTREMBL-ACC:Q9R0G5 protein from 
Marmota marmota (European marmot) (Olfactory Receptor) (E = 3.5e~ 102 ). 



NOV18b 

A disclosed NOV 18b nucleic acid of 1062 nucleotides (also referred to as CG56663- 
02) encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 18C. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 6-8 
and ending with a TAA codon at nucleotides 948-950. The start and stop codons are shown in 
bold in Table 18C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 18C. NOV18b nucleotide sequence (SEQ ID NO:79). 



TAG AG AT GG ATGG AAC C AATGGC AG CAC CC AAAC C C ATTT CAT C C TACTGGGAT TCT C TGAC CG AC C C CAT C 
TGGAGAGGATCCTCTTTGTGGTCATCCTGATCGCGTACCTCCTGACCCTCGTAGGCAACACCACCATCATCC 
TGGTGTCCCGGCTGGACCCCCACCTCCACACCCCCATGTACTTCTTCCTCGCCCACCTTTCCTTCCTGGACC 
TCAGTTTCACCACCAGCTCCATCCCCCAGCTGCTCTACAACCTTAATGGATGTGACAAGACCATCAGCTACA 
TGGGCTGTGCCATCCAGCTCTTCCTGTTCCTGGGTCTGGGTGGTGTGGAGTGCCTGCTTCTGGCTGTCATGG 
CCTATGACCGGTGTGTGGCTATCTGCAAGCCCCTGCACTACATGGTGATCATGAACCCCAGGCTCTGCCGGG. 
GCTTGGTGTCAGTGACCTGGGGCTGTGGGGTGGCCAACTCCTTGGCCATGTCTCCTGTGACCCTGCGCTTAC 
CCCGCTGTGGGCACCACGAGGTGGACCACTTCCTGCGTGAGATGCCCGCCCTGATCCGGATGGCCTGCGTCA 
GCACTGTGGCCATCGACGGCACCGTCTTTGTCCTGGCGGTGGGTGTTGTGCTGTCCCCCTTGGTGTTTATCC 
TGCTCTCTTACAGCTACATTGTGAGGGCTGTGTTACAAATTCGGTCAGCATCAGGAAGGCAGAAGGCCTTCG 
GCACCTGCGGCTCCCATCTCACTGTGGTCTCCCTTTTCTATGGAAACATCATCTACATGTACATGCAGCCAG 
GAGCCAGTTCTTCCCAGGACCAGGGCATGTTCCTCATGCTCTTCTACAACATTGTCACCCCCCTCCTCAATC 
CTCTCATCTACACCCTCAGAAACAGAGAGGTGAAGGGGGCACTGGGAAGGTTGCTTTTGGGGAAGAGAGAGC 
TAGGAAAGGAGTA AAGGCATCTCCACCTGACTTCACTTCCATCCAGGGCCACTGGCA GCATCTGGAACGGCT 
GAATTCCAGCTGATATTAGCCCACGACTCCCAACTTGCCTTTTTCTGGACTTTT 



In a search of public sequence databases, the NOV 18b nucleic acid sequence has600 of 
710 bases (84%) identical to a gb:GENBANK-ID:AX008326|acc:AX008326.1 mRNA from 
Marmota marmota (Sequence 24 from Patent W09967282) (E = 8.8e 109 ). 

1 5 The disclosed NOV1 8D polypeptide (SEQ ID NO:80) encoded by SEQ ID NO:79 has 

314 amino acid residues and is presented in Table 18D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV18b has A signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV18b may also localize to the Golgi body with a certainty of 0.4000, the endoplasmic 

20 reticulum (membrane) with a certainty of 0.3000, or in the endoplasmic reticulum (lumen) 

with a certainty of 0.3000. The most likely cleavage site for NOV 18b is between positions 42 
and 43: LVG-NT. 



Table 18D. Encoded NOV18b protein sequence (SEQ ID NO:80). 

MDGTNGSTQTHFI LLGFSDRPHLERI LFWI LI AYLLTLiVGNTTI I LVSRLDPHLHTPMYFFLAHLSFLDLS~ 



160 




FTTSSIPQLLYNLNGCDKTISYMGCAIQLFLFI J GLGGVECLLLAVMAYDRCVAICKPLHYMVI^4NPRLCRGL 
VSVTWGCGVANSLAMSPVTLRLPRCGHHEVDHFLREMPALIRMACVSTVAIDGTVFVLAVGWLSPLVFILL 
SYSYIVRAVLQIRSASGRQKAFGTCGSHLTWSLFYGNIIYMYMQPGASSSQDQGMFLMLFYNIVTPLLNPL 
I YTLRNREVKGALGRLLLGKRELGKE 

A search of sequence databases reveals that the NOV 18b amino acid sequence has 183 
of 305 amino acid residues (60%) identical to, and 237 of 305 amino acid residues (77%) 
similar to, the 320 amino acid residue ptnr:SPTREMBL-ACC:Q9Y3N9 protein from Homo 
5 sapiens (Human) (DJ88J8.1 (Novel 7 Transmembrane Receptor (Rhodopsin Family) 
(Olfactory Receptor Like) Protein) (HS6M1-15))) (E = 2.8e' 98 ). 

NOV 18b is predicted to be expressed in at least the following tissues: Apical microvilli 
of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 

10 tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 
hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 

15 pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 
thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

20 The disclosed NOV 18a polypeptide has homology to the amino acid sequences shown 

in the BLASTP data listed in Table 18E. 



Table 18E. BLAST results for NO VI 8a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17445344 | ref |XP_ 
060558 . 1 | 
(XM_060558) 


similar to 
olfactory 
receptor (H. 
sapiens) [Homo 
sapiens] 


314 


314/314 
(100%) 


314/314 
(100%) 


e-164 . 


gi | 5901478 | gb| AAD55 
304 . 1 | AF044033_1 
(AF044033) 


olfactory 
receptor [Marmota 
marmot a] 


237 


194/237 
(81%) 


215/237 
(89%) 


2e-99 


gi | 13 6243 2 9 | ref |NP_ 
112165 . 1 | 
(NM_030903) 


olfactory 
receptor, family 
2, subfamily W, 
member 1 [Homo 
sapiens] 


320 


184/305 
(60%) 


236/305 
(77%) 


le-94 



161 




gi | 120 54431 | emb | CAC 
20523 . 1 | (AJ302603) 


olfactory 
receptor [Homo 
sapiens] 


320 


184/305 
(60%) 


236/305 
(77%) 


le-94 


gi | 12054429 | emb | CAC 
20522. l| (AJ302602) 


olfactory- 
receptor [Homo 
sapiens] 


320 


184/305 
(60%) 


235/305 
(76%) 


2e-94 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 1 8F. In the ClustalW alignment of the NOV1 8 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 Table 18F. ClustalW Analysis of NOV18 

1) Novel NOV18a (SEQ ID NO: 78) 

2) Novel NOV18b ' (SEQ ID NO: 80) 

3) gi | 17445344 | ref | XP_060558 . 1 | (XM_060558) similar to olfactory receptor (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 3 99) 

15 4) gi | 5901478 | gb | AAD55304 . 1 | AF044033_1 (AF044033) olfactory receptor [Marmota 

marmota] (SEQ ID NO: 4 00) 

5) gi | 13624329 | ref |NP_112165 . 1 | (NM_030903) olfactory receptor, family 2, subfamily 
W, member 1 [Homo sapiens] (SEQ ID NO: 4 01) 

6) gi | 12054431 | emb | CAC20523 . 1 | (AJ302603) olfactory receptor [Homo sapiens] (SEQ ID 
20 NO: 4 02) 

7) gi | 12054429 | emb |CAC20522 . l| (AJ302602) olfactory receptor [Homo sapiens] (SEQ ID 
NO : 4 0 3 ) 



25 



30 



35 



40 



45 



50 



NOVl8a 

NOV18b 

gi | 17445344 | 

gi | 5901478 | 

gi|l3624329| 

gi | 12054431 | 

gi | 12054429 | 



NOV18a 

NOV18b 

gi 1 17445344 | 

gi | 5901478 | 

gi j 13624329 | 

gi | 12054431 | 

gi | 12054429 | 



NOV18a 

NOVl8b 

gi | 17445344 | 

gi | 5901478 | 

gi|l3624329| 

gi |12054431| 




I 



FILLGFS 
FILLGFS 
FILLGFS 




50 


IIL 




3r 


LD 


P 


IIL 




1 R 


LD 


P 


IIL 




8r 


LD 


P 




Itlvgnt 


Jul 


i s B 


LD 




Jtlvgnt 


HlIL 




LD 




Itlvgnt 


Jul 


S S B 


LD 






110 



120 




162 



gi I 12054429 I 121 [^FT^ 



180 



190 



200 



210 



220 



230 



240 



NOV18a 

NOVl8b 

gi | 17445344 | 

gi | 5901478 | 

gi|l3624329| 

gi|l205443lj 

gi j 12054429 j 



NOV18a 


241 


NOV18b 


241 


gi| 17445344 | 


241 


gi| 5901478 | 


184 


gi|13624329| 


241 


gi|l205443l| 


241 


gi j 12054429 | 


241 


NOVl8a 


301 


NOV18b 


301 


gi| 17445344 | 


301 


gi|5901478| 


237 


gi|l3624329| 


301 


gi|l205443l| 


301 


gi | 12054429 j 


301 




CGSHLTWSgFYGiJI 

cgshltwsSfygSi 
cgshltwsSfygSi 
cgshltwsSfygSi 
cgshltwsSfygBi 

CGS HLT VVSiSf YGIil 

cgshltvysSfygHi 



310 



320 




lgkrelc 

LGKRBLG 
iGKRBLG 




314 

314 

314 

237 

IRFHHIstHiKRNCKS 320 

iRFHHKSTgiKRNCKS 320 

iRFHHKSTSlKRNCKS 320 



Tables 18G lists the domain descriptions from DOMAIN analysis results against 
NOV1 8. This indicates that the NOV1 8 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 18G Domain Analysis of NOV18 

gnl | Pfam |pf amOOOOl , 7tm_l / 7 transmembrane receptor (rhodopsin 
family). { SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 95.1 bits (235), Expect = 5e-21 



NOV18 : 


41 


Sbjct : 


1 


NOV18 : 


101 


Sbjct : 


61 


NOV18 : 


161 


Sbjct : 


119 


NOV18 : 


221 


Sbjct : 


172 


NOV18 : 


268 



GNTTI ILVSRLDPHLHTPMYFFLAHX,SFiDLSFTTSSIPQLLYNLNGCDKTISYMGCAIQ 100 

II + ||| III I I +1+ I I I + I I I I I I I + 

GNLLVILVILRTKKLRTPTNI FLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 



I |+ I III + + II +11 III I II + 1+ + I + M 

GALFWNGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILLVWVLALLLSLP- 



III 



l + l ,+ 



+ + | + 



1+ ll + l 1+ I 

ISLKRRSSSERKAA] 

:VTPLLNPLIY 2: 
I lll+ll 



118 



11+ 11+ 1+ 
-PLLVILVCYTR 171 
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Sbjct: 232 LPTALLITLWLAYVNSCLNPIIY 2 54 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
5 seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 

10 olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 

15 Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 18 nucleic acid of the invention encoding a G-Protein Coupled 
20 Receptor -like protein includes the nucleic acid whose sequence is provided in Table 1 8A, 

20C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 18A or 20C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
25 nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
30 are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 16 percent of the bases may be so 
changed. 
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The disclosed NOV18 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 18B or 20D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 18B or 20D while still encoding a protein that 
5 maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 40 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

10 The above disclosed information suggests that this G-Protein Coupled Receptor -like 

protein (NOV 18) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV 18 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 

1 5 protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV 18 nucleic acids and proteins of the invention are useful in potential 

20 therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 

25 starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 

fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

30 Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
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autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV 18 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV 18 substances for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV18 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
10 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV19 

NOV19 includes three novel G-Protein Coupled Receptor -like proteins disclosed 
below. The disclosed sequences have been named NOV 19a and NOV 19b. 
15 NOV19a 

A disclosed NOV 19a nucleic acid of 1046 nucleotides (also referred to as CG56665- 
01) encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 19A. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 14- 
16 and ending with a TGA codon at nucleotides 1019-1021. The start and stop codons are 
20 shown in bold in Table 19A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 19A. NOV19a nucleotide sequence (SEQ ID NO:81). 

TCAACATTATTACATGAACATTTCAGATGTCATCTCCTTTGATATTTTQGTTTCAGCCATGAAAACAGGAAA 
TCAAAGTTTTGGGACAGATTTTCTACTTGTTGGTCTTTTCCAATATGGCTGGATAAACTCTCTTCTCTTTGT 
CGTCATTGCCACCCTCTTTACAGTTGCTCTGACAGGAAATATCATGCTGATCCACCTCATTCGACTGAACAC 
CAGACTCCACACTCCAATGTACTTTCTGCTCAGTCAGCTCTCCATCGTTGACCTCATGTACATCTCCACCAC 
AGTG C C C AAG ATGG C AGT C AGCTT C C T CTC AC AGAGTAAGACC ATT AGATTT TTGGG CTGTG AGATT C AAAC 
GTATGTGTTCTTGGCCCTTGGTGGAACTGAAGCCCTTCTCCTTGGTTTTATGTCTTATGATCGCTATGTAGC 
TATCTGTCACCCTTTACATTATCCTATGCTTATGAGCAAGAAGATCTGCTGCCTCATGGTTGCATGTGCATG 
GGCCAGTGGTTCTATCAATGCTTTCATACATACATTGTATGTGTTTCAGCTTCCATTCTGTAGGTCTCGGCT 
CATTAACCACTTTTTCTGTGAAGTTCCAGCTCTACTATCATTGGTGTGTCAGGACACCTCCCAGTATGAGTA 
TACAGTCCTCCTGAGTGGACTTATTATCTTGCTACTACCATTCCTAGCCATTCTGGCTTCCTATGCTCGTGT 
GCTTATTGTGGTATTCCAGATGAGCTCAGGAAAAGGACAGGCAAAAGCTGTTTCCACTTGTTCCTCCCACCT 
G ATTGT GGC AAG CC TGT TCT AT GCAAC C AC T CT CT TT AC C T AC AC AAGGC C AC ACT C C T TG CGTT CC CCT T C 
ACGGGATAAGGCGGTGGCAGTATTTTACACCATTGTCACACCTCTACTGAACCCATTTATCTACAGCCTGAG 
AAATAAGGAAGTGACGGGGGCAGTGAGGAGACTGTTGGGATATTGGATATGCTGTAGAAAATATGACTTCAG 
ATCTCTGTATTGATTGAGCATTAACAACATAAAAAGCT 



The disclosed NOV19a polypeptide (SEQ ID NO:82) encoded by SEQ ID NO:81 has 
335 amino acid residues and is presented in Table 19B using the one-letter amino acid code. 

25 
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Table 19B. Encoded NOV19a protein sequence (SEQ ID NO:82). 

MNISDVISFDILVSAMKTGNQSFGTDFLLVGLFQYGWINSLLFWIATLFTVALTGNIMLIHLI 

RLNTRLHTPMYFLLSQLSIVDLMYISTTVPKMAVSFLSQSKTIRFLGCEIQTYVFLALGGTEAL 

LLGFMSYDRYVAICHPLHYPMLMSKKICCLMVACAWASGSINAFIHTLYVFQLPFCRSRLINHF 

FCEVPALLSLVCQDTSQYEYTVLLSGLIILLLPFLAILASYARVLIWFQMSSGKGQAKAVSTC 

SSHLIVASLFYATTLFTYTRPHSLRSPSRDKAVAVFYTIVTPLLNPFIYSLRNKEVTGAVRRLLGYWIC 

CRKYDFRSLY 



A search of sequence databases reveals that the NOV19a amino acid sequence has 155 
of 309 amino acid residues (50%) identical to, and 199 of 309 amino acid residues (64%) 
similar to, the 316 amino acid residue ptnr: TREMBLNEW-ACC:AAG45196 protein from 
5 Mus musculus (Mouse) (T2 Olfactory Receptor) (E = 9.3e" 79 ). 



NOV19b 

A disclosed NOV 19b nucleic acid of 1046 nucleotides (also referred to as CG56665- 
02) encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 19C. An 
10 open reading frame was identified beginning with an ATG initiation codon at nucleotides 59- 
60 and ending with a TGA codon at nucleotides 1019-1021. The start and stop codons are 
shown in bold in Table 19C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 19C. NOV19b nucleotide sequence (SEQ ID NO:83). 

TCAACATTATTACATGAACATTTCAGATGTCATCTCCTTTGATATTTTGGTTTCAGCC ATGAAAACAGGAAA 
TCAAAGTTTTGGGACAGATTTTCTACTTGTTGGTCTTTTCCAATATGGCTGGATAAACTCTCTTCTCTTTGT 
CGTCATTGCCACCCTCTTTACAGTTGCTCTGACAGGAAATATCATGCTGATCCACCTCATTCGACTGAACAC 
CAGACTCCACACTCCAATGTACTTTCTGCTCAGTCAGCTCTCCATCGTTGACCTCATGTACATCTCCACCAC 
AGTGCCCAAGATGGCAGTCAGCTTCCTCTCACAGAGTAAGACCATTAGATTTTTGGGCTGTGAGATTCAAAC 
GTATGTGTTCTTGGCCCTTGGTGGAACTGAAGCCCTTCTCCTTGGTTTTATGTCTTATGATCGCTATGTAGC 
TATCTGTCACCCTTTACATTATCCTATGCTTATGAGCAAGAAGATCTGCTGCCTCATGGTTGCATGTGCATG 
GGCCAGTGGTTCTATCAATGCTTTCATACATACATTGTATGTGTTTCAGCTTCCATTCTGTAGGTCTCGGCT 
CATTAACCACTTTTTCTGTGAAGTTCCAGCTCTACTATCATTGATGTGTCAGGACACCTCCCAGTATGAGTA 
TACAGTCCTCCTGAGTGGACTTATTATCTTGCTACTACCATTCCTAGCCATTCTGGCTTCCTATGCTCGTGT 
GCTTATTGTGGTATTCCAGATGAGCTCAGGAAAAGGACAGGCAAAAGCTGTTTCCACTTGTTCCTCCCACCT 
GATTGTGGCAAGCCTGTTCTATGCAACCACTCTCTTTACCTACACAAGGCCACACTCCTTGCGTTCCCCTTC 
ACGGG AT AAGG CGGTGG C AGT ATTTT AC AC C AT TGT C AC AC CT CTACTG AACCC ATT TAT CT AC AGC C TG AG 
AAATAAGGAAGTGACGGGGGCAGTGAGGAGACTGTTGGGATATTGGATATGCTGTAGAAAATATGACTTCAG 
ATCTCTGTATTGATTGAGCATTAACAACATAAAAAGCT 



15 In a search of public sequence databases, the NOV 19b nucleic acid sequence has 592 

of 910 bases (65%) identical to a gb:GENBANK-ID:GGCOR4GEN|acc:X94744.1 mRNA 
from Gallus gallus (G. gallus cor4 DNA for olfactory receptor 4) (E = 7.8e" 48 ). 

The disclosed NOV 19b polypeptide (SEQ ID NO: 84) encoded by SEQ ID NO:83 has 
320 amino acid residues and is presented in Table 19D using the one-letter amino acid code. 

20 Signal P, Psort and/or Hydropathy results predict that NOV 19b has A signal peptide and is 

likely to be localized to the plasma membrane with a certainty of 0.4600. Alternatively, 
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NOV 19b may also localize to the microbody (peroxisome) with a certainty of 0.2188, the 
endoplasmic reticulum (membrane) with a certainty of 0.1000, or in the endoplasmic 
reticulum (lumen) with a certainty of 0.1000. The most likely cleavage site for NOV 19b is 
between positions 40 and 41 : ALT-GN. 



Table 19D. Encoded NOV19b protein sequence (SEQ ID NO:84). 

MKTGNQSFGTDFLLVGLFQYGWINSLLFWIATLFTVALTGNIMLIHLIRIJSrTRLHTPMYFLLSQLSIVDLM 
YISTTVPKMAVSFLSQSKTIRFLGCEIQTYVFLALGGTEAIjLLGFMSYDRYVAICHPLHYPMLMSKKICCLM 
VACAWASGSINAFIHTLYVFQLPFCRSRLINHFFCEVPALLSLMCQDTSQYEYTVLLSGLIILLLPFLAILA 
SYARVLIWFQMSSGKGQAKAVSTCSSHLIVASLFYATTLFTYTRPHSLRSPSRDKAVAVFYTIVTPLLNPF 
IYSLRNKEVTGAVRRLLGYWICCRKYDFRSLY 

A search of sequence databases reveals that the NOV 19b amino acid sequence has 155 
of 306 amino acid residues (50%) identical to, and 198 of 306 amino acid residues (64%) 
similar to, the 316 amino acid residue ptnr:TREMBLNEW-ACC:BAB30304 protein from Mus 
musculus (Mouse) (Adult Male Testis cDNA, Riken Full-Length Enriched Library, 
Clone:4932441h21, Full Insert Sequence) (E = 1.3e 79 ). 

NOV19b is predicted to be expressed in at least the following tissues: Apical microvilli 
of the retinal pigment epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma 
cell lines, corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral 
tissue, cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary 
artery and umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal 
hematopoietic cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, 
lung lymphoma cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express 
MHC II and III nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, 
pons, prostate, putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary 
artery in aortic) spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, 
thalamus, and thymus tissue. This information was derived by determining the tissue sources 
of the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Literature sources, and/or RACE sources. 

The disclosed NOV 19a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 19E. 
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Table 19E. BLAST results for NOV19a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17445348 | ref |XP_ 
060559 . 1 | 
(XM_060559) 


similar to 
OLFACTORY 
RECEPTOR 2T1 

(OLFACTORY 
RECEPTOR 1-25) 

(ORl-25) (H\ 
sapiens) [Homo 
sapiens] 


533 


300/301 
(99%) 


301/301 
(99%) 


e-143 


gi | 17437056 | ref | XP_ 
060314 . 1 | 
(XM_06 0314) 


similar to 
OLFACTORY 
RECEPTOR 2T1 

(OLFACTORY 
RECEPTOR 1-25) 

(ORl-25) (H. 
sapiens) [Homo 
sapiens] 


695 


169/310 
(54%) 


224/310 
(71%) 


5e-84 


gi | 174 4 53 56 | ref | XP_ 
060561 . 1 | 
(XM_060561) 


similar to 
OLFACTORY 
RECEPTOR 2T1 

(OLFACTORY 
RECEPTOR 1-25) 

(ORl-25) (H . 
sapiens) [Homo 
sapiens] 


312 


172/305 
(56%) 


223/305 
(72%) 


3e-80 


gi | 174 5 65 95 | ref | XP_ 

06S073.l| 

(XM 065073) 


similar to 
olfactory- 
receptor (H. 
sapiens) [Homo 
sapiens] 


638 


142/292 
(48%) 


188/292 
(63%) 


7e-78 


gi | 17475192 | ref |XP_ 
062796 . 1 | 
(XM_062796) 


similar to 
olfactory 
receptor (H. 
sapiens) [Homo 
sapiens] 


315 


154/299 
(51%) 


209/299 
(69%) 


2e-77 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 19F. In the ClustalW alignment of the NOV 19 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (/.<?., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 Table 19F. ClustalW Analysis of NOV19 

1) Novel NOV19a (SEQ ID NO: 82) 

2) Novel NOV19b (SEQ ID NO: 84) 

3) gi | 17445348 | ref |XP_060559 . 1 | (XM_060559) similar to OLFACTORY RECEPTOR 2T1 
(OLFACTORY RECEPTOR 1-25) (ORl-25) (H. sapiens) [Homo sapiens] (SEQ ID NO: 395) 

15 4) gi| 17437056 |ref | XP_0603 14 . 1 | (XM_060314) similar to OLFACTORY RECEPTOR 2T1 

(OLFACTORY RECEPTOR 1-25) (ORl-25) (H. sapiens) [Homo sapiens] (SEQ ID NO: 3 97) 
5) gi | 17445356 | ref | XP_06 0 56 1 . 1 | (XM_060561) similar to OLFACTORY RECEPTOR 2T1 
(OLFACTORY RECEPTOR 1-25) (ORl-25) (H, sapiens) [Homo sapiens] (SEQ ID NO: 3 94) 
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6) gi | 17456595 | ref |XP_065073 . 1 | <XM_065073) similar to olfactory receptor (H. 
sapiens) [Homo sapiens] (SEQ ID NO:398) 

7) gi | 17475192 | ref |XP_062796 . 1 | (XM_062796 ) similar to olfactory receptor (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 4 04) 



NOV19a 

NOV19b 

gi | 17445348 | 

gi|l7437056| 

gijl7445356 j 

gi|l7456595| 

gi | 17475192 | 



NOVl9a 


61 | 




NOV19b 


46 | 




gi|l7445348| 


46 




gij 17437056 | 


51 | 


L 


gi j 17445356 | 


45 j 


L 


gi j 17456595 j 


46 ) 




gij 17475192 j 


47 i 


3H 



NOV19a 

NOVl9b 

gi | 17445348 | 

gi | 17437056 j 

gi | 17445356 j 

gi | 17456595 j 

gij 17475192 | 



NOV19a 

NOVl9b 

gi | 17445348 

gijl7437056 

gi|l7445356 

gi j 17456595 

gi j 17475192 



NOV19a 
NOV19b 
gi|l7445348 
gi j 17437056 
gi j 17445356 
gi j 17456595 
gi j 17475192 



NOV19a 

NOV19b 

gi | 17445348 | 

gi | 17437056 | 

gi | 17445356 j 

gij 17456595 I 

gij 17475192 j 




NOV19a 
NOV19b 



317 
302 



370 



390 



400 
..|.. 



410 



170 



420 



317 

302 




gi 
gi 

gi 

gi 



17445348 | 
17437056 j 
17445356 j 
17456595 j 
17475192 j 



311 ISFTGCG 318 

351 IASNWKIILIHIDSRLHTPMYFLLSQLSLRDILYISTIVPKMLVDQVMSQRAISFAGCT 410 



301 



301 



332 ESHVSLISLVE 343 

303 303 



430 440 450 460 470 480 

....|....|....|....|....|....|....|....|....| |....|....| 

NOV19a 317 R&LGYW^C 325 

10 NOV19b 302 RIILGYW^C 310 

gi | 17445348 | 319 IQSFFFSALGGAEALLLASMAYDRYIAICFPLHYPIRMSKRMCVLMXTGSw|lGSINACA 378 

gi | 17437056 j 411 AQHFLYLTLAGAEFFLLGLMSYDRYVAICNPLHYPVLMSRKICWLIVAAAWlGGSIDGFL 470 

gi | 17445356 | 301 rMiQNIFS 309 

gi | 17456595 j 344 PPAVEWTGASVKGCPRTWCLPREQVLWDGPDSGTSLESKQPHQEG^SDMffiOSNTICTLV 403 

15 gi |17475192 j 303 KG&RCR& 312 



20 



25 



NOV19a 

NOV19b 

gi | 17445348 | 

gi | 17437056 | 

gi | 17445356 | 

gi j 17456595 | 

gi j 17475192 j 



490 500 510 520 530 540 

....|....|....| |....|....|....|....|....|....|....|....| 

3 25 CRKYDFRSLY 335 

310 C§KYDFRSLY 32 0 

3 79 HTV YVLHIPYCQSRAINHFFCDVPAMVTLACMDTWVYEGTVFLSTTIFLVFPFIA 4 33 

4 71 LTP VTMQFPFCASREINHFFCEVPALLKLSCTDTSAYETAMYVCCIMMLLIPFSV 52 5 

309 VpN 312 

4 04 SELNQFWAYPIQHDLPKEVLLTPAPCKVGAIIIHPAAREDTLNTSQETPGTPKCYRGKNI 463 



312 



315 



30 



35 



NOV19a 

NOV19b 

gi | 17445348 | 

gi | 17437056 j 

gi | 17445356 | 

gi | 17456595 | 

gi | 17475192 j 



550 



580 



590 



600 



I 



I 



I 



335 



335 



320 320 

4 34 ISCSYGRVLLAVYHMKSAEGRKKAYLTCSTHLTWTFYYAPFVYTYLRPRSLRSPTEDKV 4 93 
526 ISGSYTRILITVYRMSEAEGRGKAVATCSSHMVWSLFYGAAMYTYVLPHSYHTPEQDKA 5 85 



312 



312 



464 KGVKEGKAEPEGPVGPETVGSKTEMNFAGSEFKEVNFRCTASMHN — SPDVTSDPVLQAA 521 
315 315 



40 



45 



NOV19a 

NOVl9b 

gi | 17445348 | 

gi | 17437056 | 

gi | 17445356 j 

gi|l7456595| 

gi j 17475192 | 



610 



620 



630 



640 



650 



660 



335 335 

320 320 

4 94 LAVFYTTLTPMLNPII YSLRNKEVMGALTRVSQRICSGKM 53 3 

586 VSAFYTILTPMLNPLIYSLRNKDVTGALQKWGRMEWKTLPFQALQVRCVKWRRSVLVSS 64 5 



312 



312 



522 MDVGFSGLPDVSQSHSKTLWGARGRGPSIRRQREFMPEEK KDTVYWEKRRKNNEAAK 578 

315 315 



670 



680 



690 



700 



720 



50 



55 



NOV19a 

NOV19b 

gi | 17445348 | 

gi j 17437056 j 

gi j 17445356 j 

gi j 17456595 j 

gi I 17475192 j 



- 335 

- 320 

- 533 

- 695 

- 312 

579 RSREKRRLNDAAIEGRLAALMEENALLKGELKALKLRFGLLPLTGSAIGSPLDWGPPAWG 638 

315 315 



335 

320 

533 

64 6 FIATERTLADTSHSSSHAEFPERGVRMN CSKLFSLVEEPVTSLGDLFNFR- 

312 



NOV19a 


335 


335 


NOV19b 


320 


320 


gi| 17445348 | 


533 


533 


gij 17437056 j 


695 


695 


gi | 17445356 | 


312 


312 


gi | 17456595 j 


638 


638 


gi j 17475192 j 


315 


315 
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Table 19G lists the domain description from DOMAIN analysis results against 
NOV 19. This indicates that the NOV 19 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 19G Domain Analysis of NOV19 

gnl | Pf am | pf amOOOOl , 7tm_l / 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 
Score a 91.3 bits (225), Expect = 8e-20 



10 



NOV19: 56 GNIMLIHLIRLNTRLHTPMYFLLSQLSIVDLMYISTTVPKMAVSFLSQSKTIRFLGCEIQ 115 

II+++I +1 +1 II I I++ II+++ 11+ I++ 

Sbjct: 1 GNLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 

NOV19: 116 TYVF LALGG TEALLL GFM S YDR YVA I CH P LHY PM LMS KK I CC L»M V AC AW AS G S I NA F I HT 175 

+ 1+ I III +1 I I l + l I I I I I + + + +++ I 

Sbjct: 61 GALFWNGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILLVWVLALLLSLPPL 120 



15 NOV19: 176 LYVFQLPFCRSRLINHFFCEVPALLSLVCQDTSQYEYTVLLSGI/JXLLLPFIiA XLASYAR 23 5 

1+ + I I I +I+ + 

Sbjct : 121 LFSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTRILRTLRKRA 18 0 

NOV19: 236 VLIWFQMSSGKGQAKAVSTCSSHLIVASLFY ATTLFTYTRPHSLRSPSRDKAVAV 291 

20 + |+| ++ + | + | + + 

Sbjct: 181 RSQRSLKRRSSSERKAAKMLLVWWFVLCWLPYHIVLLLDSLCLLSIWRVLPTALLITL 24 0 

NOV19: 292 FYTIVTPLLNPFI Y 305 

+ I III II 

25 Sbjct: 241 WLAYVNSCLNPIIY 254 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
subfamily of G protein-coupled receptors in a number of species. These receptors share a 
seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
30 and are likely to underlie the recognition and G-protein-mediated transduction of various 

signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

35 Olfactory receptors (ORs) have been identified as extremely large subfamily of G 

protein-coupled receptors in a number of species. Th^se receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 
Previously, OR genes cloned in different species were from random locations in the respective 

40 genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
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displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

The disclosed NOV 19 nucleic acid of the invention encoding a G-Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 19A, 
5 19C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 1 9A or 1 9C while 
still encoding a protein that maintains its G-Protein Coupled Receptor -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 

10 acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 

15 chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
o variant nucleic acids, and their complements, up to about 35 percent of the bases may be so 
changed. 

The disclosed NOV 19 protein of the invention includes the G-Protein Coupled 
20 Receptor -like protein whose sequence is provided in Table 19B or 19D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 19B or 19D while still encoding a protein that 
maintains its G-Protein Coupled Receptor -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 52 percent of the 
25 residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV19) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
30 NOV 19 nucleic acids and proteins identified here may be useful in potential therapeutic 

applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
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delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV 19 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in developmental diseases, MHCII and III diseases 
5 (immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 

Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 
disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 

10 fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 
Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 

15 ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 

disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 
autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

20 NOV19 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV 19 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV 19 protein has multiple hydrophilic 

25 regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV20 

30 A disclosed NOV20 nucleic acid of 1027 nucleotides (also referred to as CG56665-01) 

encoding a novel G-Protein Coupled Receptor -like protein is shown in Table 20A. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 1-3 and 
ending with a TAG codon at nucleotides 940-942. The start and stop codons are shown in 
bold in Table 20A, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 20A. NOV20 nucleotide sequence (SEQ ID NO:85). 

ATGATCTGCTCAGCTATCAACCTACACTTACTACTGGCAGTTAAGATGATTCACCCTGTCTGGATTCTTGCT 
CCTCGGGAGCAAGGGCTGTTTCTGCTGATTTATCTGGCAGTGCTGGTGGGGAACCTGCTCATCATTGCAGTC 
ATCACTCTCGATCAGCATCTTCACACACCCATGTACTTCTTCCTGAAGAACCTCTCCGTTTTGGATCTGTGC 
TACATCTCAGTCACTGTGCCTAAATCCATCCGTAACTCCCTGACTCGCAGAAGCTCCATCTCTTATCTTGGC 
TGTGTGGCTCAAGTCTATTTTTTCTCTGCCTTTGCATCTGCTGAGCTGGCCTTCCTTACTGTCATGTCTTAT 
GACCGCTATGTTGCCATTTGCCACCCCCTCCAATACAGAGCCGTGATGACATCAGGAGGGTGCTATCAGATG 
GCAGTCACCACCTGGCTAAGCTGCTTTTCCTACGCAGCCGTCCACACTGGCAACATGTTTCGGGAGCACGTT 
TGCAGATCCAGTGTGATCCACCAGTTCTTCCGTGACATCCCTCATGTGTTGGCCCTGGTTTCCTGTGAGGTT 
TTCTTTGTAGAGTTTTTGACCCTGGCCCTGAGCTCATGCTTGGTTCTGGGATGCTTTATTCTCATGATGATC 
TCCTATTTCCAAATCTTCTCAACGGTGCTCAGAATCCCTTCAGGACAGAGTCGAGCAAAAGCCTTCTCCACC 
TGCTCCCCCCAGCTCATTGTCATCATGCTCTTTCTTACCACAGGGCTCTTTGCTGCCTTAGGACCAATTGCA 
AAAGCTCTGTCCATTCAGGATTTAGTGATTGCTCTGACATACACAGTTTTGCCTCCCTTCCTCAATCCCATC 
AT AT AT AGT CTT AGGAATAAGG AGATT AAAAC AG C CATGTGG AGACTCTTTGTG AAG AT ATATTT T C TGC AA 
AAGT AG AAC AT C CTGGT CT TT ACT ATAG AAG AT CTGC AAC AAAAC C CC AAAAAAGC AT AAAT ACT TT ATG AC 
AAAAAAAGATGAAAAAATT 



The disclosed NOV20 polypeptide (SEQ ID NO:86) encoded by SEQ ID NO:85 has 
313 amino acid residues and is presented in Table 20B using the one-letter amino acid code. 

5 

Table 20B. Encoded NOV20 protein sequence (SEQ ID NO:86). 

MICSAINLHLLLAVKMIHPVWILAPREQGLFLLIYLAVLVGNLLIIAVITLDQHLHTPMYFFLK 
NLSVLDLiCYI SVTVPKS IRNSLTRRSS I SYLGCVAQVYFFSAFASAELAFLTVMSYDRYVAICH 
PLQYRAVMTSGGCYQMAVTTWLSCFSYAAVHTGNMFREHVCRSSVIHQFFRDIPHVLALVSCEV 
FFVEFLTLALSSCLVLGCFILMMISYFQIFSTVLRIPSGQSRAKAFSTCSPQLIVIMLFLTTGL 
FAALGPI AKALSIQDLVI ALTYTVLPPFLNPIIYSLRNKEIKTAMWRLFVKIYFLQK 

A search of sequence databases reveals that the NOV20 amino acid sequence has 134 
of 278 amino acid residues (48%) identical to, and 179 of 278 amino acid residues (64%) 
similar to, the 321 amino acid residue ptnr: SPTREMBL-ACC:Q9UGF5 BA150A6.4 protein 
1 0 from Homo sapiens (Human) (NOVEL 7 TRANSMEMBRANE RECEPTOR (RHODOPS1N 
FAMILY) (E - 2.4e" 64 ). 

The disclosed NOV20 polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 20C. 



Table 20C. BLAST results for NOV20 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
<%> 


Expect 


gi | 17437075 | ref |XP_ 

060319.1] 

<XM_060319) 


similar to 
OLFACTORY 
RECEPTOR 5U1 
(HS6M1-28) (H . 
sapiens) [Homo 
sapiens] 


311 


287/294 
(97%) 


288/294 
(97%) 


e-134 



175 



:1 ill O .r*Tii 1i ., f)t '""> 1 fl>"»- 



gi | 1744 53 73 | ref | XP_ 
060567 . 1 | 
(XM_0 6 056 7) 


similar to 
OLFACTORY 
RECEPTOR 5U1 
(HS6M1-28) (H. 
sapiens) [Homo 
sapiens] 


309 


147/272 
(54%) 


188/272 
(69%) 


8e-63 


gi|l74453 94|ref| XP_ 
060572 . 1 j 
(XM_060572) 


similar to 
OLFACTORY 
RECEPTOR 5U1 
(HS6M1-28) (H. 
sapiens) [Homo 
sapiens] 


316 


133/283 
(46%) 


187/283 
(65%) 


2e-61 


gi | 174 37 015 | ref |XP_ 
060307 . 1 | 
(XM_060307) 


similar to 
OLFACTORY 
RECEPTOR 5U1 
(HS6M1-28) (H. 
sapiens) [Homo 
sapiens] 


312 


139/291 
(47%) 


189/291 
(64%) 


9e-59 


gi | 174 643 51 | ref | XP_ 
069462 . 1 | 
(XM069462) 


similar to 
OLFACTORY 
RECEPTOR 5U1 
(HS6M1-28) (H. 
sapiens) [Homo 
sapiens] 


321 


133/278 
(47%) 


175/278 
(62%) 


3e-57 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 20D. In the ClustalW alignment of the NOV20 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 20D. ClustalW Analysis of NOV20 



1) Novel 

3) gi|l74 
(HS6M1-28) 

4) gi|l74 
(HS6M1-28) 

5) gi|l74 
(HS6M1-28) 

6) gi|l74 
(HS6M1-28) 

7) gi | 174 
(HS6M1-28) 



NOV2 0 (SEQ 
37075 | ref |XP 
(H. sapiens 
45373 I ref | XP 
(H. sapiens 
45394 | ref |XP 
(H. sapiens 
37015 | ref | XP 
(H. sapiens 
64351 | ref |XP 
(H. sapiens 



ID NO: 86 
060319 . 
) [Homo 

060567 . 
) [Homo 
060572 . 
) [Homo 
_060307 . 
) [Homo 
_069462 . 
) [Homo 



) 

l| (XM_06 
sapi ens] 
l| (XM_0 6 
sapiens] 
l| (XM _ 06 
sapiens] 
l| (XM 0 6 
sapiens] 
l| (XM_06 
sapiens] 



0319) 
(SEQ 
0567) 
(SEQ 
0572) 
(SEQ 
0307) 
(SEQ 
9462) 
(SEQ 



simil 
ID NO 

simil 
ID NO: 

simil 
ID NO: 

simil 
ID NO 

simil 
ID NO 



ar to 
405) 
ar to 
406) 
ar to 
407) 
ar to 
408) 
ar to 
409) 



OLFACTORY RECEPTOR 5U1 

OLFACTORY RECEPTOR 5U1 

OLFACTORY RECEPTOR 5U1 

OLFACTORY RECEPTOR 5U1 

OLFACTORY RECEPTOR 5U1 



25 



30 



NOV2 0 

gi | 17437075 | 
gi | 17445373 [ 
gi j 17445394 | 
gi | 17437015 j 
gi|l746435l| 




70 
■I-. 



80 
-I-- 



90 
■I-- 



100 



110 



120 



176 



NOV20 

gi | 17437075 | 
gi j 17445373 | 
gi j 17445394 j 
gi | 17437015 j 
gi|1746435l| 




LDLCYI SVTVPKS IgNSL 
LDLCYI SVTVPKS IgNSL 
LDLCjjl SVTVPKS IgNSL 
LDjgCYI 5VT|P,KS ifesL 
LDgCY'ISVTVPgsgSNSL 

ldlcBisvtvpSsiBnsl 




NOV20 

gi | 17437075 | 
gi 1 17445373 j 
gi j 17445394 j 
gi j 17437015 j 
gi |17464351| 



111 

109 
109 
120 
109 
109 




j jjGST 

;sgl3bsaai 



180 

^EHVfflR 170 

IREHVIR 168 

jSPSYIG 168 

iPFJG 179 

JpFiR 168 

jPLgG 168 



190 



200 



210 



240 



NOV20 


171 


gi | 17437075 | 


169 


gi 1 17445373 j 


169 


gi j 17445394 j 


180 


gi|l7437015 j 


169 


gi|l746435l| 


169 




sc. 
sc: 

ISCSjEfNLI 

IldpMviti 

ISCSQTgs 

csyeHi 



FVgFLTpALSL 
Fvfi FLT|KLSgC| 
SljAtijLI 
alGXffivFGlsl 




__ II 
©VSALG^ 

PTgSAAFIj 



230 
228 
228 
239 
228 
228 



250 



260 



270 



NOV20 


231 


gi 


17437075 | 


229 


gi 


17445373 | 


229 


gi 


17445394 j 


240 


gi 


17437015 j 


229 


gi 


17464351) 


229 




310 



320 



330 



NOV20 


291 


gi | 17437075 | 


289 


gi 1 17445373 | 


288 


gi j 17445394 | 


300 


gi j 17437015 j 


289 


gi| 17464351) 


289 




I 



]pFVglYFLM 313 




IYFLQK 



311 
309 

pCGP 316 

iKSDMKllFYSiNV 312 

jdR^Ls|EELP^KMCLKAMFKL 321 



Table 20E lists the domain descriptions from DOMAIN analysis results against 
NOV20. This indicates that the NOV20 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 20E Domain Analysis of NOV20 

gnl | Pf am | pf amOOOOl, 7tm_l / 7 transmembrane receptor (rhodopsin family) 
(SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 83.6 bits (205), Expect = 2e-17 



NOV20 : 41 GNLLI I AVI TLDQHLHTPMYFFLKNLSVLDLCYI SVTVPKS IRNSLTRRSS IS YLGCVAQ 100 

HIM II + Ml II IM II ++ I ++ * I 

Sbjct : 1 GNLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 60 

NOV2 0 : 101 VYFFSAFASAELAFLTVMS YDRYVAI CHPLQYRAVMTSGGCYQMAVTTWLSCFS YAAVHT 16 0 

I I + I I +1 I I l + l I I I l + l I + I + + 1 + 

Sbjct : 61 G ALi FWNGYAS I L L LT A I S I D R YLA I VH P LR YRR I RT P R RAKVL I LL VWVLAL L LS L P P L 120 

177 



; o ' 



1i P „ ill :1 



7n o:e 



NOV20: 161 GNMFREHVCRSSVIHQFFRDIPHVLALVSCEVFFVEFLTLALSSCLVLGCFILMMISYFQ 220 

+ I + + + + I I II II + 

Sbjct: 121 LFSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTRILRTLRKRA 180 

NOV20: 221 IFSTVLRIPSGQSRAKAFSTCSPQLIVIMLFLTTGLFAALGPIAKALSIQDLVIALT 277 

| + ||| ++ ++ + | + I + + I II 

Sbjct: 181 RSQRSLKRRSSSERKAAKMLLVVVWFVLCWLPYHIVLLLDSLCLLSIWRVLPTALLITL 24 0 

10 NOV20: 278 - YTVLPPFLNPIIY 290 

+ I M I I I 
Sbjct: 241 WLAYVNSCLNPIIY 254 

G-Protein Coupled Receptor (GPCRs) have been identified as extremely large 
1 5 subfamily of G protein-coupled receptors in a number of species. These receptors share a 

seven transmembrane domain structure with many neurotransmitter and hormone receptors, 
and are likely to underlie the recognition and G-protein-mediated transduction of various 
signals. Previously, GPCR genes cloned in different species were from random locations in the 
respective genomes. The human GPCR genes are intron less and belong to four different gene 
20 subfamilies, displaying great sequence variability. These genes are dominantly expressed in 
olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large subfamily of G 
protein-coupled receptors in a number of species. These receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
25 likely to underlie the recognition and G-protein-mediated transduction of odorant signals. 

Previously, OR genes cloned in different species were from random locations in the respective 
genomes. The human OR genes are intron less and belong to four different gene subfamilies, 
displaying great sequence variability. These genes are dominantly expressed in olfactory 
epithelium. 

30 The disclosed NOV20 nucleic acid of the invention encoding a G-Protein Coupled 

Receptor -like protein includes the nucleic acid whose sequence is provided in Table 20A or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 20A while still encoding a 
protein that maintains its G-Protein Coupled Receptor -like activities and physiological 

35 functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 

40 modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

derivatized. These modifications are carried out at least in part to enhance the chemical 

178 
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stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. 

The disclosed NOV20 protein of the invention includes the G-Protein Coupled 
Receptor -like protein whose sequence is provided in Table 20B. The invention also includes 
5 a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 20B while still encoding a protein that maintains its G-Protein 
Coupled Receptor -like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 54 percent of the residues may be so changed. 



10 (F ab )2,that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G-Protein Coupled Receptor -like 
protein (NOV20) is a member of a "G-Protein Coupled Receptor family". Therefore, the 
NOV20 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 

1 5 below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

20 The NOV20 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 

25 disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 

30 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 



The invention further encompasses antibodies and antibody fragments, such as F a b or 
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autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome, and/or other diseases and pathologies. 

NOV20 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV20 substances for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV20 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
1 0 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV21 

NOV21 includes three novel adrenal secretory serine protease-like proteins disclosed 
below. The disclosed sequences have been named NOV21a and NOV21b. 
15 NOV21a 

A disclosed NOV21a nucleic acid of 1028 nucleotides (also referred to as CG56639- 
01) encoding a novel adrenal secretory serine protease-like protein is shown in Table 21 A. An 
open reading frame was identified beginning with an TCG initiation codon at nucleotides 1-3 
and ending with a TGA codon at nucleotides 769-771. The start and stop codons are shown in 
20 bold in Table 21 A, and the 5' and 3' untranslated regions, if any, are underlined. Because the 
start codon of NOV21a is not a traditional initiation codon, NOV21a could be a partial reading 
frame that extends further in the 5' direction. 



Table 21A. NOV21a nucleotide sequence (SEQ ID NO:87). 

TCGCCATTTCCAGACGCCCCGGAGGCCACCACACACACCCAGCTACCAGACTGTGGCCTGGCGCCGGCCGCG 
CTCACCAGGATTGTGGGCGGCAGCGCAGCGGGCCGTGGGGAGTGGCCGTGGCAGGTGAGCCTGTGGCTGCGG 
CGCCGGGAACACCGTTGCGGGGCCGTGCTGGTGGCAGAGAGGTGGCTGCTGTCGGCGGCGCACTGCTTCGAC 
GTCTACGGGGACCCCAAGCAGTGGGCGGCCTTCCTAGGCACGCCGTTCCTGAGCGGCGCGGAGGGGCAGCTG 
GAGCGCGTGGCGCGCATCTACAAGCACCCGTTCTACAATCTCTACACGCTCGACTACGACGTGGCGCTGCTG 
GAGCTGGCGGGGCCGGTGCGTCGCAGCCGCCTGGTGCGTCCCATCTGCCTGCCCGAGCCCGCGCCGCGACCC 
CCGGACGGCACGCGCTGCGTCATCACCGGCTGGGGCTCGGTGCGCGAAGGAGGCTCCATGGCGCGGCAGCTG 
CAGAAGGCGGCCGTGCGCCTCCTCAGCGAGCAGACCTGCCGCCGCTTCTACCCAGTGCAGATCAGCAGCCGC 
ATCTCTGAACCCCCTTTCTTCTCTCCCCAACAGGGTGACGCTGGGGGACCCCTGGCCTGCAGGGAGCCCTCT 
GGACGGTGGGTGCTAACTGGGGTCACTAGCTGGGGCTATGGCTGTGGCCGGCCCCACTTCCCAGGTGTCTAT 
ACCCGGGTGGCAGCTGTGAGAGGCTGGATAGGACAGCACATCCAGGAGTGA CCACCACGTGACTGCCCAGGC 
CGAGACTCTACGTGAAAGCAACAGGAGCAGCAGGCCACCCAACACCCCACCCCACCGTACCCTACCCAAGGA 
CGGGTGTGGGGGGGCTGTGGGTCATGGGGATGCATTTTGGTACCACCCTTTGTTCCAATAAACACAGCCCCT 
CCACCCTAGCTCACTGGCTCAGCACCTCAGTGTCACAGCGAGGACCACCTGCCTGGTGCTTCACCAGGACCC 
GGGGTGGAACGAAACAACCC 
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In a search of public sequence databases, the NOV2 la nucleic acid sequence, located 
on chromosome 19, has 296 of 466 bases (63%) identical to a gbrGENBANK- 
ID:E13204|acc:E 13204.1 mRNA from Homo sapiens (Human cDNA encoding a serine 
protease) (E = 3.9e~ 18 ). 

5 The disclosed NOV21a polypeptide (SEQ ID NO:88) encoded by SEQ ID NO:87 has 

256 amino acid residues and is presented in Table 21 B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV21a has A signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.7480. Alternatively, 
NOV21a may also localize to the lysosome (lumen) with a certainty of 0.3168, or the 
10 mitochondrial matrix space with a certainty of 0.1000. The most likely cleavage site for 
NOV21a is between positions 68 and 69: SAA-HC. 



Table 21B. Encoded NOV21a protein sequence (SEQ ID NO:88). 

SPFPDAPEATTHTQLPDCGLAPAALTRIVGGSAAGRGEWPWQVSLWLRRREHRCGAVLVAERWLLSAAHCFD 
VYGDPKQWAAFLGTPFLSGAEGQLERVARIYKHPFYlSrL,YTLDYDVALLEIiAGPVRRSRLVRPICLPEPAPRP 
PDGTRCVITGWGSVREGGSMARQLQKAAVRLLSEQTCRRFYPVQISSRISEPPFFSPQQGDAGGPLACREPS 
GR WVLTG VT S WG YG CGR P H F P G V YTR VAAVRGW I GQH I Q E 

A search of sequence databases reveals that the NOV21a amino acid sequence has 99 
15 of 250 amino acid residues (39%) identical to, and 134 of 250 amino acid residues (53%) 

similar to, the 279 amino acid residue ptnr:SPTREMBL-ACC:Q9QZ74 protein from Rattus 
norvegicus (Rat) (Adrenal Secretory Serine Protease Precursor) (E = 1.5e" 42 ). 

NOV21a is predicted to be expressed in at least the following tissues: Ovary, kidney, 
breast, lung, muscle, liver, spleen, blood, lymphocyte. This information was derived by 
20 determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
sources. 

NOV21b 

In the present invention, the target sequence identified previously, NOV21a, was 
25 subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
30 the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 




sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
5 brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 

10 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated Accession Number NOV21b. This differs 

15 from the previously identified sequence (NOV2 la) in being a splice variant and a mature 
protein starting with serine. 

A disclosed NOV21b nucleic acid of 785 nucleotides (also referred to as CG56639-02) 
encoding a novel adrenal secretory serine protease-like protein is shown in Table 21C. An 
open reading frame was identified beginning with an CTT initiation codon at nucleotides 1-3 

20 and ending with a TGA codon at nucleotides 783-785. The start and stop codons are shown in 
bold in Table 21C, and the 5' and 3' untranslated regions, if any, are underlined. Because the 
start codon of NO V2 lb is not a traditional initiation codon, NOV21b could be a partial 
reading frame that extends further in the 5' direction. 



Table 21C. NOV21b nucleotide sequence (SEQ ID NO:89). 

CTTCGCCATTTCCAGACGCCCCGGAGGCCACCACACACACCCAGCTACCAGACTGTGGCCTGGCGCCGGCCG 
CGCTCACCAGGATTGTGGGCGGCAGCGCAGCGGGCCGTGGGGAGTGGCCGTGGCAGGTGAGCCTGTGGCTGC 
GGCGCCGGGAACACCGTTGCGGGGCCGTGCTGGTGGCAGAGAGGTGGCTGCTGTCGGCGGCGCACTGCTTCG 
ACGTCTACGGGGACCCCAAGCAGTGGGCGGCCTTCCTAGGCACGCCGTTCCTGAGCGGCGCGGAGGGGCAGC 
TGGAGCGCGTGGCGCGCATCTACAAGCACCCGTTCTACAATCTCTACACGCTCGACTACGACGTGGCGCTGC 
TGGAGCTGGCGGGGCCGGTGCGTCGCAGCCGCCTGGTGCGTCCCATCTGCCTGCCCGAGCCCGCGCCGCGAC 
CCCCGGACGGCACGCGCTGCGTCATCACCGGCTGGGGCTCGGTGCGCGAAGGAGGCTCCATGGCGCGGCAGC 
TGCAGAAGGCGGCCGTGCGCCTCCTCAGCGAGCAGACCTGCCACCGCTTCTACCCAGTGCAGATCAGCAGCC 
GCATGCTGTGTGCCGGCTTCCCGCAGGGTGGCGTGGACAGCTGCTCGGGTGACGCTGGGGGACCCCTGGCCT 
GCAGGGAGCCCTCTGGACGGTGGGTGCTAACTGGGGTCACTAGCTGGGGCTATGGCTGTGGCCGGCCCCACT 
TCCCAGGTGTCTATACCCGGGTGGCAGCTGTGAGAGGCTGGATAGGACAGCACATCCAGGAGTGA 

25 " .^^^ — ^- 

In a search of public sequence databases, the NOV21b nucleic acid sequence, located 
on chromosome 19, has 160 of 162 bases (98%) identical to a gb:GENBANK- 
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ID:HUMLAMBBB|acc:M94363.1 mRNA from Homo sapiens (Human lamin B2 (LAMB2) 
gene and ppvl gene sequence) (E = 4.3e~ 59 ). 

The disclosed NOV21b polypeptide (SEQ ID NO:90) encoded by SEQ ID NO:89 has 
260 amino acid residues and is presented in Table 21D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV21b has A signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.7480. Alternatively, 
NOV21b may also localize to the lysosome (lumen) with a certainty of 0.3082, or the 
mitochondrial matrix space with a certainty of 0.1000. The most likely cleavage site for 
NOV21b is between positions 68 and 69: SAA-HC. 



Table 21D. Encoded NOV21b protein sequence (SEQ ID NO:90). 



SPFPDAPEATTHTQLPDCGLAPAALTRIVGGSAAGRGEWPWQVSLWLRRREHRCGAVIiVAERWLLSAAHCFD 
VYGDPKQWAAFLGTPFLSGAEGQLERVARIYKHPFYNLYTLDYDVALLELAGPVRRSRLVRPICLPEPAPRP 
PDGTRCVITGWGSVREGGSMARQLQKAAVRLLSEQTCHRFYPVQISSRMLCAGFPQGGVDSCSGDAGGPLAC 
REPSGRWVLTGVTSWGYGCGRPHFPGVYTRVAAVRGWIGQHIQE 



A search of sequence databases reveals that the NOV21 b amino acid sequence has 123 
of 250 amino acid residues (49%) identical to, and 1 54 of 250 amino acid residues (61%) 
similar to, the 855 amino acid residue ptnr:SPTREMBL-ACC:Q9Y5Y6 protein from Homo 
sapiens (Human) (Matriptase) (E = 3.5e" 59 ). 

NOV21b is predicted to be expressed in at least the following tissues: adrenal gland, 
Ovary, kidney, breast, lung, muscle, liver, spleen, blood, lymphocyte. . 

The disclosed NOV21a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 21E.. 



Table 2 IE. BLAST results for NOV21a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 12 83 65 03 | db j | BAB 
23684 . 1 | (AK004939) 


data source : SPTR , 
source 
key:095519 / 
evidence : ISS~homo 
log to DJ1170K4 .4 
(NOVEL PROTEIN) 
(FRAGMENT) -putati 
ve [Mus musculus] 


799 


118/244 
(48%) 


153/244 
(62%) 


7e-55 


gi | 102573 90 | gb | AAG1 
5395 . 1 | AF057145_1 
(AF057145) 


serine protease 
TADG15 [Homo 
sapiens] 


855 


115/250 
(46%) 


146/250 
(58%) 


6e-52 
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gi | 11415 04 0 | ref |NP_ 
068813 . 1 | 
<NM_021978) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase , 
epithin) ,- 
suppression of 
tumorigenicity 14 
(colon 
carcinoma) ,- 
matriptase [Homo 
sapiens] 


855 


115/250 
(46%) 


146/250 
(58%) 


7e-52 


gi | 7363445 | ref | NP_0 
35306. 2| 
(NM 011176) 


protease, serine, 
14 (epithin) [Mus 
musculus] 


855 


115/250 
(46%) 


144/250 
(57%) 


8e-52 


gi | 16758444 | ref |NP_ 
446087. 1| 
(NM_053635) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase, 
epithin) [Rattus 
norvegicus] 


855 


112/247 
(45%) 


141/247 
(56%) 


le-51 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 21F. In the ClustalW alignment of the NOV21 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



10 



15 



20 



25 



Table 21F. ClustalW Analysis of NOV21 



1) Novel NOV21a (SEQ ID NO: 88) 

2) Novel NOV21b (SEQ ID NO: 90) 

3) gi | 12836503 |dbj | BAB23684 . 1 | (AK004939) data source :SPTR, source key:095519, 
evidence : ISS~homolog to DJ1170K4.4 (NOVEL PROTEIN) (FRAGMENT) -putative [Mus 
musculus] (SEQ ID NO: 410) 

4) gi | 10257390 |gb|AAG15395 . 1 | AF057145_1 (AF057145) serine protease TADG15 [Homo 
sapiens] (SEQ ID NO: 4 11) 

5) gi | 11415040 | ref |NP_068813 . 1 | (NM_02 1 97 8 ) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin); suppression of tumorigenicity 14 (colon carcinoma) ; 
matriptase [Homo sapiens] (SEQ ID NO: 412) 

6) gi | 7363445 | ref |NP_035306 . 2 | (NM_011176) protease, serine, 14 (epithin) [Mus 
musculus] (SEQ ID NO: 413) 

7) gi | 16758444 | ref |NP_446087 . 1 | (NM_053635) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin) [Rattus norvegicus] (SEQ ID NO: 414) 



10 



20 



30 



40 



50 



60 



30 



35 



NOV21a 

NOV21b 

gi | 12836503 | 

gi j 10257390 j 

gi j 11415040 j 

gi|7363445| 

gi | 16758444 | 



P T TE V POAAD 
GSDRARKGGG 

gsdrarKggg 
gsnrgr||agg 
gnnrgrkagg 

70 




peg: 



AAj 

I^ynsrhekvngl 1 
Mynsrhekvngl 
•Synsrlenmngf 
bynsrlenmngfgegve 




SNRD- -||fflRFTP 4 7 
j HGPGRjWfflVLAA 6 0 
j HGPGRpfflVLAA 60 

"rgprrIjRvlva 6 0 

.GPRRWfflVMVA 6 0 



80 90 



100 



110 



120 
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NOV2 lb 


1 




gi | 12836503 | 


48 


lllGL 


gi | 10257390 j 


61 


gij 11415040 j 


61 


fedlGL 


gij 7363445 | 


61 


pSFSF 


gi|l6758444 | 


61 






JSQDLGRREgl 
3AYENSNgTE 

dayensnBte 
7 : iHldayenstP 

LDAYENS 




NOV21a 

NOV21b 

gi| 12836503 | 

gij 10257390 j 

gij 11415040 j 

gi j 7363445 | 

gij 16758444 | 



1 

1 -- 

107 l^jjKMg 

119 IvKDAg 

119 KVlfoa 

119 b*VKE 

119 &VKE2 



130 
..|.. 



150 
..|.. 



160 



170 



180 



^EgVAST-R 

Cl£ysgvpf 

ClIySGVPF 
CLLYNEVPV 
O.MYSEVPV 




EgQRLTLSPEjIVRgLlI 165 

□HLVEEAERvHaeBr^ 178 

QpLVEEAERvllAEgRp 17 8 

PHLAEEVDRAMAVfiRiyj 17 8 

PHLEEEVDRAMAV§RyJ 17 8 
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NOV21a 
NOV21b 

gi 
gi 
gi 
gi 
gi 



12836503 | 
10257390 j 
11415040] 
7363445 | 
16758444 | 



NOV21a 
NOV21b 

gi 
gi 
gi 
gi 
gi 



12836503 | 
10257390 j 
11415040 j 
7363445 | 
16758444 I 



NOV21a 
NOV21b 
gi|l2836503 | 
gi j 10257390 j 
gi 1 11415040 j 
gi|7363445 | 
gi j 16758444 | 



200 



220 
..|.. 



230 
..|.. 



NOV21a 


1 


NOV21b 


1 


gi | 12836503 | 


166 


gi j 10257390 j 


179 


gij 11415040 j 


179 


gi |7363445 | 


179 


gij 16758444 | 


179 



dellsnsstlasyb 
^lpprarj3lks|f^ 

^LPPRARSLK^F^ 




IGLVIItBABVN^IVVLN- STLgCYRYS; 
DSKTVQRTQbHS CS FGLHARgVELMR^TTra 
DS KTjyqR^Q3E)^S CS FGLHAR@VELMR^TT 
ISWAFglDPRMI^'R^Qb^SCSFAL: 
|S WAF@I D PRMItQR|CQrts CS FAL: 



270 



280 
..|.. 



2VLPLK 224 

-FPD 235 

- — FPD 235 

-FPN 235 

- FPN 235 

300 
... | 



PESLMXjKVRLEWTRyDC: 



Bsgds&FRSFDllAS cHeRGSDLI 
ffls VljsXjTFRS FDgAS cBe RG S DLI 
Hs^sHTFRSFD^APcfflEHGSDLI 

gs^ijslfr frs fd&apcSghd sdl| 



310 



320 
..|.. 



330 



340 




|KRL|TS^Yg 279 

PHApVQXicffl 2 95 

PHApVQLCg 2 95 

PHA^VRLCg 2 95 

PHAWRlicB 2 95 



360 



1 

1 --■ 

2 80 CSRQEPvMEVlis 
2 96 TYPPSYN^TFHS 
2 96 TYPPSYNBTFHS 
2 96 TFSPSYNXjTFL^ 
2 96 TFSPSYNIiTFLS 



CQ^LTLEGRLDtG 
ILPlRkSSCGGRLRKAfegTF 351 




IvfvWKKGMHSY&PFLLSVlgV. 

iTLITNTERRMP GF32||r 

LiLXTL I TNTERRgP GFggT F^L PR|tgS CG RLRKA 

SfLVjTLi I TNTDRRHP GFEJsjTF 

gFLMTLITNTDRRlP GFBKTF 



390 




;SCGGFLSDTB 
SCGGLLSEAB 



|TF 351 
JTF 351 

STF 351 



SLDYGLAL^FD^ 
^ QHVKVS^KFF 
rNQHVKVR^KFF 

' rnvkvrI|klf 

RNVKVRgKLI 



qRRQKYNRL 
LjIJe P G VP AGT 
[jtjEPGVPAGT 
[^jDPNVPVGS 
O&DPNIPVGS 



^HgqwmHqnrrl 

jpfeDYVEHNGEilY 
|p||DYVEflNGEj 

:iDYVEHNGE; 

:iDYVEyNGEg: 



430 



NOV21a 

NOV21b 

gi | 12836503 | 

gijl0257390j 

gij 11415040 j 

gi [7363445 | 

gij 16758444 | 




440 



450 



460 
..|....| 



470 
..|.. 



lqpySrIpmv, 

WW 1 * 

S^FJjVTS: 
SQFMVSS., H 




1 

[S 453 

RTGRCIR 4 66 

RTGRCIR 466 

iKTGRCIR 4 66 

TGRCIR 466 



490 



500 



510 
...|.. 

185 



520 



530 



540 



NOV21b 
gi|l2836503| 
gi|l0257390 | 
gi j 11415040 | 
gi |7363445 | 
gi | 16758444 | 



NOV21a 
N0V21b 
gi|l2836503| 
gi j 10257390 | 
gi j 11415040 j 
gi |7363445 | 
gi j 16758444 | 



NOV21a 
NOV2 lb 
gi | 12836503 | 
gi j 10257390 | 
gi j 11415040 | 
gi|7363445 | 
gi j 16758444 | 



NOV21a 

NOV21b 

gi | 12836503 | 

gi | 10257390 j 

gi | 11415040 j 

gi|7363445| 

gi j 16758444 | 



NOV21a 

NOV21b 

gi | 12836503 j 

gi | 10257390 j 

gi j 11415040 j 

gi|7363445| 

gi | 16758444 | 



NOV21a 
NOV21b 
gi|l2836503| 
gi j 10257390 j 
gi | 11415040 | 
gi | 7363445 | 
gi j 16758444 | 



1 

453 VN-GI 

4 6 7 KELRCDGWADCTDHSDELNCSCDAGHQFTCKNKF 

4 6 7 KEL RCDGWADC TDH S DE LNC S CDAGHQ FT C KNKF 

4 67 KELRCDGWADCPDYSDERYCRCNATHQFTCKNQF 

4 67 KDLRCDGWADCPDYSDERHCRCNATHQFMCKNQF 



481 ^FQCQEDS' 

527 SqtfrcsIig: 
52 7 Bqtfrcs||g: 
527 BgsfkcsIg: 

527 SGSFKCsJiG] 
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NOV21a 

NOV21b 

gi | 12836503 | 

gi j 10257390 | 

gi j 11415040 j 

gi |7363445 | 

gij 16758444 | 




Gp^IQE 256 

GjQHIQE 26 0 

QCfvLT- 799 

GV 855 

GV 855 

;i|TGV 85 5 

;EQTGV 855 
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Tables 21G-H lists the domain descriptions from DOMAIN analysis results against 
NOV21 . This indicates that the NOV21 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 21G Domain Analysis of NOV21 

gnl | Smart | smart 0 0 02 0 , Tryp_SPc / Trypsin- like serine protease; Many of 
these are synthesised as inactive precursor zymogens that are cleaved 
during limited proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others are inactive 
due to substitutions of the catalytic triad residues. (SEQ ID 
NO: 812) 

CD-Length = 230 residues, 100.0% aligned 
Score = 221 bits (563), Expect = 4e-59 



5 



NOV2 1 : 


27 


RIVGGSAAGRGEWPWQVSLWLRRREHRCGAVLVAERWLLSAAHCFDWGDPKQWAAFLGT 


86 






MINI 1 1 +IIIIII 1 1 II I++ ll + l + IMI 1 11 + 




Sbjct : 


1 


RI VGGSEANIGS FPWQ VSLQYRGGRHFCGGSL I S PRWVLTAAHCVY - GSAPSSI RVRLGS 


59 


NOV21 : 


87 


PFL-SGAEGQLERVARIYKHPFYNLYTLDYDVALLELAGPVRRSRLVRPICLPEPAPRPP 


145 






1 II 1 1 + I+ + + II II II I + MI + I + II 1 lllllll 1 




Sbjct : 


60 


HDLSSGEETQTVKVSKVIVHPNYNPSTYDNDIALLKLSEPVTLSDTVRPICLPSSGYNVP 


119 


NOV21 : 


146 


DGTRCVITGWGSVRE - GGSMARQLQKAAVRLLSEQTCRRFYPVQI SSRISEPPFFSPQ - - 


202 






II 1 ++III 1 11+ 11+ 1 ++I INI 1 + + 




Sbjct : 


120 


AGTTCTVSGWGRTSESSGSLPDTLQEVNVPIVSNATCRRAYSGGPAITDNMLCAGGLEGG 


179 


NOV21 : 


203 


QGDAGGPLACREPSGRWVLTGVTSWG- YGCGRPHFPGVYTRVAAVRGWI 2 50 








lll + llll 1 MM 1+ III III M+ IIIIIII++ II 




Sbjct : 


180 


KD ACQGD S GG PL VCN - -DPRWVLVGI VSWGSYGCARPNKPGVYTRVSS YLDWI 2 3 0 





Table 21H Domain Analysis of NOV21 

gnl | Pf am | pf am00089 , trypsin, Trypsin. Proteins recognized include all 
proteins in families SI, S2A, S2B, S2C, and S5 in the classification 
of peptidases. Also included are proteins that are clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ*) . (SEQ ID NO: 813) 

CD-Length = 217 residues, 100.0% aligned 
Score = 177 bits (448) , Expect = 9e-46 



NOV2 1 : 


28 


IVGGSAAGRGEWPWQVSLWLRRREHRCGAVLVAERWLLSAAHCFDVYGDPKQWAAFLGTP 


87 






MM 1 1 +IIIIII + 1 II I++I l + l + IMI M 




Sbj Ct : 


1 


IVGGREAQAGSFPWQVSLQVSSG- HFCGGSL I SENWVLTAAHCVS GAS SVRWLGEH 


56 


NOV2 1 : 


88 


FLSGAEGQLER- - VARIYKHPFYNLYTLDYDVALLELAGPVRRSRLVRPICLPEPAPRPP 


145 






1 II ++ 1 +1 II II 1 l+lll+l II lllllll + 1 




Sbj Ct : 


57 


NLGTTEGTEQKFDVKKIIVHPNYNPDT — NDIALLKLKSPVTLGDTVRPICLPSASSDLP 


114 


NOV2 1 : 


146 


DGTRCVITGWGSVREGGSMARQLQKAAVRLLSEQTCRRFYPVQISSR ISEPPFFSPQ 


202 






II 1 ++III + 1+ + 11+ 1 ++I +111 1 ++ 




Sbjct : 


115 


VGTT C S VS G WGRT KNL GT - S DTLQEVWP I VS RE T CR S A YGGT VTDTM I C AGALGG KD AC 


173 


NOV2 1 : 


203 


QGDAGGPLACREPSGRWVLTGVTSWGYGCGRPHFPGVYTRVAAVRGWI 250 








lll + llll 1 + 11+ MINI ++IIMIII+ II 




Sbjct : 


174 


QGDSGGPLVCSDG E L VG I VS WG YG C AVGNY PG VYTR VS R Y LDW I 217 
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Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being 
found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, 
including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 
families (denoted SI - S27) of serine protease have been identified, these being grouped into 6 
5 clans on the basis of structural similarity and other functional evidence. 

Tryptase is a tetrameric serine protease that is concentrated and stored selectively in 
the secretory granules of all types of mast cells, from which it is secreted during mast cell 
degranulation. Its exclusive presence in mast cells permits its use as a specific clinical 
indicator of mast cell activation by measurement of its level in biologic fluids and as a 
10 selective marker of intact mast cells using immunohistochemical techniques with antitryptase 
antibodies. 

In addition, NOV2I nucleic acids and polypeptides are useful, inter alia, as novel 
members of the protein families according to the presence of domains and sequences related to 
previously described proteins. For example, NOV21 nucleic acids and polypeptides contain a 

15 structural motif that is characteristic of protein sbelonging to the serine protease family of 
. proteins. Accordingly, NOV21 may be useful in the same ways other members of this family 
are useful as detailed above. 

The disclosed NOV21 nucleic acid of the invention encoding a Adrenal secretory 
serine protease -like protein includes the nucleic acid whose sequence is provided in Table 

20 2 1 A, 2 1 C or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 21 A or 21C 
while still encoding a protein that maintains its Adrenal secretory serine protease -like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 

25 including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

30 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 2 percent of the 
bases may be so changed. 
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The disclosed NOV21 protein of the invention includes the Adrenal secretory serine 
protease -like protein whose sequence is provided in Table 21 B or 24D. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 21B or 21D while still encoding a protein that 
5 maintains its Adrenal secretory serine protease -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 54 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b)2,that bind immunospecifically to any of the proteins of the invention. 

10 The above disclosed information suggests that this Adrenal secretory serine protease - 

like protein (NOV21) is a member of a "Adrenal secretory serine protease family". Therefore, 
the NOV21 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 

15 protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV21 nucleic acids and proteins of the invention are useful in potential 

20 therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, cirrhosis, 

transplantation, endometriosis, fertility, anemia, ataxia-telangiectasia, autoimmune disease, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, allergies, 
immunodeficiencies, graft versus host disease (GVHD), lymphaedema, muscular dystrophy, 
Lesch-Nyhan syndrome, myasthenia gravis, and/or other diseases and pathologies. 

25 NOV21 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV21 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV21 protein has multiple hydrophilic 

30 regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV22 
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NOV22 includes three novel adrenal secretory serine protease-like proteins disclosed 
below. The disclosed sequences have been named NOV22a, and NOV22b. 
NOV22a 

A disclosed NOV22a nucleic acid of 796 nucleotides (also referred to as CG56643-01) 
5 encoding a novel adrenal secretory serine protease-like protein is shown in Table 22 A. An 
open reading frame was identified beginning with an ACC initiation codon at nucleotides 1-3 
and ending with a TGA codon at nucleotides 763-765. The start and stop codons are shown in 
bold in Table 22A, and the 5' and 3' untranslated regions, if any, are underlined. Because the 
start codon of NOV22a is not a traditional initiation codon, NOV22a could be a partial reading 
10 frame that extends further in the 5' direction. 



Table 22A. NOV22a nucleotide sequence (SEQ ID NO:91). 

ACCCGAGCAGGCCAAGATCCCCAGACCTGGTCTTGTGTCCTCCTTCCAGAATGTGGGGCCAGGCCTGCAATG 
GAGAAGCCCACCCGGGTCGTGCGCGGGTTCGGAGCTGCCTCCGGGGAGGTGCCCTGGCAGGTCAGCCTGAAG 
GAAGGGTCCCGGCACTTCTGCGGAGCAACTGTGGTGGGGGACCGCTGGCTGCTGTCTGCCGCCCACTGCTTC 
CATAGCACGAAGGTGGAGCAGGTTCGGGCCCACCTGGGCACTGCGTCCCTCCTGGGCCTGGGCGGGAGCCCG 
GTGAAGATCGGGCTGCGGCGGGTAGTGCTGCACCCCCTCTACAACCCTGGCATCCTGGACTTCGACCTGGCT 
GTCCTGGAGCTGGCCAGCCCCCTGGCCTTCAACAAATACATCCAGCCTGTCTGCCTGCCCCTGGCCATCCAG 
AAGTTCCCTGTGGGCCGGAAGTGCATGATCTCCGGATGGGGAAATACGCAGGAAGGAAATCTGCAGAAGGCG 
TCCGTGGGCATCATAGACCAGAAAACCTGTAGTGTGCTCTACAACTTCTCCCTCACAGACCGCATGATCTGC 
GCAGGCTTCCTGGAAGGCAAAGTCGACTCCTGCCAGGGTGACTCTGGGGGCCCCCTGGCCTGCGAGGAGGCC 
CCTGGCGTGTTTTATCTGGCAGGGATCGTGAGCTGGGGTATTGGCTGCGCTCAGGTTAAGAAGCCGGGCGTG 
TACACGCGCATCACCAGGCTAAAGGGCTGGATCATCCAGGAGTG ACCACCACGTGACTGCCCAGGCCGAGAC 
TCTA 



In a search of public sequence databases, the NOV22a nucleic acid sequence, located 

on chromosome 19, has 278 of 428 bases (64%) identical to a gb:GENBANK- 
15 ID:E13204|acc:E 13204.1 mRNA from Homo sapiens (Human cDNA encoding a serine 

protease) (E = 1.6e" 29 ). 

The disclosed NOV22a polypeptide (SEQ ID NO:92) encoded by SEQ ID NO:91 has 

254 amino acid residues and is presented in Table 22B using the one-letter amino acid code. 

Signal P, Psort and/or Hydropathy results predict that NOV22a has no signal peptide and is 
20 likely to be localized to the microbody (peroxisome) with a certainty of 0.5090. Alternatively, 

NOV22a may also localize to the cytoplasm with a certainty of 0.4500, to the lysosome 

(lumen) with a certainty of 0.2082, or the mitochondrial matrix space with a certainty of 

0.1000. 



Table 22B. Encoded NOV22a protein sequence (SEQ ID NO:92). 

TRAGQDPQTWSCVLLPECGARPAMEKPTRWRGFGAASGEVPWQVSLKEGSRHFCGATWGDRWLLSAAHCF 
HSTKVEQVRAHLGTASLLGLGGSPVKIGLRRWLHPLYNPGILDFDLAVLELASPLAFNKYIQPVCLPLAIQ 
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KFPVGRKCMISGWGNTQEGNLQKASVGIIDQKTCSVLYNFSLTDRMICAGFLEGKVDSCQGDSGGPLACEEA 
PG VF Y LAG I V S WG I GC AQ VK K PG VYT R I T RL KG W 1 1 Q E 

A search of sequence databases reveals that the NOV22a amino acid sequence has 100 
of 241 amino acid residues (41%) identical to, and 149 of 241 amino acid residues (61%) 
similar to, the 273 amino acid residue ptnr:TREMBLNEW-ACC:BAB20278 protein from Mus 
5 musculus (Mouse) (Type 1 Spinesin) (E = 3Ae~ 49 ). 

The adrenal secretory serine protease disclosed in this invention is predicted to be 
expressed in at least the following tissues: Ovary, kidney, breast, lung, muscle, liver, spleen, 
blood, lymphocyte. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
10 Public EST sources, Literature sources, and/or RACE sources. 

NOV22b 

In the present invention, the target sequence identified previously, NOV22a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 

1 5 downstream sequence available for the reverse primer. In each case, the sequence was 

examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 

20 sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 

25 lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 

30 ESTs were included as components for an assembly when the extent of their identity with 

another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV22b. This differs from the previously 
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identified sequence (NOV22a) in having 43 additional aminoacids and different N and C 
terminus. 

A disclosed NOV22b nucleic acid of 992 nucleotides (also referred to as CG56643-02) 
encoding a novel adrenal secretory serine protease-like protein is shown in Table 22C. An 
5 open reading frame was identified beginning with an ATG initiation codon at nucleotides 101- 
103 and ending with a TAA codon at nucleotides 920-922. The start and stop codons are 
shown in bold in Table 22C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 22C. NOV22b nucleotide sequence (SEQ ID NO:93). 

GCTAGTCTATCCCGAGACCCCTCCCACTCCAACAGTTAATGCTTCCCTTGACCTCAGAATGGCCTCCTACAC 
CTTACCCAGGTGCTAGGGCGGCAGCCCCATGGGGACAGTGGGGAGACTCTTGCGCTCTGAGCGGGCCATCAG 
GCCCACCTCCTCCTCACTCTGTGGCTTTGTGAGATTCCTGCAACTCTGTGAGCCCTGGTTTCTTCGTCTGTG 
GGGTGGGGATGCTGCATCTCGGGGCTGTTATCGGAGCGGAACTGGAGCTGCTCTGATGATCACTGTGCACGT 
GGCCTTTCTGGCTCTTTCCCTGGTAGCCACCAAGCCCGAGCTCCTGCAGAAGGCGTCCGTGGGCATCATAGA 
CCAGAAAACCTGTAGTGTGCTCTACAACTTCTCCCTCACAGACCGCATGATCTGCGCAGGCTTCCTGGAAGG 
CAAAGTCGACTCCTGCCAGGGTGACTCTGGGGGCCCCCTGGCCTGCGAGGAGGCCCCTGGCGTGTTTTATCT 
GGCAGGGATCGTGAGCTGGGGTATTGGCTGCGCTCAGGTTAAGAAGCCGGGCGTGTACACGCGCATCACCAG 
GCTAAAGGGCTGGATCCTGGAGATCATGTCCTCCCAGCCCCTTCCCATGTCTCCCCCCTCGACCACAAGGAT 
GCTGGCCACCACCAGCCCCAGGACGACAGCTGGCCTCACAGTCCCGGGGGCCACACCCAGCAGACCCACCCC 
TGGGGCTGCCAGCAGGGTGACGGGCCAACCTGCCAACTCAACCTTATCTGCCGTGAGCACCACTGCTAGGGG 
ACAGACGCCATTTCCAGACGCCCCGGAGGCCACCACACACACCCAGCTACCAGGTACCGGGAGAGACGGAGG 
a A T PC r TQnOAOTGG AGGGT CC CATGTTAATCAGC CTGGG C TGC CT AAC AAG AC AT A ACGT C GTC C ACT T TG 
GGAGGCCGAGGCGGGCGGATCAAGAGGTCAGGAGATCGAGACCATCCTGGCGAACA 



10 In a search of public sequence databases, the NOV22b nucleic acid sequence, located 

on chromosome 19, has 203 of 294 bases (69%) identical to a gbiGENBANK- 
ID:AF133086|acc:AF 133086.1 mRNA from Homo sapiens (membrane-type serine protease 1 
mRNA, complete cds) (E = 3.6e 16 ). 

The disclosed NOV22b polypeptide (SEQ ID NO: 94) encoded by SEQ ID NO:93 has 

15 273 amino acid residues and is presented in Table 22D using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV22b has A signal peptide and is 
likely to be localized to the mitochondrial inner membrane with a certainty of 0.8723. 
Alternatively, NOV22b may also localize to the plasma membrane with a certainty of 0.6500, 
to the mitochondrial intermembrane space with a certainty of 0.5053, or the mitochondrial 

20 matrix space with a certainty of 0.3617. The most likely cleavage site for NOV22b is between 
positions 43 and 44: GDA-AS. 



Table 22D. Encoded NOV22b protein sequence (SEQ ID NO:94). 

MGTVGRLLRSERAIRPTSSSLCGFVRFLQLCEPWFLRLWGGDAASRGCYRSGTGAALMITVHVAFLALSLVA 
TKPELLQKASVGIIDQKTCSVLYNFSLTDRMICAGFLEGKVDSCQGDSGGPLACEEAPGVFYLAGIVSWGIG 
CAQVKKPGVYTRITRLKGWILEIMSSQPLPMSPPSTTRMLATTSPRTTAGLTVPGATPSRPTPGAASRVTGQ 
P ANS T L S AVS TT ARGQT P F PD AP E ATTHT QL PGTGRDGG IPGS GG S H VNQ PGL PNKT 
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A search of sequence databases reveals that the NOV22b amino acid sequence has 49 
of 90 amino acid residues (54%) identical to, and 63 of 90 amino acid residues (70%) similar 
to, the 277 amino acid residue ptnr:SPTREMBL-ACC:096899 protein from Scolopendra 
5 subspinipes (Plasminogen Activator Spa) (E = 4.3e" 24 ). 

NOV22b is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
10 salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. . 

NOV22c 

A disclosed NOV22c nucleic acid of 912 nucleotides (also referred to as CG56643-03) 
encoding a novel adrenal secretory serine protease-like protein is shown in Table 22E. An 
15 open reading frame was identified beginning with an ATG initiation codon at nucleotides 77- 
79 and ending with a TAAcodon at nucleotides 896-898. The start and stop codons are 
shown in bold in Table 22E, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 22E. NOV22c nucleotide sequence (SEQ ID NO:95). 

CACTCCAACAGTTAATGCTTCCCTTGACCTCAGAATGGCCTCCTACACCTTACCCAGGTGCTAGGGCGGCAG 
CCCC ATGGGGAC AGTGGGG AG AC TCTTGCGCTCT GAG CGGG C C AT C AGG C C CACC T C C T C C T C ACTC TGTGG 
CTTTGTGAGATTCCTGCAACTCTGTGAGCCCTGGTTTCTTCGTCTGTGGGGTGGGGATGCTGCATCTCGGGG 
CTGTTATCGGAGCGGAACTGGAGCTGCTCTGATGATCACTGTGCACGTGGCCTTTCTGGCTCTTTCCCTGGT 
AGCCACCAAGCCCGAGCTCCTGCAGAAGGCGTCCGTGGGCATCATAGACCAGAAAACCTGTAGTGTGCTCTA 
CAACTTCTCCCTCACAGACCGCATGATCTGCGCAGGCTTCCTGGAAGGCAAAGTCGACTCCTGCCAGGGTGA 
, CTCTGGGGGCCCCCTGGCCTGCGAGGAGGCCCCTGGCGTGTTTTATCTGGCAGGGATCGTGAGCTGGGGTAT 
TGGCTGCGCTCAGGTTAAGAAGCCGGGCGTGTACACGCGCATCACCAGGCTAAAGGGCTGGATCCTGGAGAT 
CATGTCCTCCCAGCCCCTTCCCATGTCTCCCCCCTCGACCACAAGGATGCTGGCCACCACCAGCCCCAGGAC 
GACAGCTGGCCTCACAGTCCCGGGGGCCACACCCAGCAGACCCACCCCTGGGGCTGCCAGCAGGGTGACGGG 
CCAACCTGCCAACTCAACCTTATCTGCCGTGAGCACCACTGCTAGGGGACAGACGCCATTTCCAGACGCCCC 
GGAGGCCACCACACACACCCAGCTACCAGGTACCGGGAGAGACGGAGGGATCCCTGGGAGTGGAGGGTCCCA 
TGTTAATCAGCCTGGGCTGCCTAACAAGACATAACGTCGTCCACTTTG 



20 In a search of public sequence databases, the NOV22c nucleic acid sequence, located 

on chromosome 19, has 203 of 294 bases (69%) identical to a gb : GENB ANK- 
ID:E13204|acc:E13204.1 mRNA from Homo sapiens (Human cDNA encoding a serine 
protease) (E = 1.3e" 18 ). 

The disclosed NOV22c polypeptide (SEQ ID NO:96) encoded by SEQ ID NO:95 has 

25 273 amino acid residues and is presented in Table 22F using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV22c has A signal peptide and is 
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likely to be localized to the mitochondrial inner membrane with a certainty of 0.8723. 
Alternatively, NOV22c may also localize to the plasma membrane with a certainty of 0.6500, 
to the mitochondrial intermembrane space with a certainty of 0.5053, or the mitochondrial 
matrix space with a certainty of 0.3617. The most likely cleavage site for NOV22c is between 
5 positions 43 and 44: GDA-AS. 



Table 22F. Encoded NOV22c protein sequence (SEQ ID NO:96). 

MGWGRLLRSERAIRPTSSSLCGFVRFLQLCEPWFLRLWGGDAASRGCYRSGTGAALMITVHVAFLALSLVA 
TKPELLQKASVGIIDQKTCSVLYNFSLTDRNICAGFLEGKVDSCQGDSGGPLACEEAPGVFYLAGIVSWGIG 
CAQVKKPGVYTRITRLKGWILEIMSSQPLPMSPPSTTRMLATTSPRTTAGLTVPGATPSRPTPGAASRVTGQ 
PANSTLSAVSTTARGQTPFPDAPEATTHTQLPGTGRDGGIPGSGGSHVNQPGLPNKT , , 



A search of sequence databases reveals that the NOV22c amino acid sequence has 49 
of 90 amino acid residues (54%) identical to, and 63 of 90 amino acid residues (70%) similar 

10 to, the 277 amino acid residue ptnr:SPTREMBL-ACC:096899 protein from Scolopendra 
subspinipes (Plasminogen Activator SPA) (E = 4.5 e" 24 ). 

NOV22c is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 

15 kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 

salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. . 

The disclosed NOV22a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 22G.. 

20 



Table 22G. BLAST results for NOV22a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 16758444 | ref | NP_ 
446087. 1| 
(NM_053635) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase, 
epithin) [Rattus 
norvegicus] 


855 


109/251 
(43%) 


148/251 
(58%) 


7e-55 


gi | 7363445 | ref |NP_0 
35306.2 | 
(NM_011176) 


protease, serine, 
14 (epithin) [Mus 
musculus] 


855 


110/248 
(44%) 


150/248 
(60%) 


7e-54 


gi | 9757702 | dbj | BAB0 
8218. 1| (AB038498) 


homo log of human 
MT-SP1 [Xenopus 
laevis] 


845 


113/261 
(43%) 


156/261 
(59%) 


2e-52 


gi | 10257390 | gb | AAG1 
5395 . 1 | AF057145 1 
(AF057145) 


serine protease 
TADG15 [Homo 
sapiens] 


855 


107/248 
(43%) 


145/248 
(58%) 


3e-52 
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gi | 1141504 0 | ref | NP_ 


suppression of 


855 


107/248 


145/248 


3e-52 


068813 . 1 | 


tumorigenicity 14 




(43%) 


(58%) 




(NM 021978) 


(colon carcinoma, 
matriptase , 
epithin) ; 
suppression of 
tumorigenicity 14 
(colon 
carcinoma) ; 
matriptase [Homo 
sapi ens] 











The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 22H. In the ClustalW alignment of the NOV22 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence {i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



1) Novel NOV22a (SEQ ID 

2) Novel NOV22b (SEQ ID 

3) Novel NOV22c (SEQ ID 

4) gi | 16758444 | ref |NP_446 
carcinoma, matriptase, epi 

5) gi | 7363445 | ref |NP_0353 
musculus] (SEQ ID NO: 4 13) 

6) gi | 9757702 | db j |BAB0821 
(SEQ ID NO:415) 

7) gi | 102 573 90 jgb | 7AAG153 9 
sapiens] (SEQ ID NO: 411) 

8) gi | 11415040 | ref |NP_068 
carcinoma, matriptase, epi 
matriptase [Homo sapiens] 



Table 22H. ClustalW Analysis of NOV22 

NO: 92) 
NO: 94) 
NO: 96) 

087.1 | (NM_053635) suppression of tumorigenicity 14 (colon 
thin) [Rattus norvegicus] (SEQ ID NO: 414) 
06.2 | (NM_011176) protease, serine, 14 (epithin) [Mus 

8.1 | (AB038498) homolog of human MT-SP1 [Xenopus laevis] 

5 . 1 | AF057145_1 (AF057145) serine protease T7ADG15 [Homo 

813.1 | (NM_021978) suppression of tumorigenicity 14 (colon 
thin) ; suppression of tumorigenicity 14 (colon carcinoma) ; 
(SEQ ID NO:412) 
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NOV22a 
NOV22b 
NOV22C 

gi | 16758444 | ref 
gi|7363445|ref | 
gi | 9757702 | db j | 
gij 10257390 |gb| 
gi j 11415040 j ref 
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30 
•I-- 
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MGNNRGRKAGGGSj 
MGSNRGRKAGGGS*{ 

M 

MGSDRARKGGGGP; 
MGSDRARKGGGGPi 




EEGVEFLP 
EEGVEFLP 
EEGVEFLP 
EEGVEFLP 
EEGVEFLP 




anSaSi 
Ssl 



NOV22a 
NOV22b 
NOV22C 

gi | 16758444 | ref 
gi|7363445|ref | 
gi | 9757702 | db j j 
gij 10257390 |gb| 
gi | 11415040 | ref 
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NOV22a 
NOV22b 
NOV22c 

gi | 16758444 | ref 
gi|7363445 |ref | 
gij 9757702 j db j j 
gi jl0257390 |gbj 
gij 11415040 | ref 



NOV22a 
NOV22b 
NOV22c 

gi | 16758444 | ref 
gi|7363445|ref | 
gi | 9757702 j dbj j 
gi j 10257390 | gb j 
gi | 11415040 j ref 



NOV22a 
NOV22b 
NOV22c 

gi | 16758444 | ref 
gi | 7363445 | ref | 
gi | 9757702 j dbj j 
gi | 10257390 | gb j 
gij 11415040 | ref 



NOV22a 
NOV22b 
NOV22C 



gi 
gi 
gi 
gi 



16758444 | ref 
7363445 | ref | 
9757702 jdbj j 
10257390 |gbj 
11415040 j ref 



NOV22a 
NOV22b 
NOV22C 

gi 1 16758444 | ref 
gij7363445|ref | 
gi j 9757702 j dbj | 
gij 10257390 | gb j 
gi | 11415040 j ref 



NOV22a 
NOV22b 
NOV22C 

gi | 16758444 | ref 
gij7363445|ref | 
gij 9757702 jdbj j 
gijl0257390|gbj 
gi j 11415040 j ref 



NOV22a 



14 0 
..|.. 



150 
..|.. 



160 



170 



180 
■-I 




190 
..|.. 



HKKST: 
Hp2SA5 ' 

HKESA T 
200 




210 
..|.. 



YYWSEFS 
YYWSEFS 
YYWSEFS 
YYWSEFS 
YYWSEFS 



220 
..|.. 



|PHLEEEVgRAM^VE|| 

"phlaeevSra^SveI 

,FREAAFEiiAk|ELp 

)Hlveeaervm4ee5 

>HLVE1 



1 
1 
1 

177 
177 
168 
177 
177 



230 
..|.. 



240 




prmlqrt'qdsj 

plMLgRT^DN 

pSiaSvfichs 

ISpTVclRTpDS 
|sgTVgRT@fJ 




GRTJ|T< 
GAA^TR 

;sngwa^ 

GVEgM^ 
GVEliMJRgT 




1 
1 
1 

237 
237 
225 
237 
237 




PSYNLTFfflSS 
PSYNLTFfflSS 
PSYNLTFgSS 

psynltfSss 
psynltfSss 




490 



500 



510 



0QT2|sgvLi]p; 
196 



520 
I- 



530 



540 
--I 



19 



.1 nrt ? 



fii 1 "as ;1i ihi 



10 



15 



20 



25 



30 



35 



40 



NOV2 2b 
NOV22C 

gi | 16758444 | ref 
gi|7363445 | ref | 
gi|9757702 j db j j 
gi|l0257390 |gb| 
gi | 11415040 j ref 



NOV22a 
NOV22b 
NOV22c 

gi | 16758444 | ref 
gi|7363445|ref | 
gi | 9757702 | dbj | 
gi | 10257390 |gb| 
gi | 11415040 j ref 



NOV22a 
NOV22b 
NOV22C 

gi | 16758444 | ref 
gi | 7363445 | ref | 
gi | 9757702 j db j j 
gi|l0257390|gb| 
gi | 11415040 j ref 



NOV22a 
NOV22b 
NOV22C 

gi | 16758444 | ref 
gi j 7363445 | ref | 
gi j 9757702 | db j j 
gij 10257390 |gb| 
gi | 11415040 j ref 




680 



690 




730 



740 



750 



760 



770 



780 



NOV22a 
45 NOV22b 
NOV22C 

gi | 16758444 | ref 
gi|7363445|ref | 
gi j 9757702 | db j | 
50 gi j 10257390 | gb j 
gi j 11415040 j ref 




74 

;VPGATPSRPT 2 06 
ITVPGATPSRPT 206 
772 
772 
I^J 762 
772 
772 



790 



800 



810 



820 



830 



840 



55 



60 



65 



70 



NOV22a 
NOV22b 
NOV22C 

gi | 16758444 | ref 
gi|7363445 |ref | 
gi | 9757702 j db j | 
gi|l0257390|gb| 
gi j 11415040 j ref 



NOV22a 
NOV22b 
NOV22C 

gi | 16758444 | ref 
gi (7363445 | ref | 



PK^SVjj}YNFS£BD| 

PGAASR^TG@ 




197 



gi I 9757702 |dbj | 
gi|10257390|gbj 
gi | 11415040 | ref 




Tables 22I-J lists the domain descriptions from DOMAIN analysis results against 
NOV22. This indicates that the NOV22 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 221 Domain Analysis of NOV22 

gnl | Smart | smart 00 020 , Tryp_SPc / Trypsin- like serine protease; Many of 
these are synthesised as inactive precursor zymogens that are cleaved 
during limited proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others are inactive 
due to substitutions of the catalytic triad residues. (SEQ ID 
NO:812) 

CD-Length = 230 residues, 100.0% aligned 
Score = 220 bits (560) , ' Expect = 9e-59 





NOV2 2 : 


29 




Sbjct : 


1 


15 


N0V22 : 


88 




Sbjct : 


61 




NOV22 : 


148 


20 








Sbjct: 


120 




NOV22 : 


200 


25 


Sbjct: 


180 



RWRGFGAASGEVPWQVSLK- EGSRHFCGATWGDRWLLSAAHCFHSTKVEQVRAHLGTA 8 7 

l + l I I I IIIIII+ I 1 1 1 1 1 +++ li + i + MM + + + 1 11 + 

RI VGGS E ANIGS FPWQVS LQYRGGRHFCGGSL I S PRWVLTAAHCVYGS APS S I RVRLGSH 60 

SLLGLGGSPVK1 GLRRWLH P L YN PG I LD FDLAVLE LAS P L AFN K Y I Q P VC L P LA I Q KF P 14 7 

I + + + I++M III I I+I+I+I+ 1+ + ++I+III + I 

DLSSGEE-TQTVKVSKVIVHPNYNPSTYDNDIALLKLSEPVTLSDTVRPICLPSSGYNVP 119 

VGRKCMI SGWGNTQEGN LQKASVGI IDQKTCSVLY- -NFSLTDRMICAGFLEGK 199 

I I +1 1 1 1 I I + 11+ +1 1+ II I ++II l + lll Ml 

AGTTCTVSGWGRTSESSGSLPDTLQEVNVP I VSNATCRRAYSGGPAI TDNML C AGGL EGG 17 9 



VDSCQGDSGGPLACEEAPGVFYLAGIVSWG- IGCAQVKKPGVYTRITRLKGWI 

l + IIMIIIII I + +1 MUM III- IMIIII++ II 

KDACQGDSGGPLVCND - - PRWVLVGIVSWGSYGCARPNKPGVYTRVSSYLDWI 
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Table 22 J Domain Analysis of NOV22 

gnl | Pf am | pf am00089 , trypsin, Trypsin. Proteins recognized include all 
proteins in families SI, S2A, S2B, S2C, and S5 in the classification 
of peptidases. Also included are proteins that are clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ*). (SEQ ID NO:813) 

CD-Length = 217 residues, 100.0% aligned 
Score = 192 bits (488) , Expect = 2e-50 
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NOV22 : 


30 


Sbjct: 


1 


NOV22 : 


90 


Sbjct : 


59 


NOV22 : 


150 


Sbjct : 


117 



WRGFGAASGEVPWQVSLKEGSRHFCGATWGDRWLLSAAHCFHSTKVEQVRAHLGTASL 89 

+ 11 l+l IMIII+ I MM +++ + l + l + llll +1 I +1 

I VGGRE AQ AGS F P WQVS LQVS S GH FCGGS LIS ENWVLT AAHC VS GAS S VRWL - - GEHNL 58 

LGLGGSPVKIGLRRWLHPLYNPGILDFDLAVLELASPLAFNKYIQPVCLPLAIQKFPVG 14 9 

1+ I ++ + ++-MI Ml l + l + l + l 11+ ++I + MI I Ml 

GTTEGTEQKFDVKKI IVHPNYNPD- -TNDIALLKLKSPVTLGDTVRPICLPSASSDLPVG 116 

RKCM I SGWGNTQEGN LQKASVGIIDQKTCSVLYNFSLTDRMICAGFLEGKVDSCQG 205 

I +IMI 1+ II- I 1+ ++M I ++I I II II I I II l + lll 

TTCSVSGWGRTKNLGTSDTLQE VWPI VSRETCRSAYGGTVTDTMI CAGALGGK- DACQG 175 
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NOV2 2 : 2 06 D S GG P LAC E E APG VF YLAG I VS WG I GC AQ VK K PG VYT R I TRL KG W I 251 

linn i + i hum in minimi m 

Sbjct: 176 DSGGPLVCSDG ELVGIVSWGYGCAVGNYPGVYTRVSRYLDWI 217 

5 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being 

found in viruses, bacteria and eukaryotes [1] . They include a wide range of peptidase activity, 
including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 
families (denoted SI - S27) of serine protease have been identified, these being grouped into 6 
clans on the basis of structural similarity and other functional evidence [1]. 

10 Tryptase is a tetrameric serine protease that is concentrated and stored selectively in 

the secretory granules of all types of mast cells, from which it is secreted during mast cell 
degranulation. Its exclusive presence in mast cells permits its use as a specific clinical 
indicator of mast cell activation by measurement of its level in biologic fluids and as a 
selective marker of intact mast cells using immunohistochemical techniques with antitryptase 

15 antibodies. Vanderslice [2] demonstrated the existence of multiple tryptases. In this respect, 
mast cell tryptase is like other serine proteases such as glandular kallikrein and trypsin, which 
are also members of multigene families. Miller et al. [3] mapped both alpha-tryptase and beta- 
tryptase to human chromosome 16 by PCR analysis of DNA from human/hamster somatic cell 
hybrids. Miller et al. [3] cloned a second cDNA for human tryptase, called beta-tryptase, from 

20 a mast cell cDNA library. The 1,142 bases of beta-tryptase were found to encode a 30-amino 
acid leader sequence of 3,089 daltons and a 245-amino acid catalytic region of 27,458 daltons. 
The amino acid sequence of beta-tryptase was found to be 90% identical with that of alpha- 
tryptase, the first 20 amino acids of the catalytic portions being 100% identical. Both alpha- 
and beta-tryptase sequences were localized to human chromosome 16 by analysis of DNA 

25 preparations from 25 human/hamster somatic cell hybrids by PCR. 

Because of the presence of the trypsin domains and the homology to the adrenal 
secretory serine protease, we anticipate that the novel sequence described here will have useful 
properties and functions similar to these genes. 

The disclosed NOV22 nucleic acid of the invention encoding a Adrenal secretory 

30 serine protease -like protein includes the nucleic acid whose sequence is provided in Table 

22A, 25C, 25E or a fragment thereof. The invention also includes a mutant or variant nucleic 
acid any of whose bases may be changed from the corresponding base shown in Table 22A, 
25C, or 25E while still encoding a protein that maintains its Adrenal secretory serine protease 
-like activities and physiological functions, or a fragment of such a nucleic acid. The 

35 invention further includes nucleic acids whose sequences are complementary to those just 

described, including nucleic acid fragments that are complementary to any of the nucleic acids 
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just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
5 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 36 percent of the 
bases may be so changed. 

The disclosed NOV22 protein of the invention includes the Adrenal secretory serine 

10 protease -like protein whose sequence is provided in Table 22B, 25D, or 25F. The invention 
also includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 22B, 25D, or 25F while still encoding a protein that 
maintains its Adrenal secretory serine protease -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 57 percent of the 

15 residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(FabK that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Adrenal secretory serine protease - 
like protein (NOV22) is a member of a "Adrenal secretory serine protease family". Therefore, 

20 the NOV22 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

25 delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV22 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, cirrhosis, 
transplantation, endometriosis, fertility, anemia, ataxia-telangiectasia, autoimmune disease, 

30 hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, allergies, 

immunodeficiencies, graft versus host disease (GVHD), lymphaedema, muscular dystrophy, 
Lesch-Nyhan syndrome, myasthenia gravis, and/or other diseases and pathologies. 

NOV22 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV22 substances for use in 
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therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV22 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
5 assay systems for functional analysis of various human disorders, which will help in 

understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV23 

NOV23 includes three novel serine protease DESC1 protease-like proteins disclosed 
10 below. The disclosed sequences have been named NOV23a, NOV23b, NOV23c, and 
NOV23d. ' 
NOV23a 

The disclosed NOV23a nucleic acid of 1546 nucleotides (also referred to as CG56647- 
02) encoding a novel serine protease DESCl-like protein is shown in Table 23 A. An open 
15 reading frame was identified beginning with an ATG initiation codon at nucleotides 101-103 
and ending with a TAG codon at nucleotides 1481-1483. Putative untranslated regions, if any, 
found upstream from the initiation codon and downstream from the termination codon are 
underlined in Table 23 A, and the start and stop codons are in bold letters. 

Table 23A. NOV23a Nucleotide Sequence (SEQ ID NO:97) 

"gcccctgccataggaggcggggactgtcatttcaccgtctcctgatgccattccagaggttacgccctga 
a gtcagctcagatcctgggccaggcactgc atgggagacaggcatgagcaggacctctttctgccttcga 
ggaaaacacgggggcatctggggctcacttggcactcatccaccttgtgctgtacctggggacctccggc 
cttcctctctacacagggcttccacgtggaccacacggccgagctgcggggaatccggtggaccagcagt 

TTGCGGCGGGAGACCTCGGACTATCACCGCACGCTGACGCCCACCCTGGAGGCACTGTTTGTAAGTAGTT 
TT C AG AAGAC AGAGTTAGAGG C AAGC TGC GTGGGTTG CT CGGT ACTGAAT T AT AGGGATGGGAACT C C AG 
TGTCCTCGTACATTTCCAGCTGCACTTTCTGCTGCGACCCCTCCAGACGCTGAGCCTGGGCCTGGAGGAG 
GAGCTATTGCAGCGAGGGATCCGGGCAAGGCTGCGGGAGCACGGCATCTCCCTGGCTGCCTATGGCACAA 
TTGTGTCGGCTGAGCTCACAGGTAGACATAAGGGACCCTTGGCAGAAAGAGACTTCAAATCAGGTCGCTG 
TCCAGGGAACTCCTTTTCCTGCGGGAACAGCCAGTGTGTGACCAAGGTGAACCCGGAGTGTGACGACCAG 
GAGGACTGCTCCGATGGGTCCGACGAGGCGCACTGCGAGTGTGGCTTGCAGCCTGCCTGGAGGATGGCCG 
GCAGGATCGTGGGCGGCATGGAAGCATCCCCGGGGGAGTTTCCGTGGCAAGCCAGCCTTCGAGAGAACAA 
GGAGCACTTCTGTGGGGCCGCCATCATCAACGCCAGGTGGCTGGTGTCTGCTGCTCACTGCTTCAATGAG 
TTCCAAGACCCGACGAAGTGGGTGGCCTACGTGGGTGCGACCTACCTCAGCGGCTCGGAGGCCAGCACCG 
TGCGGGCCCAGGTGGTCCAGATCGTCAAGCACCCCCTGTACAACGCGGACACGGCCGACTTTGACGTGGC 
TGTGCTGGAGCTGACCAGCCCTCTGCCTTTCGGCCGGCACATCCAGCCCGTGTGCCTCCCGGCTGCCACA 
CACATCTTCCCACCCAGCAAGAAGTGCCTGATCTCAGGCTGGGGCTACCTCAAGGAGGACTTCGTGGTCA 
AGCCAGAGGTGCTGCAGAAAGCCACTGTGGAGCTGCTGGACCAGGCACTGTGTGCCAGCTTGTACGGCCA 
TTCACTCACTGACAGGATGGTGTGCGCTGGCTACCTGGACGGGAAGGTGGACTCCTGCCAGGGTGACTCA 
GGAGGACCCCTGGTCTGCGAGGAGCCCTCTGGCCGGTTCTTTCTGGCTGGCATCGTGAGCTGGGGAATCG 
GGTGTGCGGAAGCCCGGCGTCCAGGGGTCTATGCCCGAGTCACCAGGCTACGTGACTGGATCCTGGAGGC 
CACCGAAAGGTAG AAGATGATGTACGTGCCTATCTTGATTTAGGGAGAACGGATATCGTCATAGTATCTT 
CATAAT 
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The disclosed NOV23a nucleic acid sequence, located on chromosome 19, has 356 of 
566 bases (62%) identical to a gb:GENBANK-ID:AF133086|acc:AF 133086.1 mRNA from 
Homo sapiens (membrane-type serine protease 1 mRNA, complete cds) (E = l.le" 23 ) . 

A disclosed NOV23a polypeptide (SEQ ID NO:98) encoded by SEQ ID NO:97 is 460 
5 amino acid residues and is presented using the one-letter amino acid code in Table 23B. 

Signal P, Psort and/or Hydropathy results predict that NOV23a contains no signal peptide and 
is likely to be localized in the microbody (peroxisome) with a certainty of 0.5387. 

Table 23B. Encoded NOV23a protein sequence (SEQ ID NO:98). 

MGDRHEQDLFLPSRKTRGHLGLTWHSSTLCCTWGPPAFLSTQGFHVDHTAELRGIRWTSSLRRETSDYHRTLTPT 
LEALFVSSFQKTELEASCVGCSVLNYRDGNSSVLVHFQLHFLLRPLQTLSLGLEEELLQRGIRARLREHGISLAA 
YGTIVSAELTGRHKGPLAERDFKSGRCPGNSFSCGNSQCVTKVNPECDDQEDCSDGSDEAHCECGLQPAWRMAGR 
IVGGMEASPGEFPWQASLRENKEHFCGAAIINARWLVSAAHCFNEFQDPTKWVAYVGATYLSGSEASTVRAQWQ 
IVKHPLYNADTADFDVAVLELTSPLPFGRHIQPVCLPAATHIFPPSKKCLISGWGYLKEDFWKPEVLQKATVEL 
LDQALCASLYGHSLTDRMVCAGYLDGKVDSCQGDSGGPIiVCEEPSGRFFLAGIVSWGIGCAEARRPGVYARVTRL 
RDW I LEATER 

The disclosed NOV23a amino acid sequence has 1 12 of 248 amino acid residues 
10 (45%) identical to, and 157 of 248 amino acid residues (63%) similar to, the 422 amino acid 
residue ptnr:SPTREMBL-ACC:Q9UL52 protein from Homo sapiens (Human) (serine 
protease DESC1) (E = Lie 58 ). 

NOV23a is predicted to be expressed in at least Ovary, kidney, breast, lung, muscle, 
liver, spleen, blood and lymphocyte. This information was derived by determining the tissue 
1 5 sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, Public EST sources, and/or RACE sources. 

NOV23b 

A disclosed NOV23b nucleic acid of 1 777 nucleotides (also referred to as CG56647- 
03) encoding a novel serine protease DESCl-like protein is shown in Table 23C. An open 
20 reading frame was identified beginning with an ATG initiation codon at nucleotides 101-103 
and ending with a TAG codon at nucleotides 163 1-1633. Putative untranslated regions, if any, 
found upstream from the initiation codon and downstream from the termination codon are 
underlined in Table 23C, and the start and stop codons are in bold letters. 

Table 23C. NOV23b Nucleotide Sequence (SEQ ID NO:99) 

"gcccctgccataggaggcggggactgtcatttcaccgtctcctgatgccattccagaggttacgccctga 
agt c agc t c agat c ctggg c c aggc actg c atggg ag ac agg c atg ag c aggac ct ct ttctg c cttcg a 
ggaaaacacgggggcatctggggctcacttggcactcatccaccttgtgctgtacctggggacctccggc 

CTTCCTCTCTACACAGGGCTTCCACGTGGACCACACGGCCGAGCTGCGGGGAATCCGGTGGACCAGCAGT 
TTGCGGCGGGAGACCTCGGACTATCACCGCACGCTGACGCCCACCCTGGAGGCACTGTTTGTAAGTAGTT 
TTCAGAAGACAGAGTTAGAGGCAAGCTGCGTGGGTTGCTCGGTACTGAATTATAGGGATGGGAACTCCAG 
TGTCCTCGTACATTTCCAGCTGCACTTTCTGCTGCGACCCCTCCAGACGCTGAGCCTGGGCCTGGAGGAG 
GAGCTATTGCAGCGAGGGATCCGGGCAAGGCTGCGGGAGCACGGCATCTCCCTGGCTGCCTATGGCACAA 
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TTGTGTCGGCTGAGCTCACAGGGAGACATAAGGGACCCTTGGCAGAAAGAGACTTCAAATCAGGCCGCTG 
TCCAGGGAACTCCTTTTCCTGCGGGAACAGCCAGTGTGTGACCAAGGTGAACCCGGAGTGTGACGACCAG 
GAGGACTGCTCCGATGGGTCCGACGAGGCGCACTGCGAGTGTGGCTTGCAGCCTGCCTGGAGGATGGCCG 
GCAGGATCGTGGGCGGCATGGAAGCATCCCCGGGGGAGTTTCCGTGGCAAGCCAGCCTTCGAGAGAACAA 
GGAGCACTTCTGTGGGGCCGCCATCATCAACGCCAGGTGGCTGGTGTCTGCTGCTCACTGCTTCAATGAG 
TTCCAAGACCCGACGAAGTGGGTGGCCTACGTGGGTGCGACCTACCTCAGCGGCTCGGAGGCCAGCACCG 
TGCGGGCCCAGGTGGTCCAGATCGTCAAGCACCCCCTGTACAACGCGGACACGGCCGACTTTGACGTGGC 
TGTGCTGGAGCTGACCAGCCCTCTGCCTTTCGGCCGGCACATCCAGCCCGTGTGCCTCCCGGCTGCCACA 
CACATCTTCCCACCCAGCAAGAAGTGCCTGATCTCAGGCTGGGGCTACGTGCTGCAGAAAGCCACTGTGG 
AGCTGCTGGACCAGGCACTGTGTGCCAGCTTGTACGGCCATTCACTCACTGACAGGATGGTGTGCGCTGG 
CTACCTGGACGGGAAGGTGGACTCCTGCCAGGGTGACTCAGGAGGACCCCTGGTCTGCGAGGAGCCCTCT 
GGCCGGTTGTTTCTGGCTGGCATCGTGAGCTGGGGAATCGGGTGTGCGGAAGCCCGGCATCCAGGGGTCT 
ATGCCCGAGTCACCAGGCTACGCGACTGGATCCTGGAGGCCACCACCAAAGCCAGCATGCCTCTGGCCCC 
CACCATGGCTCCTGCCCCTGCCGCCCCCAGCACAGCCTGGCCCACCAGTCCTGAGAGCCCTGTGGTCAGC 
ACCCCCACCAAATCGATGCAGGCCCTCAGTACCGTGCCTCTTGACTGGGTCACCGTTCCTAAGCTACAAG 
GTATTTTCGGGGCAGAAAGGTAG AAGATGATGTACGTGCCTATCTTGATTTAGGGAGAACGGATATCGTC 
ATAGTATCTTCATAATTTTGGATCTTCCTGTTCAAGGAAAGGTCACATGTGTATCCGTTT ATTCCCATCT 
TACGTTGCGTGTACCCTCATGGTATCT 



The disclosed NOV23b nucleic acid sequence, located on chromosome 19, has 208 of 
327 bases (63%) identical to a gb:GENBANK-ID:AF098327|acc:AF098327.1 mRNA from 
Homo sapiens (putative mast cell mMCP-7-like II typtase gene, complete cds) (E = 2.8e" 14 ) . 
5 A disclosed NOV23b polypeptide (SEQ ID NO: 1 00) encoded by SEQ ID NO:99 is 

510 amino acid residues and is presented using the one-letter amino acid code in Table 23D. 
Signal P, Psort and/or Hydropathy results predict that NOV23b contains no signal peptide and 
is likely to be localized in the microbody (peroxisome) with a certainty of 0.5 131. 



Table 23D. Encoded NOV23b protein sequence (SEQ ID NO: 100). 

MGDRHEQDLFLPSRKTRGHLGLTWHSSTLCCTWGPPAFLSTQGFHVDHTAELRGIRWTSSLRRETSDYHRTLTPT 
LEALFVSSFQKTELEASCVGCSVLNYRDGNSSVLVHFQLHFLLRPLQTLSLGLEEELLQRGIRARLREHGISLAA 
YGTIVSAELTGRHKGPLAERDFKSGRCPGNSFSCGNSQCVTKVNPECDDQEDCSDGSDEAHCECGIiQPAWRMAGR 
IVGGMEASPGEFPWQASLRENKEHFCGAAIINARWLVSAAHCFNEFQDPTKWVAYVGATYLSGSEASTVRAQWQ 
IVKHPLYNADTADFDVAVLELTSPLPFGRHIQPVCLPAATHIFPPSKKCLISGWGYVLQKATVELLDQALCASLY 
GHSLTDRMVCAGYLDGKVDSCQGDSGGPLVCEEPSGRLFLAGIVSWGIGCAEARHPGVYARVTRLRDWILEATTK 
ASMPLAPTMAPAPAAPSTAWPTSPESPWSTPTKSMQAIiSTVPLDWVTVPKLQGIFGAER 

10 

The disclosed NOV23b amino acid sequence has 109 of 246 amino acid residues 
(44%) identical to, and 152 of 246 amino acid residues (61%) similar to, the 422 amino acid 
residue ptnr:SPTREMBL-ACC:Q9UL52 protein from Homo sapiens (Human) (serine 
protease DESC1) (E = 1.3e~ 55 ). 

15 NOV23b is predicted to be expressed in at least the following tissues: adrenal gland, 

bone marrow, brain-amygdala, brain-cerebellum, brain-hippocampus, brain-substantia nigra, 
brain-thalamus, brain-whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma-Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, 
skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and 

20 uterus. Expression information was derived from the tissue sources of the sequences that were 
included in the derivation of the NOV23b sequence. 
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NOV23c 

A disclosed NOV23c nucleic acid of 815 nucleotides (also referred to as CG56647-01) 
encoding a novel adrenal secretory serine protease-like protein is shown in Table 23E. An 
open reading frame was identified beginning with a GGT initiation codon at nucleotides 1-3 
5 and ending with a TAA codon at nucleotides 787-789. The start and stop codons are shown in 
bold in Table 23E, and the 5' and 3' untranslated regions, if any, are underlined. Because the 
start codon of NOV23c is not a traditional initiation codon, NOV23c could be a partial reading 
frame that extends further in the 5' direction. 



Table 23E. NOV23c nucleotide sequence (SEQ ID NO:101). 

GGTCCTGCCTTCGTGGGGTCATGGCTGGTGACCTGCTGTCTTGCAGAGTGTGGCTTGCAGCCTGCCTGGAGG 
ATGGCCGGCAGGATCGTGGGCGGCATGGAAGCATCCCCGGGGGAGTTTCCGTGGCAAGCCAGCCTTCGAGAG 
AACAAGGAGCACTTCTGTGGGGCCGCCATCATCAACGCCAGGTGGCTGGTGTCTGCTGCTCACTGCTTCAAT 
GAGTTCCAAGACCCGACGAAGTGGGTGGCCTACGTGGGTGCGACCTACCTCAGCGGCTCGGAGGCCAGCACC 
GTGCGGGCCCAGGTGGTCCAGATCGTCAAGCACCCCCTGTACAACGCGGACACGGCCGACTTTGACGTGGCT 
GTGCTGGAGCTGACCAGCCCTCTGCCTTTCGGCCGGCACATCCAGCCCGTGTGCCTCCCGGCTGCCACACAC 
ATCTTCCCACCCAGCAAGAAGTGCCTGATCTCAGGCTGGGGCTACCTCAAGGAGGACTTCCGTAAGCATCTT 
CCTCTGCAGAAAGCCACTGTGGAGCTGCTGGACCAGGCACTGTGTGCCAGCTTGTACGGCCATTCACTCACT 
GACAGGATGGTGTGCGCTGGCTACCTGGACGGGAAGGTGGACTCCTGCCAGGGTGACTCAGGAGGACCCCTG 
GTCTGCGAGGAGCCCTCTGGCCGGTTCTTTCTGGCTGGCATCGTGAGCTGGGGAATCGGGTGTGCGGAAGCC 
CGGCGTCCAGGGGTCTATGCCCGAGTCACCAGGCTACGTGACTGGATCCTGGAGGCCACCCGTTCCTAAGCT 
ACAAGGTATTTTCGGGGCAGAAA 



10 

In a search of public sequence databases, the NOV23c nucleic acid sequence, located 
on chromosome 19, has 350 of 564 bases (62%) identical to a gb:GENBANK- 
ID:E13204|acc:E 13204.1 mRNA from Homo sapiens (Human cDNA encoding a serine 
protease) (E = 3.2e -26 ). 

15 The disclosed NOV23c polypeptide (SEQ ID NO: 102) encoded by SEQ ID NO: 101 

has 262 amino acid residues and is presented in Table 23F using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV23c has no signal peptide 
and is likely to be localized extracellularly with a certainty of 0.3750. Alternatively, NOV23c 
may also localize to the microbody (peroxisome) with a certainty of 0.1391, to the 

20 endoplasmic reticulum (membrane) with a certainty of 0. 1 000, or the endoplasmic reticulum 
(lumen) with a certainty of 0.1000. The most likely cleavage site for NOV23c is between 
positions 15 and 16: CLA-EC. 



Table 23F. Encoded NOV23c protein sequence (SEQ ID NO:102). 

GPAFVGSWLVTCCLAECGLQPAWRMAGRIVGGMEASPGEFPWQASLRENKEHFCGAAIINARWLVSAAHCFN 
EFQDPTKWVAYVGATYLSGSEASTVRAQWQIVKHPLYNADTADFDVAVLELTSPLPFGRHIQPVCLPAATH 
IFPPSKKCLISGWGYLKEDFRKHLPLQKATVELLDQALCASLYGHSLTDRMVCAGYLDGKVDSCQGDSGGPL 
VCEEPSGRFFLAGIVSWGIGCAEARRPGVYARVTRLRDWILEATRS 
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A search of sequence databases reveals that the NOV23c amino acid sequence has 1 14 
of 248 amino acid residues (45%) identical to, and 152 of 248 amino acid residues (61%) 
similar to, the 273 amino acid residue ptnnTREMBLNE W-ACC :B AB20278 protein from Mus 
5 musculus (Mouse) (Type 1 Spinesin) (E = Lie" 53 ). 

NOV23c is predicted to be expressed in at least the following tissues: Ovary, kidney, 
breast, lung, muscle, liver, spleen, blood, lymphocyte. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
1 0 sources. 

The disclosed NOV23a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 23G. 



Table 23G. BLAST results for NOV23 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
<%) 


Positives 
(%) 


Expect 


gi | 12 83 65 03 | db j | BAB 
23684. l| (AK004939) 


data source : SPTR, 

source 

key :095519, 

evidence : ISS~homo 

log to DJ1170K4.4 
(NOVEL PROTEIN) 
(FRAGMENT) -put at i 

ve [Mus musculus] 


799 


136/280 
(48%) 


180/280 
(63%) 


le-75 


gi | 16758444 | ref |NP 
446087. l| 
(NM_053635) 


suppression of 
tumorigenicity 14 
(colon carcinoma, 
matriptase , 
epithin) [Rattus 
norvegi cus] 


855 


133/289 
(46%) 


182/289 
(62%) 


3e-72 


gi | 10257390 | gb | AAG1 
5395 . 1 | AF057145 1 
(AF0 5714 5) 


serine protease 
TADG15 [Homo 
sapiens] 


855 


132/289 
(45%) 


185/289 
(63%) 


3e-72 


gi | 11415040 | ref |NP 
068813. l| 
(NM_021978) 


suppression of 
tumorigenicity 14 

(colon carcinoma, 
matriptase, 
epithin) ; 
suppression of 
tumorigenicity 14 

(colon 
carcinoma) ,- 
matriptase [Homo 
sapiens] 


855 


132/289 
(45%) 


185/289 
(63%) 


3e-72 


gi | 1224 9 015 | dbj | BAB 
20376. l| (AB030036) 


prostamin [Homo 
sapiens] 


855 


131/289 
(45%) 


184/289 
(63%) 


9e-72 



1 5 The homology between these and other sequences is shown graphically in the 

ClustalW analysis shown in Table 23H. In the ClustalW alignment of the NOV23 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 

205 



.1! nn7?r 



.ii if 



regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 

5 



Table 23H. ClustalW Analysis of NOV23 

1) Novel NOV23a (SEQ ID NO: 98) 

2) Novel NOV23b (SEQ ID NO: 10 0) 
2) Novel NOV23C (SEQ ID NO: 102) 

10 4) gi | 12836503 | dbj |BAB23684 . 1 | (AK004939) data source:SPTR, source key:095519, 
evidence : I SS -homo log to DJ1170K4.4 (NOVEL PROTEIN) (FRAGMENT) -putative [Mus 
muSCUlus] (SEQ ID NO: 416) 

5) gi | 16758444 | ref |NP_446087 . 1 | (NM_053635) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin) [Rattus norvegicus] (SEQ ID NO: 4 17) 
15 6) gi | 10257390 | gb | AAG15395 . 1 | AF057145_1 (AF057145) serine protease TADG15 [Homo 

sapiens] (SEQ ID NO: 4 18) 

7) gi | 11415040 | ref |NP_068813 . 1 | (NM_021978) suppression of tumorigenicity 14 (colon 
carcinoma, matriptase, epithin) ; suppression of tumorigenicity 14 (colon carcinoma) ; 
matriptase [Homo sapiens] (SEQ ID NO: 419) 
20 8) gi | 12249015 | dbj [BAB20376 . 1 | (AB030036) prostamin [Homo sapiens] (SEQ IDNO:420) 



10 



25 



30 



35 



40 



45 



50 



55 



60 



NOV27a 
NOV27b 
NOV27C 
gi|l2836503| 
gi j 16758444 j 
gijl0257390| 
gi j 11415040 | 
gi | 12249015 | 



NOV27a 
NOV27b 
NOV27C 
gi|l2836503| 
gi j 16758444 j 
gi|l0257390| 
gi | 11415040 | 
gi | 12249015 | 



NOV27a 

NOV27b 

NOV27c 

gi 1 12836503 | 

gi 1 16758444 | 

gi|l0257390| 

gi j 11415040 | 

gi | 12249015 | 



23 
23 
1 



20 

iRHKQD--- 
MaDRHEQg- - - 



30 



50 60 
..|....|....|....| 

|S RKTRGHgGfflT- 23 

RKTRGHllGHT- 23 



MPTTE VPQAADGQGDAS I 
MGNNRGRKAGGGSQDFgi 
MGSDRARKGGGGPKDI 
MGSDRARKGGGGPKDI 
MGSDRARKGGGGPKDI 



)GEgAAE PEGK 

.glgyns rlenmng fe egve 
lGlIy^srhekvngleegve 
,glIy|s rhe kvngle eg ve 
iGlSy#srhekvngleegve 




[QPK NTKRKNRDYS7RFTP 4 7 

STAKQVE KRGPRRWVVMVA 60 

/KKVT:iCHGPGRwW|jAA 6 0 

/KKVE KHG PGR WvJvBaA 6 0 

ckvekhgpgrwvvHaa 6 0 



4 8 LLL - VLAALVS AGVML 

61 WFSFLLLSLMAGLLV 

61 VL I GLLLVLLG I GFL\ 

61 VL I GLLLVLLG I G FL\ 

61 VL I GLLLVLLG I GFLA/ 



^FLGYKAEVTpSQV^SGSLRVLNRHFSQDLGRRESI. 

]Jfhyr--nvrxqkv^nghlritnenfldayenstste| 

LjQYR - - DVR||Q KvfeNGYMR I TNENF VDAYENSNST: 
"_iQYR- - DVR^QKVgNGYMRITNENFVDAYENSNSTl 
LiQYR- - DVRfeQKVgNGYMRITNENFVDAYENSNSTE| 




130 140 150 160 170 180 

• • - - 1 — I - - - - I--- - — I — I — I - • L- - • I — I - 1 

4 0 -TgGFHVDHTpE^R--gIRWTSg LRRETSDJYHR-TLTP-TLgALF 8 0 

4 0 -TQGFHVDHt||eI|R- -gIRWTSg LRRETSD^HR-TLTP-TLgALF 80 

i i 

107 KAOkMLQEL VJ|STRL-HTYYNSHSWS FGEGPLTCFFWFI LDI PEgQRLTLSPEWRELL 165 

119 QVK>EALKLMYJ§*E§p^ 178 

119 KVKDALKLLY^Ggt FlHpYHKE^VTAFSEGSVIAYYWSEFSI PQiLVEEAERVMAgERV 178 

119 KV^DALKLLY^GJ?|PFlHpYHKeHaVTAFS EGS VI AYY WSE FS I PQQLVEEAERVMAgERV 17 8 

119 KV||DALKLLY^G^PFLgPYHKE§AVTAFS EGS VI AYY WSE FS I PQ^LVEEAERVMAgERV 17 8 



NOV27a 


81 


NOV27b 


81 


NOV27C 


1 


gi 1 12836503 | 


166 


gij 16758444] 


179 



SFQ- 
SFQ- 



200 210 220 230 

jCVGCSVLN YR 

CVGCSVLN YR 



240 



102 
102 
1 
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10 



15 



20 



25 



gi | 10257390 | 
gi j 11415040 j 
gij 12249015 | 



NOV27a 

NOV27b 

NOV27C 

gi | 12836503 | 

gi | 16758444 j 

gi|l0257390| 

gi j 11415040 j 

gi j 12249015 j 



NOV27a 
NOV27b 
NOV27C 
gi | 12836503 | 
gi | 16758444 j 
gi | 10257390 j 
gi j 11415040 j 
gij 12249015 | 



179 BMLPPRARSLKSFVVnsWAFPTDSKT^QRgQDNSCSFGLHARGVELMRFTTPG- -FPDS 236 
179 RpLPPRARSLKSFVVMSVVAFPTDSKT^ - FPDS 23 6 

179 B m LPPRARSLKSFw3sWAFPTDSKT|^R^QDNSCSFGLHARGVELMRFTTPG--FPDS 236 

250 260 270 280 290 300 

|....|....|....|....|....|....|....|....|....| 

102 - -DG1|SSVLVHFQ 113 

102 --DGNSSVLVHFjQ 113 

1 -- : " 1 

226 PDQQTTSCLWHL^GPEDLM I KVRLEWTRVDCRDR VAMYDAAGPLEKRLITSVYGC 280 

237 PYPA||feCQWVLgGDADSVLSLTFRSFDVAPCDGHDSDLVTVYDSLSPMEPHAVVRLCGT 296 

237 PYPAI^CQWALMGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGT 2 96 

23 7 P Y P AH^RCQWALMGD AD S VL S LT FR S FDL AS CDE RGS DL VT VYNT LS PME PHAL VQ LCGT 2 96 

23 7 PYPAH^RCQWALfGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGT 2 96 



310 
..|.. 



320 



330 



LH gLLRP- 

LH SlLRP- 



113 

113 

1 

2 81 SRQEPVMEVLASGSVMAVVWKKGMHS YYDP 
2 97 FSPSYNLTFLSSQNVFLVTLITNTDRRHPG 
2 97 YPPSYNLTFHSSQNVLLITLITNTERRHPG 
2 97 YPPSYNLTFHSSQNVLLITLITNTERRHPG 
2 97 Y P P SYNLT FHS S QNVLL I TL I TNTERRHPG 



340 
..|.. 



350 



360 

-LQ 122 
-LQ 122 



,ls vks vafqdcqvnltlegrldtqgflr 3 4 0 

:atffqlpkmsscgg llseaqgtfs 352 

jeatffqlprmsscgg rlrkaqgtfn 3 52 

iatffqlprmsscgg rlrkaqgtfn 3 52 

ktffqlprmsscgg rlrkaqgtfn 3 52 



30 



35 



40 



45 



50 



55 



60 



65 



70 



NOV27a 


123 


NOV27b 


123 


NOV27C 


1 


gi|l2836503 | 


341 


gij 16758444 | 


353 


gi j 10257390 j 


353 


gij 11415040 j 


353 


gij 12249015] 


353 


NOV27a 


145 


NOV27b 


145 


NOV27c 


1 


gi | 12836503 | 


401 


gi j 16758444 j 


413 


gi j 10257390 j 


413 


gi j 11415040 j 


413 


gi j 12249015 j 


413 


NOV27a 


160 


NOV27b 


160 


NOV27c 


1 



gi | 12836503 | 
gi j 16758444 j 
gi j 10257390 j 
gi j 11415040 j 
gi j 12249015 j 



NOV27a 

NOV27b 

NOV27C 

gi | 12836503 | 

gi j 16758444 j 

gi j 10257390 j 

gi j 11415040 j 

gijl2249015j 




400 



|pyypghyppninctwnikvpnnrnvkvrfkl: 
Ipyypghyppnidctwnievpnnqhvkvsfkff 
Ipyypghyppnidctwnievpnnqhvkvrfkff 
|pyypghyppnidctwnievpnnqhvkvrfkffy 




410 420 

HjRgRLREHG 145 

IR2RLREHG 14 5 

_' x 

RQKYNRLCTQGQWMIQNRRLCGF 4 00 

PN1;PVGSCTKDYVEINGEKFCGE 412 

^pgGTCPKDYVEINGEKYCGE 412 

Pg^PgGTCPKDYVEINGEKYCGE 412 

p§\|p0gtcpkdyveingekycge 412 



430 
..|.. 



440 
..|.. 



450 




460 

Jvsaelt-- 

XVSAELT-- 



RS- 



- Q FWS SNS SKI T VH FHS D] 
-QFWTSNSNKITVRFHSDQg - 
-QFWTSNSNKITVRFHSDQgY' 



490 
..|.. 



500 




470 480 

....|....|....| 

160 

160 

! 

QVYYSLYNQSDPCPGEFLCSV 4 54 

FljAEYLSYDSNDPCPGMFMCKTGRCIRK 4 67 

IFIIAEYLSYDSSDPCPGQFTCRTGRCIRK 4 67 

FllAEYLSYDSSDPCPGQFTCRTGRCIRK 4 67 

|fiJaEYLSYDSSDPCPGQFTCRTGRCIRK 4 67 

520 




4 54 NGLCVPACDG I KDCPNGLD 

4 68 DLRCDGWADCPDYSDERHCRCNATHQFMCKNQFCKPLFWVCDSVNDCGDGSD 

4 6 8 elrcdgwadctdhsdei^cscdaghqftcknkfckplfwvcdsvndcgdnsd 
4 6 8 elrcdgwadctdhsdelncscdaghqftcknkfckplfwcdsvl|dcgdnsd 
4 6 8 elrcdgwadctdhsdelncscdaghqftcknkfckplfwvcdsviIdcgdnsd 



550 



170 
170 
1 




0 



590 



SICVCRA 4 81 
SGCSCPA 527 
3CSCPA 527 
jCSCPA 527 
3CSCPA 527 

600 



pgnsesHoJsqI 
|pg|se|sBgSsq| 



3cli 



4 82 MFQCQEDSTCI SLPRVCDRQP 

52 8 GSFKCSNGKCLPQSQQCNGKDECGD 

52 8 QTFRCSNGKCLSKSQQCNGKD gCGD 

52 8 QTFRCSNGKCLSKSQQCNGKD 

52 8 QTFRCSNGKCLSKSQQCNGKD 



«CGD 
IcGD 



SDEEQCQEG- -VP 
S DEAS CDNVNAVS 
S DEAS C P KVNWT 
SDEASCPKVNWT 
SDEASCPKVNWT 
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1 n 



610 



10 



NOV27a 
NOV27b 
NOV27C 
gi|l2836503 | 
gi | 16758444 j 
gi j 10257390 j 
gi | 11415040 j 
gi|l2249015| 




620 
I 

pawrmag 

p|S|rmag 

P&RMAG 

- -^£glss 
-sifrRQ. 

-SltTRQ 

-Ijtrq. 



630 



640 




650 

L---L 

RENKB 
RENKE 
RENKE 
3IRGR 

halg§g 

fiALG^QG 
HAIiGQG 




15 



20 



NOV27a 
NOV27b 
NOV27c 
gi | 12836503 | 
gi | 16758444 j 
gi | 10257390 | 
gij 11415040 | 
gi | 12249015 | 



257 

60 

596 



257 WAR 



gAR 
NAR 
ADR 



647 $PE 

64 7 SPN 

64 7 pPN 

647 S>N 



wlvsaahcf 
wlvsaahcf 
w lvs aahcf 
w BE Baahcf 
wlvsaahc fh 
wlvsaahc 
wlvsaahc 
iwlvsaahc 



670 680 

■|.... I 
STB 

m 

DEDS MAS 

jbjET I FK|£Sj 
£I^DRGFRYS 
DDRGFR^S 



690 



700 



710 




iSE^STl 



SllL_ 

ll^DRGFRXj; 



DP TEW 

dptSw 



TYLE 

TYLgGSEgS 1 ] 
TYLgGSEgS'] 
iKMRQNSRWPGEj 
LLDQ&KRSgSC 

IlhdqBqrsBpc 
■hdqSqrsBpc 



AOVVppK 

ApfvQU^K 
iS-FK^|spiiFL 

jEHigiia^gx;T 

QERRlkl 

2 ! ERkq] 




730 



25 



30 



35 



40 



45 



50 



55 



60 



65 



NOV27a 
NOV27b 
NOV27c 
gi|l2836503 | 
gi|l6758444 j 
gi|l0257390 j 
gij 11415040 | 
gij 12249015 j 



NOV27a 

NOV27b 

NOV27C 

gi| 12836503 | 

gi j 16758444 j 

gi|l0257390| 

gi j 11415040 j 

gij 12249015 j 




3 71 AT^jELLE) j AL 

3 61 AT^eLlISal 

173 ATMELLfilAL 

711 vd^qlvJpid: 

766 GEiRVlkiT' 

7 66 Ge|rVXBjTT 

7 66 GEjgRynij ■ TT 



850 



790 
I 



740 
P|GRHX 

|lpI|grhpj 

P^GRHbp 
JSAT&j 

_J|ssMy; 
eyssmSr 

:is si w 

800 



750 



760 




810 



jPSgKCLlsJ 
PSgKCLks] 

jEPGQHCwi 

.g[Sai * 
g j aiw: 
.G j aiw: 
.GgAiw; 

820 




770 

.|....| 
iLKEDFWKPE ? V 

! ™ i» 
. $f 

lkedfr- khlp 
.qreggpvsn-t 
|tkegg-tgal| 
!tqygg-tgal| 
qygg-tgalS 

'QYGG - TGAL/i 



780 




■ 840 



766 




860 



NOV27a 


431 


WG 


I 


G 


NOV27b 


421 


WG 


I 


G 


N0V27c 


233 


WG 


I 


G 


gi 1 12836503 | 


771 


WG 




G 


gi j 16758444 j 


826 






e 


gi j 10257390 j 


826 


WG 




G 


gij 11415040 | 


826 


WG 




G 


gi j 12249015 j 


826 


WG 




G 




I 



880 
..|.. 



890 
..|.. 



I 



900 



910 



N0V27a 
NOV27b 
NOV27c 
gi 1 12836503 | 
gijl6758444 j 
gi j 10257390 j 
gi j 11415040 j 
gi j 12249015 j 



920 
..|.. 



870 

5R 460 

rKASMPLAPTMAPAPAAPSTAWPTSPESPWS 4 80 

262 

799 

855 

855 

855 

855 

930 



460 460 

481 TPTKSMQALSTVPLDWVTVPKLQGIFGAER 510 

262 

799 

855 

855 

855 

855 



262 
799 
855 
855 
855 
855 
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Tables 23I-L list the domain descriptions from DOMAIN analysis results against 
NOV23. This indicates that the NOV23 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 231 Domain Analysis of NOV23a 

gnl | Smart | smart 0 0 02 0 , TrypJPc, Trypsin- like serine protease; Many of 
these are synthesised as inactive precursor zymogens that are cleaved 
during limited proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others are inactive 
due to substitutions of the catalytic triad residues. (SEQ ID 
NO: 812) 

CD-Length = 230 residues, 100.0% aligned 
Score = 269 bits (687) , Expect = 3e-73 





NOV23 : 


225 




Sbjct : 


1 


10 


NOV23 : 


284 




Sbjct : 


60 


15 


NOV23 : 


344 




Sbjct : 


119 




NOV23 : 


402 


20 


Sbjct : 


179 



RIVGGMEASPGEFPWQASLR-ENKEHFCGAAIINARWLVSAAHCFNEFQDPTKWVAYVGA 283 

1 1 1 1 1 11+ I 1 1 1 1 11+ Mil ++I + II+++IIII |+ +| + 

RIVGGSEANIGSFPWQVSLQYRGGRHFCGGSLISPRWVLTAAHCVYGSA- PSSIRVRLGS 5 9 

T Y LS G S E AS T VRAQ WQ I VKH P L YNADT AD FD VAVLE LT S P L P FGRH I Q P VCL P AATH I F 34 3 

II I + +| +++ | | || || | + | + | + | + |+ ++|+| | |+ + + 

HDLSSGEETQTV-KVSKVIVTIPNYNPSTYDNDIALLKLSEPVTLSDTVRPICLPSSGYNV 118 

PPSKKCLISGWGYLKEDFWKPEVLQKATVELLDQALCASLY- -GHSLTDRMVCAGYLDG 4 01 
I I + MII I 1+ 11+ I ++ I I II ++II l + MI M 

PAGTTCTVSGWGRTSESSGSLPDTLQEVNVPI VSNATCRRAYSGGPAITDNMLCAGGLEG 17 8 

KVDSCQGDSGGPDVCEEPSGRFFLAGIVSWG- I G CAE ARR PG VYARVT RL RDW I 4 54 

l + l I I II I I I I II 1+ I MINI III + 1 I I I 11+ III 

GKDACQGDSGGPLVCN- -DPRWVDVGIVSWGSYGCARPNKPGVYTRVSSYLDWI 23 0 



Table 23 J Domain Analysis of NOV23a 

gnl | Pf am | pf am00089 , trypsin, Trypsin. Proteins recognized include all 
proteins in families SI, S2A, S2B , S2C, and S5 in the classification 
of peptidases. Also included are proteins that are clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ*) . (SEQ ID NO: 813) 

CD-Length = 217 residues, 100.0% aligned 
Score = 223 bits (568), Expect - 2e-59 



25 



30 



35 



NOV27 : 


226 


Sbjct: 


1 


NOV27 : 


286 


Sbjct : 


58 


NOV2 7 : 


346 


Sbjct : 


116 


NOV2 7: 


406 


Sbjct : 


173 



IVGGMEASPGEFPWQASLRENKEHFCGAAI INARWLVSAAHCFNEFQDPTKWVAYVGATY 2 85 
Mil II I MM 11+ + I I I I ++I+ I+++IIII + + +| 

IVGGREAQAGSFPWQVSLQVSSGHFCGGSLISENWVLTAAHCVS GASSVRWLGEHN 57 

LSGSEASTVRAQWQIVKHPLYNADTADFDVAVLELTSPLPFGRHIQPVCLPAATHIFPP 34 5 
I +1 + + 1+1+ II II II l + l + l + l 11+ I ++I + IM + I+ | 

LGTTEGTEQKFDVKKIIVHPNYNPDT- -NDIALLKLKSPVTLGDTVRPICLPSASSDLPV 115 

SKKCLISGWGYLKEDFWKPEVLQKATVELLDQALCASLYGHSLTDRMVCAGYLDGKVDS 4 05 

I + MM I +11- I ++ + I I II ++II l + lll I II 1 + 

GTTCS VSGWGRTKNL - - GTSDTLQE WVP I VS RETCRS AYGGTVTDTM I CAGALGGK - DA 172 



CQGDSGGPLVCEEPSGRFFLAGIVSWGIGCAEARRPGVYARVTRLRDWI 

I M 1 1 1 I II 1 1 + I MINI III I II I ll + l III 

CQGDSGGPLVCSDG E L VG I VS WG YGC AVGNY PG VYTR VS R YLDW I 



454 



217 
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Table 23K Domain Analysis of NOV23b 

gnl | Smart | smartO 0192 , LDLa, Low-density lipoprotein receptor domain 
class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) 
receptor that plays a central role in mammalian cholesterol 
metabolism. The N- terminal type A repeats in LDL receptor bind the 
lipoproteins. Other homologous domains occur in related receptors, 
including the very low-density lipoprotein receptor and the LDL 
receptor-related protein/alpha 2 -macroglobulin receptor, and in 
proteins which are functionally unrelated, such as the C9 component of 
complement. Mutations in the LDL receptor gene cause familial 
hypercholesterolemia. (SEQ ID NO: 8 14) 
CD-Length - 38 residues, 100.0% aligned 
Score = 50.4 bits (119), Expect = 2e-07 



NOV23 : 176 RCPGNSFSCGNSQCVTKVNPECDDQEDCSDGSDEAHCEC 

II M I +1+ II + || Mill +| 

Sbjct: 1 TCPPGEFQCKNGRCIPLSWV-CDGVDDCGDGSDEENCPS 



214 



38 



Table 23L Domain Analysis of NOV23b 

gnl | Pfam| pf am00057 , ldl_recept_a , Low-density lipoprotein receptor 
domain class A (SEQ ID NO: 815) 
CD-Length = 39 residues, 94.9% aligned 
Score =43.5 bits (101), Expect = 3e-05 



10 



15 



20 



25 



NOV23: 175 GRCPGNSFSCGNSQCVTKVNPECDDQEDCSDGSDEAHC 212 

I I I 11+ +1+ II II lllll -I 

Sbjct: 1 STCGPNEFQCGSGECIPMSW- VCDGDPDCEDGSDEKNC 3 7 

Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being 
found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, 
including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 
families (denoted SI - S27) of serine protease have been identified, these being grouped into 6 
clans on the basis of structural similarity and other functional evidence. 

Tryptase is a tetrameric serine protease that is concentrated and stored selectively in 
the secretory granules of all types of mast cells, from which it is secreted during mast cell 
degranulation. Its exclusive presence in mast cells permits its use as a specific clinical 
indicator of mast cell activation by measurement of its level in biologic fluids and as a 
selective marker of intact mast cells using immunohistochemical techniques with antitryptase 
antibodies. Vanderslice demonstrated the existence of multiple tryptases. In this respect, mast 
cell tryptase is like other serine proteases such as glandular kallikrein and trypsin, which are 
also members of multigene families. Miller et al. mapped both alpha-tryptase and beta-tryptase 
to human chromosome 16 by PGR analysis of DNA from human/hamster somatic cell hybrids. 
Miller et al. cloned a second cDNA for human tryptase, called beta-tryptase, from a mast cell 
cDNA library. The 1,142 bases of beta-tryptase were found to encode a 30-amino acid leader 
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sequence of 3,089 daltons and a 245-amino acid catalytic region of 27,458 daltons. The amino 
acid sequence of beta-tryptase was found to be 90% identical with that of alpha-tryptase, the 
first 20 amino acids of the catalytic portions being 100% identical. Both alpha- and beta- 
tryptase sequences were localized to human chromosome 16 by analysis of DNA preparations 
5 from 25 human/hamster somatic cell hybrids by PCR. 

Because of the presence of the trypsin domains and the homology to the adrenal 
secretory serine protease, it is anticipated that the novel sequences described here will have 
useful properties and functions similar to these proteins. 

The disclosed NOV23 nucleic acid of the invention encoding an Adrenal secretory 

10 serine protease -like protein includes the nucleic acids whose sequences are provided in Tables 
23A, 23C, 23E or a fragment thereof The invention also includes a mutant or variant nucleic 
acid any of whose bases may be changed from the corresponding base shown in Table 23 A, 
23c, or 23E while still encoding a protein that maintains its Adrenal secretory serine protease - 
like activities and physiological functions, or a fragment of such a nucleic acid. The invention 

15 further includes nucleic acids whose sequences are complementary to those just described, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 

20 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 37 percent of the 
bases may be so changed. 

25 The disclosed NOV23 protein of the invention includes the Adrenal secretory serine 

protease -like protein whose sequence are provided in Table 23B, 23D, or 23F. The invention 
also includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 23B, 23D, or 23F while still encoding a protein that 
maintains its Adrenal secretory serine protease -like activities and physiological functions, or a 

30 functional fragment thereof. In the mutant or variant protein, up to about 55 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2,that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that these Adrenal secretory serine protease - 
like proteins (NOV23) is a member of a "Adrenal secretory serine protease family". 
Therefore, the NOV23 nucleic acids and proteins identified here may be useful in potential 
therapeutic applications implicated in (but not limited to) various pathologies and disorders as 
5 indicated below. The potential therapeutic applications for this invention include, but are not 
limited to: protein therapeutic, small molecule drug target, antibody target (therapeutic, 
diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene 
therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of 
all tissues and cell types composing (but not limited to) those defined here. 
10 The NOV23 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, cirrhosis, 
transplantation, endometriosis, fertility, anemia, ataxia-telangiectasia, autoimmune disease, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, allergies, 
immunodeficiencies, graft versus host disease (GVHD), lymphaedema, muscular dystrophy, 
15 Lesch-Nyhan syndrome, myasthenia gravis, and/or other diseases and pathologies. 

NOV23 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV23 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
20 NOVX Antibodies" section below. The disclosed NOV23 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 
25 NOV24 

NOV24 includes two novel parchorin-like proteins disclosed below. The disclosed 
sequences have been named NOV24a and NOV24b. 

NOV24a 

A disclosed NOV24a nucleic acid of 2091 nucleotides (also referred to as CG56455- 
30 01) encoding a novel parchorin-like protein is shown in Table 24A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 7-9 and ending with a 
TGA codon at nucleotides 2080-2082. The start and stop codons are shown in bold in Table 
24A, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 24A. NOV24a nucleotide sequence (SEQ ID NO: 103). 

GCGGCC ATGGCCGAGGCCGCGGAGCCGGAGGGGGTTGCCCCGGGTCCCCAGGGGCCGCCGGAGGTCCCCnrc; 
CCTCTGGCTGAGAGACCCGGAGAGCCAGGAGCCGCGGGCGGGGAGGCAGAAGGGCCGGAGGGGAGCGAGGGC 
GCAGAGGAGGCGCCGAGGGGCGCCGCCGCTGTGAAGGAGGCAGGAGGCGGCGGGCCAGACAGGGGCCCGGAG 
GCCGAGGCGCGGGGCACGAGGGGGGCGCACGGCGAGACTGAGGCCGAGGAGGGAGCCCCGGAGGGTGCCGAG 
GTGCC CC AAGG AGGGG AGG AG AC AAGCGG CGCGC AG C AGGTGG AGGGGG CG AG C C CGGG ACG CGG CG CG C AG 
GG CGAGC C C CGCGGGGAGG CT C AGAGGG AGC C CG AGGACT CTG CGG C CC C CGAG AGG C AGGAGGAGG CGG AG 
CAGAGGCCTGAGGTCCCGGAAGGTAGCGCGTCCGGGGAGGCGGGGGACAGCGTAGACGCGGAGGGCCCGCTG 
GGGGACAACATAGAAGCGGAGGGCCCGGCGGGCGACAGCGTAGAGGCGGAGGGCCGGGTGGGGGACAGCGTA 
GACGCGGAAGGTCCGGCGGGGGACAGCGTAGACGCGGAGGGCCCGCTGGGGGACAACATACAAGCCGAGGGC 
CCGGCGGGGGACAGCGTAGACGCGGAGGGCCGGGTGGGGGACAGCGTAGACGCGGAAGGTCCGGCGGGGGAC 
AGCGTAGACGCGGAGGGCCGGGTGGGGGACAGCGTAGAGGCGGGGGACCCGGCGGGGGACGGCGTAGAAGCG 
GGGGTCCCGGCGGGGGACAGCGTAGAAGCCGAAGGCCCGGCGGGGGACAGCATGGACGCCGAGGGTCCGGCA 
GGAAGGGCGCGCCGGGTCTCGGGTGAGCCGCAGCAATCGGGGGACGGCAGCCTCTCGCCCCAGGCCGAGGCA 
ATTGAGGTCGCAGCCGGGGAGAGTGCGGGGCGCAGCCCCGGTGAGCTCGCCTGGGACGCAGCGGAGGAGGCG 
GAGGTCCCGGGGGTAAAGGGGTCCGAAGAAGCGGCCCCCGGGGACGCAAGGGCAGACGCTGGCGAGGACAGG 
GTAGGGGATGGGCC ACAGC AGG AGC CGGGGG AGGACGAAGAGAG ACGAGAGCGGAGC C CGGAGGGG C CAAGG 
GAGGAGGAAGCAGCGGGGGGCGAAGAGGAATCCCCCGACAGCAGCCCACATGGGGAGGCCTCCAGGGGCGCC 
GCGGAGCCTGAGGCCCAGCTCAGCAACCACCTGGCCGAGGAGGGCCCCGCCGAGGGTAGCGGCGAGGCCGCG 
CGCGTGAACGGCCGCCGGGAGGACGGAGAGGCGTCCGAGCCCCGGGCCCTGGGGCAGGAGCACGACATCACC 
CTCTTCGTCAAGGCTGGTTATGATGGTGAGAGTATCGGAAATTGCCCGTTTTCTCAGCGTCTCTTTATGATT 
CTCTGGCTGAAAGGCGTTATATTTAATGTGACCACAGTGGACCTGAAAAGGAAACCCGCAGACCTGCAGAAC 
CTGGCTCCCGGAACAAACCCTCCTTTCATGACTTTTGATGGTGAAGTCAAGACGGATGTGAATAAGATCGAG 
GAGTTCTTAGAGGAGAAATTAGCTCCCCCGAGGTATCCCAAGCTGGGGACCCAACATCCCGAATCTAATTCC 
GCAGGAAATGACGTGTTTGCCAAATTCTCAGCGTTTATAAAAAACACGAAGAAGGATGCAAATGAGGTTCAT 
GAAAAGAACCTGCTGAAGGCCCTGAGGAAGCTGGATAATTACTTAAATAGCCCTCTGCCTGATGAAATAGAT 
GCCTACAGCACCGAGGATGTCACTGTTTCTGGAAGGAAGTTTCTGGATGGGGACGAGCTGACGCTGGCTGAC 
TGCAACCTCTTACCCAAGCTCCATATTATTAAGGTTCTTCATTTTCAGATTGTGGCCAAGAAGTACAGAGAT 
TTTGAATTTCCTTCTGAAATGACTGGCATCTGGAGATACTTGAATAATGCTTATGCTAGAGATGAGTTCACA 
AATACGTGTCCAGCTGATCAAGAGATTGAACACGCATATTCAGATGTTGCAAAAAGAATGAAATGAAGCTGG 



In a search of public sequence databases, the NOV24a nucleic acid sequence, located 
on chromosome 21, has 1347 of 1897 bases (71%) identical to a gb : GENB ANK- 
ID:AB035520|acc:AB035520.1 mRNA from Oryctolagus cuniculus {Oryctolagus cuniculus 
5 mRNA for parchorin, complete cds) (E = 2.4e~ 175 ). 

A disclosed NOV24a polypeptide (SEQ ID NO: 104) encoded by SEQ ID NO:103 has 
691 amino acid residues and is presented in Table 24B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV24a has no signal peptide and is 
likely to be localized to the nucleus with a certainty of 0.3000. Alternatively, NOV24a may 
10 also localize to the mitochondrial matrix space with a certainty of 0.1000, or to the lysosome 
(lumen) with a certainty of 0.1 000. 



Table 24B. Encoded NOV24a protein sequence (SEQ ID NO: 104). 

MAE AAE P EG V APG P QGP P E VP A P L AE R P G E P GAAGGE AE G PEGS EG AE E AP RGAAAVKE AGGGG PD RG P E AE 
ARGTRGAHGETEAEEGAPEGAEVPQGGEETSGAQQVEGASPGRGAQGEPRGEAQREPEDSAAPERQEEAEQR 
PEVPEGSASGEAGDSVDAEGPLGDNIEAEGPAGDSVEAEGRVGDSVDAEGPAGDSVDAEGPLGDNIQAEGPA 
GDSVDAEGRVGDSVDAEGPAGDSVDAEGRVGDSVEAGDPAGDGVEAGVPAGDSVEAEGPAGDSMDAEGPAGR 
ARRVSGE PQQSGDGS LS PQAEAI EVAAGE SAGRS PG E LAWDAAE E AE V PG VKG S E E AA PGD ARADAGE DRVG 
DGPQQEPGEDEERRERSPEGPREEEAAGGEEESPDSSPHGEASRGAAEPEAQLSNHLAEEGPAEGSGEAARV 
NGRREDGEASEPRALGQEHDITLF^/KAGYDGESIGNCPFSQRLFMILWLKGVIFNVTTVDLKRKPADLQNLA 
PGTNPPFMTFDGEVKTDVNKIEEFLEEKLAPPRYPKLGTQHPESNSAGNDVFAKFSAFIKNTKKDANEVHEK 
NLLKALRKLDNYLNSPLPDEIDAYSTEDVTVSGRKFLDGDELTLADCNLLPKLHI IKVLHFQIVAKKYRDFE 
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rFPSEMTGIWRYLNNAYARDEFTNTCPADQEIEHAYSDVAKRMK | 

A search of sequence databases reveals that the NOV24a amino acid sequence has 414 
of 655 amino acid residues (63%) identical to, and 453 of 655 amino acid residues (69%) 
similar to, the 637 amino acid residue ptnr:SPTREMBL-ACC:Q9N2G5 protein from 
5 Oryctolagus cuniculus (Rabbit) (Parchorin) (E = 2.5e~ 182 ). 

NOV24a is predicted to be expressed in at least the following tissues: brain, lung, and 
kidney. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 
10 In addition, the sequence is predicted to be expressed in gastric parietal cells, choroid 

plexus, salivary duct, lacrimal gland, kidney, airway epithelia and chorioretinal epithelia 
because of the expression pattern of (GENBANK-ID: gb:GENBANK- 

ID:AB035520|acc:AB035520.1) a closely related Oryctolagus cuniculus mRNA for parchorin, 
complete cds homolog. 

15 NOV24b 

A disclosed NOV24b nucleic acid of 859 nucleotides (also referred to as CG56455-02) 
encoding a novel parchorin-like protein is shown in Table 24C. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 1-3 and ending with a TGA 
codon at nucleotides 853-855. The start and stop codons are shown in bold in Table 24A, and 

20 the 5' and 3' untranslated regions, if any, are underlined. 

Table 24C. NOV24b nucleotide sequence (SEQ ID NO:105). 

ATGGCCGAGGCCGCGGAGCCTGAGGCCCAGCTCAGCAACCACCTGGCCGAGGAGGGCCCCGCCGAGGGTAGC 
GGCGAGGCCGCGCGTGTGAACGGCCGCCGGGAGGACGGAGAGGCGTCCGAGCCCCGGGCCCTGGGGCAGGAG 
CACGACATCACCCTCTTCGTCAAGGCTGGTTATGATGGTGAGAGTATCGGAAATTGCCCGTTTTCTCAGCGT 
CTCTTTATGATTCTCTGGCTGAAAGGCGTTATATTTAATGTGACCACAGTGGACCTGAAAAGGAAACCCGCA 
GACCTGCAGAACCTGGCTCCCGGAACAAACCCTCCTTTTATGACTTTTGATGGTGAAGTCAAGACGGATGTG 
AATAAGATCGAGGAGTTCTTAGAGGAGAAATTAGCTCCCCCGAGGTATCCCAAGCTGGGGACCCAACATCCC 
GAATCTAATTCCGCAGGAAATGACGTGTTTGCCAAATTCTCAGCGTTTATAAAAAACACGAAGAAGGATGCA 
AATGAGATTCATGAAAAGAACCTGCTGAAGGCCCTGAGGAAGCTGGATAATTACTTAAATAGCCCTCTGCCT 
GATGAAATAGATGCCTACAGCACCGAGGATGTCACTGTTTCTGGAAGGAAGTTTCTGGATGGGGACGAGCTG 
ACGCTGGCTGACTGCAACCTCTTACCCAAGCTCCATATTATTAAGATTGTGGCCAAGAAGTACAGAGATTTT 
GAATTTCCTTCTGAAATGACTGGCATCTGGAGATACTTGAATAATGCTTATGCTAGAGATGAGTTCACAAAT 
ACGTGTCCAGCTGATCAAGAGATTGAACACGCATATTCAGATGTTGCAAAAAGAATGAAATGAAGCT 



In a search of public sequence databases, the NOV24b nucleic acid sequence, located 
on the q22.12 region of chromosome 21, has 741 of 847 bases (87%) identical to a parchorin 
mRNA from oryctolagus cuniculus gb accno AB035520.1) (E = 3.2e~ 140 ). 
25 A disclosed NOV24b polypeptide (SEQ ID NO: 1 06) encoded by SEQ ID NO: 1 05 has 

284 amino acid residues and is presented in Table 24D using the one-letter amino acid code. 
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Signal P, Psort and/or Hydropathy results predict that NOV24b has no signal peptide and is 
likely to be localized to the nucleus with a certainty of 0.3000. Alternatively, NOV24b may 
also localize to the mitochondrial matrix space with a certainty of 0.1000, or to the lysosome 
(lumen) with a certainty of 0.1000. 

5 

Table 24D. Encoded NOV24b protein sequence (SEQ ID NO: 106). 

MAEAAEPEAQLSNHLAEEGPAEGSGEAARVNGRREDGEASEPRAIjGQEHDITLFVKAGYDGESIGNCPFSQR 
LFMILWLKGVIFNVTTVDLKRKPADLQNLAPGTNPPFMTFDGEVKTDVNKIEEFLEEKLAPPRYPKLGTQHP 
ESNSAGNDVFAKFSAFIKNTKKDANEIHEKNLLKALRKLDNYLNSPLPDEIDAYSTEDVTVSGRKFLDGDEL 
TLADCNLLPKLHI IKIVAKKYRDFEFPSEMTGIWRYLNNAYARDEFTNTCPADQEIEHAYSDVAKRMK 

A search of sequence databases reveals that the NOV24b amino acid sequence has 255 
of 281 amino acid residues (90%) identical to, and 263 of 281 amino acid residues (93%) 
similar to, the 637 amino acid residue ptnr:SPTREMBL-ACC:Q9N2G5 protein from 
10 Oryctolagus cuniculus (Rabbit) (Parchorin) (E = 1.6e~ 134 ). 

NOV24b disclosed in this invention is predicted to be expressed in at least the 
following tissues: heart, placent, skeletal muscle, stomach, and lung. . 

The disclosed NOV24a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 24E. 
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Table 24E. BLAST results for NOV24a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 75 9263 6 |dbj | BAA9 
4345. l| (AB035520) 


parchorin 
[Oryctolagus 
cuniculus] 


637 


436/715 
(60%) 


475/715 
(65%) 


e-130 


gi|6685319|sp| Q9Y6 9 
6 |CLI4_HUMAN 


CHLORIDE 
INTRACELLULAR 
CHANNEL PROTEIN 4 
( INTRACELLULAR 
CHLORIDE ION 
CHANNEL PROTEIN 
P64H1) 


253 


182/238 
(76%) 


207/238 
(86%) 


e-108 


gi | 73 3 033 5 | ref |NP_0 
39234 . 1 | 
(NM_013943) 


chloride 
intracellular 
channel 4; 
chloride 
intracellular 
channel 4 like 
[ Homo sap lens] 


253 


182/238 
(76%) 


208/238 
(86%) 


e-108 


gi | 7304963 | ref |NP_0 
3 8913 ,l| 
(NM_013885) 


chloride 
intracellular 
channel 4 
(mitochondrial ) 
[Mus musculus] 


253 


181/238 
(76%) 


207/238 
(86%) 


e-107 


gi | 458 8524 |gb | AAD2 6 
136 . 1 | AF109196_1 
(AF109196) 


intracellular 
chloride channel 
p64Hl [Homo 
sapiens] 


253 


180/238 
(75%) 


205/238 
(85%) 


e-106 
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The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 24R In the Clustal W alignment of the NOV24 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 24F. ClustalW Analysis of NOV24 

1) Novel NOV24a (SEQ ID NO: 104) 

2) Novel NOV24b (SEQ ID NO: 106) 

3) gi | 7592636 | dbj |BAA94345-l| (AB035520) parchorin [Oryctolagus cuniculus] (SEQ ID 
NO : 421) 

4) gi | 66 85319 | sp | Q9Y6 9 6 | CLI4_HUMAN CHLORIDE INTRACELLULAR CHANNEL PROTEIN 4 
(INTRACELLULAR CHLORIDE ION CHANNEL PROTEIN P64H1) (SEQ ID NO: 4 22) 

5) gi | 7330335 | ref |NP_039234 . 1 | (NM_013943) chloride intracellular channel 4; 
chloride intracellular channel 4 like [Homo sapiens] (SEQ ID NO: 423) 

6) gi | 7304963 | ref |NP_038913 . 1 | (NM_013885) chloride intracellular channel 4 
(mitochondrial) [Mus musculus] (SEQ ID NO: 424) 

7) gi | 4588524 | gb | AAD26136 . 1 | AF109196_1 (AF109196) intracellular chloride channel 
p64Hl [Homo sapiens] (SEQ ID NO: 425) 



NOV24a 
NOV24b 
gi|7592636 | 
gi | 6685319 | 
gi j 7330335 | 
gi j 7304963 | 
gi [4588524 | 



10 20 30 

....|....|....|....|....|....|....|. 
MAEAAEPEGVAPGPQGPPEVPAPLAERPGEPGAAG- 



40 



50 



60 



-GEAEGPEGSEGAEEAPRGAA 5 5 
1 

MAETAEPEGGAPSPQGPPEGSALLEERPGEPDPAGPEASEGAAKAPSGEGAGAAAKAGAT 6 0 

1 



NOV24a 

NOV24b 

gi | 7592636 | 

gi j 6685319 j 

gi|7330335 | 

gi|7304963 | 

gi j 4588524 j 



70 80 
....|....|....|....|.... 
AVKEAGGGGPDRGPEAEARGTRG- 



90 100 110 120 

.|....|....|....|....|....|....| 
- AHGE TE AE E GAP EGAE V PQGGE E T S G AQ Q VE 10 9 



56 

1 1 

61 EEASGGRDGEGAGEQAPDAGTESGGETPDAKGAQIEAEGAPEGTKAPQLGEEGSGGKQVE 12 0 

1 1 
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14 0 



150 



160 
..|.. 



170 
..|.. 



180 
•-I 



NOV24a 


110 


NOV24b 


1 


gi 


7592636 | 


121 


gi 


6685319 j 


1 


gi 


7330335 j 


1 


gi 


7304963 j 


1 


gi 


4588524 j 


1 



-GDSVDAEG- 



--S 163 
1 



190 200 210 220 230 240 

....|....|....|....|....|....|....|....|....|....|....|....| 
NOV24a 170 IEAEGPAGDSVEAEGRVGDSVDAEGPAGDSVDAEGPLGDNIQAEGPAGDSVDAEGRVGDS 229 

NOV24b 1 1 

gi | 7592636 | 164 IDAGG SVDAAG SVDAGG SIDAGGSM-D AGGSVDAGG S 199 

gi|6685319| 1 1 
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gi |7330335 | 
gi |7304963 | 
gi j 4588524 | 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



NOV24a 

NOV24b 

gi|7592636| 

gi|6685319| 

gi|7330335 j 

gi | 7304963 j 

gi|4588524| 



NOV24a 

NOV24b 

gi |7592636 | 

gi j 6685319 | 

gi | 7330335 | 

gi|7304963 | 

gi|4588524| 



NOV24a 

NOV24b 

gi |7592636 | 

gi | 6685319 j 

gi |7330335 j 

gi |7304963 j 

gi | 4588524 | 



250 260 270 280 290 300 

....|....|....|....|....|....|....|....|....|....|....|....| 
23 0 VDAEGPAGDS VDAEGRVGDS VEAGDPAGDGVEAGVPAGDS VEAEG - - PAGDSMDAEGPAG 2 87 

1 1 

200 IDTGG SVDAAG S VDAGGS I DTG — RNVDAGGS I DAGGS VDAGGSMDAEGPAG 249 

1 1 

! 1 

! 1 

! 1 



310 320 330 340 350 360 

. ... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | 
288 RARRVSGEPQQSGDGSLSPQAEAIEVAAGESAGRSPGELAWDAAEEAEVPGVKGSEEAAP 3 47 



25 0 GAHGAGGEPQDLGAGSPQPRSEAVEVAAAENEGHSPGESVEDAAAEEAA-G- 

1 

1 

! 

1 



-TREP 3 03 
1 



370 



380 390 400 410 420 

. ... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | 
34 8 GDARADAGEDRVGDGPQQEPGEDEERRERSPEGPREEEAAGGEEESPDSSPHG- -EASRG 4 05 

x MAE 3 

3 04 EGSEDAAGEDGDQGRPQEETEQQAERQEPGPETQSEEE ER- PPDRSPDGEAAASTR 3 58 

! 1 



NOV24a 


406 


AAEPEAQ 


LS 


NOV24b 


4 


AAEPEAQ 


LS 


gi | 7592636 


359 


AAQPEAE 


LS 


gij 6685319 


1 


MA 


LS 


gi|7330335 


1 


MA 


LS 


gi|7304963 


1 


MA 


LS 


gi|4588524 


1 


MA 


LS 



430 440 

j_l I I • 

NHLAEEGPAEGSGEAAR? 

NHLAEEGPAEGSGEAAR; 
NHLAAEEGGQ - RGE 1 



460 



470 



480 




490 



500 



NOV24a 


466 


IGNCPFSQRLFMILWLKGVj 


F| 


NOV24b 


64 


IGNCPFSQRLFMILWLKGVj 


Fij 


gi |7592636 | 


417 


IGNCPFSQRLFMILWLKGVJ 


fS 


gi j 6685319 j 


32 


IGNCPFSQRLFMILWLKGVj 




gi|7330335 | 


32 


IGNCPFSQRLFMILWLKGVj 




gi|7304963| 


32 


IGNCPFSQRLFMILWLKGVj 


F l 


gi|4588524 | 


32 


IGNCPFSQRLFMILWLKGVJ 


if| 



510 
■ ■ I ■ ■ 



iGEASEPRALGg 
EASEPRALGQ 
EASEEGDPGp 
D 



520 
■ • I • ■ 




530 



540 



/TTVDLKRKPADLQNLAPGTgPPFI TF 

/ttvdlkrkpadlqnlapgt|ppf| TF! 

/TTVDLKRKPADLQNLAPGt|pPF| TF! 

^ttvdlkrkpadlqnlapgtSppfi TF 



jEVKTDVNKI 

Ievktdvnki 

KTDVNKI 

evktdvnki 
evktdvnki 
evktdvnki 
evktdvnki 



525 

123 

476 

91 

91 

91 

91 



600 



NOV24a 

NOV24b 

gi | 7592636 | 

gi|6685319| 

gi|7330335| 

gi|7304963 j 

gi|4588524 | 



NOV24a 
NOV24b 
gi|7592636| 
gi | 6685319 | 





YLNSPLPDEID^HSyED: 

ylnsplpdeid^sHed! 
ylnsplpde||dWsHed 
ylnsplpdeidBsBed 



RKFLDGgEgTLADCNLLPKLHI 

rkfldg geb tladcnllpklhi 
rkfldg^^tladcnllpklhi 
rkfldgiSeStladcnllpklhi 




VAKKYR 
VAKKYR 
VAKKYR 
VAKKYR 



645 
238 
591 
206 
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ii -\n n ? iP r 1 1 s? ... iLii t 



10 



gi|7330335 | 
gi|7304963 j 
gi | 4588524 | 



NOV24a 

NOV24b 

gi | 7592636 | 

gij 6685319 j 

gi|7330335| 

gi|7304963 j 

gi|4588524 j 



152 
152 
152 



YLNSPLPDEID 
YLNSPLPDEID 
YLNSPLPDEID 



670 



3l KF 



680 



R^FLDGl 




690 



700 




1TK 2 53 

]tk 253 

iTK 2 53 

?K 253 



1 5 The gene of invention encodes a homolog of parchorin, a new member of the 

intracellular chloride channel family. Parchorin was discovered as a 120 kDa phosphoprotein 
in gastric parietal cells (Urushidani et al., J Membr Biol. 1999 Apr l;168(3):209-20). 
Subsequent analysis revealed that this protein had significant homology to the family of 
intracellular chloride channels, especially in the C terminal domain (Nishizawa et al., J Biol 

20 Chem 2000 Apr 14;275(15):1 1 164-73). However, unlike other members of this family, 
parchorin exists mainly in the cytoplasm and translocated to the plasma membrane upon 
stimulation of chloride ion efflux. In addition, parchorin shows only two hydrophobic domains 
relative to the ten to twelve domains seen in other intracellular chloride channels. Tissue 
expression of parchorin in the rabbit is enhanced in cells that secrete water, like parietal cells, 

25 choroid plexus, salivary duct, lacrimal gland, kidney, airway epithelia, and chorioretinal 

epithelia. It is therefore thought that this protein plays a critical role in these tissues, possibly 
by modulating chloride ion transport. 

Intracellular chloride channels have diverse roles within cells, such as volume 
regulation, acidification of intracellular vesicles, vectorial transepithelial chloride transport 

30 and regulation of cellular excitability (Jentsch et al., Pflugers Arch 1999 May;437(6):783-95). 
Loss of function mutations affecting three different members of this family lead to three 
human inherited diseases: myotonia congenita, Dent's disease, and Bartter's syndrome. In 
addition, a mouse knockout model involving a member of this family has been generated that 
mimics diabetes insipidus (Matsumura et al., Nat Genet 1999 Jan;21(l):95-8). 

35 It is likely, therefore, that the protein of invention participates in physiological 

functions similar to those of other members of the intracellular chloride channel family, 
particularly parchorin. 

The disclosed NOV24 nucleic acid of the invention encoding a Parchorin-like protein 
includes the nucleic acids whose sequences are provided in Table 24A or 24C or a fragment 

40 thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 24A or 24C while still encoding a 
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protein that maintains its Adrenal secretory serine protease -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
5 includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 

include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 

10 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 37 percent of the bases may be so changed. 

The disclosed NOV24 protein of the invention includes the Parchorin -like protein 
whose sequence is provided in Table 24B or 24D. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

15 in Table 24B or 24D while still encoding a protein that maintains its Adrenal secretory serine 
protease -like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 40 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b>2, that bind immunospecifically to any of the proteins of the invention. 

20 The above disclosed information suggests that this Parchorin -like protein (NOV24) is 

a member of a "Parchorin family". Therefore, the NOV24 nucleic acids and proteins identified 
here may be useful in potential therapeutic applications implicated in (but not limited to) 
various pathologies and disorders as indicated below. The potential therapeutic applications 
for this invention include, but are not limited to: protein therapeutic, small molecule drug 

25 target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic 
and/or prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

The NOV24 nucleic acids and proteins of the invention are useful in potential 
30 therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, Alzheimer's 

disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, 
cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 
leukodystrophies, behavioral disorders, addiction, anxiety, pain, neurodegeneration, systemic 
lupus erythematosus, autoimmune disease, asthma, emphysema, scleroderma, allergy, ARDS, 
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diabetes, autoimmune disease, renal artery stenosis, interstitial nephritis, glomerulonephritis, 
polycystic kidney disease, systemic lupus erythematosus, renal tubular acidosis, IgA 
nephropathy, hypercalceimia, Lesch-Nyhan syndrome, cancer, trauma, bacterial/viral/parasitic 
infection, tissue degeneration, and/or other diseases and pathologies. 
5 NOV24 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immuno-specifically to the novel NOV24 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobic ity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV24 protein has multiple hydrophilic 
10 regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV25 

15 A disclosed NOV25 nucleic acid of 1 123 nucleotides (also referred to as CG56457-01) 

encoding a novel protein phosphatase-like protein is shown in Table 25A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 60-62 and ending 
with a TGA codon at nucleotides 768-770. The start and stop codons are shown in bold in 
Table 25A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 25A. NOV25 nucleotide sequence (SEQ ID NO: 107). 

TCCGGATCGCTTCCCGGGCGGCGAGCTGGGGGTGCACCCGGACCGCCGCCCCCGGGATC ATGGGCAATGGCA 
TGACCAAGGTACTTCCTGGACTCTACCTCGGAAACTTCATTGATGCCAAAGACCTGGATCGCCTGGGCCGAA 
ATAAGATCACACACATCATCTCTATCCATGAGTCACCCCAGCCTCTGCTGCAGGATATCACCTACCTTCGCA 
T C C CGGTCG CT G AT AC C C C TG AGGT ACC C AT C AAAAAG C ACTT C AAAG AATGTAT CAAC T TC AT C CACTG CT 
GCCGCCTTAATGGGGGGAACTGCCTTGTGCACTGCTTTGCAGGCATCTCTCGCAGCACCACGATTGTGACAG 
CGT ATGTG ATG ACT GTG ACGGGGCT AGG CTGGCGGGACGTG CTTG AAGC C AT C AAGGCC AC C AGG C C CAT CG 
CCAACCCCAACCCAGGCTTTAGGCAGCAGCTTGAAGAGTTTGGCTGGGCCAGTTCCCAGAAGCTTCGCCGGC 
AGCTGGAGGAGCGCTTCGGCGAGAGCCCCTTCCGCGACGAGGAGGACTTGCGCGCGCTGCTGCCTCTCTGCA 
GGCGCTGTCGCCAGGGTCCGGGGACTTCGGCCCCGTCGGCCACCACAGCGTCCTCGGCCGCTTCCGAGGGGA 
CCCTGCAGCGCCTGGTGCCGCGATCGCCGCGGGAATCACACCGGCCGCTGCCGCTGCTGGCGCGCGTCAAGC 
AGACTTTCTCTTGCCTCCCCCGGTGTCTGTCCCGCAAGGGCGGCAAGTGA GGATGCAGTCCAGCCGTGGCTC 
CCCACTTCCGACTGGCTCCCTTCGGGGGCTGTCTGCGCCTTCCACGCCCCCCAGGACGGGCCCAGAGGCTGG 
GGGAGCCCCGCGGCGGCCTGAACCCTGCCTCCCGCGCCCGCCCTGCTCGTCCGCGTCTGCAGTCAGCGTCCC 
CAACCTGTGCGTCTCTGTGTCCGGGCCGGCCTGCTGCAGCCACCTGGTGCCTTAGTCCTTGGGCTGGGGGAG 
GGGGCCCACCCTTAAAGGCGGCGGGAGGGGAGGGAGGGAGAGTGGAGGGTTTGACGGGCCTGGAGGGTATTA 
AAGAGACACAGAAGAAGCTGCCTGTCAAAAAAAAAAAAAAAAA 



20 ~ " ~ ^ ^ ~ ~ 

In a search of public sequence databases, the NOV25 nucleic acid sequence, located on 
chromosome 20, has 324 of 505 bases (64%) identical to a gb:GENBANK- 
ID:AF 1 655 19|acc:AF 1655 19.1 mRNA from Homo sapiens (mitogen-activated protein kinase 
phosphatase x (MKPX) mRNA, complete cds) (E = 2.3e 31 ). 
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A disclosed NOV25 polypeptide (SEQ ID NO: 108) encoded by SEQ ID NO: 107 has 
236 amino acid residues and is presented in Table 25B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV25 has no signal peptide and is 
likely to be localized to the cytoplasm with a certainty of 0.6500. Alternatively, NOV25 may 
5 also localize to the lysosome (lumen) with a certainty of 0.1 805, to the mitochondrial matrix 
space with a certainty of 0.1000, or to the plasma membrane with a certainty of 0.1000. 



Table 25B. Encoded NOV25 protein sequence (SEQ ID NO: 108). 

MGNGMTKVLPGLYLGNFIDAKDLDRLGRNKITHIISIHESPQPLLQDITYLRIPVADTPEVPIKKHFKECIN 
FIHCCRLNGGNCLVHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIKATRPIANPNPGFRQQLEEFGWASSQ 
KLRRQLEERFGESPFRDEEDLRALLPLCRRCRQGPGTSAPSATTASSAASEGTLQRLVPRSPRESHRPLPLL 
ARVKQTFSCLPRCLSRKGGK 



A search of sequence databases reveals that the NOV25 amino acid sequence has 91 of 
169 amino acid residues (53%) identical to, and 125 of 169 amino acid residues (73%) similar 
to, the 184 amino acid residue ptnr:SPTREMBL-ACC:Q9NRW4 protein from Homo sapiens 
(Human) (Mitogen-Activated Protein Kinase Phosphatase X) (E = 7.3e" 50 ). 

NOV25 is predicted to be expressed in at least brain, testis, exocrine pancreas, adipose, 
bone, peripheral blood, salivary glands, spinal cord, thyroid . This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, public EST sources and/or RACE sources. 

In addition, the sequence is predicted to be expressed in hematopoietic stem cells 
because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:AF165519|acc:AF165519.1) a closely related Homo sapiens mitogen-activated protein 
kinase phosphatase x (MKPX) mRNA, complete cds homolog. 

The disclosed NOV25 polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 25C. 



Table 25C. BLAST results for NOV25 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|l7458347|ref |XP 
059288. l| 
(XM_059288) 


similar to 
bA243J16.6 (novel 
protein with a 
dual specificity 
phosphatase , 
catalytic domain) 
(H. sapiens) 
[Homo sapiens] 


235 


223/236 
(94%) 


229/236 
(96%) 


e-124 


gi 1 18104942 |ref |NP_ 
542178. l| 
(NM_080611) 


dual specificity 
phosphatase -like 
15 [Homo sapiens] 


243 


216/251 
(86%) 


222/251 
(88%) 


e-115 
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15 



ii fi ii' j J-' w 1 ' o *tl if* srs ii *i i? $ s 



gi | 9910432 | ref | NP_0 
64 570 . 1 | 
(NM_020185) 


mitogen- act ivated 

T~\ T~/""\ 1~ d "1 T1 l^ - ~l T"t .P» t3 

JL vJ L. fc2 X 11 Jv -LllctbC 

phosphatase x; 
homolog of mouse 
dual specificity 
phosphatase LMW- 
DSP2; JNK- 
stimulat ing 
phosphatase 1 
[ Homo sapi en s ] 


184 


91/169 

\ D-5 IT ) 


125/169 


4e-53 


gi | 13183069 | gb | AAK1 
5038 . 1 |AF237619_1 
(AF23 7619) 


dual specificity 
phosphatase TS- 
DSP2 [Mus 
musculus] 


184 


90/169 
(53%) 


125/169 
(73%) 


2e-52 


gi | 14726046 | ref |XP_ 
046543 . 1 | 
(XM_046543) 


mitogen- act ivated 
protein kinase 
phosphatase x 
[Homo sapiens] 


184 


89/169 
(52%) 


118/169 
(69%) 


2e-50 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 25D. In the ClustalW alignment of the NOV25 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 25D. ClustalW Analysis of NOV25 



15 



20 



25 



1) Novel NOV25 (SEQ ID NO: 108) 

2) gi | 17458347 | ref |XP_059288. 1 | (XM_059288) similar tobA243J16.6 (novel protein 
with a dual specificity phosphatase, catalytic domain) (H. sapiens) [Homo sapiens] 
(SEQ ID NO: 426) 

3) gi | 18104942 | ref |NP_542178 . 1 | (NM_080611) dual specif icity phosphatase-like 15 
[Homo sapiens] (SEQ ID NO:427) 

4) gi | 9910432 | ref |NP_064570 . 1 | (NM_020185) mitogen-activated protein kinase 
phosphatase x; homolog of mouse dual specificity phosphatase LMW-DSP2; JNK- 
stimulating phosphatase 1 [Homo sapiens] (SEQ ID NO: 428) 

5) gi|l3183069|gb|AAK15038.l|AF237619_l (AF237619) dual specificity phosphatase TS- 
DSP2 [Mus musculus] (SEQ ID NO:429) 

6) gi | 14726046 | ref |XP_046543 . 1 | (XM_046543) mitogen-activated protein kinase 
phosphatase x [Homo sapiens] (SEQ ID NO: 4 30) 
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NOV25 

gi | 17458347 | 
gi 1 18104942 j 
gi | 9910432 | 
gi|l3183069| 
gi j 14726046 j 




10 



20 
•I-- 



30 



LPGLYjgGNF 
LPGLYgGNF 

lpglyBgnf 

LPGLYgGNF; 

lpgly|gnf| 
lpglyhgnf 



ighpasqigssilfls g 
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NOV25 

gi | 17458347 | 
gi | 18104942 | 
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!) Is ifji 



10 



15 



20 



25 



30 



gi | 9910432 | 
gi| 13183069 | 
gi | 14726046 j 



NOV2 5 

gi | 17458347 | 
gi j 18104942 | 
gi j 9910432 | 
gi | 13183069 | 
gi | 14726046 | 



NOV25 

gi | 17458347 | 
gi j 18104942 | 
gi j 9910432 | 
gi j 13183069 | 
gi|l4726046| 



NOV25 

gi | 17458347 | 
gi | 18104942 | 
gi j 9910432 | 
gi | 13183069 | 
gi j 14726046 | 




YLSIP 

ylsip 




sqniJti? 

sq4|t]r 

;SQNSjTR 




CLVHC 
CLVHC 
CLVHC 



130 



140 



150 




160 

wassqJ|l 

WASSQ|fL 
WASSQ^L 



190 



200 



210 



220 




...|....|....|.... 
IPLCRRCRQGPGTSAPSATTASS. 
_ -—JpLCKRCRQGSATSASSAG- PHS 
PLCKRCRQGSATSAS SAG- PHS. 





LORLvprspreshrplpllarvkqtQsc 225 

^RLVPRTPREAHRPLPLLARVKQTfSsC 2 24 
•^RLVPRTPREAHRPLPLLARVKQtHsC 2 32 

i&| * @wa 

ift gWA 



179 
179 



IMK [§WA 17 9 



250 

2 26 LPgCfflSRKGGK 236 

225 LPgCfflSRKGGK 235 

233 LPgCfflSRKGGK 243 

180 FLgRffl 184 

180 FLyRjl 184 

180 FLaRH 184 
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Tables 25E-H list the domain descriptions from DOMAIN analysis results against 
NOV25. This indicates that the NOV25 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 25E Domain Analysis of NOV25 

gnl | Smart | smart00195 , DSPc, Dual specificity phosphatase, catalytic 
domain (SEQ ID NO: 816) 

CD-Length =13 9 residues, 97.8% aligned 
Score = 139 bits (349), Expect = 2e-34 



40 



45 



NOV25 : 


4 


Sbjct : 


1 


NOV25 : 


63 


Sbjct : 


61 


NOV25 : 


123 


Sbjct : 


121 



GMTKVLPGLYLGNFIDAKDLDRLGRNKITHIIS- IHESPQPLLQDITYLRIPVADTPEVP 62 

I —II 1111++ II +1 I + III+I+ II II III I I 

GPSEILPHLYLGSYSDASNLALLKKLGITHVINVTEEVPNSNKSGFLYLGIPVDDNTETK 60 

IKKHFKECINFIHCCRIJSrGGNCL VHCFAGISRSTTIVTAYVMTVTGLGWRDVLEAIKATR 12 2 

I + I + II II I I I I I l+l I I |++ I l + l + I + +1 I 

I S P YLPEAVE FI EDAEKKGGKVLVHCQAGVSRS ATL I IAYLMKYRNMSLNDAYDFVKERR 12 0 

PIANPNPGFRQQLEEF 13 8 

II +11 II +11 1+ 
PUS PNFGFLRQL I E Y 136 
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Table 25F Domain Analysis of NOV25 

gnl |Pfam|pfam00782, DSPc, Dual specificity phosphatase, catalytic 
domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary 
fold is highly similar to that of tyrosine-specif ic phosphatases, 
except for a "recognition" region. (SEQ ID NO: 817) 
CD-Length =13 9 residues, 97.8% aligned 
Score = 136 bits (342) , Expect = 2e-33 



5 



NOV2 5 : 


4 


GMTKVLPGLYLGNFIDAKDLDRLGRNKITHI IS - IHESPQPLLQDITYLRIPVADTPEVP 


62 






1 + + + M 1111++ 1 +1 1 + III+I+ II II III 1 1 




Sbjct : 


1 


GPSEILPHLYLGSYPTASNLAFLSKLGITHVINVTEEVPNSKNSGFLYLHIPVDDNHETD 


60 


NOV25: 


63 


I KKHFKEC INFI HCCRLNGGNCLVHCFAG I SRSTT I VTAYVMTVTGLGWRDVLEAI KATR 


122 






1 + 1 + II 1 II INI MINI I++ ll + l 1 + +|| 




Sbjct : 


61 


I SP YLDEAVE FI EDARQKGGKVLVHCQAGI SRSATLI I AYLMKTRNLSLNEAYSFVKERR 


120 



10 NOV25: 123 PIANPNPGFRQQLEEF 138 

11+111 |++| I |+ 
Sbjct: 121 PI ISPNFGFKRQLIEY 136 



Table 25G Domain Analysis of NOV25 

gnl | Smart | smart0 04 04 , PTPc_motif, Protein tyrosine phosphatase, 
catalytic domain motif (SEQ ID NO: 818) 
CD-Length = 105 residues, 53.3% aligned 
Score = 41.2 bits (95), Expect = 7e-05 



15 NOV25: 50 YLR I PVADTPEVPIK- KHFKEC INF I HCCRLNGGNCLVHCFAG IS RSTTIVTAYVM 104 

I I Ml I + II +111 II- 1+ I I ++ 

Sbjct: 7 YTGWPDHGVPESPDSILEFLRAVKKSLNKSANNGPVWHCSAGVGRTGTFVAIDIL 62 



Table 25H Domain Analysis of NOV25 

gnl |Pfam|pfam00102, Y_phosphatase , Protein-tyrosine phosphatase (SEQ 
ID NO: 819) 

CD-Length = 23 5 residues, 31.9% aligned 
Score = 38.5 bits (88), Expect = 4e-04 



20 NOV25: 50 YLR I PVADTPE VP I KKHFKEC INF I HCCRLNG - -GNCLVHCFAG I S RSTTIVTAYVM- -T 105 

I I I I I I + + + + 1+1111111+1+ + + 

Sbjct: 139 YTGWPDHGVPESP- - KSILDLLRKVRKSKGTPDDGPIWHCSAGIGRTGTFIAIDILLQQ 196 



NOV25 : 106 VTGLGWRDVLEAI KATR 12 2 

25 + | | | + +| | 

Sbjct: 197 LE KEG WD V FDT VKKL R 213 



The gene of invention is a member of the family of dual specificity protein 

phosphatases (DSPs; Martell et ah, Mol Cells 1998 Feb 28;8(1):2-1 1). DSPs recognize either 

30 Ser/Thr or Tyr moieties as targets for dephosphorylation. These enzymes regulate mitogenic 

signal transduction and can thereby regulate the cell cycle. Some members of this family are 

effective tumor suppressors, for example, PTEN. PTEN is required during embryonic 

development and later in life, and mutations in this gene give rise to different kinds of 
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inherited and sporadic cancers (Eng, Recent Prog Horm Res 1999;54i441-52; discussion 453). 
In Drosophila, members of the DSP family, such as puckered, have important roles in 
development (Martin-Blanco et al., Genes Dev 1998 Feb 15;12(4):557-70). The crystal 
structure of one member of the DSP family has been elucidated (Yuvaniyama at al., Science 
5 1996 May 3 1;272(5266): 1328-31) and this family has been successfully targeted for small 
molecule drug development (Ducruet et al., Bioorg Med Chem 2000 Jun;8(6):1451 -66). In 
addition, overexpression of a DSP has been demonstrated to be a potential therapy for cardiac 
hypertrophy (Bueno et al., Circ Res 2001 Jan 19;88(l):88-96). The gene of invention has 
greatest homology to a DSP identified in hematopoietic stem/progenitor cells from a patient 

10 with myelodysplastic syndromes. It shows the presence of a distinct domain present in all 

DSPs , which qualifies it as a bona fide member of this family. Its localization is predicted to 
be cytoplasmic, which makes it a good candidate to interact with members of the signal 
transduction cascade governing the cell cycle. 

The disclosed NOV25 nucleic acid of the invention encoding a Protein phosphatase - 

15 like protein includes the nucleic acid whose sequence is provided in Table 25A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 25A while still encoding a protein 
that maintains its Protein phosphatase -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 

20 are complementary to those just described, including nucleic acid fragments that are 

complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

25 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 36 percent of the bases may be so changed. 

The disclosed NOV25 protein of the invention includes the Protein phosphatase -like 

30 protein whose sequence is provided in Table 25B. The invention also includes a mutant or 

variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 25B while still encoding a protein that maintains its Protein phosphatase -like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 48 percent of the residues may be so changed. 



225 



The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Protein phosphatase -like protein 
(NOV25) is a member of a "Protein phosphatase family". Therefore, the NOV25 nucleic acids 
and proteins identified here may be useful in potential therapeutic applications implicated in 
(but not limited to) various pathologies and disorders as indicated below. The potential 
therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

The NOV25 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in for example Von Hippel-Lindau (VHL) syndrome, 
pancreatitis, obesity, Alzheimer's disease, stroke, tuberous sclerosis, hypercalceimia, 
Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, 
multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, 
anxiety, pain, neurodegeneration, psychiatric disorders, metabolic disorders, fertility, 
hypogonadism, xerostomia, hyperthyroidism, hypothyroidism, cancer, trauma, tissue 
degeneration, viral/bacterial/parasitic infections, and/or other diseases and pathologies. 

NOV25 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV25 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV25 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV26 

NOV26 includes two novel GAGE-7-like proteins disclosed below. The disclosed 
sequences have been named NOV26a and NOV26b. 
NOV26a 

A disclosed NOV26a nucleic acid of 550 nucleotides (also referred to as CG56461-01) 
encoding a novel GAGE-7-like protein is shown in Table 26A. An open reading frame was 
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identified beginning with a ATG initiation codon at nucleotides 67-69 and ending with a TAA 
codon at nucleotides 400-402. The start and stop codons are shown in bold in Table 26A, and 
the 5' and 3' untranslated regions, if any, are underlined. 

Table 26A. NOV26a nucleotide sequence (SEQ ID NO: 109). 

~GTTCCTGCTGTCTGGACTTTTTCTGTCCCACTGAGACGCAGCTGTATTCTGTTTGCAGTGTGAAAT ATGATT 
TGGCGAGGAAGATCAACATATAGGCCTAGGCCGAGGAGAAGTGTACCACCTCCTGAGCTGATTGGGCCTATG 
CTGGAGCCCGGTGATGAGGAGCCTCAGCAAGAGGAACCCCCAACTGAAAGTCGGGATCCTGCACCTGGTCAG 
GAG AG AG AAGAAGATC AGGGTG CAG CT GAG ACT CAAGTGC C TG AC C TGG AAGCTGAT CT C CAGG AGCTGT CT 
CAGTCAAAGACTGGGGGTGAATGTGGAAATGGTCCTGATGACCAGGGGAAGATTCTGCCAAAATCAGAACAA 
T TT AAAATGC C AGAAGG AGGTG AC AGG CAAC CAC AGGTT TA AATGAAGAC AAGC TGAAAC AAC AC AAAAC TG 
TTTTTATCTAAGATATTTGACTTAAAAATATCGAAATAAACTTTTGCAGCTTTCTCCAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCCCGC 



5 In a search of public sequence databases, the NOV26a nucleic acid sequence, located 

on the X chromosome, has 293 of 360 bases (81%) identical to a gb:GENBANK- 
ID:AF251237|acc:AF25 1237.1 mRNA from Homo sapiens (XAGE-1 mRNA, complete cds) 
(E-3.6e' 46 ). 

A disclosed NOV26a polypeptide (SEQ ID NO:l 10) encoded by SEQ ID NO: 109 has 
10 111 amino acid residues arid is presented in Table 26B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV26a has no signal peptide and is 
likely to be localized to the mitochondrial matrix space with a certainty of 0.4462. 
Alternatively, NOV26A may also localize to the nucleuswith a certainty of 0.3000, to the 
mitochondrial inner membrane with a certainty of 0.1347, or to the mitochondrial 
15 intermembrane space with a certainty of 0.1347. 



Table 26B. Encoded NOV26a protein sequence (SEQ ID NO: 110). 

MIWRGRSTYRPRPRRSVPPPELIGPMLEPGDEEPQQEEPPTESRDPAPGQEREEDQGAAETQVPDLEADLQE 
LSQSKTGGECGNGPDDQGKILPKSEQFKMPEGGDRQPQV 

A search of sequence databases reveals that the NOV26a amino acid sequence has 60 
of 1 15 amino acid residues (52%) identical to, and 72 of 1 15 amino acid residues (62%) 
20 similar to, the 1 16 amino acid residue ptnr:SPTREMBL-ACC:Q9UEU5 protein from Homo 
sapiens (Human) (GAGE-7) (E = 1.4e" 23 ). 

NOV26a is predicted to be expressed in at least placenta. This information was derived 
by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, and/or RACE sources. 
25 NOV26b 
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In the present invention, the target sequence identified previously, NOV26a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
5 examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 

10 related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

15 gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 

20 another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV26b. This is 100% identical to the 
previously identified sequence (NOV26a). 

A disclosed NOV26b nucleic acid of 494 nucleotides (also referred to as CG56461-02) 

25 encoding a novel GAGE-7-like protein is shown in Table 26C. An open reading frame was 

identified beginning with a ATG initiation codon at nucleotides 67-69 and ending with a TAA 
codon at nucleotides 400-402. The start and stop codons are shown in bold in Table 26C, and 
the 5' and 3' untranslated regions, if any, are underlined. 

Table 26C. NOV26b nucleotide sequence (SEQ ID NOrlll). 

"gttcctgctgtctggactttttctgtcccactgagacgcagctgtattctgtttgcagtgtgaaat atgatt 
tggcgaggaagatcaacatataggcctaggccgaggagaagtgtaccacctcctgagctgattgggcctatg 

CTGGAGCCCGGTGATGAGGAGCCTCAGCAAGAGGAACCACCAACTGAAAGTCGGGATCCTGCACCTGGTCAG 
GAGAGAGAAGAAGATCAGGGTGCAGCTGAGACTCAAGTGCCTGACCTGGAAGCTGATCTCCAGGAGCTGTCT 
CAGTCAAAGACTGGGGGTGAATGTGGAAATGGTCCTGATGACCAGGGGAAGATTCTGCCAAAATCAGAACAA 
TTTAAAATGCCAGAAGGAGGTGACAGGCAACCACAGGTTTAA ATGAAGACAAGCTGAAACAACACAAAACTG 
TTTTTATCTAAGATATTTGACTTAAAAATATCAAAATAAACTTTTGCAGCTTTCTCCAAAAA 
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In a search of public sequence databases, the NOV26b nucleic acid sequence, located 
on the X chromosome, has 346 of 426 bases (81%) identical to a gbiGENBANK- 
ID:HSL185E6A|acc:Z68274.1 mRNA from Homo sapiens (Human DNA sequence from 
cosmid L129H7, Huntington's Disease Region, chromosome 4pl6.3 contains Pseudogene and 
5 CpG island) (E = 5. 7e 53 ). 

The disclosed NOV26b polypeptide (SEQ ID NO:l 12) encoded by SEQ ID NO:l 1 1 
has 111 amino acid residues and is presented in Table 26D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV26b has no signal peptide 
and is likely to be localized to the mitochondrial matrix space with a certainty of 0.4462. 
10 Alternatively, NOV26b may also localize to the nucleus with a certainty of 0.3000, to the 
mitochondrial inner membrane with a certainty of 0. 1347, or to the mitochondrial 
intermembrane space with a certainty of 0. 1 347. 



Table 26D. Encoded NOV26b protein sequence (SEQ ID NO:112). 

MIWRGRSTYRPRPRRSVPPPELIGPMLEPGDEEPQQEEPPTESRDPAPGQEREEDQGAAETQVPDLEADLQE 
LSQSKTGGECGNGPDDQGKILPKSEQFKMPEGGDRQPQV 

15 A search of sequence databases reveals that the NOV26b amino acid sequence has 60 

of 1 15 amino acid residues (52%) identical to, and 72 of 1 15 amino acid residues (62%) 
similar to, the 116 amino acid residue ptnr:SPTREMBL-ACC:Q9UEU5 protein from Homo 
sapiens (Human) (GAGE-7) (E = 1 .4e~ 23 ). 

NOV26b is predicted to be expressed in at least the following tissues: Placenta, Whole 

20 Organism. . 

The disclosed NOV26a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 26E. 



Table 26E. BLAST results for NOV26a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
<aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 174 863 97 | ref |XP__ 
060048. l| 
(XM_060048) 


similar to G 
antigen 3 (H. 
sapiens) [Homo 
sapiens] 


137 


84/84 
(100%) 


84/84 
(100%) 


2e-33 


gi | 18027836 | gb | AAL.5 
5879.1|AF318372_1 
(AF318372) 


unknown [Homo 
sapiens] 


111 


110/111 
(99%) 


110/111 
(99%) 


4e-33 


gi | 18157212 | emb|CAC 
83008. 1| (AJ318881) 


XAGE-3 protein 
[Homo sapiens] 


111 


111/111 
(100%) 


111/111 
(100%) 


2e-29 



229 



J 




gi | 174 863 94 | ref |XP_ 
066835. l| 
<XM_066835) 


similar to G 
antigen Bl; 
prostate 

associated gene 1 
(H . sapiens) 
[Homo sapiens] 


185 


64/78 
(82%) 


69/78 
(88%) 


2e-26 


gi | 14765261 | ref |XP_ 
0 32309.1] 
(XM 032309) 


hypothetical 
protein XP_032309 
[Homo sapiens] 


111 


80/111 
(72%) 


93/111 
(83%) 


4e-26 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 26F. In the ClustalW alignment of the NOV26 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 26F. ClustalW Analysis of NOV26 

1) Novel NOV26a (SEQ ID NO:110) 

2) Novel NOV26b (SEQ ID NO: 112) 

3) gi| 17486397 | ref |XP_060048.1 | (XM_060048) similar to G antigen 3 (H. sapiens) 
[Homo sapiens] (SEQ ID NO: 431) 

4) gi | 1802783 6 | gb | AAL55879 . 1 | AF318372_1 (AF3 183 72) unknown [Homo sapiens] (SEQ ID 
NO:131) gi | 18157212 | emb | CAC83008 . 1 | (AJ318881) (SEQ ID NO:432) 

6) gi|l8157212|emb|CAC83008.l| (AJ318881) XAGE-3 protein [Homo sapiens] (SEQ ID 
NO:433) 

7) gi | 17486394 | ref |XP_066835 . 1 | (XM_066835) similar to G antigen Bl; prostate 
associated gene 1 (H. sapiens) [Homo sapiens] (SEQ ID NO: 434) 

8) gi | 14765261 | ref |XP_032309 . 1 | (XM_032309) hypothetical protein XP_032309 [Homo 
sapiens] (SEQ ID NO: 435) 



NOV2 6a 

NOV2 6b 

gi | 17486397 | 

gi | 18027836 | 

gi | 18157212 j 

gi|l7486394| 

gi|l476526l| 




20 



|....| 



ITDEMCASSADISVCFAQSCG SK 




50 

| .... | . . 



18 

18 

^NIYVFCIFVLL^C^DNKT 3 7 

18 

18 

DFFLAN — YD : ITPGTP 5 8 
18 




NOV2 6a 

NOV2 6b 

gi | 17486397 | 

gi|l8027836| 

gij 18157212 | 

gi|l7486394| 

gi|l476526l| 




NOV2 6a 
NOV2 6b 
gi|l7486397| 



69 
69 
95 



130 



140 



150 



160 



170 



180 



DLQELSQSKTGGECGNGPDDQGKILPKSEQFKMPEGGDRQPQV 
DLQELSQSKTGGECGNGPDDQGKILPKSEQFKMPEGGDRQPQV 
DLQELSQSKTGGECGNGPDDQGKILPKSEQFKMPEGGDRQPQV 
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gi|l8027836| 69 

gi 1 18157212 I 69 

gi| 17486394 | 119 

gijl476526l| 69 




111 
111 

FLCS^CILSKLYTMFAIYFKPLL 17 8 

iGisjgg in 



10 



15 



20 



25 



30 



35 



40 



NOV26a 111 111 

NOV26b 111 111 

gi|l7486397| 137 137 

gi|l8027836| 111 111 

gi|l8157212| 111 111 

gi j 17486394 I 179 IDALDLM 185 

gi|l476526l| 111 111 

The gene of invention is a member of GAGE family of proteins. It belongs to the broad 
family of GAGE/MAGE/PAGE genes that are expressed in various human cancers. Many 
human tumors express antigens that are recognized in vitro by cytolyticT lymphocytes (CTLs) 
derived from the tumor-bearing patient. The MAGE(melanoma-specific antigen), PAGE 
(Prostate cancer antigen) and GAGE (G antigen) gene family members encode such antigens. 
Therefore these antigens can serve as therapeutic targets in cancer. 

The LNCaP progression model of human prostate cancer consists of lineage -related 
sublines that differ in their androgen sensitivity and metastatic potential. A differential display 
polymerase chain reaction was employed by Chen ME, et al. (J Biol Chem 1998 Jul 
10;273(28): 176 18-25) to evaluate mRNA expression differences between the LNCaP sublines 
in order to define the differences in gene expression between the androgen-sensitive, 
nontumorigenic LNCaP cell line and the androgen-insensitive, metastatic LNCaP sublines, 
C4-2 and C4-2B. An amplicon, BG16.21, was isolated that showed increased expression in the 
androgen-independent and metastatic LNCaP sublines, C4-2 and C4-2B. Hybridization 
screening of a lambda gtl 1 expression library with BG16.21 revealed two transcripts, both 
homologous to BG 16.21 at the 3' end. A GenBankTM data base search using the GCG 
Wisconsin software package revealed the shorter approximately 600-bp transcript (designated 
GAGE-7) to be a new member of the GAGE family. The second approximately 700-bp 
transcript was a novel gene (designated PAGE-1, "prostate associated gene") with only 45% 
homology to GAGE gene family members. RNA blot analysis demonstrated that GAGE-7 
mRNA was expressed at equal levels in all lineage related prostate cancer cell sublines, while 
PAGE-1 mRNA levels were elevated 5-fold in C4-2 and C4-2B as compared with LNCaP 
cells. Neither GAGE-7 nor PAGE-1 demonstrated any regulation by androgens in the prostate 
cancer cell lines used in this study. PAGE-1 and GAGE-7 expression was found to be 
restricted to testes (high) and placenta (low) on human multiple tissue Northern blots. As 
GAGE/MAGE antigens were reported previously to be targets for tumor-specific cytotoxic 
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lymphocytes in melanoma, these results suggest that PAGE-1 and GAGE-7 may be related to 
prostate cancer progression and may serve as potential targets for novel therapies. 

The GAGE-1 gene was identified previously as a gene that codes for an antigenic 
peptide, YRPRPRRY, which was presented on a human melanoma by HLA-Cw6 molecules 
5 and recognized by a clone of CTLs derived from the patient bearing the tumor. By screening a 
cDNA library from this melanoma, De Backer O, et al. (Cancer Res 1999 Jul 1 ;59(13):3157- 
65) identified five additional, closely related genes named GAGE-2-6. We report here that 
further screening of this library led to the identification of two more genes, GAGE-7B and -8. 
GAGE-1, -2, and -8 code for peptide YRPRPRRY. Using another antitumor CTL clone 

10 isolated from the same melanoma patient, they identified antigenic peptide, YYWPRPRRY, 
which is encoded by GAGE-3, -4, -5, -6, and -7B and which is presented by HLA-A29 
molecules. Genomic cloning of GAGE-7B showed that it is composed of five exons. Sequence 
alignment showed that an additional exon, which is present only in the mRNA of GAGE-1, 
has been disrupted in gene GAGE-7B by the insertion of a long interspersed repeated element 

1 5 retroposon. These GAGE genes are located in the pi 1 .2-pl 1 .4 region of chromosome X. They 
are not expressed in normal tissues, except in testis, but a large proportion of tumors of various 
histological origins express at least one of these genes. Treatment of normal and tumor 
cultured cells with a demethylating agent, azadeoxycytidine, resulted in the transcriptional 
activation of GAGE genes, suggesting that their expression in tumors results from a 

20 demethylation process. 

The disclosed NOV26 nucleic acid of the invention encoding a GAGE-7 -like protein 
includes the nucleic acid whose sequence is provided in Table 26A, 26C or a fragment thereof. 
The invention also includes a mutant or variant nucleic acid any of whose bases may be 
changed from the corresponding base shown in Table 26A or 26C while still encoding a 

25 protein that maintains its GAGE-7 -like activities and physiological functions, or a fragment of 
such a nucleic acid. The invention further includes nucleic acids whose sequences are 
complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 

30 chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
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binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 19 percent of the bases may be so changed. 

The disclosed NOV26 protein of the invention includes the GAGE-7 -like protein 
whose sequence is provided in Table 26B or 26D. The invention also includes a mutant or 
5 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 26B or 26D while still encoding a protein that maintains its GAGE-7 -like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 28 percent of the residues may be so changed. l ^ 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
1 0 (F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this GAGE-7 -like protein (NOV26) is a 
member of a "GAGE-7 family". Therefore, the NOV26 nucleic acids and proteins identified 
here may be useful in potential therapeutic applications implicated in (but not limited to) 
various pathologies and disorders as indicated below. The potential therapeutic applications 
15 for this invention include, but are not limited to: protein therapeutic, small molecule drug 

target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic 
and/or prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

20 The NOV26 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in fertility disorders, cancer, trauma, tissue degeneration, 
viral/bacterial/parasitic infections, and/or other diseases and pathologies. 

NOV26 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV26 substances for use in 

25 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV26 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 

30 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV27 

NOV27 includes three novel sodium - glucose cotransporter -like proteins disclosed 
below. The disclosed sequences have been named NOV27a, NOV27b, and NOV27c. 
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NOV27a 

A disclosed NOV27a nucleic acid of 1914 nucleotides (also referred to as CG56645- 
01) encoding a novel sodium - glucose cotransporter-like protein is shown in Table 27A. An 
open reading frame was identified beginning with a ATG initiation codon at nucleotides 51-53 
5 and ending with a TGA codon at nucleotides 1 839- 1 84 1 . The start and stop codons are shown 
in bold in Table 27A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 27A. NOV27a nucleotide sequence (SEQ ID NO: 113). 

TTGCCCCTCAGTCCCTCGGGCTCATACCTAGTGCCTGCGGCAGGACAGCC ATGGCCGCCAACTCCACCAGCG 
ACCTCCACACTCCCGGGACGCAGCTGAGCGTGGCTGACATCATCGTCATCACTGTGTATTTTGCTCTGAACG 
TGGCCGTGGGCATATGGTCCTCTTGTCGGGCCAGTAGGAACACGGTGAATGGCTACTTCCTGGCAGGCCGGG 
ACATGACGTGGTGGCCGATTGGAGCCTCCCTCTTCGCCAGCAGCGAGGGCTCTGGCCTCTTCATTGGACTGG 
CGGGCT C AGG CGCGGC AGG AGGTC TGG C CGTGG C AGGC TTCG AGTGGAATGC C ACGT ACGTG CTG CTGGC AC 
TGGCATGGGTGTTCGTGCCCATCTACATCTCCTCAGAGATCGTCACCTTACCTGAGTACATTCAGAAGCGCT 
ACGGGGGCCAGCGGATCCGCATGTACCTGTCTGTCCTGTCCCTGCTACTGTCTGTCTTCACCAAGATATCGC 
TGGACCTGTACGCGGGGGCTCTGTTTGTGCACATCTGCCTGGGCTGGAACTTCTACCTCTCCACCATCCTCA 
CGCTCGGCATCACAGCCCTGTACACCATCGCAGGGGGCCTGGCTGCTGTAATCTACACGGACGCCCTGCAGA 
CGCTCATCATGGTGGTGGGGGCTGTCATCCTGACAATCAAAGCTTTTGACCAGATCGGTGGTTACGGGCAGC 
TGGAGGCAGCCTACGCCCAGGCCATTCCCTCCAGGACCATTGCCAACACCACCTGCCACCTGCCACGTACAG 
ACGCCATGCACATGTTTCGAGACCCCCACACAGGGGACCTGCCGTGGACCGGGATGACCTTTGGCCTGACCA 
TCATGGCCACCTGGTACTGGTGCACCGACCAGGTGATCGTGCAGCGATCACTGTCAGCCCGGGACCTGAACC 
ATGCCAAGGCGGGCTCCATCCTGGCCAGCTACCTCAAGATGCTCCCCATGGGCCTGATCATAATGCCGGGCA 
TGATCAGCCGCGCATTGTTCCCAGATGATGTGGGCTGCGTGGTGCCGTCCGAGTGCCTGCGGGCCTGCGGGG 
CCGAGGTCGGCTGCTCCAACATCGCCTACCCCAAGCTGGTCATGGAACTGATGCCCATCGGTCTGCGGGGGC 
TGATGATCGCAGTGATGCTGGCGGCGCTCATGTCGTCGCTGACCTCCATCTTCAACAGCAGCAGCACCCTCT 
TCACTATGGACATCTGGAGGCGGCTGCGTCCCCGCTCCGGCGAGCGGGAGCTCCTGCTGGTGGGACGGCTGG 
TCATAGTGGCACTCATCGGCGTGAGTGTGGCCTGGATCCCCGTCCTGCAGGACTCCAACAGCGGGCAACTCT 
TCATCTACATGCAGTCAGTGACCAGCTCCCTGGCCCCACCAGTGACTGCAGTCTTTGTCCTGGGCGTCTTCT 
GGCGACGTGCCAACGAGCAGGGGGCCTTCTGGGGCCTGATAGCAGGGCTGGTGGTGGGGGCCACGAGGCTGG 
TCCTGGAATTCCTGAACCCAGCCCCACCGTGCGGAGAGCCAGACACGCGGCCAGCCGTCCTGGGGAGCATCC 
ACTACCTGCACTTCGCTGTCGCCCTCTTTGCACTCAGTGGTGCTGTTGTGGTGGCTGGAAGCCTGCTGACCC 
CACCCCCACAGAGTGTCCAGATTGAGAACCTTACCTGGTGGACCCTGGCTCAGGATGTGCCCTTGGGAACTA 
AAGCAGGTGATGGCCAAACACCCCAGAAACACGCCTTCTGGGCCCGTGTCTGTGGCTTCAATGCCATCCTCC 
TCATGTGTGTCAACATATTCTTTTATGCCTACTTCGCCTGA CACTGCCATCCTGGACAGAAAGGCAGGAGCT 
CTGAGTCCTCAGGTCCACCCATTTCCCTCATGGGGATCCCGA 



In a search of public sequence databases, the NOV27a nucleic acid sequence, located 
on chromosome 17, has 1598 of 1838 bases (86%) identical to a gb:GENBANK- 

10 lD:OCU08813|acc:U08813.1 mRNA from Oryctolagus cuniculus {Oryctolagus cuniculus 
Na+/glucose cotransporter-related protein mRNA, complete cds) (E = 2.6e~ 309 ). 

A disclosed NOV27a polypeptide (SEQ ID NO: 1 14) encoded by SEQ ID NO:l 13 has 
596 amino acid residues and is presented in Table 27B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV27A has a signal peptide and is 

15 likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 
NOV27a may also localize to the endoplasmic reticulum (membrane) with a certainty of 
0.6850, to the Golgi body with a certainty of 0.4600, or to the enoplasmic reticulum (lumen) 
with a certainty of 0.1000. The most likely cleavage site for NOV27A is between positions 42 
and43:CRA-SR. 
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Table 27B. Encoded NOV27a protein sequence (SEQ ID NO: 114). 

MAANSTSDLHTPGTQLSVADIIVITVYFAIJWAVGIWSSCRASRNTVNGYFLAGRDMTWWPIGASLFASSEG 
SGLFIGLAGSGAAGGLAVAGFEWNATYVLLALAWVFVPIYISSEIVTLPEYIQKRYGGQRIRMYLSVLSLLL 
SVFTKISLDLYAGALFVHICLGWNFYLSTILTLGITALYTIAGGLAAVIYTDALQTLIMWGAVILTIKAFD 
QIGGYGQLEAAYAQAIPSRTIANTTCHLPRTDAMHMFRDPHTGDLPWTGMTFGLTIMATWYWCTDQVIVQRS 
LSARDLNHAKAGSILASYLKMLPMGLIIMPGMISRALFPDDVGCVVPSECLRACGAEVGCSNIAYPKLVMEL 
MPIGLRGLMIAVMLAALMSSLTSIFNSSSTLFTMDIWRRLRPRSGERELLLVGRLVIVALIGVSVAWIPVLQ 
DSNSGQLFXYMQSVTSSLAPPVTAVFVLGVFWRRANEQGAFWGLIAGLWGATRLVLEFLNPAPPCGEPDTR 
PAVLGS I HYLHFAVALFALSGAVWAGSLLTPPPQSVQI ENLTWWTLAQDVPLGTKAGDGQTPQKHAFWARV 
CGFNAILLMCVNIFFYAYFA 

A search of sequence databases reveals that the NOV27a amino acid sequence has 531 
of 596 amino acid residues (89%) identical to, and 559 of 596 amino acid residues (93%) 
similar to, the 597 amino acid residue ptnr:SPTREMBL-ACC:Q28610 protein from 
Oryctolagus cuniculus (Rabbit) (Na+/Glucose Cotransporter-Related Protein) (E = l.le" 289 ). 

NOV27a is predicted to be expressed in at least heart and kidney. This information was 
derived by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, public EST sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in the following tissues because 
of the expression pattern of (GENBANK-ID: gb:GENBANK-ID:OCU08813|acc:U08813.1) a 
closely related Oryctolagus cuniculus Na+/glucose cotrans porter-related protein mRNA, 
complete cds homolog. 

NOV27b 

In the present invention, the target sequence identified previously, NOV27a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
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uterus. Usually the resulting ampl icons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
5 another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV27b. This differs from the previously 
identified sequence (NOV27a) in having 16 extra internal amino acids. 

A disclosed NOV27b nucleic acid of 1912 nucleotides (also referred to as CG56645-02 
10 encoding a novel sodium - glucose cotransporter-like protein is shown in Table 27C. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 35-37 and 
ending with a TGA codon at nucleotides 1871-1873. The start and stop codons are shown in 
bold in Table 27C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 27C. NOV27b nucleotide sequence (SEQ ID NO:115). 



CGGGCTCATACCTAGTGCCTGCGGCAGGACAGCC ATGGCCGCCAACTCCACCAGCGACCTCCACACTCCCGG 
GACGCAGCTGAGCGTGGCTGACATCATCGTCATCACTGTGTATTTTGCTCTGAACGTGGCCGTGGGCATATG 
GTC CT C TTGT C GGG C C AGT AGG AAC ACGGTG AATGGCT ACTTCCTGGCAGG CCGGG ACATG ACG TGGTGG C C 
GATTGGAGCCTCCCTCTTCGCCAGCAGCGAGGGCTCTGGCCTCTTCATTGGACTGGCGGGCTCAGGCGCGGC 
AGGAGGTCTGGCCGTGGCAGGCTTCGAGTGGAATGCCACGTACGTGCTGCTGGCACTGGCATGGGTGTTCGT 
GCCCATCTACATCTCCTCAGAGATCGTCACCTTACCTGAGTACATTCAGAAGCGCTACGGGGGCCAGCGGAT 
CCGCATGTACCTGTCTGTCCTGTCCCTGCTACTGTCTGTCTTCACCAAGATATCGCTGGACCTGTACGCGGG 
GGCTCTGTTTGTGCACATCTGCCTGGGCTGGAACTTCTACCTCTCCACCATCCTCACGCTCGGCATCACAGC 
CCTGTACACCATCGCAGGGGGCCTGGCTGCTGTAATCTACACGGACGCCCTGCAGACGCTCATCATGGTGGT 
GGGGGCTGTCATCCTGACAATCAAAGCTTTTGACCAGATCGGTGGTTACGGGCAGCTGGAGGCAGCCTACGC 
CCAGGCCATTCCCTCCAGGACCATTGCCAACACCACCTGCCACCTGCCACGTACAGACGCCATGCACATGTT 
TCGAGACCCCCACACAGGGGACCTGCCGTGGACCGGGATGACCTTTGGCCTGACCATCATGGCCACCTGGTA 
CTGGTGCACCGACCAGGTCATCGTGCAGCGATCACTGTCAGCCCGGGACCTGAACCATGCCAAGGCGGGCTC 
CATCCTGGCCAGCTACCTCAAGATGCTCCCCATGGGCCTGATCATCATGCCGGGCATGATCAGCCGCGCATT 
GTTCCCAGGTGCTCATGTCTATGAGGAGAGACACCAAGTGTCCGTCTCTCGAACAGATGATGTGGGCTGCGT 
GGTGCCGTCCGAGTGCCTGCGGGCCTGCGGGGCCGAGGTCGGCTGCTCCAACATCGCCTACCCCAAGCTGGT 
CATGGAACTGATGCCCATCGGTCTGCGGGGGCTGATGATCGCAGTGATGCTGGCGGCGCTCATGTCGTCGCT 
GACCTCCATCTTCAACAGCAGCAGCACCCTCTTCACTATGGACATCTGGAGGCGGCTGCGTCCCCGCTCCGG 
CGAGCGGGAGCTCCTGCTGGTGGGACGGCTGGTCATAGTGGCACTCATCGGCGTGAGTGTGGCCTGGATCCC 
CGTCCTGCAGGACTCCAACAGCGGGCAACTCTTCATCTACATGCAGTCAGTGACCAGCTCCCTGGCCCCACC 
AGTGACTGCAGTCTTTGTCCTGGGCGTCTTCTGGCGACGTGCCAACGAGCAGGGGGCCTTCTGGGGCCTGAT 
AGCAGGGCTGGTGGTGGGGGCCACGAGGCTGGTCCTGGAATTCCTGAACCCAGCCCCACCGTGCGGAGAGCC 
AGACACGCGGCCAGCCGTCCTGGGGAGCATCCACTACCTGCACTTCGCTGTCGCCCTCTTTGCACTCAGTGG 
TGCTGTTGTGGTGGCTGGAAGCCTGCTGACCCCACCCCCACAGAGTGTCCAGATTGAGAACCTTACCTGGTG 
GACCCTGGCTCAGGATGTGCCCTTGGGAACTAAAGCAGGTGATGGCCAAACACTCCAGAAACACGCCTTCTG 
GGCCCGTGTCTGTGGCTTCAATGCCATCCTCCTCATGTGTGTCAACATATTCTTTTATGCCTACTTCGCCTG 
ACACTGCCATCCTGGACAGAAAGGCAGGAGCTCTGAGTCC 



15 In a search of public sequence databases, the NOV27b nucleic acid sequence, located 

on chromosome 17, has 903 of 1017 bases (88%) identical to a gb:GENBANK- 
ID:OCU08813|acc:U08813.1 mRNA from Oryctolagus cuniculus {Oryctolagus cuniculus 
Na+/glucose cotransporter-related protein mRNA, complete cds) (E = 4Aq a 76 ). 

The disclosed NOV27b polypeptide (SEQ ID NO:l 16) encoded by SEQ ID NO:l 15 
20 has 612 amino acid residues and is presented in Table 27D using the one-letter amino acid 
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code. Signal P, Psort and/or Hydropathy results predict that NOV27b has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 
NOV27b may also localize to the endoplasmic reticulum (membrane) with a certainty of 
0.6850, to the Golgi body with a certainty of 0.4600, or to the endoplasmic reticulum (lumen) 
5 with a certainty of 0.1000. The most likely cleavage site for NOV27B is between positions 42 
and43:CRA-SR. 



Table 27D. Encoded NOV27b protein sequence (SEQ ID NO:116). 

MAANSTSDLHTPGTQLSVADIIVITVYFALNVAVGIWSSCRASRNTVNGYFLAGRDMTWWPIGASLFASSEG 
SGLFI GLAGS GAAGGLAVAGFEWNAT YVLLALAWVFVPI Y I S SE I VTLPE Y I QKR YGGQRI RMYLS VLSLLL 
S V FT K I S LDL Y AG AL FVH I CLGWNF Y L S T I LTLG I T AL YT I AGGL AAV I YTD ALQTL I MWGAVILTI KAFD 
QIGGYGQLEAAYAQAIPSRTIANTTCHLPRTDAMHMFRDPHTGDLPWTGMTFGLTIMATWYWCTDQVIVQRS 
LSARDLNHAKAGSILASYLKMLPMGLIIMPGMISRALFPGAHVYEERHQVSVSRTDDVGCWPSECLRACGA 
EVGCSNIAYPKLVMELMPIGLRGLMIAVMLAALMSSLTSIFNSSSTLFTMDIWRRLRPRSGERELLLVGRLV 
IVALIGVSVAWIPVLQDSNSGQLFIYMQSVTSSLAPPVTAVFVLGVFWRRANEQGAFWGLIAGLVVGATRLV 
LEFLNPAPPCGEPDTRPAVLGSIHYLHFAVALFALSGAVWAGSLLTPPPQSVQIENLTWWTLAQDVPLGTK 
AGDGQTLQKHAFWARVCGFNAILLMCVNIFFYAYFA 



A search of sequence databases reveals that the NOV27b amino acid sequence has 530 
10 of 612 amino acid residues (86%) identical to, and 558 of 612 amino acid residues (91%) 
similar to, the 597 amino acid residue ptnr:SPTREMBL-ACC:Q28610 protein from 
Oryctolagus cuniculus (Rabbit) (Na+/Glucose Cotransporter-Related Protein) (E = 1.9e" 284 ). 

NOV27b is predicted to be expressed in at least heart and kidney. .The sequence is 
predicted to be expressed in the following tissues because of the expression pattern of 
15 (GENB ANK-ID : gb:GENBANK-ID:OCU08813|acc:U08813.1) a closely related Oryctolagus 
cuniculus Na+/gIucose cotransporter-related protein mRNA, complete cds homolog. 
NOV27c 

A disclosed NOV27c nucleic acid of 1741 nucleotides (also referred to as 191828203) 
encoding a novel sodium - glucose cotransporter-like protein is shown in Table 27E. An open 
20 reading frame was identified beginning with a ATG initiation codon at nucleotides 5-7 and 
ending with a TGA codon at nucleotides 1688-1690. The start and stop codons are shown in 
bold in Table 27E, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 27E. NOV27c nucleotide sequence (SEQ ID NO: 117). 

AGCC ATGGCCGCCAACTCCACCAGCGACCTCCACACTCCCGGGACGCAGCTGAGCGTGGCTGACATCATCGT 
CATCACTGTGTATTTTGCTCTGAATGTGGCCGTGGGCATATGGTCCTCTTGTCGGGCCAGTAGGAACACGGT 
GAATGGCTACTTCCTGGCAGGCCGGGACATGACGTGGTGGCCGATTGGAGCCTCCCTCTTCGCCAGCAGCGA 
GGGCTCTGGCCTCTTCATTGGACTGGCGGGCTCAGGCGCGGCAGGAGGTCTGGCCGTGGCAGGCTTCGAGTG 
GAATGCCACGTACGTGCTGCTGGCACTGGCATGGGTGTTCGTGCCCATCTACATCTCCTCAGAGCTGGACCT 
GT ACGCGGGGG CTCTGT TT GTG C AC AT CTG C CTGGGC TGGAACTT CT ACCT CT C CAC C AT CCTCACG CTCGG 
CATCACAGCCCTGTACACCATCGCAGGGGGCCTGGCTGCTGTAATCTACACGGACGCCCTGCAGACGCTCAT 
CATGGTGGTGGGGGCTGTCATCCTGACAATCAAAGCTTTTGACCAGATCGGTGGTTACGGGCAGCTGGAGGC 
AGCCTACGCCCAGGCCATTCCCTCCAGGACCATTGCCAACACCACCTGCCACCTGCCACGTACAGACGCCAT 
GCACATGTTTCGAGACCCCCACACAGGGGACCTGCCGTGGACCGGGATGACCTTTGGCCTGACCATCATGGC 
CACCTGGTACTGGTGCACCGACCAGGTCATCGTGCAGCGATCACTGTCAGCCCGGGACCTGAACCATGCCAA 
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GGCGGGCTCCATCCTGGCCAGCTACCTCAAGATGCTCCCCATGGGCCTGATCATCATGCCGGGCATGATCAG 
CCGCGCATTGTTCCCAGATGATGTGGGCTGCGTGGTGCCGTCCGAGTGCCTGCGGGCCTGCGGGGCCGAGGT 
CGGCTGCTCCAACATCGCCTACCCCAAGCTGGTCATGGAACTGATGCCCATCGGTCTGCGGGGGCTGATGAT 
CACAGTGATGCTGGCGGCGCTCATGTCGTCGCTGACCTCCATCTTCAACAGCAGCAGCACCCTCTTCACTAT 
GGACATCTGGAGGCGGCTGCGTCCCCGCTCCGGCGAGCGGGAGCTCCTGCTGGTGGGACGGCTGGTCATAGT 
GGCACTCATCGGCGTGAGTGTGGCCTGGATCCCCGTCCTGCAGGGCTCCAACAGCGGGCAACTCTTCATCTA 
CATGCAGTCAGTGACCAGCTCCCTGGCCCCACCAGTGACTGCAGTCTTTGTCCTGGGCGTCTTCCGGCGACG 
TGCCAACGAGCAGGGGGCCTTCTGGGGCCTGATAGCAGGGCTGGTGGTGGGGGCCACGAGGCTGGTCCTGGA 
ATTCCTGAACCCAGCCCCACCGTGCGGAGAGCCAGACACGCGGCCAGCCGTCCTGGGGAGCATCCACTACCT 
GCACTTCGCTGTCGCCCTCTTTGCACTCAGTGGTGCTGTTGTGGTGGCTGGAAGCCTGCTGACCCCACCCCC 
ACAGAGTGTCCAGATTGAGAACCTTACCTGGTGGACCCTGGCTCAGGATGTGCCCTTGGGAACTAAAGCAGG 
TGATGGCCAAACACCCCAGAAACACGCCTTCTGGGCCCGCGTCTGTGGCTTCAATGCCATCCTCCTCATGTG 
TGTCAACATATTCTTTTATGCCTACTTCGCCTGA CACTGCCATCCTGGACAGAAAGGCAGGAGCTCTGAGTT 
GGCGGCCATGGCT 



In a search of public sequence databases, the NOV27c nucleic acid sequence, located 
on chromosome 17, has 1409 of 1445 bases (97%) identical to a gb:GENBANK- 
ID:AX1 9 1622|acc:AX19 1622.1 mRNA from Homo sapiens (Sequence 144 from Patent 
5 WO0149728)(E = 0.0). 

A disclosed NOV27c polypeptide (SEQ ID NO: 1 1 8) encoded by SEQ ID NO:l 17 has 
561 amino acid residues and is presented in Table 27F using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV27c has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 
10 NOV27c may also localize to the endoplasmic reticulum (membrane) with a certainty of 

0.6850, to the Golgi body with a certainty of 0.4600, or to the endoplasmic reticulum (lumen) 
with a certainty of 0.1000. The most likely cleavage site for NOV27C is between positions 42 
and 43: CRA-SR. 



15 



20 



Table 27F. Encoded NOV27c protein sequence (SEQ ID NO: 118). 



MAANSTSDLHTPGTQLSVADIIVITVYFALNVAVGIWSSCRASRNTVNGYFLAGRDMTWWPIGASLFASSEG 
SGLFIGLAGSGAAGGLAVAGFEWNATYVLLALAWVFVPI YISSELDLYAGALFVHI CLGWNFYLST I L.TLGI 
TALYT I AGGLAAV I YTDALQTLI MWGAV I LT I KAFDQI GGYGQLEAAYAQAI PS RT I ANTTCHLPRTDAMH 
MFRDPHTGDLPWTGMTFGLTIMATWYWCTDQ VI VQRSLSARDLNHAKAGS I LAS YLKMLPMGL 1 1 MPGM I SR 
ALFPDDVGCWPSECLRACGAEVGCSNIAYPKLVMELMPIGLRGLMITVMLAALMSSLTSIFNSSSTLFTMD 
IWRRLRPRSGERELLLVGRLVIVALIGVSVAWIPVLQGSNSGQLFIYMQSVTSSLAPPVTAVFVLGVFRRRA 
NEQGAFWGLIAGLWGATRLVLEFLNPAPPCGEPDTRPAVLGSIHYLHFAVALFALSGAVWAGSLLTPPPQ 
S VQI ENLTWWTLAQDVPLGTKAGDGQTPQKHAFW ARVCGFNAI LLMCVNI FFY AY FA 



A search of sequence databases reveals that the NOV27c amino acid sequence has 394 
of 460 amino acid residues (85%) identical to, and 423 of 460 amino acid residues (91%) 
similar to, the 597 amino acid residue ptnr:SPTREMBL-ACC:Q28610 protein from 
Oryctolagus cuniculus (Rabbit) (Na+/Glucose Cotransporter-Related Protein) (E = 2.6e" 125 ). 

NOV27c is predicted to be expressed in at least heart, kidney, and colon. Expression 
information was derived from the tissue sources of the sequences that were included in the 
derivation of the sequence of CuraGen Acc. No. 191828203. The sequence is predicted to be 
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expressed in kidney because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:OCU08813|acc:U08813.1) a closely related Oryctolagus cuniculus Na-H/glucose 
cotransporter-related protein mRNA, complete cds homolog. 

The disclosed NOV27a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 27G. 



Table 27G. BLAST results for NOV27a 




Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 52 04 6 9 |gb| AAA6 6 0 
65. l| (U08813) 


597 aa protein 
related to 
Na/glucose 
cotransporters 
[Oryctolagus 
cuniculus] 


597 


531/596 
(89%) 


559/596 
(93%) 


0.0 


gi | 16553 933 | db j | BAB 
71619.1] (AK057946) 


unnamed protein 
product [Homo 
sapiens] 


517 


440/456 
(96%) 


440/456 
(96%) 


0.0 


gi| 18203958 | gb | AAH2 
1357.l|AAH21357 
(BC021357) 


Unknown {protein 
for MGC:29197) 
[Mus musculus] 


678 


346/545 
(63%) 


435/545 
(79%) 


0.0 


gi|958 8428| emb | CACO 
0574. l| (AL109659) 


dJ1024N4.1 (novel 
Sodium : solute 
symporter family 
member similar to 
SLC5A1 (SGLTl) ) 
[Homo sapiens] 


552 


344/522 
(65%) 


425/522 
(80%) 


e-180 


gi | 2564063 | db j | BAA2 
2950. l| (AB008225) 


Na+-glucose 
cotransporter 
type 1 (SGLT-l) - 
like protein 
[Xenopus laevis] 


673 


315/539 
(58%) 


415/539 
(76%) 


e-174 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 27H. In the ClustalW alignment of the NOV27 protein, as 
10 well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence {i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 27H. ClustalW Analysis of NOV27 



1) Novel NOV27a (SEQ ID NO: 114) 

2) Novel NOV27b (SEQ ID NO: 116) 

3) Novel NOV27b (SEQ ID NO: 118) 

4) gi|520469|gb|AAA66065.l| (U08813) 597 aa protein related to Na/glucose 
cotransporters [Oryctolagus cuniculus] (SEQ ID NO: 436) 

5) gi | 16553933 |dbj |BAB71619.l| (AK057946) unnamed protein product [Homo sapiens] 
(SEQ ID NO:437) 
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6) gi|l8203958|gb|AAH21357.l|AAH21357 (BC021357) Unknown (protein for MGC:29197) 
[Mus musculus] (SEQ ID NO:438) 

7) gil 9588428 | emb | CAC00574 . 1 | (AL109659) dJ1024N4.l (novel Sodium : solute symporter 
family member similar to SLC5A1 (SGLT1) ) [Homo sapiens] (SEQ ID NO:439) 

8) gi|2564063 |dbj |BAA22950.l| (AB008225) Na+ -glucose cotransporter type 1 (SGLT-1) 
like protein [Xenopus laevis] (SEQ ID NO:440) 



10 



15 



20 



25 



30 



35 



40 



55 



60 



65 



NOV27a 
NOV27b 
NOV27c 
gi|520469| 
gi|l6553933 | 
gi|l8203958| 
gi|9588428| 
gi|2564063 | 



NOV27a 
NOV27b 
NOV27c 
gi|520469| 
gi | 16553933 | 
gi|l8203958| 
gi|9588428| 
gi|2564063 j 



NOV27a 
NOV27b 
NOV27C 
gi|520469| 
gi|l6553933 | 
gi j 18203958 j 
gi|9588428| 
gi|2564063 | 





NOV27a 


171 




NOV27b 


171 


45 


NOV27C 


136 




gi|520469| 


172 




gi|l6553933| 


76 




gi|18203958| 


177 




gi| 9588428 | 


181 


50 


gi|2564063 | 


178 



NOV27a 

NOV27b 

NOV27C 

gi | 520469 | 

gi|l6553933 | 

gi|l8203958| 

gi|9588428| 

gi|2564063 | 



NOV27a 


291 


NOV27b 


291 


NOV27C 


256 


gi|520469| 


292 


gi|l6553933 | 


196 




231 
231 
196 
232 
136 
237 
241 
238 



240 
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gi|18203958| 297 
gi|9588428| 301 
gi|2564063| 298 



NOV27a 
NOV27b 
NOV27C 
gi|520469| 
gi|l6553933 | 
gi|l8203958| 
gi|9588428| 
gi|2564063 j 



NOV27a 
NOV27b 
NOV27C 
gi|520469| 
gi|l6553933| 
gi|l8203958| 
gi j 9588428 | 
gi|2564063 j 



NOV27a 
NOV27b 
NOV27c 
gi|520469| 
gi | 16553933 | 
gi 1 18203958 j 
gi | 9588428 | 
gi j 2564063 j 



NOV27a 

NOV27b 

NOV27C 

gi | 520469 | 

gi|l6553933 | 

gi|l8203958| 

gi|9588428| 

gi | 2564063 j 




600 
|....| 

550 
566 
515 
551 
471 

SVSVN 573 
552 

FDAEVEDPVETNHRPAENGI S W 581 

660 



NOV27a 


550 


NOV27b 


566 


NOV27C 


515 


9i 


520469| 


551 


gi 


16553933 | 


471 


gi 


18203958 | 


574 


gi 


9588428 | 


552 


gi 


2564063 j 


582 




3 E ANTT QG P E Q P G ALHR S WG K WL WNVJ FCGL SG AP QQ A 63 3 

552 



582 EDISEEPHTT|TSAIYNDTTDiSNPlS-S SLLKKTILWLCGMDSRKGDK 628 



NOV27a 

NOV27b 

NOV27C 

gi | 520469 | 

gi|l6553933 | 

gijl8203958| 

gi|9588428| 

gi|2564063 | 




552 



629 HDQAPPAPLEPAEVLLYlRPLLK^LNTAVgjcSSAG|HL|^§G 67 3 
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Table 271 lists the domain description from DOMAIN analysis results against NOV27. 
This indicates that the NOV27 sequence has properties similar to those of other proteins 
5 known to contain this domain. 
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Table 271 Domain Analysis of NOV27 

gnl|Pfam|pfam004 74, SSF, Sodium : solute symporter family. (SEQ ID 
NO : 8 2 0 ) 

CD-Length = 406 residues, 100.0% aligned 
Score = 310 bits (793), Expect = 2e-85 



Y FLAGRDMTWWP I GAS L FAS S E GSGLFIGLAGS GAAGGLA T/AGFE WNAT Y VLLALAWVFV 10 9 

mill ii + inn + i+iiii+iii hi + i + i i + i 

10 Sbjct: 1 YFLAGRSMTGFVNGLSLAASYMSAASFVGLAGAGAASGLAGGLYAIGALVGVWLLLWLFA 60 



NOV2 7 : 


50 


Sbjct : 


1 


NOV2 7 : 


110 


Sbjct : 


61 


NOV2 7 : 


170 


Sbjct : 


121 


NOV2 7 : 


230 


Sbjct : 


181 


NOV2 7 : 


290 


Sbj Ct : 


227 


NOV27 : 


350 


Sbjct : 


278 


NOV2 7 : 


410 


Sbjct : 


338 


NOV2 7 : 


470 


Sbj Ct : 


397 



PIYISSE1 V f J-iFx!*! iyi\.KHj(ayKJ.lU , lI L/D VOj^JjJjJ-'O v l- j. j_ j_i -r*^ . — — — 

I + '* I + I + I++II + M + M +111 Mill II + l + + II + + II l + 

PRLRNLGAYTMPDYLRKRFGGKRILVYLSALSLLLYFFTYMSVQIVGGARLIEIALGLNY 12 0 
YLSTILTLGITALYTIAGGLAAVIYTDALQTLIMWGAVILTIKAFDQIGGYGQLEAAYA 22 9 

I + +| + || + || II 1 1 +1 I +1 ++I+ I + M I I ++1 1 1 I 



I I +1+ Ml II I 1+ I I + Ml I 

r T PDGLHILRDPLTGLSLWPGLVLGTTGL PHILQRCL 22 6 



, _ , t S ARDLNHAKAG S I LAS YL KML PMGL I IMPGMISRALFPDDVGCWPSECLRACGAEVGCS 34 9 

25 * +| + | II I + II +MIIIIII II + I MM MM 

nm a if T P PfiVT . T LT PMF 1 1 VM PGM I SRGLFAI ALAGANP RACGTWGCS 277 



IMM I ++I I II l + l + l I M I I + I I + I I I IM + M I+ + + +1 + + I 

NIAYPTLAVKLGPPGLAGIMLAVMLAAIMSTLTSQLLSSSSAFTHDLYKNIRRKASATEK 

LL VGRL V I VAL I G VS V AW I P VLQD SN S GQ L F I YMQ S VTS S L AP P VT AV FVLG V FWRRANE 

mi i+ i+ +i+i + +i + +i +i i +i +i i+i 1 1 

ELVGRSRI IVLWISLAILLAVQ- PAQMGIAFLVQLAFAGLGSAFLPVILLAI FWKRVNE 
QGAFWGL I AG 479 

III ll + l I 



40 The gene of invention codes for a human ortholog of a rabbit sodium-glucose 

cotransporter (SGLT) and belongs to the large family of SGLTs that has been described to 
date. The rabbit gene is expressed in the kidney (Pajor,Biochim Biophys Acta 1994 Sep 
14;1 194(2):349-51), and the novel gene described herein is expressed in the heart in addition 
to the kidney. It shows the characteristic sodium-solute symporter protein motif shared by 

45 members of the SGLT family. 

SGLTs are critical in the maintenance of glucose homeostasis in the body, in a variety 
of tissues. Inhibitors of SGLTs are being studied in the treatment of diabetes. Treatment of 
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Zucker diabetic fatty rats with the SGLT inhibitor T-1095 lowers both fed and fasted blood 
glucose levels to near-normal levels (Nawano et al., Am J Physiol Endocrinol Metab 2000 
Mar;278(3):E535-43). In streptozotocin-induced diabetic rats, T-1095 also exerts an 
antihyperglycemic effect which is nullified by nephrectomy, indicating that the drug acts 
through inhibition of renal SGLTs rather than intestinal ones (Oku et al., Biol Pharm Bull 
2000 Dec;23(12): 1434-7) In addition, SGLT-1 seems to have a role in mammalian renal 
tubulogenesis (Yang et al., Am J Physiol Renal Physiol 2000 Oct;279(4):F765-77). 

The disclosed NOV27 nucleic acid of the invention encoding a Sodium - Glucose 
Cotransporter -like protein includes the nucleic acid whose sequence is provided in Table 27A, 
27C, 27E or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 27A, 27C, 
or 27E while still encoding a protein that maintains its Sodium - Glucose Cotransporter -like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 14 percent of the 
bases may be so changed. 

The disclosed NOV27 protein of the invention includes the Sodium - Glucose 
Cotransporter -like protein whose sequence is provided in Table 27B, 30D, or 30F. The 
invention also includes a mutant or variant protein any of whose residues may be changed 
from the corresponding residue shown in Table 27B, 27D, or 27F while still encoding a 
protein that maintains its Sodium - Glucose Cotransporter -like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 42 
percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab>2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Sodium - Glucose Cotransporter - 
like protein (NOV27) is a member of a "Sodium - Glucose Cotransporter family". Therefore, 
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the NOV27 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
5 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV27 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in diabetes, obesity, hypertension, cardiomyopathy, 

10 atherosclerosis, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 

atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, 
transplantation, autoimmune disease, renal artery stenosis, interstitial nephritis, 
glomerulonephritis, polycystic kidney disease, systemic lupus erythematosus, renal tubular 

15 acidosis, IgA nephropathy, hypercalceimia, Lesch-Nyhan syndrome, cancer, tissue 

degeneration, diabetic nephropathy, microvascular and macrovascular disease, and/or other 
diseases and pathologies. 

NOV27 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV27 substances for use in 

20 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV27 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 

25 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV28 

A disclosed NOV28 nucleic acid of 1560 nucleotides (also referred to as CG561 85-01) 
encoding a MYD-l-like protein is shown in Table 28A. An open reading frame was identified 
30 beginning with a ATG initiation codon at nucleotides 31-33 and ending with a TGA codon at 
nucleotides 1537-1539. The start and stop codons are shown in bold in Table 28A, and the 5' 
and 3' untranslated regions, if any, are underlined. 
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Table 28A. NOV28 nucleotide sequence (SEQ ID NO:119). 

CAGCCCTCGCGGGCGGCGTAGCCGCGGCCC ATGGAGCCCGCGGGCCGGTCCCCGGCCGCCTCGGGCCGCTGC 
TCTGCCTGCTGCTCCCCGCGTCCTGCGCCTGGTGGAGTGGCGGGTGAGGAGGAGCTGCAGGTGATTCAGCCT 
GAGAAGTCTGTATCAGTTGCAGCTGGAGAGTCGGCCGCTCTGCAGTGCACTGTGACCTCCCTGAACCCTGTG 
GGGCCCATCCAACGGTTCAGAGGAGCTGGACCAGGCCGGAAATTAATCTACCATCAAAAAGAAGGCCACTTC 
C C C CGGGT AAC AACTGT TT C AG ATC TC AC AAAG AG AAC C AAC ATGG ACTTT T C CAT C TG CAT C AGT AAC AT C 
ACCCCAGCAGATGCCGGCACCTACTACTGTGTGAAGTTCCAGAAAGGGAGCCCTGACGTGGAGTTGAAGTCT 
GGAGCAGGCACTGAGCTGTCTGTGCGTGCCAAACCCTCTGCCCCCGTGGTATCGGGCCCCGCAGCGAGGGCC 
ACACCTGACCACACAGTGAGCTTCACCTGCGAGTCTCATGGCTTCTCACCCAGAGACATCAGCCTGAAATGG 
TTCAAAAATGGGAATCAGCTCTCAGACTTCCAGACCAACGTGGACCCCGCAAGAGAGAGCGTGTCCTACAGC 
ATCCACAGCACAGCCAATGTGGTGCTGACCCGCGGGGACATTCACTCTCAAGTCATCTGCGAGGTGGCCCAC 
GTCACCTTGCGGGGGGACTCTTTTCGTGGGACTGCCAACTTGTCTGAGACTATCCAAGTTCCACCCACCTTG 
GAGGTTACTCAACAGCCCATGAGGGCAGAGAACCAGGTGAATATCACCTGCCAGGTGACGAAATTCTACCCC 
CAGAGACTACAGTTGACCTGGTTGGAGAACGGCAATGTGTCCCGGACAGAAACGGCCTCAACTCTTACAGAG 
AACAAGGATGGCACCTACAACTGGATGAGCTGGCTCCTGGTGAATGTATCTGCCCACAGGGATGATGTGAAG 
CTCACCTGCCAGGTGGAGCATGACGGGCAGTCAGCGGTCAGCAAAAGCCATGACCTGAAGGTCTCAGCCCAC 
CTGAAGGAGCAGAGCTCAAATACCGCCGCTGAGAACACTGGACCTAATGAACAGAACATCTATATTGTGGTG 
GGCGTGGTGTGCACCTTGCTGGTGGCCCTACTGATGGAGGCTCTCTACCTCGTCCGAATCAGACAGAAGAAA 
GC C C AGGGC T C C AC TT C TT CT AC AAGGTTGC ATG AACC CG AGAAG AATG CC AGAAAAAT AAC CCAGG AC AC A 
AATGATATCACATATGCGGACCTGAACCTGCCCAAGGGGAAGAAGCCTGCTCCCCGGGCCGCGGAGCCCAAC 
AACCACACAGAGTATGCCAGCATTCAGACCAGCCTGCAGCCTGCGTCGGAGGACACCCTCACCTATGCTGAC 
CTGGACATGGTGCACCTCAACCGGACCCCCAAGCAGCTGGCCCCCAAGCCCGAGCTGTCCTTCTCAGAGTAT 
GCCAGCATCCAGGTCCCGAGGAAGTGAATGGGACCGTGGTTTGCTCTA 



In a search of public sequence databases, the NOV28 nucleic acid sequence, located on 
chromosome 22, has 1466 of 1544 bases (94%) identical to a gb:GENBANK- 
ID:HSSIRPALP|acc:Yl 0375.1 mRNA from Homo sapiens (Hsapiens mRNA for SIRP- 
5 alphal)(E-7.4e" 310 ). 

The disclosed NOV28 polypeptide (SEQ ID NO: 120) encoded by SEQ ID NO:l 19 has 
503 amino acid residues and is presented in Table 28B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV28 has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.4600. Alternatively, 
10 NOV28 may also localize to the endoplasmic reticulum (membrane) with a certainty of 

0.1000, to the endoplasmic reticulum (lumen) with a certainty of 0.1000, or extracellularly 
with a certainty of 0.1000. The most likely cleavage site for NOV28 is between positions 30 
and 31: V AG-EE. 



Table 28B. Encoded NOV28 protein sequence (SEQ ID NO: 120). 

MEPAGPVPGRLGPLLCLLLPASCAWSGVAGEEELQVIQPEKSVSVAAGESAALQCTVTSLNPVGPIQRFRGA 
GPGRKLIYHQKEGHFPRVTTVSDLTKRTNMDFSICISNITPADAGTYYCVKFQKGSPDVELKSGAGTELSVR 
AKPSAPWSGPAARATPDHTVSFTCESHGFSPRDISLKWFKNGNQLSDFQTNVDPARESVSYSIHSTANWL 
T RGD I HS Q V I CE V AH VT L. RGD S F RGT ANL S E T I Q VP P TL E VT QQ PMRAENQ VN I T C Q VT KF Y P QRL QLT WL E 
NGNVSRTETASTLTENKDGTYNWMSWLLVNVSAHRDDVKLTCQVEHDGQSAVSKSHDLKVSAHLKEQSSNTA 
AENTGPNEQNIYIWGWCTLLVALLMEALYLVRIRQKKAQGSTSSTRLHEPEKNARKITQDTNDITYADLN 
LPKGKKPAPRAAEPNNHTEYASIQTSLQPASEDTLTYADLDMVHLNRTPKQLAPKPELSFSEYASIQVPRK 

15 ~ ' " 

A search of sequence databases reveals that the NOV28 amino acid sequence has458 

of 503 amino acid residues (91%) identical to, and 475 of 503 amino acid residues (94%) 

similar to, the 503 amino acid residue ptnr:SPTREMBL-ACC:P78324 protein from Homo 
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sapiens (Human) (Protein Tyrosine Phosphatase, Non-Receptor Type Substrate 1 Precursor 
(Shp Substrate- 1) (Inhibitory Receptor Shps-1) (Shps-1) (Signal- Regulatory Protein Alpha- 1) 
(SIRP-Alphal) (MYD-1 Antigen)) (E = 5.7e 247 ). 

NOV28 is predicted to be expressed in at least myeloid, macrophages, Adrenal 
5 Gland/Suprarenal gland, Bone Marrow, Brain, Whole Organism. This information was derived 
by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 

In addition, the sequence is predicted to be expressed in myeloid and macrophages 
10 because of the expression pattern of (GENBANK-ID: gb : GENB ANK-ID :HS SIRP ALP | acc: 
Y10375.1) a closely related H.sapiens mRNA for SIRP-alphal homolog. 

The disclosed NOV28 polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 28C. 



Table 28C. BLAST results for NOV28 




Gene Index/ 
Identifier 


Protein/ Organism 


Length 

(aa). 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 14771369 |ref |XP_ 
044897 . 1 | 
(XM_044897) 


hypothetical 
protein XP_044897 
[Homo sapiens] 


504 


458/504 
(90%) 


476/504 
(93%) 


0.0 


gi | 4758978 | ref |NP_0 
04639. l| 
(NM_004648) 


protein tyrosine 
phosphatase, non- 
receptor type 
substrate 1; SHP 
substrate- 1 [Homo 
sapiens] 


503 


458/503 
(91%) 


475/503 
(94%) 


0.0 


gi | 6624134 |gb| AAF19 
260 . 1 | AC004832_5 
(AC004832) 


similar to SHPS-1 
[Homo sapiens] ; 
similar to 
BAA12974 . 1 
(PID:gl864011) 


402 


402/402 
(100%) 


402/402 
(100%) 


0.0 


gi | 2 842392 | emb | CAA7 
1944.1] (Y11047) 


MyD-1 antigen 
[Homo sapiens] 


429 


391/429 
(91%) 


407/429 
(94%) 


0-0 


gi | 2842390 | emb | CAA7 
1942. 1| (Y11045) 


MyD-1 antigen 
[Bos taurus] 


506 


373/510 
(73%) 


415/510 
(81%) 


0.0 



15 

The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 28D. In the ClustalW alignment of the NOV28 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (z.e., regions that may be required to preserve structural or 
20 functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 28D. ClustalW Analysis of NOV28 



1) Novel NOV2 8 (SEQ ID NO: 12 0) 

2) gi | 14771369 |ref |XP_044897 . 1 | (XM_044897) hypothetical protein XP_044897 [Homo 
sapiens] (SEQ ID NO: 441) 

3) gi | 4758978 | ref |NP_004639 . 1 | (NM_004648) protein tyrosine phosphatase, non- 
receptor type substrate 1; SHP substrate- 1 [Homo sapiens] (SEQ ID NO: 44 2) 

4) gi|6624134|gb|AAF19260.l|AC004832_5 (AC004832) (SEQ IDNO:443) 

5) similar to SHPS-1 [Homo sapiens]; similar to BAA12974.1 (PID :gl86401l) (SEQ ID 
NO: 44 4) 

6) gi|2842392|emb|CAA71944.l| (Y11047) MyD-1 antigen [Homo sapiens] (SEQ ID NO:445) 

7) gi|2842390|emb|CAA71942.l| (Y11045) MyD-1 antigen [Bos taurus] (SEQ ID NO:446) 



NOV28 

gi 1 14771369 | 
gi | 4758978 | 
gi | 6624134 j 
gi | 2842392 j 
gi|2842390| 



NOV2 8 

gi | 14771369 | 
gi|4758978| 
gi j 6624134 j 
gi|2842392| 
gi|2842390 j 



NOV2 8 

gi|l4771369| 
gi|4758978| 
gi|6624134 j 
gi|2842392 j 
gi|2842390| 



NOV2 8 

gi | 14771369 | 
gi | 4758978 | 
gij 6624134 | 
gi | 2842392 j 
gi|2842390 j 




190 



200 



I 



I 



210 
. . I . . 



I 



220 



230 



240 



I 



NOV2 8 


180 


S 


gi|l4771369| 


181 




gij 4758978 | 


180 


? 


gij 6624134 j 


79 


P 


gi|2842392 j 


106 




gij2842390 j 


181 


b 

P. 



L K WF KNGNgL S D FQTNVD P 
LKWFKNGNELSDFQTNVDP 
LKWFKNGNELSDFQTNVDPl 
L K WF KNGNffiL S D FQTNVD P 



iLK WFKNGNEL £ 



QTgVDP 



ESVSYSIHSTAjjjVVLTR 
ESVSYSIHSTAKVVLTR 
ESVSYSIHSTAKWLTR 
ESVSYSIHSTAKVVLTR 

esvsysihstakvvltr 
vsysi!st!3kv3l! 



VG 
VG 
AR 
VG 

SDDN 



RD||HSQVIC EVAHVTLjgGD 

jdvhsqvicevahvtlqgdp 
sdvhsqvi cevahvtlqgdp 

|D@HSQVI CEVAHVTLj^GDl 

Sdvhsqvi cevahvtlqgdp 

dvhsqvi CEVAHVTLQG§ p 



- 238 

- 239 

- 238 

- 137 

- 164 
P 240 



270 



280 



290 



300 



NOV2 8 


299 


gi| 14771369 | 


300 


gi|4758978| 


299 


gi j 6624134 j 


198 


gi j 2842392 | 


225 


gi|2842390 j 


300 
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NOV2 8 

gi | 14771369 | 
gi |4758978 | 
gij 6624134 | 
gi j 2842392 j 
gi j 2842390 j 



NOV2 8 

gi | 14771369 | 
gij 4758978 | 
gi j 6624134 j 
gi|2842392 j 
gi j 2842390 j 



NOV28 

gi | 14771369 | 
gi | 4758978 | 
gij 6624134 | 
gi|2842392 j 
gi j 2842390 j 




477 
478 
477 
376 
403 
480 



I 



490 
I 



500 



J 



LNRTPKQjjAPKPEgs FSEYASgQVPRK 

lnrtpkqpapkpepsfseyasvqvprk 
lnrtpkqpapkpepsfseyasvqvprk 
lnrtpkq|Japkpe2sfseyas|3qvprk 
lnrtpkqpapkpepsfseyasvqvprk 
lnrtpkqpapkpepsfseyasvqvprk 



503 
504 
503 
402 
429 
506 



Tables 28E-F list the domain descriptions from DOMAIN analysis results against 
NOV28. This indicates that the NOV28 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 28E Domain Analysis of NOV28 

gnl | Smart | smart 0 04 07 , IGcl, Immunoglobulin C-Type (SEQ ID NO: 821) 
CD-Length = 75 residues, 94.7% aligned 
Score = 50.8 bits (120), Expect = 2e-07 



NOV2 8- 26 7 QVNITCQVTKFYPQRLQLTWLENGNVSRTETAST - LTENKDGTYNWMS WLLVNVSAHRDD 325 

+ I II Ml + +111+11 + +1 ++IIIII l + l 1+ I 

Sbjct : 1 PATLVCLVTGFYPPD I TVTWLKNGQEVTSGVKTTDPLKDKDGTYFLSSYLTVS AST WE SG 6 0 

NOV2 8: 32 6 VKLTCQVEHDG 33 6 

I I I I l + l 

Sbjct: 61 DVYTCQVTHEG 71 



Table 28F Domain Analysis of NOV28 

gnl | Smart | smart 0 04 07 , IGcl, Immunoglobulin C-Type (SEQ ID NO: 821) 
CD-Length = 75 residues, 96.0% aligned 
Score = 47.8 bits (112), Expect = 2e-06 



NOV28: 164 TV S FT CE S HG F S P RD I S L K W F KNGNQL S D FQTNVDP ARE S-VSYSIHS T ANWLTRGD I H 222 

+ I I I I I I++ I I I I + + + I I ++ +1 + I I + + 

Sbjct: 1 PATLVCLVTGFYPPD I TVTWLKNGQEVTSGVKTTDPLKDKDGTYFLSSYLTVSASTWESG 6 0 

NOV28: 223 SQVICEVAHVTL 234 
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Sbjct: 61 DVYTCQVTHEGL 72 

Protein tyrosine phosphatases (PTPases), such as SHP-1 and SHP-2; that contain Src 
5 homology 2 (SH2) domains play important roles in growth factor and cytokine signal 

transduction pathways. A protein of approximately 1 15 to 120 kDa that interacts with SHP-1 
and SHP-2 was purified from v-src-transformed rat fibroblasts (SR-3Y1 cells), and the 
corresponding cDNA was cloned. The predicted amino acid sequence of the encoded protein, 
termed SHPS-1 (SHP substrate 1), suggests that it is a glycosylated receptor-like protein with 

10 three immunoglobulin-like domains in its extracellular region and four YXX(L/V/I) motifs, 
potential tyrosine phosphorylation and SH2-domain binding sites, in its cytoplasmic region. 
Various mitogens, including serum, insulin, and lysophosphatidic acid, or cell adhesion 
induced tyrosine phosphorylation of SHPS-1 and its subsequent association with SHP-2 in 
cultured cells. Thus, SHPS-1 may be a direct substrate for both tyrosine kinases, such as the 

15 insulin receptor kinase or Src, and a specific docking protein for SH2-domain-containing 

PTPases. In addition, we suggest that SHPS-1 may be a potential substrate for SHP-2 and may 
function in both growth factor- and cell adhesion-induced cell signaling. (Fujioka et al. Mol 
Cell Biol. 1996 Dec;16(12):6887-99.) 

The rat OX41 antigen is a cell surface protein containing three immunoglobulin 

20 superfamily domains and intracellular immunoreceptor tyrosine-based inhibitory motifs 

(1TIM). It is a homologue of the human signal-regulatory protein (SIRP) also known as SHPS- 
1, BIT or MFR. Cell activation-induced phosphorylation of the intracellular ITIM motifs 
induces association with the tyrosine phosphatases SHP-1 and SHP-2. To identify the 
physiological OX41 ligand, recombinant OX41-CD4d3+4 fusion protein was coupled to 

25 fluorescent beads to produce a multivalent cell binding reagent. The OX41-CD4d3+4 beads 

bound to thymocytes and concanavalin A-stimulated splenocytes. This interaction was blocked 
by the monoclonal antibody (mAb) OX 101 . Affinity chromatography with OX 101 mAb and 
peptide sequencing revealed the rat SIRP ligand to be CD47 (integrin-associated protein). A 
direct interaction between human SIRP and human CD47 was demonstrated using purified 

30 recombinant proteins and surface plasmon resonance ruling out the involvement of other 

proteins known to be associated with CD47. The affinity of the SIRP/CD47 interaction was 
K(d) approximately 8 microM at 37 degrees C with a k(off )>/=2.1 s(-l). The membrane-distal 
SIRP V-like domain was sufficient for binding to CD47.(Vernon-Wilson EF, et al. Eur J 
Immunol. 2000 Aug;30(8):2130-7.) 
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The transmembrane glycoprotein SHPS-1 binds the protein tyrosine phosphatase SHP- 
2 and serves as its substrate. Although SHPS-1 has been implicated in growth factor- and cell 
adhesion-induced signaling, its biological role has remained unknown. Fibroblasts 
homozygous for expression of an SHPS-1 mutant lacking most of the cytoplasmic region of 
5 this protein exhibited increased formation of actin stress fibers and focal adhesions. They 
spread more quickly on fibronectin than did wild-type cells, but they were defective in 
subsequent polarized extension and migration. The extent of adhesion-induced activation of 
Rho, but not that of Rac, was also markedly reduced in the mutant cells. Activation of the Ras- 
extracellular signal-regulated kinase signaling pathway and of c-Jun N-terminal kinases by 

10 growth factors was either unaffected or enhanced in the mutant fibroblasts. These results 

demonstrate that SHPS-1 plays crucial roles in integrin-mediated cytoskeletal reorganization, 
cell motility and the regulation of Rho, and that it also negatively modulates growth factor- 
induced activation of mitogen-activated protein kinases. (Inagaki, A. et al., EMBO J. 2000 
Dec 15;19(24):6721-31.) 

15 Machida K. et al. (Oncogene. 2000 Mar 23; 19(1 3): 171 0-8.) investigated the effect of 

cell transformation by v-src on the expression and tyrosine phosphorylation of SHPS-1, a 
putative docking protein for SHP-1 and SHP-2. They found that transformation by v-src 
virtually inhibited the SHPS-1 expression at mRNA level. While nontransforming Src kinases 
including c-Src, nonmyristoylated forms of v-Src had no inhibitory effect on SHPS-1 

20 expression, transforming Src kinases including wild-type v-Src and chimeric mutant of c-Src 
bearing v-Src SH3 substantially suppressed the SHPS-1 expression. In cells expressing 
temperature sensitive mutant of v-Src, suppression of the SHPS-1 expression was temperature- 
dependent. In contrast, tyrosine phosphorylation of SHPS-1 was rather activated in cells 
expressing c-Src or nonmyristoylated forms of v-Src. SHPS-1 expression in SR3Y1 was 

25 restored by treatment with herbimycin A, a potent inhibitor of tyrosine kinase, or by the 
expression of dominant negative form of Ras. Contrary, active form of Mekl markedly 
suppressed SHPS-1 expression. Finally, overexpression of SHPS-1 in SR3Y1 led to the drastic 
reduction of anchorage independent growth of the cells. Taken together, their results suggest 
that the suppression of SHPS-1 expression is a pivotal event for cell transformation by v-src, 

30 and the Ras-MAP kinase cascade plays a critical role in the suppression. 

SHPS-1 (SH2-domain bearing protein tyrosine phosphatase (SHP) substrate- 1), a 
member of the inhibitory-receptor superfamily that is abundantly expressed in macrophages 
and neural tissue, appears to regulate intracellular signaling events downstream of receptor 
protein-tyrosine kinases and integrin-extracellular matrix molecule interactions. To investigate 
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the function of SHPS-1 in a hematopoietic cell line, SHPS-1 was expressed in Ba/F3 cells, an 
IL-3-dependent pro-B-cell line that lacks endogenous SHPS-1 protein. Interestingly, 
expression of either SHPS-1, or a mutant lacking the intracellular domain of SHPS-1 (DeltaCT 
SHPS-1), resulted in the rapid formation of macroscopic Ba/F3 cell aggregates. As the 
5 integrin-associated protein/CD47 was shown to be a SHPS-1 ligand in neural cells, Babic, J. et 
al. (J Immunol. 2000 Apr 1 ;164(7):3652-8.) investigated whether CD47 played a role in the 
aggregation of SHPS- 1 -expressing Ba/F3 cells. In support of this idea, aggregate formation 
was inhibited by an anti-CD47 Ab. Furthermore, erythrocytes from control, but not from 
CD47-deficient mice, were able to form rosettes on SHPS-1 -expressing Ba/F3 cells. Because 
10 erythrocytes do not express integrins, this result suggested that SHPS-1-CD47 interactions can 
take place in the absence of a CD47-integrin association. They also present evidence that the 
amino-terminal Ig domain of SHPS-1 mediates the interaction with CD47. Although SHPS-1- 
CD47 binding likely triggers bidirectional intracellular signaling processes, these results 
demonstrate that this interaction can also mediate cell-cell adhesion. 
1 5 Inhibitory immunoreceptors downregulate signaling by recruiting Src homology 2 

(SH2) domain-containing tyrosine and/or lipid phosphatases to activating receptor complexes 
[1]. There are indications that some inhibitory receptors might also perform other functions [2] 
[3]. In adherent macrophages, two inhibitory receptors, SHPS-1 and PIR-B, are the major 
proteins binding to the tyrosine phosphatase SHP-1. SHPS-1 also associates with two tyrosine- 
20 phosphorylated proteins (pp55 and ppl30) and a protein tyrosine kinase [4]. Here, Timms, JF. 
et al. (Curr Biol. 1999 Aug 26;9(16):927-30.) have identified pp55 and ppl30 as the adaptor 
molecules SKAP55hom/R (Src-kinase-associated protein of 55 kDa homologue) and 
FYB/SLAP-130 (Fyn-binding protein/SLP-76-associated protein of 130 kDa), respectively, 
and the tyrosine kinase activity as PYK2. Two distinct SHPS-1 complexes were formed, one 
25 containing SKAP55hom/R and FYB/SLAP-130, and the other containing PYK2. Recruitment 
of FYB/SLAP-130 to SHPS-1 required SKAP55hom/R, whereas PYK2 associated with 
SHPS-1 independently. Formation of both complexes was independent of SHP-1 and tyrosine 
phosphorylation of SHPS- 1. Finally, tyrosine phosphorylation of members of the SHPS-1 
complexes was regulated by integrin-mediated adhesion. Thus, SHPS-1 provides a scaffold for 
30 the assembly of multi-protein complexes that might both transmit adhesion-regulated signals 
and help terminate such signals through SHP-1 -directed dephosphorylation. Other inhibitory 
immunoreceptors might have similar scaffold-like functions. 

SHPS-1 (or SIRP) is a member of the immunoglobulin (Ig) superfamily abundantly 
expressed in neurons and other cell types. Within its cytoplasmic domain, it possesses at least 
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two immunoreceptor tyrosine-based inhibitory motifs, which are targets for tyrosine 
phosphorylation and mediate the recruitment of SHP-2, an Src homology 2 (SH2) domain- 
containing protein-tyrosine phosphatase. Since other immunoreceptor tyrosine-based 
inhibitory motifs-containing receptors have critical roles in the negative regulation of 
5 hemopoietic cell functions, the expression of SHPS-1 in cells of hematological lineages was 
examined. By analyzing a panel of hemopoietic cell lines, evidence was provided that SHPS-1 
is abundantly expressed in macrophages and, to a lesser extent, in myeloid cells. No 
expression was detected in T-cell or B-cell lines. Expression of SHPS-1 could also be 
documented in normal ex vivo peritoneal macrophages. Further studies showed that SHPS-1 

10 was an efficient tyrosine phosphorylation substrate in macrophages. However, unlike in non- 
hemopoietic cells, tyrosine-phosphorylated SHPS-1 in macrophages associated primarily with 
SHP-1 and not SHP-2. Finally, analyses allowed identification of several isoforms of SHPS-1 
in mouse cells. In part, this heterogeneity was due to differential glycosylation of SHPS-1. 
Additionally, it was caused by the production of at least two distinct shps-1 transcripts, coding 

15 for SHPS-1 polypeptides having different numbers of Ig-like domains in the extracellular 

region. Taken together, these findings indicate that SHPS-1 is likely to play a significant role 
in macrophages, at least partially as a consequence of its capacity to recruit SHP-1 . Veilette, 
A. et al. (J Biol Chem. 1998 Aug 28;273(35):22719-28.) 

SHPS-1 is a 120 kDa glycosylated receptor-like protein that contains immunoglobulin- 

20 like domains in its extracellular region and four potential tyrosine phosphorylation for SH2 

domain binding sites in its cytoplasmic region. Epidermal growth factor (EGF) stimulated the 
rapid tyrosine phosphorylation of SHPS-1 and subsequent association of SHPS-1 with SHP-2, 
a protein tyrosine phosphatase containing SH2 domains, in Chinese hamster ovary cells 
overexpressing human EGF receptors. In the cells overexpressing SHPS-1, the tyrosine 

25 phosphorylation of SHPS-1 was more evident than that observed in parent cells. However, 
overexpression of SHPS-1 alone did not affect the activation of MAP kinase in response to 
EGF. These results suggest that SHPS-1 may be involved in the recruitment of SHP-2 from 
the cytosol to the plasma membrane in response to EGF. Copyright 1997 Academic Press. 
Ochi, F. et al. (Biochem Biophys Res Commun. 1997 Oct 20;239(2):483-7.) 

30 The immune system recognizes invaders as foreign because they express determinants 

that are absent on host cells or because they lack 'markers of self that are normally present. 
Oldenborg et al. (2000) demonstrated that CD47 functions as a marker of self on murine red 
blood cells. Red blood cells that lack CD47 were rapidly cleared from the bloodstream by 
splenic red pulp macrophages. CD47 on normal red blood cells prevented this elimination by 
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binding to the inhibitory receptor signal regulatory protein alpha (STRP-alpha). Thus, 
Oldenborg et al. (2000) concluded that macrophages may use a number of nonspecific 
activating receptors and rely on the presence or absence of CD47 to distinguish self from 
foreign. Oldenborg et al. (2000) suggested that CD47-SIRP-alpha may represent a potential 
5 pathway for the control of hemolytic anemia. 

The disclosed NOV28 nucleic acid of the invention encoding a MYD-1 -like protein 
includes the nucleic acid whose sequence is provided in Table 28A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 28A while still encoding a protein that maintains 
10 its MYD-1 -like activities and physiological functions, or a fragment of such a nucleic acid. 

The invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
15 include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 6 percent of the 
20 bases may be so changed. 

The disclosed NOV28 protein of the invention includes the MYD-1 -like protein 
whose sequence is provided in Table 28B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
28B while still encoding a protein that maintains its MYD-1 -like activities and physiological 
25 functions, or a functional fragment thereof. In the mutant or variant protein, up to about 27 
percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b>2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this MYD-1 -like protein (NOV28) is a 
30 member of a "MYD-1 family". Therefore, the NOV28 nucleic acids and proteins identified 
here may be useful in potential therapeutic applications implicated in (but not limited to) 
various pathologies and disorders as indicated below. The potential therapeutic applications 
for this invention include, but are not limited to: protein therapeutic, small molecule drug 
target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic 
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and/or prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

The NOV28 nucleic acids and proteins of the invention are useful in potential 
5 therapeutic applications implicated in epilepsy, eating disorders, schizophrenia, ADD, and 
cancer; heart disease; inflammation and autoimmune disorders including Crohn's disease, 
IBD, allergies, rheumatoid and osteoarthritis, inflammatory skin disorders, allergies, blood 
disorders; psoriasis colon cancer, leukemia AIDS; thalamus disorders; metabolic disorders 
including diabetes and obesity; lung diseases such as asthma, hemolytic anemia, emphysema, 
10 cystic fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency and cancer; 
and prostate disorders including prostate cancer, and/or other diseases and pathologies. 

NOV28 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV28 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
15 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

NOVX Antibodies" section below. The disclosed NOV28 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
20 disorders. 

NOV29 

NOV29 includes three novel CRAL-TRIO -like proteins disclosed below. The 
disclosed sequences have been named NOV29a, NOV29b, and NOV29c. 
NOV29a 

25 A disclosed NOV29a nucleic acid of 1327 nucleotides (also referred to as CG56187- 

01) encoding a CRAL-TRIO-like protein is shown in Table 29A. An open reading frame was 
identified beginning with a ATG initiation codon at nucleotides 16-18 and ending with a TGA 
codon at nucleotides 1261-1263. The start and stop codons are shown in bold in Table 29A, 
and the 5' and 3' untranslated regions, if any, are underlined. 

Table 29A. NOV29a nucleotide sequence (SEQ ID NO:121). 

"ggagttgactggtggatgatgtgggaagggttaggggcggggttggtggcccccgaggtcatgagagctccg 
ccgaccatcagatcctcctccgctcagttccgggagaacctccaggacctgctgcccatactgcccaatgct 

GATGACTACTTCCTCCTGCGCTGGCTGGCAGCTCGAAACTTTGACCTGCAGAAATCCGAAGACATGCTCCGA 
AGGCACATGGAGTTCCGGAAGCAACAAGACCTGGACAACATTGTCACATGGCAGCCCCCTGAGGTGGTCATC 
CAGCTGTATGACTCGGGTGGTCTTTGTGGCTACGACTACGAAGGCTGCCCTGTGTACTTCAACATCATTGGG 
TCCCTCGACCCCAAGGGTCTCCTGCTGTCAGCCTCCAAGCAGGATATGATCCGGAAGCGCATCAAAGTCTGT 
GAGCTGCTGTTGCATGAGTGTGAGCTGCAAACTCAGAAGCTGGGCAGGAAGATCGAGATGGCGCTGATGGTG 
TTTGACATGGAGGGGCTGAGCCTGAAACACCTGTGGAAGCCAGCTGTGGAGGTCTACCAGCAGTTTTTTAGC 
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ATCCTGGAAGCAAATTATCCTGAGACCCTGAAGAATTTAATTGTTATTCGAGCCCCAAAACTGTTCCCCGTG 
GCCTTCAACTTGGTCAAGTCGTTCATGAGTGAGGAGACACGCAGGAAGATTGTGATTCTGGGAGACAACTGG 
AAG C AGG AG CTGAC AAAAT TC AT C AG C C C CGACC AGC T GC CTG TGG AGT TT GGGGGG AC C AT GAC TG AC C C C 
GATGGCAACCCCAAGTGCCTGACCAAGATCAACTATGGGGGTGAGGTGCCCAAGAGCTACTACCTGTGCGAG 
CAGGTGAGGCTGCAGTATGAGCACACGAGGTCCGTGGGCCGCGGCTCCTCCCTGCAGGTGGAGAACGAGATC 
CTGTTCCCGGGCTGTGTGCTCAGGTGGCAGTTTGCTTCAGATGGTGGGGACATCGGCTTTGGGGTTTTCCTG 
AAGACCAAGATGGGGGAGCAGCAGAGTGCTAGGGAGATGACGGAGGTGCTGCCCAGCCAGCGCTACAATGCC 
CACATGGTGCCTGAGGATGGGAGCCTCACCTGCCTCCAGGCTGGCGTCCTGCGCTTCGACAACACCTACAGC 
CGGATGCATGCCAAGAAGCTCAGCTACACTGTGGAGGTGCTGCTTCCCGACAAGGCCTCTGAGGAGACGCTG 
CAGAGTCTCAAGGCGATGAGACCCTCCCCAACACAGTG AAGACCCCAGCCACCTCTACCTGTGCACTCCAAC 
CCCTTCACACCCACCCCTCTGACCCCTGCCT _____ 



In a search of public sequence databases, the NOV29a nucleic acid sequence, located 
on chromosome 22, has 935 of 1263 bases (74%) identical to a gb:GENBANK- 
lD:RN0132352|acc:AJ132352.1 mRNA from Rattus norvegicus {Rattus norvegicus mRNA 
5 for 45 kDa secretory protein, partial) (E = 4.0e" ). 

A disclosed NOV29a polypeptide (SEQ ID NO: 122) encoded by SEQ ID NO: 121 has 
415 amino acid residues and is presented in Table 29B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV29a has no signal peptide and is 
likely to be localized extracellular ly with a certainty of 0.6500. Alternatively, NOV29a may 
10 also localize to the mitochondrial membrane spacewith a certainty of 0.1000, to the lysosome 
(lumen) with a certainty of 0.1000, or to the microbody (peroxisome) with a certainty of 
0.0348. 



Table 29B. Encoded NOV29a protein sequence (SEQ ID NO: 122). 

MMWEGLGAGLVAPEVMRAPPTIRSSSAQFRENLQDLLPILPNAIDDYFLLRWIAARNFDLQKSEDMLRRHMEF 
RKQQDLDNIVTWQPPEWIQLYDSGGLCGYDYEGCPVYFNIIGSLDPKGLLLSASKQDMIRKRIKVCELLLH 
ECELQTQKLGRKIEMALMVFDMEGLSLKHLWKPAVEVYQQFFSILEANYPETLKNLIVIRAPKLFPVAFNLV 
KSFMSEETRRKIVILGDNWKQELTKFISPDQLPVEFGGTMTDPDGNPKCLTKINYGGEVPKSYYLCEQVRLQ 
YEHTRSVGRGS SLQVENE I LFPGCVLRWQFASDGGDI GFGVFLKTKMGEQQSAREMTEVLP SQRYNAHMVPE 
DGSLTCLQAGVLRFDNTYSRMHAKKLSYTVEVLLPDKASEETLQSLKAMRPSPTQ 



1 5 A search of sequence databases reveals that the NOV29a amino acid sequence has 387 

of 397 amino acid residues (97%) identical to, and 390 of 397 amino acid residues (98%) 
similar to, the 406 amino acid residue ptnr:SPTREMBL-ACC:Q9UDX3 protein from Homo 
sapiens (Human) (WUGSC:H_DJ0539M06.4 PROTEIN) (E = 7.2e" 208 ). 

NOV29a is predicted to be expressed in at least Bone, liver, brain, and prostate. This 
20 information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in Bone because of the 
expression pattern of (GENBANK-ID: gb:GENBANK-ID:RN0132352|acc: AJ132352.1) a 
25 closely related Rattus norvegicus mRNA for 45 kDa secretory protein, partial homolog. 
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NOV29b 

A disclosed NOV29b nucleic acid of 1305 nucleotides (also referred to as CG56187- 
03) encoding a CRAL-TRIO-like protein is shown in Table 29C. An open reading frame was 
identified beginning with a ATG initiation codon at nucleotides 14-16 and ending with a TGA 
5 codon at nucleotides 1262-1264. The start and stop codons are shown in bold in Table 29C, 
and the 5' and 3' untranslated regions, if any, are underlined. 



Table 29C. NOV29b nucleotide sequence (SEQ ID NO:123). 

AGTTGACTGGTGG ATGATGTGGGAAGGGTTAGGGGCGGGGTTGGTGGCCCCCGAGGTCATGAGAGCTCCGCC 
GACCATCAGATCCTCCTCCGCTCAGTTCCGGGAGAACCTCCAGGACCTGCTGCCCATACTGCCCAATGCTGA 
TGACTACTTCCTCCTGCGCTGGCTGCGAGCTCGAAACTTTGACCTGCAGAAATCCGAAGACATGCTCCGAAG 
GCACATGGAGTTCCGGAAGCAACAAGACCTGGACAACATTGTCACATGGCAGCCCCCTGAGGTCATCCAGCT 
GTATGACTCGGGTGGTCTTTGTGGCTACGACTACGAAGGCTGCCCTGTGTACTTCAACATCATTGGGTCCCT 
CGACCCCAAGGGTCTCCTGCTGTCAGCCTCCAAGCAGGATATGATCCGGAAGCGCATCAAAGTCTGTGAGCT 
GCTGTTGCATGAGTGTGAGCTGCAAACTCAGAAGCTGGGCAGGAAGATCGAGATGGCGCTGATGGTGTTTGA 
CATGGAGGGGCTGAGCCTGAAACACCTGTGGAAGCCAGCTGTGGAGGTCTACCAGCAGTTTTTTAGCATCCT 
GGAAGCAAATTATCCTGAGACCCTGAAGAATTTAATTGTTATTCGAGCCCCAAAACTGTTCCCCGTGGCCTT 
CAACTTGGTCAAGTCGTTCATGAGTGAGGAGACACGCAGGAAGATTGTGATTCTGGGAGACAACTGGAAGCA 
GGAGCTGACAAAATTCATCAGCCCCGACCAGCTGCCTGTGGAGTTTGGGGGGACCATGACTGACCCCGATGG 
CCACCCCAAGTGCCTGACCAAGATCAACTATGGGGGTGAGGTGCCCAAGAGCTACTACCTGTGCGAGCAGGT 
GAGGCTGCAGTATGAGCACACGAGGTCCGTGGGCCGCGGCTCCTCCCTGCAGGTGGAGAACGAGATCCTGTT 
CCCGGGCTGTGTGCTCAGGTGGCAGTTTGCTTCAGATGGTGGGGACATCGGCTTTGGGGTTTTCCTGAAGAC 
CAAGATGGGGGAGCAGCAGAGTGCTAGGGAGATGACGGAGGTGCTGCCCAGCCAGCGCTACAATGCCCACAT 
GGTGCCTGAGGATGGGAGCCTCACCTGCCTCCAGGCTGGCGTCTATGTCCTGCGCTTCGACAACACCTACAG 
CCGGATGCATGCCAAGAAGCTCAGCTACACTGTGGAGGTGCTGCTTCCCGACAAGGCCTCTGAGGAGACGCT 
GCAGAGTCTCAAGGCGATGAGACCCTCCCCAACACAGTQ AAGACCCCAGCCACCTCCACCTGTGCACTCCAA 

CCCCTTCAC 



In a search of public sequence databases, the NOV29b nucleic acid sequence, located 

on chromosome 22, has 906 of 1212 bases (74%) identical to a gb:GENBANK- 
10 ID:BC005759|acc:BC005759.1 mRNA from Mus musculus {Mus musculus, clone MGC:6302, 

mRNA, complete cds) (E = 2.0e~ 137 ). 

A disclosed NOV29b polypeptide (SEQ ID NO: 124) encoded by SEQ ID NO: 123 has 

416 amino acid residues and is presented in Table 29D using the one-letter amino acid code. 

Signal P, Psort and/or Hydropathy results predict that NOV29b has no signal peptide and is 
15 likely to be localized extracellularly with a certainty of 0.4500. Alternatively, NOV29b may 

also localize to the mitochondrial membrane spacewith a certainty of 0.1000, to the lysosome 

(lumen) with a certainty of 0.1000, or to the microbody (peroxisome) with a certainty of 

0.0779. 



Table 29D. Encoded NOV29b protein sequence (SEQ ID NO:124). 

MMWEGLGAGLVAPEVMRAPPTIRSSSAQFRENLQDLLPILPNADDYFLLRWLRARNFDLQKSEDMLRRHMEF 
RKQQDLDNIVTWQPPEVIQLYDSGGLCGYDYEGCPVYFNIIGSLDPKGLLL.SASKQDMIRKRIKVCELLLHE 
CELQTQKLGRKIEMALMVFDMEGLSLKHLWKPAVEVYQQFFSILEANYPETLKNLIVIRAPKLFPVAFNLVK 
S FM S E ET RRK I V I LGDNW KQE LT K F I S PDQL P VE FGGTMTDP DGH P K CLT K I N YGGE VP K S Y Y L CE Q VRL Q Y 
EHTRSVGRGSSLQVENEILFPGCVLRWQFASDGGDIGFGVFLKTKMGEQQSAREMTEVLPSQRYNAHMVPED 
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|~GSLTCLQAGVYVLRFDNTYSRMHAKKLSYTVEVLLPDKASEETLQSLKAMRPSPTQ ~1 

A search of sequence databases reveals that the NOV29b amino acid sequence has 906 
of 1212 amino acid residues (74%) identical to, and 906 of 1212 amino acid residues (74%) 
similar to, the 2529 amino acid residue gb:GENBANK-ID:BC005759|acc:BC005759.1 protein 
5 from Mus musculus {Mus musculus, clone MGC:6302, mRNA, complete cds) (E = 2.0e~ 137 ). 

NOV29b is predicted to be expressed in at least Bone, liver, brain, and prostate. . The 
sequence is predicted to be expressed inbone because of the expression pattern of 
(GENBANK-ID: gb:GENBANK-ID:BC005759|acc:BC005759.1) a closely related Mus 
musculus, clone MGC:6302, mRNA, complete cds homolog. 
10 NOV29c 

A disclosed NOV29c nucleic acid of 1218 nucleotides (also referred to as CG561 89- 
01) encoding a CRAL-TRIO-like protein is shown in Table 29E. An open reading frame was 
identified beginning with a ATG initiation codon at nucleotides 1-3 and ending with a TAG 
codon at nucleotides 1216-1218. The start and stop codons are shown in bold in Table 29E, 
15 and the 5' and 3' untranslated regions, if any, are underlined. 



Table 29E. NOV29c nucleotide sequence (SEQ ID NO: 125). 

ATGTTCCGGGAGAACATCCAAGATGTGCTATCTGCGCTGCCCAATCCTGATGACTACTTCCTCCTGCGCTGG 
CTCCAAGCTCGGAGCTTTGACCTGCAGAAATCAGAGGACATGCTGAGGAAGCATATGGAGTTCCGGAAGCAA 
CAAGACCTGGCCAACATCCTTGCCTGGCAGCCCCCAGAGGTGGTCAGGCTGTACAACGCTAACGGCATATGC 
GGCCACGACGGTGAGGGCAGCCCTGTCTGGTACCACATTGTGGGAAGCCTGGACCCCAAAGGCCTCTTGCTC 
TCAGCCTCCAAACAGGAGTTGCTCAGGGACAGCTTCCGGAGCTGCGAGCTGCTCCTGCGGGAGTGTGAGCTG 
CAGAGTCAGAAGCTGGGGAAGAGGGTGGAGAAAATCATAGCTATTTTTGGTCTCGAAGGGCTGGGCCTGAGG 
GATCTGTGGAAGCCAGGAATAGAGCTTCTCCAGGAGTTTTTCTCAGCACTTGAAGCAAATTACCCTGAGATC 
TTGAAGAGTTTAATTGTTGTGAGAGCCCCCAAGCTATTCGCCGTAGCCTTCAACCTGGTCAAGTCTTACATG 
AGTGAAGAGACACGCAGGAAGGTGGTGATTCTCGGAGATCTGATGGTTCCTGCATCCGAAGGTGTAGGGCAC 
CCAACTGGTGTTGAGGGCCCACTGCCTGGTGGGCTGCCAGACAACTGGAAGCAGGAGCTGACAAAATTCATC 
AGCCCCGACCAGCTGCCCGTGGAGTTTGGGGGGACCATGACTGACCCCGATGGCAACCCCAAGTGCCTGACC 
AAGATCAACTACGGGGGTGAGGTGCCCAAGAGCTACTACCTGTGCAAGCAGGTGAGGCTGCAGTATGAGCAC 
ACGAGGTCCGTGGGCCGCGGCTCCTCCCTGCAGGTGGAGAACGAGATCCTGTTCCCGGGCTGTGTGCTCAGG 
TGGCAGTTTGCTTCAGATGGTGGGGACATTGGCTTTGGGGTTTTCCTGAAGACCAAGATGGGGGAGCGGCAG 
AGGGCTAGGGAGATGACAGAGGTGCTGCCCAGCCAGCGCTACAATGCCCACATGGTGCCTGAAGATGGGATT 
CTCACCTGCCTCCAGGCCGGCAGCTATGTCCTGAGGTTTTACAACACCTACAGCCTGGTTCATTCTAAACGC 
ATCAGCTACACCGTGGAGGTACTGCTCCCAGACCAAACCTTCATGGAGAAGATGGAGAAATTCTAG 



In a search of public sequence databases, the NOV29c nucleic acid sequence, located 
on chromosome 22, has 418 of 532 bases (78%) identical to a gb:GENBANK- 
ID:HS130H16A|acc:AL09688Kl mRNA from Homo sapiens (Novel human mRNA similar to 
20 Rattus norvegicus 45 kDa secretory protein, AJ132352) (E = 4.9e" 129 ). 

The disclosed NOV29c polypeptide (SEQ ID NO: 126) encoded by SEQ ID NO: 125 
has 405 amino acid residues and is presented in Table 29F using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV29c has no signal peptide 
and is likely to be localized extracellularly with a certainty of 0.4500. Alternatively, NOV29c 
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may also localize to the microbody (peroxisome) with a certainty of 0.2010, to the 
mitochondrial matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a 
certainty of 0.1000. 



Table 29F. Encoded NOV29e protein sequence (SEQ ID NO:126). 

MFRENIQDVLSALPNPDDYFLLRWLQARSFDLQKSEDMLRKHMEFRKQQDLANILAWQPPEVVRLYNANGIC 
GHDGEGSPVWYHIVGSLDPKGLLLSASKQELLRDSFRSCELLLRECELQSQKLGKRVEKIIAIFGLEGLGLR 
DLWKPGIELLQEFFSALEANYPEILKSLIWRAPKLFAVAFNLVKSYMSEETRRKWILGDLMVPASEGVGH 
PTGVEGPLPGGLPDNWKQELTKFISPDQLPVEFGGTMTDPDGNPKCLTKINYGGEVPKSYYLCKQVRLQYEH 
TRSVGRGSSLQVENEILFPGCVLRWQFASDGGDIGFGVFLKTKMGERQRAREMTEVLPSQRYNAHMVPEDGI 
LTCLQAGSYVLRFYNTYSLVHSKRISYTVEVLLPDQTFMEKMEKF 

5 " " 

A search of sequence databases reveals that the NOV29c amino acid sequence has 157 
of 176 amino acid residues (89%) identical to, and 166 of 176 amino acid residues (94%) 
similar to, the 406 amino acid residue ptnr:SPTREMBL-ACC:Q9UDX3 protein from Homo 
sapiens (Human) (WUGSC:H_DJ0539M06.4 PROTEIN) (E = 2.6e 167 ). 
10 NOV29c is predicted to be expressed in at least Bone, liver, brain, and prostate . This 

information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in Bone because of the 
15 expression pattern of (GENBANK-ID: gb:GENBANK-ID:RN0132352|acc: AJ132352.1) a 
closely related Rattus norvegicus mRNA for 45 kDa secretory protein, partial homolog. 

The disclosed NOV29a polypeptide has homology to the amino acid sequences shown 
in the BLASTP data listed in Table 29G. 



Table 29G. BLAST results for NOV29a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
{%) 


Expect 


gi | 6 62413 3 | gb | AAF19 
259 . 1 | AC004832_4 
(AC0 04 832) 


similar to 45 kDa 

secretory protein 
[Rattus 

norvegicus] ; 

similar to 

CAA10644 . 1 
(PID:g4164418) 
[Homo sapiens] 


406 


387/398 
(97%) 


390/398 
(97%) 


0 . 0 


gi | 7110715 | ref |NP_0 
36561 . 1 | 
(NM_012429) 


SEC14 (S. 
cerevisiae) -like 
2; tocopherol- 
associated 
protein [Homo 
sapi ens] 


403 


269/394 
(68%) 


331/394 
(83%) 


e-165 
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gi | 16758646 | ref |NP_ 
446253 . 1 | 
(NM_053801) 


SEC14 (S. 
cerevisiae) -like 
2 [Rattus 
norvegicus] 


403 


271/394 
(68%) 


329/394 
(82%) 
■* 


e-164 


gi | 13 543184 | gb | AAHO 
5759 . 1 | AAH05759 
(BC005759) 


Unknown (protein 
for MGC:63 02) 
[Mus musculus] 


403 


273/394 
(69%) 


328/394 
(82%) 


e-164 


gi | 4164418 | emb | CAA1 
0644. l| (AJ132352) 


45 kDa secretory- 
protein [Rattus 
norvegicus] 


400 


267/384 
(69%) 


326/384 
(84%) 


e-163 



The homology between these and other sequences is shown graphically in the 
Clustal W analysis shown in Table 29H. In the Clustal W alignment of the NOV29 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 29H. ClustalW Analysis of NOV29 

1) Novel NOV29a (SEQ ID NO: 122) 

2) Novel NOV29b (SEQ ID NO:124) 

3) Novel NOV29C (SEQ ID NO: 12 6) 

4) gi | 6624133 | gb |AAF19259 . 1 | AC004832_4 (AC004832) similar to 45 kDa secretory 
protein [Rattus norvegicus]; similar to CAA10644.1 (PID : g4 1644 18 ) [Homo sapiens] 

(SEQ ID NO:447) 

5) gi | 7110715 | ref |NP_036561 . 1 | (NM_012429) SEC14 (S. cerevisiae) - like 2; 
tocopherol -associated protein [Homo sapiens] (SEQ ID NO: 44 8) 

6) gi | 16758646 | ref |NP_446253 . 1 | (NM_053801) SEC14 (S. cerevisiae) - like 2 [Rattus 
norvegicus] (SEQ ID NO: 44 9) 

7) gi | 13543184 |gb | AAH05759 . 1 | AAH05759 (BC005759) Unknown (protein for MGC:6302) 
[Mus musculus] (SEQ ID NO: 450) 

8) gi | 4164418 | emb | CAA10644 . 1 | (AJ132352) 45 kDa secretory protein [Rattus 
norvegicus] (SEQ ID NO: 451) 



NOV2 9a 
NOV2 9b 
NOV2 9C 
gi | 6624133 | 
gi |7110715 | 
gi j 16758646 | 
gi j 13543184 | 
gi j 4164418 | 



NOV2 9a 
NOV2 9b 
NOV2 9C 
gi | 6624133 | 
gi | 7110715 | 
gi j 16758646 | 
gi | 13543184 j 
gi | 4164418 | 
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130 



140 



150 



160 



170 



180 



NOV2 9a 
NOV2 9b 
NOV2 9c 
gi | 6624133 | 
gi j 7110715 | 
gi | 16758646 | 
gi 1 13543184 j 
gi | 4164418 | 



NOV2 9a 
NOV2 9b 
NOV2 9c 
gi | 6624133 | 
gi j 7110715 j 
gi 1 16758646 | 
gi j 13543184 | 
gi|4164418 | 



NOV2 9a 
NOV2 9b 
NOV2 9c 
gi | 6624133 | 
gi | 7110715 j 
gi|l6758646| 
gi|l3543184| 
gi |4164418 | 




250 
|....|....| 

232 pjj 

231 y 

213 GVGHPTGVEGPLPGGLPpi 

221 y 

221 1 

221 
221 
221 



310 



320 



330 



NOV2 9a 
NOV2 9b 
NOV2 9C 
gi | 6624133 | 
gi |7110715 | 
gi|l6758646| 
gi j 13543184 j 
gi | 4164418 | 




EILFPGCVLR 
E I LFPGCVLR 
EILFPGCVLR 
EILFPGCVLR 
EILFPGCVLR 
EILFPGCVLR 
EILFPGCVLR 
EILFPGCVLR 



NOV2 9a 


336 


NOV2 9b 


335 


NOV29c 


333 


gi| 6624133 | 


325 


gi|7110715 j 


325 


gi | 16758646 | 


325 


gi|l3543184| 


"325 


gi j 4164418 | 


325 


NOV29a 


394 


NOV2 9b 


395 


NOV2 9C 


393 


gi | 6624133 | 


385 


gi |7110715 | 


385 


gi j 16758646 | 


385 


gi| 13543184 | 


385 


gi | 4164418 | 


385 




370 
. - I . . 



I 



380 
. . I - . 



I 



EMTEVLPSQRYNAHMVPEDG 
EMTEVLPSQRYNAHMVPEDG 
EMTEVLPSQRYNAHMVPEDG 
EMTEVLPSQRYNAHMVPEDG 
E M T E VL PjSjQ R YnSh jlv P E DG 
EMTEVLPgQRYNgHMVPEDG 
EMTEVLPSQRYNAHMVPEDG 
EMTEVLPSQRYNAHMVPEDG 



440 

gSPTQ 415 
gSPTQ 416 

405 
ISPTQ 4 06 




390 
I . 
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Tables 29I-J list the domain descriptions from DOMAIN analysis results against 
NOV29. This indicates that the NOV29 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 291 Domain Analysis of NOV29 

gnl | Smart | smart 0 05 16 , SEC14 , Domain in homologues of a S . cerevisiae 
phosphatidyl inositol transfer protein (Secl4p) ; Domain in homologues 
of a S . cerevisiae phosphatidylinositol transfer protein (Secl4p) and 
in RhoGAPs, RhoGEFs and the RasGEF, neurof ibromin (NF1) . Lipid-binding 
domain. The SEC14 domain of Dbl is known to associate with G protein 
beta/gamma subunits. (SEQ ID NO: 8 22) 
CD-Length = 157 residues, 96.8% aligned 
Score = 131 bits (329), Expect = 9e-32 



10 



15 



NOV2 9 : 


90 


Sbjct : 


4 


NOV2 9: 


150 


Sbjct: 


54 


NOV2 9 : 


210 


Sbjct: 


110 



VIQLYDSGGLCGYDYEGCPVYFNI IGSLDPKGLLLSASKQDMIRKRIKVCELLLHECELQ 14 9 

I + I I I I I +| I I 111 I + + + + +11 II 

VGKAY I PGGR - - Y D KDGR P VL VFRAGRFDLK S VTLEELLRYLVYVLEKALQE 53 

TQKLGRKIEMAliMVFDMEGLSLKHLWKPAVEVYQQFFSILEANYPETLKNLIVIRAPKLF 20 9 

+1 II +II++III+ + I + I ++ 11+ +111 I + +1 I I 

- EKKTGGI EGFTT I FDLKGLSMSN PDLGVLRKILKILQDHYPERLGKVYI INPPWFF 109 

PVAFNLVKSFMSEETRRKIVILGDNWKQELTKFISPDQLPVEFGGT 2 55 

I + ++| l + ll + M II +1 + l + ll ++I l + lll I III 

RVLWKIIKPFLSEKTREKIRFVGPDSKEELLiEYIDPEQLPEELGGT 15 5 



Table 29 J Domain Analysis of NOV29 

gnl | Pfam|pf am00650, CRAL_TRIO, CRAL/TRIO domain.. The original profile 
has been extended to include the carboxyl domain from the known 
structure of Secl4 . { SEQ ID NO: 823) 
CD-Length = 185 residues, 98.9% aligned 
Score = 120 bits (300) , Expect = 2e-28 



20 



25 



30 



35 



NOV2 9 : 


73 


Sbj Ct : 


3 


NOV2 9 : 


132 


Sbjct : 


63 


NOV2 9 : 


190 


Sbjct : 


121 


NOV2 9 : 


250 


Sbjct : 


180 



RKQQDL.DNIV-TWQPPEWIQLYDSGGLCGYDYEGCPVYFNIIGSLDPKGLLLSASKQDM 13 X 

|++ +| |+ l+ll + I I +1 III I l + I + l + + I 

RREFGVDTILEEATYPKEVIAKLYPQFIHGSDKDGRPVYLERRGQLNLKKMLFITTVERM 62 

IRKRIKVCE-LLLHECELQTQKLGRKIEMALMVFDMEGLSL-KHLWKPAVEVYQQFFSIL 18 9 
+ | + | ||+ ++| + | | + IM++I + I+ Ml ++ +|| 

VRNLVYEMEQALLYLLPACSRKVGTLINGSCTVFDLKGVSVSSANWVPGVL- - KKVLNIL 12 0 



III 



III 



I II II + l+.l 1+ +11 



1+ III 



I II 



Vitamin E (alpha-tocopherol) is an essential dietary nutrient for humans and animals. 
The mechanisms involved in cellular regulation as well as in the preferential cellular and 
tissue accumulation of alpha-tocopherol are not yet well established. We previously reported 
(Stocker, A., Zimmer, S., Spycher, S. E., and Azzi, A. (1999) IUBMB Life 48, 49-55) the 
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identification of a novel 46-kDa tocopherol-associated protein (TAP) in the cytosol of bovine 
liver. Here, we describe the identification, the molecular cloning into Escherichia coli, and the 
in vitro expression of the human homologue of bovine TAP, hTAP. This protein appears to 
belong to a family of hydrophobic ligand binding proteins, which have the CRAL (cis-retinal 
5 binding motif) sequence in common. By using a biotinylated alpha-tocopherol derivative and 
the lASys resonant mirror biosensor, the purified recombinant protein was shown to bind 
tocopherol at a specific binding site with K(d) 4.6 x 10(-7) m. Northern analyses showed that 
hTAP mRNA has a size of approximately 2800 base pairs and is ubiquitously expressed. The 
highest amounts of hTAP message are found in liver, brain, and prostate. In conclusion, hTAP 

10 has sequence homology to proteins containing the CRAL_TRIO structural motif. TAP binds to 
alpha-tocopherol and biotinylated tocopherol, suggesting the existence of a hydrophobic 
pocket, possibly analogous to that of SEC 14. Zimmer S. et al. (J Biol Chem. 2000 Aug 
18;275(33):25672-80.) 

The disclosed NOV29 nucleic acid of the invention encoding a CRAL-TRIO -like 

15 protein includes the nucleic acid whose sequence is provided in Table 29A, 29C, 29E or a 

fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 29A, 29C, or 29E while 
still encoding a protein that maintains its CRAL-TRIO -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 

20 whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

25 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 26 percent of the bases may be so changed. 

The disclosed NOV29 protein of the invention includes the CRAL-TRIO -like protein 

30 whose sequence is provided in Table 29B, 29D, or 29F. The invention also includes a mutant 
or variant protein any of whose residues may be changed from the corresponding residue 
shown in Table 29B, 29D, or 29F while still encoding a protein that maintains its CRAL-TRIO 
-like activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 32 percent of the residues may be so changed. 
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The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F ab ) 2 , that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this CRAL-TRIO -like protein 
(NOV29) is a member of a "CRAL-TRIO family". Therefore, the NOV29 nucleic acids and 
5 proteins identified here may be useful in potential therapeutic applications implicated in (but 
not limited to) various pathologies and disorders as indicated below. The potential therapeutic 
applications for this invention include, but are not limited to: protein therapeutic, small 
molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 

10 research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

The NOV29 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in brain disorders including epilepsy, eating disorders, 
schizophrenia, ADD, and cancer; heart disease; inflammation and autoimmune disorders 

15 including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, inflammatory skin 
disorders, allergies, blood disorders; psoriasis colon cancer, leukemia AIDS; thalamus 
disorders; metabolic disorders including diabetes and obesity; lung diseases such as asthma, 
emphysema, cystic fibrosis, and cancer; pancreatic disorders including pancreatic 
insufficiency and cancer; and prostate disorders including prostate cancer, and/or other 

20 diseases and pathologies. 

NOV29 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV29 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

25 NOVX Antibodies" section below. The disclosed NOV29 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 



A disclosed NOV30 nucleic acid of 717 nucleotides (also referred to as CG56191-01) 
encoding a novel Ryudocan-like protein is shown in Table 30A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 22-24 and ending with a 
TAG codon at nucleotides 658-660. Putative untranslated regions, if any, found upstream from 



30 



NOV30 
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the initiation codon and downstream from the termination codon are underlined in Table 30A, 
and the start and stop codons are in bold letters. 

Table 30A. NOV30 Nucleotide Sequence (SEQ ID NO:127) 

CAGGCTGTTCACCCTCTCTGGATGGCGGTACCCACTGCCCCCGCCCTCCTGCTCCTGCTGCTGCTGCTGCT 
TTTTGCAGGCACCCCCACCACCCCTGAGTCAATCCAAGAAACTGAGGTCATCAACCCAGGACCGCCTAGGG 
GCCCAAACTTCTCCAGATCCCTACTGGAAGACTCTGGATGTGGGTGTTGGGGGCAGGAACCTGATGACTCT 
GAGCTCTCTGGCTCTAGAGATATTGATGAGTCAAGGGACCCCAAGATCATCCCTGAAGTGATCCAACCCTT 
GGTGCTTCTAGATAACCACATCCCTGAGAGGGCAGGGCCTGGGAACCTGGTCCCCACTGAAACCAAGGAAC 
TGGAGGACAACGAGGTCATCCCCAGGAGGATCTCACTCTCTGCGGGGGACCAGGATGTGTCCAATAAGGCA 
CCCATGTCCAACACTGCCCAGGGCAGCAACATCTTTGAGAGAATGGAGGTCGTGGCAGTCCTGATTGTGGA 
CAGC ATCGCGGGC ATC CT CT CTGC TGTTT TCCTG ATC CTGC TT CT GGTG AACC AT ATG AAG AAGG ATG AAG 
GCAGAAACGACCTGAGCAGGAAGCCCATCTACAAAAAAGCCCCTAGCAAGGAGTTATTACGCTTCTTCTAT 
GAGCACTGGTTTGGACTTTAG GGGATAGGGAAGTCGGAGGATTTTGCAGAGTGGCCATTAGGATGCGGGAG 
GACAACC 

The NOV30 nucleic acid was identified on chromosome 22 and has 553 of 708 bases 
(78%) identical to a gb:GENBANK-ID:HUMRYUDO|acc:Dl 3292.1 mRNA from Homo 
sapiens (mRNA for ryudocan core protein) (E = 2.2e" 82 ). 

A disclosed NOV30 polypeptide (SEQ ID NO: 128) encoded by SEQ ID NO: 127 is 
212 amino acid residues and is presented using the one-letter code in Table 30B. Signal P, 
Psort and/or Hydropathy results predict that NOV30 contains a signal peptide and is likely to 
be localized in the plasma membrane with a certainty of 0.4600. The most likely cleavage site 
for NOV30 is between positions 23 and 24: TPT-TP. 

Table 30B. Encoded NOV30 protein sequence (SEQ ID NO: 128) 

MAVPTAPALLLLLLLLLFAGTPTTPESIQETEVINPGPPRGPNFSRSLLEDSGCGCWGQEPDDSELSGSRDI 
DESRDPKIIPEVIQPLVLLDNHIPERAGPGNLVPTETKELEDNEVIPRRISLSAGDQDVSNKAPMSNTAQGS 
NIFERMEWAVLIVDSIAGILSAVFLILIiLVNHMKKDEGRNDLSRKPIYKKAPSKELLRFFYEHWFGL 

The disclosed NOV30 amino acid sequence has 121 of 198 amino acid residues (61%) 
identical to, and 140 of 198 amino acid residues (70%) similar to, the 202 amino acid residue 
ptnr:SWISSPROT-ACC: P34901 protein from Rattus norvegicus (Rat) (Syndecan-4 Precursor 
(Ryudocan Core Protein)) (E = 1.9e' 51 ). 

NOV30 is predicted to be expressed in at least myeloid tissue, B-cell lymphoma, 
including B-cell precursor lymphoblastic leukemia, lymphoplasmacytoid, immunoblastic, 
lymphocytic/CLL, hairy cell leukemia, large B-cell, mantle-cell, marginal zone and follicular, 
lymphomas, endothelia, Lymphopoietic and bone marrow (BM) plasma cells (PCs). This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, NOV30 is predicted to be expressed in the following tissues because of the 
expression pattern of (GENBANK-ID: gb : GENB ANK-ID : HUMR Y UDO | acc : D13292.1) a 
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closely related Human mRNA for ryudocan core protein homolog in species Homo sapiens: 
myeloid tissue. 

NOV30 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 30C. 

5 



Table 30C. BLAST results for NOV30 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 147 7114 0 | ref |XP_ 
009530.3 | 
(XM_009530) 


syndecan 4 
(amphiglycan, 
ryudocan) [Homo 
sapiens] 


198 


119/197 
(60%) 


136/197 
(68%) 


9e-49 


gi | 4506861 | ref | NPO 
02990. l| 
<NM_002999) 


syndecan 4 
(amphiglycan, 
ryudo can) [ Homo 
sapiens] 


198 


120/197 
(60%) 


137/197 
(68%) 


2e-45 


gi | 6981522 | ref |NP_0 
36781. l| 
(NM 012649) 


ryudo can /syndecan 
4 [Rattus 
norvegicus] 


202 


119/199 
(59%) 


139/199 
(69%) 


3e-45 


gi | 6755442 | ref | NP_0 
35651 . 1 | 
(NM 011521) 


syndecan 4 [Mus 
musculus] 


198 


117/199 
(58%) 


136/199 
(67%) 


6e-41 


gi | 13 51051 | sp| P4941 
6 | SDC4_CHICK 


SYNDECAN- 4 
PRECURSOR 


197 


80/216 
(37%) 


105/216 
(48%) 


le-14 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 30D. 



10 



15 



20 



Table 30D Clustal W Sequence Alignment 



1) NOV3 0 (SEQ ID NO: 128) 

2) gi | 14771140 | ref |XP_009530 . 3 | (XM_009530) syndecan 4 (amphiglycan, ryudocan) 
[Homo sapiens] (SEQ ID NO: 452) 

3) gi | 4 506861 | ref |NP_002990 . 1 | (NM_00299 9 ) syndecan 4 (amphiglycan, ryudocan) [Homo 
sapiens] (SEQ ID NO: 4 53) 

4) gi | 6981522 | ref |NP_036781 . 1 | (NM_012649) ryudo can/ syndecan 4 [Rattus norvegicus] 
(SEQ ID NO:454) 

5) gi | 6755442 | ref |NP_035651 . 1 | (NM_011521) syndecan 4 [Mus musculus] (SEQ ID 
NO: 45 5) 

6) gi | 1351051 | sp| P49416 |SDC4_CHICK SYNDECAN- 4 PRECURSOR (SEQ ID NO:456) 



25 



30 



35 



NOV30 

gi 
gi 
gi 
gi 
gi 



14771140 | 
4506861 | 
6981522 | 
6755442 | 
1351051 j 



NOV30 

gi | 14771140 | 
gi|450686l| 
gi | 6981522 j 
gi | 6755442 | 
gi | 1351051 j 
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130 



150 



160 



170 



180 



10 



15 



NOV30 

gi 1 14771140 | 
gi j 4506861 1 
gi j 6981522 j 
gi j 6755442 j 
gi j 1351051 j 



NOV3 0 

gi | 14771140 | 
gi|4506861| 
gi | 6981522 j 
gi | 6755442 | 
gi 1 1351051 1 




20 Table 30E lists the domain description from DOMAIN analysis results against 

NOV30. This indicates that the NOV30 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 30E Domain Analysis of NOV30 

gnl | Pf am | pf am01034 , Syndecan, Syndecan domain. Syndecans are 
transmembrane heparin sulfate proteoglycans which are implicated in 
the binding of extracellular matrix components and growth factors (SEQ 
ID NO: 824) 

CD-Length = 359 residues, 21.7% aligned 
Score = 41.6 bits (96), Expect = 5e-05 



25 NOV30: 115 NEVIPRRISLSAGDQDVSNKAPMSNTA QGSNIFERMEWAVLIVDSIAGILS 166 

III + + + + 1+ II I I II ll+l +1 +1+1 

Sbjct : 258 NETSPENTAAANPEPLGRGQRPIDNTVDSGSSGAQQSQKILERKEVLAAVIAGGWGLLF 317 

NOV3 0: 16 7 AVFLILLLVNHM- KKDEG 183 

30 I I I I++ ++ I I I I I I 

Sbjct: 318 AVFLVMFMLYRMKKKDEG 335 

Kininogens, the high molecular weight precursor of vasoactive kinins, bind to a wide 
variety of cells in a specific, reversible, and saturable manner. The cell docking sites have 

35 been mapped to domains D3 and D5(H) of kininogens; however, the corresponding cellular 
acceptor sites are not fully established. To characterize the major cell binding sites for 
kininogens exposed by the endothelial cell line EA.hy926, intact cells were digested with 
trypsin and other proteases and found a time- and concentration-dependent loss of (125)1- 
labeled high molecular weight kininogen (H-kininogen) binding capacity (up to 82%), 

40 indicating that proteins are crucially involved in kininogen cell attachment. Cell surface 

digestion with heparinases similarly reduced kininogen binding capacity (up to 78%), and the 
combined action of heparinases and trypsin almost eliminated kininogen binding (up to 85%), 
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suggesting that proteoglycans of the heparan sulfate type are intimately involved. Consistently, 
inhibitors such as p-nitrophenyl-beta-d-xylopyranoside and chlorate interfering with heparan 
sulfate proteoglycan biosynthesis reduced the total number of kininogen binding sites in a 
time- and concentration-dependent manner (up to 67%). In vitro binding studies demonstrated 
5 that biotinylated H-kininogen binds to heparan sulfate glycosaminoglycans via domains D3 
and D5(H) and that the presence of Zn(2+) promotes this association. Cloning and over- 
expression of the major endothelial heparan sulfate-type proteoglycans syndecan-1, syndecan- 
2, syndecan-4, and glypican in HEK293t cells significantly increased total heparan sulfate at 
the cell surface and thus the number of kininogen binding sites (up to 3. 3-fold). This gain in 

10 kininogen binding capacity was completely abolished by treating transfected cells with 

heparinases. It was concluded that heparan sulfate proteoglycans on the surface of endothelial 
cells provide a platform for the local accumulation of kininogens on the vascular lining. This 
accumulation may allow the circumscribed release of short-lived kinins from their precursor 
molecules in close proximity to their sites of action (Renne et al., J Biol Chem 2000, 

15 275(43):33688-96). 

Lymphopoietic cells require interactions with bone marrow stroma for normal 
maturation and show changes in adhesion to matrix during their differentiation. Syndecan, a 
heparan sulfate-rich integral membrane proteoglycan, functions as a matrix receptor by 
binding cells to interstitial collagens, fibronectin, and thrombospondin. Therefore, it was asked 

20 whether syndecan was present on the surface of lymphopoietic cells. In bone marrow, 

syndecan was only found on precursor B cells. Expression changes with pre-B cell maturation 
in the marrow and with B-lymphocyte differentiation to plasma cells in interstitial matrices. 
Syndecan on B cell precursors is more heterogeneous and slightly larger than on plasma cells. 
Syndecan 1) is lost immediately before maturation and release of B lymphocytes into the 

25 circulation, 2) is absent on circulating and peripheral B lymphocytes, and 3) is reexpressed 
upon their differentiation into immobilized plasma cells. Thus, syndecan is expressed only 
when and where B lymphocytes associate with extracellular matrix. These results indicate that 
B cells differentiating in vivo alter their matrix receptor expression and suggest a role for 
syndecan in B cell stage-specific adhesion (Sanderson et al., Cell Regul 1989,l(l):27-35). 

30 Detection of abnormal numbers and/or distribution of bone marrow (BM) plasma cells 

(PCs) on trephine biopsies can be important in the differential diagnosis of multiple myeloma 
(MM) and other PC disorders. A variety of immunohistochemical markers can potentially 
improve the specificity and sensitivity of PC detection on routine histological sections 
obtained from trephine BM biopsies, but most of them are not completely satisfactory. In one 
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study, the antibody CD138/B-B4, which is an optimal marker for PC detection on BM 
aspirates by flow cytometry, was investigated to determine whether it can be used successfully 
for the identification of PCs also on formalin-fixed, decalcified biopsies. A series of samples 
including normal BM, MM, monoclonal gammopathies of undetermined significance, and B- 
5 cell lymphoma of various types, including B-cell precursor lymphoblastic leukemia, 

lymphoplasmacytoid, immunoblastic, lymphocytic/CLL, hairy cell leukemia, large B-cell, 
mantle-cell, marginal zone and follicular lymphomas, have been investigated for CD138. 
expression using a sensitive immunohistochemical technique. Within the BM 
microenvironment, CD138 was characterized by excellent sensitivity and specificity. Virtually 

10 all normal and neoplastic PCs expressed clear-cut membrane CD 138 immunostaining, whereas 
all other cell types did not. All cases of MM, including plasmablastic and leukemic cases, 
showed strong immunoreactivity. Conversely, all B-cell lymphomas, including all cases 
characterized by secretive features, lymphoplasmacytoid, and immunoblastic lymphomas, 
were completely negative. These results demonstrate that CD138 is a highly sensitive and 

15 specific marker that is useful for the rapid and precise localization of normal and neoplastic 
PCs on routine BM sections. In addition, because of its clear-cut cell membrane localization, 
CD 138 can be used successfully in double-marker immunostaining reactions to evaluate 
precisely nuclear prognostic markers such as Ki67 and p53 in MMs (Chilosi et al., Mod Pathol 
1999, 12(12):1 101-6). 

20 Monoclonal antibody therapy has emerged as a viable treatment option for patients 

with lymphoma and some leukemias. It is now beginning to be investigated for treatment of 
multiple myeloma. There are relatively few surface antigens on the plasma cells that are 
suitable for antibody-directed treatment. Possible molecules include HM1.24, CD38, ICAM-1 
(CD54), CD40, CD45, CD20, and syndecan 1 . There is now some clinical experience with 

25 anti-CD38 antibody in lymphoma and myeloma. However, to date, there has been minimal 
clinical activity observed. Additional antibodies are entering clinical trials. A new approach 
involves the generation of an anti-CD38 single-chain variable fragment (scFv) construct that 
acts as the carrier of a toxin gene instead of being conjugated directly to the toxin itself It is 
hoped that expression of the toxin by CD38+ plasma cells will promote suicide of the 

30 malignant cells without affecting normal cells or generating an immunologic response to the 
toxin. Ongoing clinical trials are also attempting to target B-cell antigens such as CD20. 
Although CD20 is present only on 20% of myeloma cells, it may be present on myeloma 
precursor cells. This treatment has met with success in follicular lymphoma and is now being 
evaluated in clinical trials in both Europe and the United States for myeloma. Although these 
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clinical trials are in very early stages, researchers are beginning to understand that antibody 
therapy can be used not only as a carrier molecule of radioisotopes and toxins, but also as 
molecules that can trigger tumor cells and promote growth arrest or apoptosis (Maloney et al., 
Semin Hematol 1999, 36(1 Suppl 3):30-3). 
5 The NOV30 nucleic acid of the invention encoding a Ryudocan-like protein includes 

the nucleic acid whose sequence is provided in Table 30A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 30A while still encoding a protein that maintains 
its Ryudocan-like activities and physiological functions, or a fragment of such a nucleic acid. 

10 The invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

15 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 22% of the residues 
may be so changed. 

20 The NOV30 protein of the invention includes the Ryudocan-like protein whose 

sequence is provided in Table 30B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 30B while 
still encoding a protein that maintains its Ryudocan-like activities and physiological functions, 
or a functional fragment thereof. In the mutant or variant protein, up to about 63 of the bases 

25 may be so changed. 

The NOV30 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: brain disorders including epilepsy, 

30 eating disorders, schizophrenia, ADD, cancer, heart disease, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders, psoriasis, colon cancer, leukemia, 
AIDS, thalamus disorders, metabolic disorders including diabetes and obesity, lung diseases 
such as asthma, myelomas, emphysema, cystic fibrosis, and cancer, pancreatic disorders 
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including pancreatic insufficiency and cancer, and prostate disorders including prostate cancer 
and other diseases, disorders and conditions of the like. 

NOV30 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV30 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
10 disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 

NOV31 

A disclosed NOV31 nucleic acid of 683 nucleotides (also referred to as CG56392-01) 
15 encoding a novel Sulfur-rich Keratin-like protein is shown in Table 31 A. An open reading 

frame was identified beginning with an ATG initiation codon at nucleotides 46-48 and ending 
with a TGA codon at nucleotides 652-654. Putative untranslated regions, if any, found 
upstream from the initiation codon and downstream from the termination codon are underlined 
in Table 31 A, and the start and stop codons are in bold letters. 



Table 31 A. NOV31 Nucleotide Sequence (SEQ ID NO: 129) 



GAGCTGTGTAACAGCAACCGGAAAGAGAAACAATGGTGTGTTCCT ATGTGGGATATAAAGAGCCGGGGCTC 
AGGGGGCTCCACACCTGCACCTCCTTCTCACCTGCTCCTCTACCTGCTCCACCCTCAATCCACCAGAACCA 
TGGGCTGCTGTGGCTGCTCCGGAGGCTGTGGCTCCAGCTGTGGAGGCTGTGACTCCAGCTGTGGGAGCTGT 
GGCTCTGGCTGCAGGGGCTGTGGCCCCAGCTGCTGTGCACCCGTCTACTGCTGCAAGCCCGTGTGCTGCTG 
TGTTCCAGCCTGTTCCTGCTCTAGCTGTGGCAAGCGGGGCTGTGGCTCCTGTGGGGGCTCCAAGGGAGGCT 
GTGGTTCTTGTGGCTGCTCCCAGTGCAGTTGCTGCAAGCCCTGCTGTTGCTCTTCAGGCTGTGGGTCATCC 
TGCTGCCAGTGCAGCTGCTGCAAGCCCTACTGCTCCCAGTCCAGCTGTTGTAAGCCCTGTTGCTGCTCCTC 
AGGCTGTGGATCATCCTGCTGCCAGTCCAGCTGCTGCAAGCCCTGCTGCTGCCAGTCCAGCTGCTGTGTCC 
CCGTGTGCTGCCAGTCCAGCTGCTGCAAGCCCTGTTGCTGCCAGTCCAACTGTTGTGTCCCTGTGTGCTGC 
CAGTGTAAGATCTGAGGCTCTAGTGGGAAACCTCAGGTAGCTCC 



20 

The NOV3 1 nucleic acid was identified on chromosome 1 1 and has 654 of 683 bases 
(95%) identical to a gb:GENBANK-ID:HSA6693|acc:AJ006693.1 mRNA from Homo sapiens 
(UHS KerA gene) (E = 3.3e ,36 ). 

A disclosed NOV31 polypeptide (SEQ ID NO: 130) encoded by SEQ ID NO: 129 is 

25 202 amino acid residues and is presented using the one-letter code in Table 3 IB. Signal P, 

Psort and/or Hydropathy results predict that NOV3 1 contains a signal peptide and is likely to 

be localized to the cytoplasm with a certainty of 0.4500. The most likely cleavage site for a 

NOV31 peptide is between amino acids 32 and 33: TRT-MG. 
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Table 31B. Encoded NOV31 protein sequence (SEQ ID NO:130) 

MWDIKSRGSGGSTPAPPSHLLLYLLHPQSTRTMGCCGCSGGCGSSCGGCDSSCGSCGSGCRGCGPSCCAPVY 
CCKPVCCCVPACSCSSCGKRGCGSCGGSKGGCGSCGCSQCSCCKPCCCSSGCGSSCCQCSCCKPYCSQSSCC 
KPCCCSSGCGSSCCQSSCCKPCCCQSSCCVPVCCQSSCCKPCCCQSNCCVPVCCQCKI 

The disclosed NOV31 amino acid sequence has 1 58 of 170 amino acid residues (92%) 
identical to, and 158 of 170 amino acid residues (92%) similar to, the 169 amino acid residue 
ptnr:SW!SSNEW-ACC:P26371 protein from Homo sapiens (Human) (Keratin, Ultra High- 
Sulfur Matrix Protein A (Uhs Keratin A) (Uhs Kera)) (E = 1 .8e" 101 ). 

NOV3 1 is predicted to be expressed in at least Kidney, Pancreas, Testis and Whole 
Organism. This information was derived by determining the tissue sources of the sequences 
that were included in the invention including but not limited to SeqCalling sources, Public 
EST sources, Literature sources, and/or RACE sources. 

In addition, NOV3 1 is predicted to be expressed in the following tissues because of the 
expression pattern of (GENBANK-ID:gb:GENBANK-ID:HSA6693|acc:AJ006693.1) a 
closely related Homo sapiens UHS KerA gene homolog in species Homo sapiens: Kidney, 
Pancreas and Testis. 

NOV31 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 31 C. 



Table 31C. BLAST results for NOV31 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 12 83 53 76 | dbj | BAB 
23238, l| (AK004258) 


data source :SPTR, 
source 
key:Q64526 , 
evidence : ISS-puta 
tive-similar to 
ULTRA- HIGH 
SULPHUR KERATIN 
[Mus musculus] 


195 


76/157 
(48%) 


91/157 
(57%) 


5e-13 


gi | 2136964 | pir | | 146 
489 


cysteine -rich 
hair keratin 
associated 
protein - rabbit 


126 


53/120 
(44%) 


72/120 
(59%) 


3e-ll 


gi | 12844600 | db j | BAB 
26426 . 1 | (AK009665) 


data source :SPTR, 
source 
key :Q28707, 
evidence : ISS~homo 
log to CYSTEINE 
RICH HAIR KERATIN 
ASSOCIATED 
PROTElN-putative 
[Mus musculus] 


168 


59/116 
(50%) 


70/116 
(59%) 


le-10 


gi | 15082220 | ref |NP_ 
14 9 04 8. l| 
(NM_033059) 


keratin 
associated 
protein 4 . 14 
[Homo sapiens] 


195 


56/122 
(45%) 


65/122 
(52%) 


2e-l0 
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gi|l3386198|ref | NP_ 


RIKEN CDNA 


165 


53/106 


61/106 


2e-10 


081363. 1| 


2300006N05 [Mus 




(50%) 


(57%) 




(NM 027087) 


musculus] 











The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 3 ID. 

Table 31D Clustal W Sequence Alignment 
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1) NOV31 (SEQ ID NO: 13 0) 

2) gi | 12835376 | dbj | BAB23238 . 1 | (AK004258) data source:SPTR, source key:Q64526, 
evidence : ISS-putative-similar to ULTRA-HIGH SULPHUR KERATIN [Mus musculus] (SEQ ID 
NO:457) 

3) gi | 2136964 | pir | | 146489 cysteine- rich hair keratin associated protein - rabbit 
(SEQ ID NO: 45 8) 

4) gi | 12844600 |dbj |BAB26426.l| (AK009665) data source:SPTR, source key:Q28707, 
evidence : ISS~homolog to CYSTEINE RICH HAIR KERATIN ASSOCIATED PROTEIN-putat ive [Mus 
musculus] (SEQ ID NO: 459) 

5) gi | 15082220 | ref |NP_149048 . 1 | (NM_033059) keratin associated protein 4.14 [Homo 
sapiens] ( SEQ ID NO: 460) 

6) gi | 13386198 | ref |NP 081363 . 1 | (NM_027087) RIKEN CDNA 2300006N05 [Mus musculus] 
(SEQ ID NO: 461) 



NOV31 

gi | 12835376 | 
gi j 2136964 | 
gi | 12844600 | 
gi | 15082220 j 
gi|l3386198| 



NOV31 

gi | 12835376 | 
gi | 2136964 | 
i gi | 12844600 | 
gi|l5082220| 
gij 13386198 j 



NOV31 

gi|l2835376| 
gi|2136964| 
gi|l2844600 | 
gij 15082220 j 
gi j 13386198 | 



NOV31 

gi|l2835376| 
gi | 2136964 | 
gi j 12844600 | 
gi|l5082220| 
gi|l3386198| 




190 



200 



210 



220 



230 



240 



150 


SSG 




GS 


141 


RPS 


cc 
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gi | 15082220 j 
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gi | 13386198 | 165 165 

Insulin-like growth factor 1 (IGF-1) mediates many of the actions of growth hormone. 
5 Overexpression of IGF-1 has been reported to have endocrine and paracrine/autocrine effects 
on somatic growth in transgenic mice. To study the paracrine/autocrine effects of IGF-1 in hair 
follicles, transgenic mice were produced by pronuclear microinjection of a construct 
containing a mouse ultra-high sulfur keratin (UHS-KER) promoter linked to an ovine IGF-1 
cDNA. This UHS-KER promoter has previously been shown to direct expression of a reporter 

10 gene to the hair follicles of transgenic mice. Four transgenic mouse lines were established as a 
result of microinjection of 435 embryos. Transgene expression was found in skin at day 8 and 
day 15 of age in three of the lines. Progeny tests were carried out by mating two of the 
transgenic expressing males to nontransgenic females. Mice from one line were all 
nonexpressors while four of the 12 mice from the other showed integration of the transgene 

15 and three expressed transgene IGF-1 mRNA in the skin. Vibrissa growth at 1 1-21 d of age was 
significantly greater in transgenic expressors than in their nontransgenic littermates. 
Specifically, the increase in vibrissa length for transgenics at days 1 1-16 (20.5%) is 
approximately 2-fold compared with days 16-21 (1 1.9%). These results demonstrate that local 
overexpression of IGF-1 in transgenic mice is capable of stimulating vibrissa growth during 

20 the first neonatal hair cycle (Su et al., J Invest Dermatol 1999, 1 12(2):245-8). 

The major histological components of the hair follicle are the hair cortex and cuticle. 
The hair cuticle cells encase and protect the cortex and undergo a different developmental 
program to that of the cortex. In one study, the molecular characterization of a set of 
evolutionarily conserved hair genes which are transcribed in the hair cuticle late in follicle 

25 development was reported. Two genes were isolated and characterized, one expressed in the 
human follicle and one in the sheep follicle. Each gene encodes a small protein of 16 kD, 
containing greater than 50 cysteine residues, ranging from 31 to 36 mol% cysteine. Their high 
cysteine content and in vitro expression data identify them as ultra-high-sulfur (UHS) keratin 
proteins. The predicted proteins are composed almost entirely of cysteine-rich and glycine-rich 

30 repeats. Genomic blots reveal that the UHS keratin proteins are encoded by related multigene 
families in both the human and sheep genomes. Tissue in situ hybridization demonstrates that 
the expression of both genes is localized to the hair fiber cuticle and occurs at a late stage in 
fiber morphogenesis (MacKinnon et al., J Cell Biol 1990, 1 1 1(6 Pt l):2587-600). 

The NOV31 nucleic acid of the invention encoding a Sulfur-rich Keratin-like protein 

35 includes the nucleic acid whose sequence is provided in Table 31 A, or a fragment thereof. The 
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invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
fromthe corresponding base shown in Table 3 1 A while still encoding a protein that maintains 
its Sulfur-rich Keratin-like activities and physiological functions, or a fragment of such a 
nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
5 to those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. The invention additionally includes nucleic acids or nucleic acid 
fragments, or complements thereto, whose structures include chemical modifications. Such 
modifications include, by way of non-limiting example, modified bases, and nucleic acids 
whosesugar phosphate backbones are modified or derivatized. These modifications are carried 
10 out at least in part to enhance the chemical stability of the modified nucleic acid, such that 

they may be used, for example, as antisense binding nucleic acids in therapeutic applications 
in a subject. In the mutant or variant nucleic acids, and their complements, up to about 5% of 
the residues may be so changed. 



15 whose sequence is provided in Table 3 IB. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
3 IB while still encoding a protein that maintains its Sulfur-rich Keratin-like activities and 
physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 56% of the bases may be so changed. 

20 The NOV3 1 nucleic acids and proteins of the invention are useful in potential 

diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: brain disorders including epilepsy, 
eating disorders, schizophrenia, ADD, cancer, heart disease, inflammation and autoimmune 

25 disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 

inflammatory skin disorders, allergies, blood disorders, psoriasis, colon cancer, leukemia, 
AIDS, thalamus disorders, metabolic disorders including diabetes and obesity, lung diseases 
such as asthma, emphysema, cystic fibrosis, and cancer, pancreatic disorders including 
pancreatic insufficiency and cancer, and prostate disorders including prostate cancer and other 

30 diseases, disorders and conditions of the like. 

NOV31 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 



The NOV31 protein of the invention includes the Sulfur-rich Keratin-like protein 
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NOVX Antibodies" section below. For example the disclosed NOV31 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 



NOV32 

A disclosed NOV32 nucleic acid of 1575 nucleotides (also referred to as CG56686-01) 
encoding a novel DNMT1 associated protein- 1 (DMAP)-like protein is shown in Table 32A. 
An open reading frame was identified beginning with an ATG initiation codon at nucleotides 
94-96 and ending with a TGA codon at nucleotides 1573-1575. Putative untranslated regions, 
if any, found upstream from the initiation codon and downstream from the termination codon 
are underlined in Table 32A, and the start and stop codons are in bold letters. 



Table 32 A. NOV32 Nucleotide Sequence (SEQ ID NO: 131) 

CTTGGAGGCTGCAGGTCCGGACCCAGGTGCGGAAGTGCGAGGGCCCAGGCACTGACCCTTGACCTCCGGTG 
GCTCCCCCATCTCTCAGGCGCGA TQGCTACGGGCGCGGATGTACGGGACATTCTAGAACTCGGGGGTCCAG 
AAGGGGATGCAGC CTCTGGGACCAT C AG C AAG AAGGAC AT T ATC AAC C CGGAC AAG AAAAAATC CAAGAAG 
TCCTCTGAGACACTGACTTTCAAGAGGCCCGAGGGCATGCACCGGGAAGTCTATGCCTTGCTCTACTCTGA 
CAAGAACAAGGGCTCCTGCTTGCTTAGCAGGATGCAGGAGGACCTGAAGTCTTTTGCTCCAGGACATGACT 
TT C TTGC T AT AGGGGATGC AC CC C CACTGCT ACC C AG TG AC ACTGG C C AGGG AT AC CGT AC AGT GAAGG CC 
AAGTTGGGCTCCAAGAAGGTGCGGCCTTGGAAGTGGATGCCATTCACCAACCCGGCCCGCAAGGACGGAGC 
AATGTTCTTCCACTGGCGACGTGCAGCGGAGGAGGGCAAGGACTACCCCTTTGCCAGGTTCAATAAGACTG 
TGCAGGTGCCTGTGTACTCGGAGCAGGAGTACCAGCTTTATCTCCACGATGATGCTTGGACTAAGGCAGAA 
ACTGACCACCTCTTTGACCTCAGCCGCCGCTTTGACCTGCGTTTTGTTGTTATCCATGACCGGTATGACCA 
CCAGCAGTTCAAGAAGCGTTCTGTGGAAGACCTGAAGGAGCGGTACTACCACATCTGTGCTAAGCTTGCCA 
ACGTGCGGGCTGTGCCAGGCACAGACCTTAAGATACCAGTATTTGATGCTGGGCACGAACGACGGCGGAAG 
GAACAGCTTGAGCGTCTCTACAACCGGACCCCAGAGCAGGTGGCAGAGGAGGAGTACCTGCTACAGGAGCT 
GCGCAAGATTGAGGCCCGGAAGAAGGAGCGGGAGAAACGCAGCCAGGACCTGCAGAAGCTGATCACAGCGG 
CAGACACCACTGCAGAGCAGCGGCGCACGGAACGCAAGGCCCCCAAAAAGAAGCTACCCCAGAAAAAGGAG 
GCTGAGAAGCCGGCTGTTCCTGAGACTGCAGGCATCAAGTTTCCAGACTTCAAGTCTGCAGGTGTCACGCT 
GCGGAGCCAACGGATGAAGCTGCCAAGCTCTGTGGGACAGAAGAAGATCAAGGCCCTGGAACAGATGCTGC 
TGGAGCTTGGTGTGGAGCTGAGCCCGACACCTACGGAGGAGCTGGTGCACATGTTCAATGAGCTGCGAAGC 
GACCTGGTGCTGCTCTACGAGCTCAAGCAGGCCTGTGCCAACTGCGAGTATGAGCTGCAGATGCTGCGGCA 
CCGTCATGAGGCACTGGCCCGGGCTGGTGTGCTAGGGGGCCCTGCCACACCAGCATCAGGCCCAGGCCCGG 
CCTCTGCTGAGCCGGCAGTGACTGAACCCGGACTTGGTCCTGACCCCAAGGACACCATCATTGATGTGGTG 
GGCGCACCCCTCACGCCCAATTCGAGAAAGCGACGGGAGTCGGCCTCCAGCTCATCTTCCGTGAAGAAAGC 
CAAGAAGCCGTGA 



The NOV32 nucleic acid was identified on chromosome lp34 and has 1244 of 1273 
bases (97%) identical to a gb:GENBANK-ID:AF265228|acc:AF265228.1 mRNA from Homo 
sapiens (DNMT1 associated protein-1 (DMAP1) mRNA, complete cds) (E = l.Oe" 309 ). 

A disclosed NOV32 polypeptide (SEQ ID NO: 132) encoded by SEQ ID NO: 131 is 
493 amino acid residues and is presented using the one-letter code in Table 32B. Signal P, 
Psort and/or Hydropathy results predict that NOV32 does not contain a signal peptide and is 
likely to be localized to the nucleus with a certainty of 0.9800. 
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Table 32B. Encoded NOV32 protein sequence (SEQ ID NO:132) 

MATGADVRDILELGGPEGDAASGTISKKDI INPDKKKSKKSSETLTFKRPEGMHREVYALLYSDKNKGSCLL 
SRMQEDLKSFAPGHDFLAIGDAPPLLPSDTGQGYRTVKAKLGSKKVRPWKWMPFTNPARKDGAMFFHWRRAA 
EEGKDYPFARFNKTVQVPVYSEQEYQLYLHDDAWTKAETDHLFDLSRRFDLRFWIHDRYDHQQFKKRSVED 
LKERYYHICAKLANVRAVPGTDLiKI PVFDAGHERRRKEQLERLYNRTPEQVAEEEYLLQELRKIEARKKERE 
KRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPETAGIKFPDFKSAGVTLRSQRMKLPSSVG 
QKKIKALEQMLLELGVELSPTPTEELVHMFNELRSDLVLLYELKQACANCEYELQMLRHRHEALARAGVLGG 
PATPASGPGPASAEPAVTEPGLGPDPKDTI IDWGAPLTPNSRKRRESASSSSSVKKAKKP 



The disclosed NOV32 amino acid sequence has 401 of 401 amino acid residues 
(100%) identical to, and 401 of 401 amino acid residues (100%) similar to, the 467 amino acid 
residue ptnr : SPTREM B L- ACC : Q9NPF5 protein from Homo sapiens (Human) (Hypothetical 
5 53.0 Kda Protein (Dnmtl Associated Protein-1) (E = 1.3e" 248 ). 

NOV32 is predicted to be expressed in at least Adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
10 small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. This 

information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

NOV32 also has homology to the amino acid sequences shown in the BLASTP data 
15 listed in Table 32C. 



Table 32C. BLAST results for NOV32 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 724323 l|dbj | BAA9 
2663. l| (AB037846) 


KIAA1425 protein 
[Homo sapiens] 


495 


446/473 
(94%) 


446/473 
(94%) 


0.0 


gi | 1312 3 77 6 | ref |NP 
061973. l| 
(NM_019100) 


DNA 

methyl transferase 
l-associated 
protein 1 [Homo 
sapiens] 


467 


446/473 
(94%) 


446/473 
(94%) 


0 . 0 


gi | 12052838 | emb | CAB 
66592. l| (AL136657) 


hypothetical 
protein [Homo 
sapiens] 


467 


443/473 
(93%) 


445/473 
(93%) 


0 . 0 


gi | 12 96 3 5 57 | ref | NP_ 
075667. l| 
(NM 023178) 


DNMTl associated 
protein-1 [Mus 
musculus] 


468 


437/474 
(92%) 


438/474 
(92%) 


0 . 0 


gi | 12 8 056 75 |gb | AAH0 
2321. 1 |AAH02321 
(BC002321) 


Unknown (protein 
for ^ 
IMAGE:3594236) 
[Mus musculus] 


451 


420/457 
(91%) 


421/457 
(91%) 


0 . 0 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 32D. 
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Table 32D Clustal W Sequence Alignment 



10 



1) NOV32 (SEQ ID NO: 132) 

2) gi|724323l|dbj |BAA92663.l| (AB037846) KIAA1425 protein [Homo sapiens] (SEQ ID 
NO; 4 62) 

3) gi | 13123776 | ref |NP_061973 . l | (NM019100) DNA methyl transferase 1-associated 
protein 1 [Homo sapiens] (SEQ ID NO: 4 63) 

4) gi | 12052838 | emb | CAB66592 . 1 | (AL136657) hypothetical protein [Homo sapiens] (SEQ 
ID NO: 464) 

5) gi| 12963557 | ref|NP_075667.1 | (NM__023178) DNMT1 associated protein-1 [Mus 
musculus] (SEQ ID NO: 46 5) 

6) gi|l2805675|gb|AAH02321.l|AAH02321 (BC002321) Unknown (protein for 
IMAGE:3594236) [Mus musculus] (SEQ ID NO:466) 



15 



20 



NOV32 

gi |7243231| 
gi | 13123776 | 
gi | 12052838 | 
gi | 12963557 j 
gi | 12805675 j 
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NOV32 


33 


gi 


7243231| 


61 


gi 


13123776| 


33 


gi 


12052838 j 


33 


gi 


12963557 | 


33 


gi 


12805675 | 
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IKGSCLLSRMQEDLKSFAPGHDFLAIG 92 

94 

6G 

66 
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130 
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40 



NOV32 

gi|724323l| 
gi | 13123776 | 
gi | 12052838 j 
gi j 12963557 | 
gi j 12805675 j 



93 
95 
67 
67 
67 
50 



I 



140 
. . I . . 



150 



160 



170 



I 



I 



I 



| 



180 
• * I 



DAPPLLPSDTGQGYRTVKAKLGS 
DAPPLLPSDTGQGYRTVKAKLGS 
DAPPLLPSDTGQGYRTVKAKLGS 
DAPPLLPSDTGQGYRTVKAKLGS 
DAPPLLPSDTGQGYRTVKAKLGS 
DAPPLLPSDTGQGYRTVKAKLGS 



KKVRPWKWMPFTNPARKDGAMFFHWRRAAEEGKDYPF 
KKVRPWKWMPFTNPARKDGAMFFHWRRAAEEGKDYPF 
KKVRPWKWMPFTNPARKDGAMFFHWRRAAEEGKDYPF 
KKVRPWKWMPFTNPARKDGAMFFHWRRAAEEGKDYPF 
KKVRPWKWMPFTNPARKDGAMFFHWRRAAEEGKDYPF 
KKVRPWKWMPFTNPARKDGAM F FPjWRRAAEEGKDYPF 



152 
154 
126 
126 
126 
109 



45 



50 



55 



60 



65 



NOV32 

gi|724323l| 
gi | 13123776 | 
gi|l2052838| 
gi j 12963557 j 
gi 1 12805675 | 



NOV32 

gi|724323l| 
gi j 13123776 | 
gi|l2052838| 
gijl2963557| 
gij 12805675 | 



NOV32 

gi |7243231| 
gi j 13123776 
gi j 12052838 
gi j 12963557 
gi j 12805675 




213 
215 
187 
187 
187 
170 



273 
275 
247 
247 
247 
230 



I 



250 
. . I . . 



260 



I 



270 
I 



280 
- - I - - 



290 
. . I . . 



300 
■ - I 



SVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHERRRKEQLERLYNRTPEQVAEEEY 
SVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHERRRKEQLERLYNRTPEQVAEEEY 
SVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHERRRKEQLERLYNRTPEQVAEEEY 
SVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHERRRKEQLERLYNRTPEQVAEEEY 
SVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHERRRKEQLERLYNRTPEQVAEEEY 
SVEDLKERYYHICAKLANVRAVPGTDLKIPVFDAGHERRRKEQLERLYNRTPEQVAEEEY 



310 



320 



330 



340 



350 



360 



I 



LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
LLQELRKIEARKKEREKRSQDLQKLITAADTTAEQRRTERKAPKKKLPQKKEAEKPAVPE 
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15 
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25 



NOV32 

gi|7243231 | 
gi|l3123776| 
gijl2052838| 
gi | 12963557 | 
gi j 12805675 | 



NOV32 

gi|724323l| 
gi | 13123776 | 
gi|l2052838| 
gi|l2963557| 
gi|l2805675| 



333 
335 
307 
307 
307 
290 



393 
395 
367 
367 
367 
350 



NOV32 


453 


gi | 7243231 1 


455 


gi j 13123776 | 


427 


gi|l2052838| 


427 


gi j 12963557 j 


427 


gi|l2805675| 


410 
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. • I I 
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390 



400 



I 



I 
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. . I - - 
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TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVEL.SPTPTEELVHMFNE 
TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 
TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 
TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 
TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 
TAGIKFPDFKSAGVTLRSQRMKLPSSVGQKKIKALEQMLLELGVELSPTPTEELVHMFNE 



430 



I 



440 
..I 



450 



I 



I 



460 



470 



480 



l r s dl vl lye l kq ac anc e y e l qm l rhrh e al arag vlgg pa 
lrsdlvllyelkqacanceyelqmlrhrhealaragvlggpa' 
lrsdlvllyelkqacanceyelqmlrhrhealaragvlggpa' 
lrsdlvllyelkqacanceyelqmlrhrhealaragvlggpa 
lrsdlvllyelkqacanceyelqmlrhrhealaragvlggpaf 
lrsdlvllyelkqacanceyelqmlrhrhealaragvlgHpa 





490 



500 



510 



520 
. ■ I . . 



kdtiidwgapltpnsrkrresasssssvkkakkp 
kdtiidwgapltpnsrkrresasssssvkkakkp 
kdtiidwgapltpnsrkrresasssssvkkakkp 
kdtiidwgapltpnsrkrresasssssvkkakkp 
kdtiidwgapltpnsrkrresasssssvkkakkp 
kdtiidwgapltpnsrkrresasssssvkkakkp 



493 
495 
467 
467 
468 
451 



Methylation of CpG islands is associated with transcriptional silencing and the 

30 formation of nuclease-resistant chromatin structures enriched in hypoacetylated histones. 

Methyl-CpG-binding proteins, such as MeCP2, provide a link between methylated DNA and 
hypoacetylated histones by recruiting histone deacetylase, but the mechanisms establishing the 
methylation patterns themselves are unknown. Whether DNA methylation is always causal for 
the assembly of repressive chromatin or whether features of transcriptionally silent chromatin 

35 might target methyltransferase remains unresolved. Mammalian DNA methyltransferases 
(DNMT) show little sequence specificity in vitro, yet methylation can be targeted in vivo 
within chromosomes to repetitive elements, centromeres and imprinted loci. This targeting is 
frequently disrupted in tumour cells, resulting in the improper silencing of tumour-suppressor 
genes associated with CpG islands. Robertson et al. (Nat Genet 2000, 25:338-42) have shown 

40 that the predominant mammalian DNA methyltransferase, DNMT1, co-purifies with the 

retinoblastoma (Rb) tumour suppressor gene product, E2F1, and HDAC1 and that DNMT1 
cooperates with Rb to repress transcription from promoters containing E2F-binding sites. 
These results establish a link between DNA methylation, histone deacetylase and sequence- 
specific DNA binding activity, as well as a growth-regulatory pathway that is disrupted in 

45 nearly all cancer cells. Recently, Rountree et al. (Nat Genet, 2000, 25:269-77) have shown that 
the non-catalytic amino terminus of DNMT1 binds to HDAC2 and a new protein, DMAP1 (for 
DNMT1 associated protein), and can mediate transcriptional repression. DMAP1 has intrinsic 
transcription repressive activity, and binds to the transcriptional co-repressor TSG101. 
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DMAP1 is targeted to replication foci through interaction with the far N terminus of DNMT1 
throughout S phase, whereas HDAC2 joins DNMT1 and DMAP1 only during late S phase, 
providing a platform for how histones may become deacetylated in heterochromatin following 
replication. Thus, DNMT1 not only maintains DNA methylation, but also may directly target, 
5 in a heritable manner, transcriptionally repressive chromatin to the genome during DNA 
replication. 

The NOV32 nucleic acid of the invention encoding a DNMT1 associated protein- 1 
(DMAP)-like protein includes the nucleic acid whose sequence is provided in Table 32 A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 

10 bases may be changed from the corresponding base shown in Table 32A while still encoding a 
protein that maintains its DNMT1 associated protein- 1 (DMAP)-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 

15 invention additionally includes nucleic acids or nucleic acid fragments, or complements 

thereto, whose structures include chemical modifications. Such modifications include, by way 
of non-limiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 

20 antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or variant 
nucleic acids, and their complements, up to about 3% of the residues may be so changed. 

The NOV32 protein of the invention includes the DNMT1 associated protein- 1 
(DMAP)-like protein whose sequence is provided in Table 32B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 

25 residue shown in Table 32B while still encoding a protein that maintains its DNMT1 

associated protein- 1 (DMAP)-like activities and physiological functions, or a functional 
fragment thereof. In the mutant or variant protein, up to about 9% of the bases may be so 
changed. 

The NOV32 nucleic acids and proteins of the invention are useful in potential 
30 diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: cancers such as breast cancer, colorectal 
cancers, lung cancer, liver cancer, pancreatic cancer, prostate cancer, stomach cancers, 
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developmental syndromes, Fragile X and Rett and other diseases, disorders and conditions of 
the like. 

NOV32 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobic ity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV32 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
10 disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 



NOV33 

A disclosed NOV33 nucleic acid of 7693 nucleotides (also referred to as CG56688-01) 
15 encoding a novel Notchl-like protein is shown in Table 33A. An open reading frame was 

identified beginning with an ATG initiation codon at nucleotides 1-3 and ending with a TAA 
codon at nucleotides 7669-7671. Putative untranslated regions, if any, found upstream from 
the initiation codon and downstream from the termination codon are underlined in Table 33A, 
and the start and stop codons are in bold letters. 



Table 33 A. NOV33 Nucleotide Sequence (SEQ ID NO: 133) 

ATGCCGCCGCTCCTGGCGCCCCTGCTCTGCCTGGCGCTGCTGCCCGCGCTCGCCGCACGAGGCCCGCGATG 
CTCCCAGCCCGGTGAGACCTGCCTGAATGGCGGGAAGTGTGAAGCGGCCAATGGCACGGAGGCCTGCGTCT 
GTGGCGGGGCCTTCGTGGGCCCGCGATGCCAGGACCCCAACCCGTGCCTCAGCACCCCCTGCAAGAACGCC 
GGGACATGCCACGTGGTGGACCGCAGAGGCGTGGCAGACTATGCCTGCAGCTGTGCCCTGGGCTTCTCTGG 
GCCCCTCTGCCTGACACCCCTGGACAACGCCTGCCTCACCAACCCCTGCCGCAACGGGGGCACCTGCGACC 
TGCTCACGCTGACGGAGTACAAGTGCCGCTGCCCGCCCGGCTGGTCAGGGAAATCGTGCCAGCAGGCTGAC 
CCGTGCGCCTCCAACCCCTGCGCCAACGGTGGCCAGTGCCTGCCCTTCGAGGCCTCCTACATCTGCCACTG 
CCCACCCAGCTTCCATGGCCCCACCTGCCGGCAGGATGTCAACGAGTGTGGCCAGAAGCCCGGGCTTTGCC 
GCCACGGAGGCACCTGCCACAACGAGGTCGGCTCCTACCGCTGCGTCTGCCGCGCCACCCACACTGGCCCC 
AACTGCGAGCGGCCCTACGTGCCCTGCAGCCCCTCGCCCTGCCAGAACGGGGGCACCTGCCGCCCCACGGG 
CGACGTCACCCACGAGTGTGCCTGCCTGCCAGGCTTCACCGGCCAGAACTGTGAGGAAAATATCGACGATT 
GT C C AGG AAAC AAC TG CAAGAACGGGGGTGC C TGTGTGGACGGCGTGAAC AC C T AC AACTGCCCGTGC C CG 
CCAGAGTGGACAGGTCAGTACTGTACCGAGGATGTGGACGAGTGCCAGCTGATGCCAAATGCCTGCCAGAA 
CGGCGGGACCTGCCACAACACCCACGGTGGCTACAACTGCGTGTGTGTCAACGGCTGGACTGGTGAGGACT 
GCAGCGAGAACATTGATGACTGTGCCAGCGCCGCCTGCTTCCACGGCGCCACCTGCCATGACCGTGTGGCC 
TCCTTTTACTGCGAGTGTCCCCATGGCCGCACAGGTCTGCTGTGCCACCTCAACGACGCATGCATCAGCAA 
CCCCTGTAACGAGGGCTCCAACTGCGACACCAACCCTGTCAATGGCAAGGCCATCTGCACCTGCCCCTCGG 
GGTACACGGGCCCGGCCTGCAGCCAGGACGTGGATGAGTGCTCGCTGGGTGCCAACCCCTGCGAGCATGCG 
GGCAAGTGCATCAACACGCTGGGCTCCTTCGAGTGCCAGTGTCTGCAGGGCTACACGGGCCCCCGATGCGA 
GATCGACGTCAACGAGTGCGTCTCGAACCCGTGCCAGAACGACGCCACCTGCCTGGACCAGATTGGGGAGT 
TCCAGTGCATGTGCATGCCCGGCTACGAGGGTGTGCACTGCGAGGTCAACACAGACGAGTGTGCCAGCAGC 
CCCTGCCTGCACAATGGCCGCTGCCTGGACAAGATCAATGAGTTCCAGTGCGAGTGCCCCACGGGCTTCAC 
TGGGCATCTGTGCCAGTACGATGTGGACGAGTGTGCCAGCACCCCCTGCAAGAATGGTGCCAAGTGCCTGG 
ACGGACCCAACACTTACACCTGTGTGTGCACGGAAGGGTACACGGGGACGCACTGCGAGGTGGACATCGAT 
GAGTGCGACCCCGACCCCTGCCACTACGGCTCCTGCAAGGACGGCGTCGCCACCTTCACCTGCCTCTGCCG 
CCCAGGCTACACGGGCCACCACTGCGAGACCAACATCAACGAGTGCTCCAGCCAGCCCTGCCGCCACGGGG 
G C ACC TGC C AGGAC CG CG AC AACGC C T AC CTCTGCTT CTG C CTGAAGGGG ACC AC AGG ACCC AACTGCG AG 
ATCAACCTGGATGACTGTGCCAGCAGCCCCTGCGACTCGGGCACCTGTCTGGACAAGATCGATGGCTACGA 
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GTGTGCCTGTGAGCCGGGCTACACAGGGAGCATGTGTAACATCAACATCGATGAGTGTGCGGGCAACCCCT 
GCCACAACGGGGGCACCTGCGAGGACGGCATCAATGGCTTCACCTGCCGCTGCCCCGAGGGCTACCACGAC 
CCCACCTGCCTGTCTGAGGTCAATGAGTGCAACAGCAACCCCTGCGTCCACGGGGCCTGCCGGGACAGCCT 
CAACGGGTACAAGTGCGACTGTGACCCTGGGTGGAGTGGGACCAACTGTGACATCAACAACAACGAGTGTG 
AATCCAACCCTTGTGTCAACGGCGGCACCTGCAAAGACATGACCAGTGGCTACGTGTGCACCTGCCGGGAG 
GGCTTCAGCGGTCCCAACTGCCAGACCAACATCAACGAGTGTGCGTCCAACCCATGTCTGAACAAGGGCAC 
GTGTATTGACGACGTTGCCGGGTACAAGTGCAACTGCCTGCTGCCCTACACAGGTGCCACGTGTGAGGTGG 
TGCTGGCCCCGTGTGCCCCCAGCCCCTGCAGAAACGGCGGGGAGTGCAGGCAATCCGAGGACTATGAGAGC 
TTCTCCTGTGTCTGCCCCACGGCTGGGGCCAAAGGGCAGACCTGTGAGGTCGACATCAACGAGTGCGTTCT 
GAGCCCGTGCCGGCACGGCGCATCCTGCCAGAACACCCACGGCGGCTACCGCTGCCACTGCCAGGCCGGCT 
ACAGTGGGCGCAACTGCGAGACCGACATCGACGACTGCCGGCCCAACCCGTGTCACAACGGGGGCTCCTGC 
ACAGACGGCATCAACACGGCCTTCTGCGACTGCCTGCCCGGCTTCCGGGGCACTTTCTGTGAGGAGGACAT 
CAACGAGTGTGCCAGTGACCCCTGCCGCAACGGGGCCAACTGCACGGACTGCGTGGACAGCTACACGTGCA 
CCTGCCCCGCAGGCTTCAGCGGGATCCACTGTGAGAACAACACGCCTGACTGCACAGAGAGCTCCTGCTTC 
AACGGTGGCACCTGCGTGGACGGCATCAACTCGTTCACCTGCCTGTGTCCACCCGGCTTCACGGGCAGCTA 
CTGCCAGCACGATGTCAATGAGTGCGACTCACAGCCCTGCCTGCATGGCGGCACCTGTCAGGACGGCTGCG 
GCTCCTACAGGTGCACCTGCCCCCAGGGCTACACTGGCCCCAACTGCCAGAACCTTGTGCACTGGTGTGAC 
TCCTCGCCCTGCAAGAACGGCGGCAAATGCTGGCAGACCCACACCCAGTACCGCTGCGAGTGCCCCAGCGG 
CTGGACCGGCCTTTACTGCGACGTGCCCAGCGTGTCCTGTGAGGTGGCTGCGCAGCGACAAGGTGTTGACG 
TTGCCCGCCTGTGCCAGCATGGAGGGCTCTGTGTGGACGCGGGCAACACGCACCACTGCCGCTGCCAGGCG 
GGCTACACAGGCAGCTACTGTGAGGACCTGGTGGACGAGTGCTCACCCAGCCCCTGCCAGAACGGGGCCAC 
CTGCACGGACTACCTGGGCGGCTACTCCTGCAAGTGCGTGGCCGGCTACCACGGGGTGAACTGCTCTGAGG 
AGATCGACGAGTGCCTCTCCCACCCCTGCCAGAACGGGGGCACCTGCCTCGACCTCCCCAACACCTACAAG 
TGCTCCTGCCCACGGGGCACTCAGGGTGTGCACTGTGAGATCAACGTGGACGACTGCAATCCCCCCGTTGA 
CCCCGTGTCCCGGAGCCCCAAGTGCTTTAACAACGGCACCTGCGTGGACCAGGTGGGCGGCTACAGCTGCA 
CCTGCCCGCCGGGCTTCGTGGGTGAGCGCTGTGAGGGGGATGTCAACGAGTGCCTGTCCAATCCCTGCGAC 
GCCCGTGGCACCCAGAACTGCGTGCAGCGCGTCAATGACTTCCACTGCGAGTGCCGTGCTGGTCACACCGG 
GCGCCGCTGCGAGTCCGTCATCAATGGCTGCAAAGGCAAGCCCTGCAAGAATGGGGGCACCTGCGCCGTGG 
CCTCCAACACCGCCCGCGGGTTCATCTGCAAGTGCCCTGCGGGCTTCGAGGGCGCCACGTGTGAGAATGAC 
GCTCGTACCTGCGGCAGCCTGCGCTGCCTCAACGGCGGCACATGCATCTCCGGCCCGCGCAGCCCCACCTG 
CCTGTGCCTGGGCCCCTTCACGGGCCCCGAATGCCAGTTCCCGGCCAGCAGCCCCTGCCTGGGCGGCAACC 
CCTGCTACAACCAGGGGACCTGTGAGCCCACATCCGAGAGCCCCTTCTACCGTTGCCTGTGCCCCGCCAAA 
TTCAACGGGCTCTTGTGCCACATCCTGGACTACAGCTTCGGGGGTGGGGCCGGGCGCGACATCCCCCCGCC 
GCTGATCGAGGAGGCGTGCGAGCTGCCCGAGTGCCAGGAGGACGCGGGCAACAAGGTCTGCAGCCTGCAGT 
GC AAC AAC C ACGC GTG CGGCT GGG ACGG CGGTGAC TG CT C C CT C AACTTC AATGAC C C CTGG AAG AAC TGC 
ACGCAGTCTCTGCAGTGCTGGAAGTACTTCAGTGACGGCCACTGTGACAGCCAGTGCAACTCAGCCGGCTG 
CCTCTTCGACGGCTTTGACTGCCAGCGTGCGGAAGGCCAGTGCAACCCCCTGTACGACCAGTACTGCAAGG 
ACCACTTCAGCGACGGGCACTGCGACCAGGGCTGCAACAGCGCGGAGTGCGAGTGGGACGGGCTGGACTGT 
GCGGAGCATGTACCCGAGAGGCTGGCGGCCGGCACGCTGGTGGTGGTGGTGCTGATGCCGCCGGAGCAGCT 
GCGCAACAGCTCCTTCCACTTCCTGCGGGAGCTCAGCCGCGTGCTGCACACCAACGTGGTCTTCAAGCGTG 
ACGCACACGGCCAGCAGATGATCTTCCCCTACTACGGCCGCGAGGAGGAGCTGCGCAAGCACCCCATCAAG 
CGTGCCGCCGAGGGCTGGGCCGCACCTGACGCCCTGCTGGGCCAGGTGAAGGCCTCGCTGCTCCCTGGTGG 
CAGCGAGGGTGGGCGGCGGCGGAGGGAGCTGGACCCCATGGACGTCCGCGGCTCCATCGTCTACCTGGAGA 
TTGACAACCGGCAGTGTGTGCAGGCCTCCTCGCAGTGCTTCCAGAGTGCCACCGATGTGGCCGCATTCCTG 
GGAGCGCTCGCCTCGCTGGGCAGCCTCAACATCCCCTACAAGATCGAGGCCGTGCAGAGTGAGACCGTGGA 
GCCGCCCCCGCCGGCGCAGCTGCACTTCATGTACGTGGCGGCGGCCGCCTTTGTGCTTCTGTTCTTCGTGG 
GCTGCGGGGTGCTGCTGTCCCGCAAGCGCCGGCGGCAGCATGGCCAGCTCTGGTTCCCTGAGGGCTTCAAA 
GTGTCTGAGGCCAGCAAGAAGAAGCGGCGGGAGCCCCTCGGCGAGGACTCCGTGGGCCTCAAGCCCCTGAA 
GAACGCTTCAGACGGTGCCCTCATGGACGACAACCAGAATGAGTGGGGGGACGAGGACCTGGAGACCAAGA 
AGTTCCGGTTCGAGGAGCCCGTGGTTCTGCCTGACCTGGACGACCAGACAGACCACCGGCAGTGGACTCAG 
CAGCACCTGGATGCCGCTGACCTGCGCATGTCTGCCATGGCCCCCACACCGCCCCAGGGTGAGGTTGACGC 
CGACTGCATGGACGTCAATGTCCGCGGGCCTGATGGCTTCACCCCGCTCATGATCGCCTCCTGCAGCGGGG 
GCGGCCTGGAGACGGGCAACAGCGAGGAAGAGGAGGACGCGCCGGCCGTCATCTCCGACTTCATCTACCAG 
GGCGCCAGCCTGCACAACCAGACAGACCGCACGGGCGAGACCGCCTTGCACCTGGCCGCCCGCTACTCACG 
CTCTGATGCCGCCAAGCGCCTGCTGGAGGCCAGCGCAGATGCCAACATCCAGGACAACATGGGCCGCACCC 
CGCTGCATGCGGCTGTGTCTGCCGACGCACAAGGTGTCTTCCAGATCCTGATCCGGAACAGGGCCACAGAC 
CTGGATGCCCGCATGCATGATGGCACAACTCCACTGATCCTGGCTGCCCGCCTGGCCGTGGAGGGCATGCT 
GGAGGACCTCATCAACTCACACGCCGACGTCAACGCCGTAGATGACCTGGGCAAGTCCGCCCTGCACTGGG 
CCGCCGCCGTGAACAATGTGGATGCCGCAGTTGTGCTCCTGAAGAACGGGGCTAACAAAGATATGCAGAAC 
AACAGGGAGGAGACACCCCTGTTTCTGGCCGCCCGGGAGGGCAGCTACGAGACCGCCAAGGTGCTGCTGGA 
CCACTTTGCCAACCGGGACATCACGGATCATATGGACCGCCTGCCGCGCGACATCGCACAGGAGCGCATGC 
ATCACGACATCGTGAGGCTGCTGGACGAGTACAACCTGGTGCGCAGCCCGCAGCTGCACGGAGCCCCGCTG 
GGGGGCACGCCCACCCTGTCGCCCCCGCTCTGCTCGCCCAACGGCTACCTGGGCAGCCTCAAGCCCGGCGT 
GCAGGGCAAGAAGGTCCGCAAGCCCAGCAGCAAAGGCCTGGCCTGTGGAAGCAAGGAGGCCAAGGACCTCA 
AGGCACGGAGGAAGAAGTCCCAGGATGGCAAGGGCTGCCTGCTGGACAGCTCCGGCATGCTCTCGCCCGTG 
GACTCCCTGGAGTCACCCCATGGCTACCTGTCAGACGTGGCCTCGCCGCCACTGCTGCCCTCCCCGTTCCA 
GCAGTCTCCGTCCGTGCCCCTCAACCACCTGCCTGGGATGCCCGACACCCACCTGGGCATCGGGCACCTGA 
ACGTGGCGGCCAAGCCCGAGATGGCGGCGCTGGGTGGGGGCGGCCGGCTGGCCTTTGAGACTGGCCCACCT 
CGTCTCTCCCACCTGCCTGTGGCCTCTGGCACCAGCACCGTCCTGGGCTCCAGCAGCGGAGGGGCCCTGAA 
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TTTCACTGTGGGCGGGTCCACCAGTTTGAATGGTCAATGCGAGTGGCTGTCCCGGCTGCAGAGCGGCATGG 
TGCCGAACCAATACAACCCTCTGCGGGGGAGTGTGGCACCAGGCCCCCTGAGCACACAGGCCCCCTCCCTG 
CAGCATGGCATGGTAGGCCCGCTGCACAGTAGCCTTGCTGCCAGCGCCCTGTCCCAGATGATGAGCTACCA 
GGGCCTGCCCAGCACCCGGCTGGCCACCCAGCCTCACCTGGTGCAGACCCAGCAGGTGCAGCCACAAAACT 
TACAGATGCAGCAGCAGAACCTGCAGCCAGCAAACATCCAGCAGCAGCAAAGCCTGCAGCCGCCACCACCA 
CCACCACAGCCGCACCTTGGCGTGAGCTCAGCAGCCAGCGGCCACCTGGGCCGGAGCTTCCTGAGTGGAGA 
GCCGAGCCAGGCAGACGTGCAGCCACTGGGCCCCAGCAGCCTGGCGGTGCACACTATTCTGCCCCAGGAGA 
GCCCCGCCCTGCCCACGTCGCTGCCATCCTCGCTGGTCCCACCCGTGACCGCAGCCCAGTTCCTGACGCCC 
CCCTCGCAGCACAGCTACTCCTCGCCTGTGGACAACACCCCCAGCCACCAGCTACAGGTGCCTGAGCACCC 
CTTCCTGACCCCTTCGCCGGAGTCGCCCGACCAATGGTCGTCCTCGTCGCCGCACTCTAATGTGTCTGACT 
GGTCTGAGGGCGTGTCGTCGCCCCCGACCTCCATGCAGTCCCAGATCGCGCGCATCCCGGAGGCGTTCAAG 
TAATAGCTCGAGGTGCCAGCAGCTC 



The NOV33 nucleic acid has 7670 of 7693 bases (99%) identical to a gb:GENBANK- 
ID:AF308602|acc:AF308602.1 mRNA from Homo sapiens (NOTCH 1 (Nl) mRNA, complete 
cds) (E = 0.0). 

5 A disclosed NOV33 polypeptide (SEQ ID NO: 1 34) encoded by SEQ ID NO: 1 33 is 

2556 amino acid residues and is presented using the one-letter code in Table 33B. Signal P, 
Psort and/or Hydropathy results predict that NOV33 contains a signal peptide and is likely to 
be localized to the plasma membrane with a certainty of 0.4600. The most likely cleavage site 
for a NOV33 peptide is between amino acids 18 and 19: ALA-AR. 



Table 33B. Encoded NOV33 protein sequence (SEQ ID NO: 134) 

MPPLLAPLLCLALLPALAARGPRCSQPGETCLNGGKCEAANGTEACVCGGAFVGPRCQDPNPCLSTPCKNAG 
TCHWDRRGVADYACSCALGFSGPLCLTPLDNACLTNPCRNGGTCDLLTLTEYKCRCPPGWSGKSCQQADPC 
ASNPCANGGQCLPFEASYICHCPPSFHGPTCRQDVNECGQKPGLCRHGGTCHNEVGSYRCVCRATHTGPNCE 
RPYVPCSPSPCQNGGTCRPTGDVTHECACLPGFTGQNCEENIDDCPGNNCKNGGACVDGVNTYNCPCPPEWT 
GQ Y CTED VD E C QLM PNACQNGGTCHNTHGG YNC VC VNG WTG E DC S EN I DDC AS AAC FHG AT C HDR VAS F YC E 
CPHGRTGLLCHLNDACISNPCNEGSNCDTNPVNGKAICTCPSGYTGPACSQDVDECSLGANPCEHAGKCINT 
LGS FECQCLQGYTGPRCE IDVNECVSNPCQNDATCLDQ IGE FQCMCMPGYEGVHCEVNTDECASS PCLHNGR 
CLDKINEFQCECPTGFTGHLCQYDVDECASTPCKNGAKCLDGPNTYTCVCTEGYTGTHCEVDIDECDPDPCH 
YGSCKDGVATFTCLCRPGYTGHHCETNINECSSQPCRHGGTCQDRDNAYLCFCLKGTTGPNCEINLDDCASS 
PCDSGTCLDKIDGYECACEPGYTGSMCNINIDECAGNPCHNGGTCEDGINGFTCRCPEGYHDPTCLSEVNEC 
NSNPCVHGACRDSLNGYKCDCDPGWSGTNCDINNNECESNPCVNGGTCKDMTSGYVCTCREGFSGPNCQTNI 
NECASNPCLNKGTCIDDVAGYKCNCLLPYTGATCEWLAPCAPSPCRNGGECRQSEDYESFSCVCPTAGAKG 
QTCEVDINECVLSPCRHGASCQNTHGGYRCHCQAGYSGRNCETDIDDCRPNPCHNGGSCTDGINTAFCDCLP 
GFRGTFCEEDINECASDPCRNGANCTDCVDSYTCTCPAGFSGIHCENNTPDCTESSCFNGGTCVDGINSFTC 
LCPPGFTGSYCQHDVNECDSQPCLHGGTCQDGCGSYRCTCPQGYTGPNCQNLVHWCDSSPCKNGGKCWQTHT 
QYRCECPSGWTGLYCDVPSVSCEVAAQRQGVDVARLCQHGGLCVDAGNTHHCRCQAGYTGSYCEDLVDECSP 
SPCQNGATCTDYLGGYSCKCVAGYHGVNCSEEIDECLSHPCQNGGTCLDLPNTYKCSCPRGTQGVHCEINVD 
DCNPPVDPVSRSPKCFNNGTCVDQVGGYSCTCPPGFVGERCEGDVNECLSNPCDARGTQNCVQRVNDFHCEC 
RAGHTGRRCESVINGCKGKPCKNGGTCAVASNTARGFICKCPAGFEGATCENDARTCGSLRCLNGGTCISGP 
RSPTCLCLGPFTGPECQFPASSPCLGGNPCYNQGTCEPTSESPFYRCLCPAKFNGLLCHILDYSFGGGAGRD 
IPPPLIEEACELPECQEDAGNKVCSLQCmrHACGWDGGDCSLNFNDPWKNCTQSLQCWKYFSDGHCDSQCNS 
AGCLFDGFDCQPJVEGQCNPLYDQYCKDHFSDGHCDQGCNSAECEWDGLDCAEHVPERLAAGTLVVVVLMPPE 
QLRNS S FHFLRELS RVLHTNWFKRDAHGQQM I FP Y YGREE ELRKHP I KRAAEGWAAPDALLGQVKASLL PG 
GSEGGRRRRELDPMDVRGSIVYLEIDNRQCVQASSQCFQSATDVAAFLGALASLGSLNIPYKIEAVQSETVE 
PPPPAQLHFMYVAAAAFVLLFFVGCGVLLSRKRRRQHGQLWFPEGFKVSEASKKKRREPLGEDSVGLKPLKN 
ASDGALMDDNQNEWGDEDLETKKFRFEEPWLPDLDDQTDHRQWTQQHLDAADLRMSAMAPTPPQGEVDADC 
MDVNVRGPDGFTPLMIASCSGGGLETGNSEEEEDAPAVISDFIYQGASLHNQTDRTGETALHLAARYSRSDA 
AKRLLEASADANIQDNMGRTPLHAAVSADAQGVFQILIRNRATDLDARMHDGTTPLILAARLAVEGMLEDLI 
NSHADVNAVDDLGKSALHWAAAVNNVDAAVVIiLKNGANKDMQNNR 

DITDHMDRLPRDIAQERMHHDIVRLLDEYNLVRSPQLHGAPLGGTPTLSPPLCSPNGYLGSLKPGVQGKKVR 
KPSSKGLACGSKEAKDLKARRKKSQDGKGCLLDSSGMLSPVDSLESPHGYLSDVASPPLLPSPFQQSPSVPL. 
NHLPGMPDTHLGIGHLNVAAKPEMAALGGGGRLAFETGPPRLSHLPVASGTSTVLGSSSGGALNFTVGGSTS 
LNGQCEWLSRLQSGMVPNQYNPLRGSVAPGPLSTQAPSLQHGMVGPLHSSLAASALSQMMSYQGLPSTRLAT 
QPHLVQTQQVQPQNLQMQQQNLQPANIQQQQSLQPPPPPPQPHLGVSSAASGHLGRSFLSGEPSQADVQPLG 
PSSLAVHTIIiPQESPALPTSLPSSLVPPVTAAQFLTPPSQHSYSSPVDNTPSHQLQVPEHPFLTPSPESPDQ 
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rwSSSSPHSNVSDWSEGVSSPPTSMQSQIARIPEAFK | 

The disclosed NOV33 amino acid sequence has 2543 of 2556 amino acid residues 
(99%) identical to, and 2545 of 2556 amino acid residues (99%) similar to, the 2556 amino 
acid residue ptnr:TREMBLNEW-ACC:AAG33848 protein from Homo sapiens (Human) 
5 (Notch 1)(E = 0.0). 

NOV33 is predicted to be expressed in at least Adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
10 small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. This 

information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, NOV33 is predicted to be expressed in the following tissues because of the 
15 expression pattern of (GENBANK-ID: gb:GENBANK-ID:AF308602|acc:AF308602.1) a 

closely related Homo sapiens NOTCH 1 (Nl) mRNA, complete cds homolog in species Homo 
sapiens: brain. 

NOV33 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 33C. 

20 



Table 33C. BLAST results for NOV33 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 11275980 | gb | AAG3 
3848 . 1 | AF308602_1 
(AF308602) 


NOTCH 1 [Homo 
sapiens] 


2556 


2543/255 
6 (99%) 


2545/2556 
(99%) 


0 . 0 


gi | 107215 |pir | |A400 
43 


notch protein 
homolog TAN-1 
precursor - human 


2555 


2537/255 
6 (99%) 


2541/2556 
(99%) 


0 . 0 


gi j 1171777 | sp| P4653 
1 |NTC1_HUMAN 


NEUROGENIC LOCUS 
NOTCH PROTEIN 
HOMOLOG 1 
PRECURSOR 
( TRANSLOCATION - 
ASSOCIATED NOTCH 
PROTEIN TAN-1) 


2444 


2429/244 
4 (99%) 


2431/2444 


0 . 0 


gi | 6093 54 2 | sp|Q0700 
8 | NTC1_RAT 


NEUROGENIC LOCUS 
NOTCH HOMOLOG 
PROTEIN 1 
PRECURSOR 


2531 


2301/255 
7 (89%) 


2401/2557 
(92%) 


0 . 0 


gi | 112074 |pir | | S181 
88 


notch protein 
homolog - rat 


2531 


2300/255 
7 (89%) 


2399/2557 
(92%) 


0 . 0 
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The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 33D. 

Table 33D Clustal W Sequence Alignment 



10 



15 



20 



25 



30 



35 



40 



45 



50 



1) NOV33 (SEQ ID NO: 134) 

2) gi| 11275980 |gb|AAG33848.l|AF308602_l (AF308602) NOTCH 1 [Homo sapiens) (SEQ ID 
NO : 4 6 1 ) 

3) gi | 107215 | pir | | A40043 notch protein homolog TAN- l precursor - human (SEQ ID 
NO: 46 8) 

4) gi | 1171777 | sp| P46531 |NTC1_HUMAN NEUROGENIC LOCUS NOTCH PROTEIN HOMOLOG 1 
PRECURSOR (TRANSLOCATION -ASSOCIATED NOTCH PROTEIN TAN- 1 ) (SEQ ID NO: 4 69) 

5) gi | 6093542 | sp|Q07008 |NTC1_RAT NEUROGENIC LOCUS NOTCH HOMOLOG PROTEIN 1 PRECURSOR 
(SEQ ID NO:470) 

6) gi | 112074 |pir | | S18188 notch protein homolog - rat (SEQ ID NO:471) 



NOV3 3 

gi | 11275980 | 
gi j 107215 | 
gi | 1171777 | 
gi j 6093542 | 
gi | 112074 | 



NOV33 

gi | 11275980 | 
gi | 107215 | 
gi | 1171777 | 
gi | 6093542 | 
gi | 112074 | 



NOV33 

gi | 11275980 | 
gi j 107215 | 
gi | 1171777 | 
gi | 6093542 | 
gi | 112074 | 



61 
61 
61 
61 
61 
61 



121 
121 
121 
121 
121 
121 



NOV33 


181 


gi 


11275980 | 


181 


gi 


107215 | 


181 


gi 


1171777 | 


181 


gi 


6093542 j 


181 


gi 


112074 | 


181 




TLTEYKCRCPPG 
TLTEYKCRCPPG 
TLTEYKCRCPPG 
TLTEYKCRCPPG 
TLTEYKCRCPPG 
TLTEYKCRCPPG 




55 



60 



65 



NOV33 

gi | 11275980 | 
gi j 107215 | 
gi j 1171777 | 
gi|6093542| 
gij 112074 | 



NOV3 3 

gi|H275980| 
gij 107215 | 
gi j 1171777 | 



241 
241 
241 
241 
241 
241 



301 
301 
301 
301 



250 



260 



I 



270 



280 



I 



290 



300 



hecaclpgftgqnceeniddcpgnncknggacvdgvntyncpcppewtgqyctedvdecq 
hecaclpgftgqnceeniddcpgnncknggacvdgvntyncpcppewtgqyctedvdecq 
hecaclpgftgqnceeniddcpgnncknggacvdgvntyncpcppewtgqyctedvdecq 
hecaclpgftgqnceeniddcpgnncknggacvdgvntyncpcppewtgqyctedvdecq 
hecaclpgfggqnceensddcpgnncknggacvdgvntyncgcppewtgqyctedvdecq 
hecaclpgfBgqnceenBddcpgnncknggacvdgvntyncHcppewtgqyctedvdecq 



J_ 



310 



320 
•1^ 



330 
■ ■ I ■ • 



340 



350 
. . I . • 



360 
■ • I 



lmpnacqnggtchnthggyncvcvngwtgedcseniddcasaacfhgatchdrvasfyce 
lmpnacqnggtchnthggyncvcvngwtgedcseniddcasaacfhgatchdrvasfyce 
lmpnacqnggtc[2nthggyncvcvngwtgedcseniddcasaacfhgatchdrvasfyce 
lmpnacqnggtchnthggyncvcvngwtgedcseniddcasaacfhgatchdrvasfyce 
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gi|6093542| 
gi j 112074 | 



hggyncvcvngwtgedcsgniddcasaacfffigatchdrvasfyce 
hggyncvcvngwtgedcsKniddcasaacfSgatchdrvasfyce 



360 
360 



370 



10 



NOV33 


361 


CPHGRTG 


gi | 11275980 | 


361 


CPHGRTG 


gij 107215 | 


361 


CPHGRTG 


gij 1171777 | 


361 


CPHGRTG. 


gi|6093542| 


361 


CPHGRTG 


gij 112074 | 


361 


CPHGRTG 




430 



440 



450 



460 



15 



20 



NOV33 

gi|H275980| 
gij 107215 | 
gi | 1171777 | 
gij 6093542 j 
gij 112074 | 




I 



TLGSFECQCLQGY' 
TLGSFECQCLQGY r 
TLGSFECQCLQGY' 
TLGSFECQCLQGY' 
TLGSFECQCLQGY' 
TLGSFECQCLQGY' 



ii 



I 



ECVSNPCQ 
ECVSNPCQ 
ECVSNPCQ 
ECVSNPCQ 
ECfjSNPCQ 
ECMSNPCQ 




490 



25 



30 



35 



40 



45 



NOV33 

gi | 11275980 | 
gij 107215 | 
gi j 1171777 | 
gi j 6093542 | 
gij 112074 | 



NOV33 

gi|H275980| 
gi j 107215 | 
gi j 1171777 | 
gi j 6093542 j 
gij 112074 | 



NOV33 

gi | 11275980 | 
gij 107215 | 
gij 1171777 | 
gi j 6093542 j 
gij 112074 | 




550 




560 
■ . I . 



TYTCVCTEGYTG' 
TYTCVCTEGYTG' 
TYTCVCTEGYTG' 
TYTCVCTEGYTG' 
TYTCVCTEGYTG' 
TYTCVCTEGYTG' 



610 



620 




GTCQD 
GTCQD 
GTCQD 
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I 



1450 

. . I . . . 



1460 

. . I . . . 



1470 



1480 



1490 



I 



I 



I 



1500 
. I 



ippplieeacelpecqedagnkvcslqcnnhacgwdggdcslnfndpwknctqslqcwky 
ippplieeacelpecqedagnkvcslqcnnhacgwdggdcslnfndpwknctqslqcwky 
ippplieeacelpecqedagnkvcslqcnnhacgwdggdcslnfndpwknctqslqcwky 
ippplieeacelpecqedagnkvcslqcnnhacgwdggdcslnfndpwknctqslqcwky 

IPPpffllEEACELPECQEDAGNKVClLQCNNHACGWDGGDCSLNFNDPWKNCTQSLQCWKY 



[ PPP®IEEACELPECQEDAGNKV( 



rGWDGGDCSLNFNDPWKNCTQSLQCWKY 



1510 



1520 



1530 
. . I . . . 



| 



1540 
. . I . . . 



1550 
• - I - - - 



I 



I 



fsdghcdsqcnsagclfdgfdcqraegqcnplydqyckdhfsdghcdqgcnsaecewdgl 
fsdghcdsqcnsagclfdgfdcqraegqcnplydqyckdhfsdghcdqgcnsaecewdgl 
fsdghcdsqcnsagclfdgfdcqraegqcnplydqyckdhfsdghcdqgcnsaecewdgl 
fsdghcdsqcnsagclfdgfdcqraegqcnplydqyckdhfsdghcdqgcnsaecewdgl 
fsdghcdsqcnsagclfdgfdcqTOegqcnplydqyckdhfsdghcdqgcnsaecewdgl 
fsdghcdsqcnsagclfdgfdcqEegqcnp'lydqyckdhfsdghcdqgcnsaecewdgl 



1500 
1500 
1499 
1500 
1499 
1499 



1560 
1560 
1559 
1560 
1559 
1559 



I 



1570 
■ • | . ■ . 



1580 
|... 



1590 



1600 



1610 



1620 



I 



1 



dcaehvperlaagtlvwvlmppeqlrnssfhflrelsrvlhtnwfkrdahgqqmifpy 
dcaehvperlaagtlvwvlmppeqlrnssfhflrelsrvlhtnwfkrdahgqqmifpy 
dcaehvperlaagtlvwvlmppeqlrnssfhflrelsrvlhtnwfkrdahgqqmifpy 
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DCAEHVPERLAAGTLVj?VVLgpp|QLRN|SFHFLR|SsTlVLHTNVVFKRDAHGQQMIFPY 
DCAEHVPERLAAGTLVlWVLSppiQLP^lsFHFLR|BsSvLHTNWFKRDAS 



I 



1630 
. - I I 



1640 



1650 



I 



I 



1660 
. . I . . . 



1670 
. . I... J 



1680 



ygreeelrkhpi kraaegwaapdallgqvkasllpggseggrrrreldpmdvrgs ivyle 
ygreeelrkhpi kraaegwaapdallgqvkasllpggseggrrrreldpmdvrgs ivyle 
ygreeelrkhpi kraaegwaapdallgqvkasllpggseggrrrreldpmdvrgs ivyle 
ygreeelrkhpi kraaegwa apdallgqvka sllpggseggrrrreldpmdvrgs ivyle 

YGREEELRKHPI KRlAgGWAjg^^^^^MSLLPGg^GGRgRRELDPMDgjgGS IVYLE 

ygreeelrkhpi krBaBgwai^^^^^^HsllpgHhggrhreldpmdHgs IVYLE 



1690 



1700 

I 



1710 

..I... 



I 



1720 

. . 1 . . ■ 



1730 



I 



idnrqcvqas 
idnrqcvqas 
idnrqcvqas 
idnrqcvqas 
idnrqcvqIs 
idnrqcvqIs 



S Q C FQ s at d VAA FLG al AS LG S LN I 
SQCFQSATDVAAFLGALASLGSLNI 
SQCFQSATDVAAFLGALASLGSLNI 
SQCFQSATDVAAFLGALASLGSLNI 
SQCFQSATDVAAFLGALASLGSLNI 
SQCFQSATDVAAFLGALASLGSLNI 
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Tables 33E-I list the domain descriptions from DOMAIN analysis results against 
NOV33. This indicates that the NOV33 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 33E Domain Analysis of NOV33 

gnl | Smart | smart 000 04 , NL, Domain found in Notch and Lin- 12; The Notch 
protein is essential for the proper differentiation of the Drosophila 
ectoderm. This protein contains 3 NL domains. {SEQ ID NO: 82 5) 
CD-Length =39 residues, 100.0% aligned 
Score = 45.1 bits (105), Expect = 5e-05 



NOV3 3 : 1443 PPLIEEACELPECQEDAGNKVCSLQCNNHACGWDGGDCS 14 81 

I II +1 + |+ II +111 I lllllll 

Sbjct: 1 PQDPWSRCEDAQCWDKFGDGVCDEECNNAECLWDGGDCS 3 9 

5 



Table 33F Domain Analysis of NOV33 

gnl | Smart | smart 00004 , NL, Domain found in Notch and Lin- 12; The Notch 
protein is essential for the proper differentiation of the Drosophila 
ectoderm. This protein contains 3 NL domains. (SEQ ID NO: 82 5) 
CD-Length = 39 residues, 94.9% aligned 
Score = 44.7 bits (104), Expect = 7e-05 



NOV33: 1486 DP WKNCTQS LQC WKY FSDGHCDS QCNS AGCL FDGFDCQ 1523 

Ml I III I II II +11+1 ll+ll II 

Sbjct: 3 DPWSRCE-DAQCWDKFGDGVCDEECNNAECLWDGGDCS 3 9 

10 



Table 33G Domain Analysis of NOV33 

gnl | Smart | smart0 00 04 , NL, Domain found in Notch and Lin- 12; The Notch 
protein is essential for the proper differentiation of the Drosophila 
ectoderm. This protein contains 3 NL domains. (SEQ ID NO: 825) 
CD-Length =39 residues, 97.4% aligned 
Score ~ 41.2 bits (95), Expect = 7e-04 



NOV3 3 : 1522 CQRAEGQCNPLYDQYCKDHFSDGHCDQGCNSAECEWDGLDC 15 62 

I +1 I I I I II 11+ I 1 + I I I ill II 

Sbjct: 1 PQDPWSRCE DAQ CWDK FGDG VCDE E CNNAE CL WDGGD C 3 8 

15 



Table 33H Domain Analysis of NOV33 

gnl | Pf am | pf am00023 , ank, Ank repeat. Ankyrin repeats generally consist 
of a beta, alpha, alpha, beta order of secondary structures. The 
repeats associate to form a higher order structure. (SEQ ID NO: 826) 
CD-Length - 33 residues, 97.0% aligned 
Score = 42.7 bits (99), Expect = 3e-04 



NOV3 3 : 192 9 GETALHLAARYSRSDAAKRLLEASADANIQDN 19 6 0 

I I llllll + I I I I I II I + l 

Sbjct: 2 GNT P LHL AARNGHLE WKLL L E AGAD VN ARD K 3 3 

20 
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Table 331 Domain Analysis of NOV33 

gnl | Pf am | pf am00066 , notch, Notch (DSL) domain. The Notch domain is 
also called the 'DSL' domain. The notch proteins are transmembrane 
proteins with extracellular domains of repeated EGF domains and the 
notch (or DSL) domain N-terminal to that. These proteins are generally- 
involved in lateral inhibition in developmental processes. (SEQ ID 
NO: 826) 

CD-Length = 38 residues, 81.6% aligned 
Score = 42.0 bits (91), Expect = 4e-04 



NOV3 3: 1533 YDQYCKDHFSDGHCDQGCNSAECEWDGLDCA 1563 

I + +I + I++I M IM I +11 11+ 
Sbjct: 8 YRRHCAERFANGVCNQECNNAACGFDGGDCS 3 8 

Notch is a surface receptor. It transmits signals received from outside the cell to the 
cell's interior. Notch ligands, such as Delta, Serrate and Scabrous interact with epidermal 
growth factor repeats contained in Notch's extracellular domain. Notch plays an active role in 
the differentiation of glial cellsand Notch influences the length and organisation of neuronal 

10 processes. Several homologs of the Drosophila Notch receptor and its ligands, Delta/Serrate, 
have been cloned in man. Three human disorders including a neoplasia (a T-cell acute 
lymphoblastic leukemia/lymphoma), a late onset neurological disease (CADASIL) and a 
developmental disorder (the Alagille syndrome) are associated with mutations in, respectively, 
the Notch 1, Notch3 and Jagged 1 genes, pointing out the broad spectrum of Notch activity in 

15 humans (Joutel A, and Tournier-Lasserve E, 1998, Semin Cell Dev Biol, 9:619-25; Frisen J, 
and Lendahl U, 2001, Bioessays 23:3-7). 

In Drosophila, the intracellular domain of Notch binds Suppressor of hairless, a 
multifunction transcription factor that acts as a signal transducing molecule shuttling between 
the cytoplasm and the nucleus. A nuclear function has been documented for the mammalian 

20 Notch homolog (Lu, 1996), as well as for Drosophila Notch (Struhl and Adachi, 1998, Cell 
93:649-60). When Notch is bound by a ligand, a signal is passed across the cell membrane 
releasing the Suppressor of Hairless protein, freeing this protein to enter the nucleus and 
assume its role in activating transcription of enhancer of Split complex genes. E(spl)-C 
proteins act in turn to repress the adoption of neural and other differentiated states. Deltex, an 

25 intracellular docking protein, replaces Suppressor of Hairless as Su(H) leaves the site of 
interaction with the intracellular tail of Notch. 

The Notch receptor's function is called neurogenic, but this confusing nomenclature 
refers to the phenotype established in the absence of functional Notch. Notch's function is to 
repress the adoption of differentiation by cells that carry the Notch protein. A look at the 

30 principle ligand of Notch (Delta) and its function makes the anti-neural function of Notch 
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more easily understood. Delta is not secreted, but is cell bound. The Delta-Notch interaction 
serves a cell adhesive function between ligand and receptor bearing cells. The receptor bearing 
cell is inhibited in assuming a differentitated state, while the ligand bearing cell is free to do 
so. During neurogenesis, this latter cell delaminates, that is, it migrates out of the epithelial 
5 cell layer in which it formerly resided, and assumes the differentiated state of a neuroblast in 
its new physical location within the developing nervous system. Thus Notch is involved in 
neurogenesis with respect to cells that bears the ligands for Notch: Delta, Serrate and 
Scabrous. 

Lateral inhibition is one of the major themes of development. The process of lateral 

10 inhibition and cell selection is repeated hundreds of times in Drosophila, with differentiation 
that takes place in nearly every kind of tissue. For example, Notch is required to limit the 
number of neuronal precursors, limit the number of muscle precursors, limit the growth of 
malphigian tubules, and regulate the growth of the ovary. Notch also functions as receptor for 
both Serrate and Delta in organizing the dorsal-ventral boundary of the wing. One important 

15 target of Serrate and Notch in this context is wingless (Diaz-Benjumea and Cohen, 1995, 

Development 121:4215-25). Two extreme models can be envisioned for lateral inhibition. The 
first implicates the Notch pathway in the choice of a single precursor via a negative feedback 
loop. This process could be random in some cases. The second model postulates that the 
precursor is pre-determined by some mechanism other than Notch signaling, and that Notch 

20 signaling then serves only to mediate mutual, uniform repression of other cells and ensure 

development of a single precursor. Studies concerning the physical spacing of precursors for 
the microchaetes of the peripheral nervous system suggest the existence of a regulatory loop 
under transcriptional control between Notch and its ligand Delta. Activation of Notch leads to 
repression of the achaete-scute genes, which are themselves known to regulate transcription of 

25 Delta; this regulation may perhaps be direct (Seugnet et al., 1997, Dev Biol. 192:585-98). 
Neuroblast segregation was studied in embryos lacking both the maternal and the zygotic 
forms of either Notch or Delta. A seven-up-LacZ marker was used to follow neuralization of 
5-2 and 7-4 neuroblast groups. In the absence of Notch signaling, the cells with an equivalence 
group do not enter the neural differentiation pathway simultaneously. Neuralization within a 

30 group is progressive with two or three cells segregating early and several more later. This 
suggests that neural potential is not evenly distributed among these cells. A requirement for 
transcriptional regulation of Notch and/or Delta during neuroblast segregation in embryos was 
tested by providing Notch and Delta ubiquitously at uniform levels. Neuroblast segregation 
occurs normally under conditions of uniform Notch expression, suggesting that transcriptional 
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regulation of Notch is not necessary for many aspects of development of the larval CNS and 
PNS. In particular, it is dispensable both before and after neuroblast segregation, implying that 
it is not a necessary component of neuroblast segregation, per se. Under conditions of uniform 
Delta expression, a single neuroblast segregates from each proneural group in 80% of the 
5 cases; in the remaining 20%, more than one neuroblast segregates from a single group of cells. 
Thus transcriptional regulation of Delta is largely dispensable, with only a small percentage of 
multiple neurons segregating in each cluster. Genes such as achaete, scute, extramacrochaete, 
and wingless could be responsible for local differences in proneural activity. Notch signaling 
would enable all cells to mutually repress one another; only a cell with an elevated neural 

10 potential could overcome this repression (Seugnet et al., 1997, Development 124:2015-25). 

The development and patterning of the wing in Drosophila relies on a sequence of cell 
interactions molecularly driven by a number of ligands and receptors. Genetic analysis 
indicates that a receptor encoded by the Notch gene and a signal encoded by the wingless gene 
play a number of interdependent roles in this process and display very strong functional 

15 interactions. At certain times and places, during wing development, the expression of wingless 
requires Notch activity and that of its ligands Delta and Serrate. This has led to the proposal 
that all the interactions between Notch and wingless can be understood in terms of this 
regulatory relationship. This proposal has been tested by analyzing interactions between Delta- 
and Serrate-activated Notch signaling and Wingless signaling during wing development and 

20 patterning. Cell death caused by expressing dominant negative Notch molecules during wing 
development cannot be rescued by coexpressing Nintra. This suggests that the dominant 
negative Notch molecules cannot only disrupt Delta and Serrate signaling but can also disrupt 
signaling through another pathway. One possibility is the Wingless signaling pathway, since 
the cell death caused by expressing dominant negative Notch molecules can be rescued by 

25 activating Wingless signaling. Furthermore, the outcome of the interactions between Notch 
and Wingless signaling differs when Wingless signaling is activated by expressing either 
Wingless itself or an activated form of the Armadillo. For example, the effect of expressing 
the activated form of Armadillo with a dominant negative Notch on the patterning of sense 
organ precursors in the wing resembles the effects of expressing Wingless alone. This result 

30 suggests that signaling activated by Wingless leads to two effects: a reduction of Notch 
signaling and an activation of Armadillo (Brennan, 1999, Curr Biol, 9:707-10). 

Expression of a dominant negative Notch molecule (Extracellular Notch or ECN) 
throughout the developing wing mimics the effects of loss of Notch function. However, Nintra 
cannot rescue the cell death caused by overexpressing ECN. Since Nintra provides constitutive 
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signaling for Delta and Serrate during wing development and the effects of ECN are mediated 
by the sequestration of extracellular molecules that can interact with Notch, this suggests that 
the ECN molecule is sequestering extracellular molecules other than Delta and Serrate and 
attenuating signaling through another pathway. One candidate pathway is the Wingless 
5 signaling pathway, since the cell death caused by expressing the ECN can be rescued by 

activating Wingless signaling. Therefore, it is possible that the ECN molecule is sequestering 
the Wingless protein. The possibility that Wingless can bind the extracellular domain of Notch 
is supported by the following results, in particular, by two observations: first, that some of the 
deleterious effects of ECN can be suppressed by Wingless, but not Wingless signaling in the 

10 form of a constitutive ly active Armadillo molecule; and second, that this interaction requires 
specific EGF-like repeats of Notch, namely repeats 17-19 and 24-26 but not 10-12. Evidence 
for a physical interaction between Notch and Wingless has also been provided recently by 
Wesley (1999, Mol Cell Biol. 19:5743-58) who finds that the Wingless protein is enriched in a 
biopanning assay designed to identify proteins that interact with the extracellular domain of 

15 the Notch protein and that Wingless can be immunoprecipitated with Notch from embryo 

extracts and cultured cells. These experiments also show that the association of Wingless with 
Notch requires the integrity of a region of Notch centered around EGF-like repeats 24-26 
(Wesley, 1999, Mol Cell Biol. 19:5743-58) which these experiments indicate are essential for 
the interactions that are described between Wingless and ECN during wing development and 

20 patterning (Brennan et al., 1999, Curr Biol. 9:707-10). The interaction of Wingless and Notch 
signaling that has been observed might also be important during normal neural development. 
Wingless and Delta have opposite effects during neurogenesis; Wingless promotes while Delta 
suppresses the development of sense organs. Various experiments suggest that during the 
segregation of neural precursors a reduction of Notch signaling in the precursors themselves is 

25 as important as the Delta-mediated activation of Notch signaling in the surrounding cells. It is 
possible that, like the activation of Notch by Delta, the suppression of Notch signaling is an 
active process mediated by the interaction of Wingless and Dishevelled with Notch. If this 
were the case, since both Delta and Wingless have spatially and temporally regulated patterns 
of gene expression, their interactions with Notch could contribute to the well-documented bias 

30 in the appearance of precursors from clusters of cells with neural potential. This competitive 

interaction could also account for the observed increases in Wingless signaling associated with 
reductions in Notch signaling during lateral inhibition (Brennan et al., 1999, Curr Biol. 9:707- 
10; Brennan et al, 1999, Dev Biol. 216:210-29). 
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The NOV33 nucleic acid of the invention encoding a Notch 1 -like protein includes the 
nucleic acid whose sequence is provided in Table 33A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 33A while still encoding a protein that maintains its 
5 Notch 1 -like activities and physiological functions, or a fragment of such a nucleic acid. The 
invention further includes nucleic acids whose sequences are complementary to those just 
described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 

10 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the residues 

15 may be so changed. 

The NOV33 protein of the invention includes the Notchl-like protein whose sequence 
is provided in Table 33B. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residue shown in Table 33B while still 
encoding a protein that maintains its Notchl-like activities and physiological functions, or a 

20 functional fragment thereof. In the mutant or variant protein, up to about 1 1% of the bases 
may be so changed. 

The NOV33 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 

25 have efficacy for treatment of patients suffering from: neoplasia such as T-cell acute 

lymphoblastic leukemia/lymphoma and mammary carcinomas, a late onset neurological 
disease like CADASIL and a developmental disorder such as the Alagille syndrome, familial 
and congenital cholestatic diseases, Hereditary vascular dementia, neurological diseases and 
other diseases, disorders and conditions of the like. 

30 NOV33 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV33 protein have multiple 



295 



:t .o o '.7 Ft n a r? ... o. 3 "=s j .o 



hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 
5 NOV34 

A disclosed NOV34 nucleic acid of 935 nucleotides (also referred to as CG56715-01) 
encoding a novel Olfactory Receptor-like protein is shown in Table 34A. An open reading 
frame was identified beginning with an ACA codon, which codes for the amino acid 
Threonine, at nucleotides 2-4 and ending with a TGA codon at nucleotides 932-934. Putative 
10 untranslated regions, if any, found upstream from the initiation codon and downstream from 
the termination codon are underlined in Table 34A, and the start and stop codons are in bold 
letters. 

Table 34A. NOV34 Nucleotide Sequence (SEQ ID NO: 135) 

CACACAAGAAGGCATCTACTTCATCCTCACGGACATCCCTGGATTTGAGGCCTCCCACATCTGGATCTCCA 
TCCCCGTCTGCTGTCTCTACACCATCTCCATCATGGGCAATACCACCATCCTCACTGTCATTCGCACAGAG 
CCATCTGTCCACCAGCGCATGTATCTGTTTCTCTCCATGCTGGCCCTGACGGACCTGGGTCTCACCCTCAC 
CACCCTACCCACAGTCATGCAGCTTCTCTGGTTCAACGTTCGTAGAATCAGCTCTGAGGCCCGTTTTGCTC 
AGTTTTTCTTCCTTCATGGATTCTCCTTTATGGAGTCTTCTGTCCTCCTGGCTATGTCCGTTGACTGCTAT 
GTGGCCATCTGCTGTCCCCTCCATTATGCCTCCATCCTCACCAATGAAGTCATTGGTAGAACTGGGTTAGC 
CATCATTTGCTGCTGTGTTCTGGCGGTTCTTCCCTCCCTTTTCTTACTCAAGCGACTGCCTTTCTGCCACT 
CCCACCTTCTCTCTCGCTCCTATTGCCTCCACCAGGATATGATCCGCCTGGTCTGTGCTGACATCAGGCTC 
AACAGCTGGTATGGATTTGCTCTTGCCTTGTTCATTATTATCGTGGATCCTCTGCTCATTGTGATCTCCTA 
T AC AC T T ATT CTG AAAAAT AT C T TGGG C AC AGC C AC CTGGGC TG AGCG ACT C CGTGC C CTC AAT AACTGC C 
TGTCCCACATTCTGGCTGTCCTGGTCCTCTACATTCCCATGGTTGGTGTATCTATGACTCATCGCTTTGCC 
AAGCATGCCTCTCCACTGGTCCATGTTATCATGGCCAATATCTACCTGCTGGCACCCCCGGTGATGAACCC 
CATCATTTACAGTGTAAAGAACAAGCAGATCCAATGGGGAATGTTAAATTTCCTTTCCCTCAAAAATATGC 
ATTCAAGATGAG 



The NOV34 nucleic acid has 578 of 903 bases (64%) identical to a gb : GENB ANK- 

15 ID:AF137396|acc:AF137396.2 mRNA from Homo sapiens (ubiquilin 3, HOR 5'Betal4, 

HOR5'Betal3, HOR5'Betal2, and HOR5*Betal 1 genes, complete cds; HOR 5'BetalO and 

HOR5 f Beta9 pseudogenes, complete sequence; HOR5 f Beta8 and HOR5'Beta7 genes, complete 

cds; CHR1 lORFl and amphiphysin pseudogenes, complete sequence; HOR5'Beta6 and 

HOR5'Beta5 genes, complete cds; HOR5'Beta4 pseudogene, complete sequence; HOR 5'Beta3 

20 genes, complete cds; HOR5'Beta2 pseudogene, complete sequence; and HOR 5'Betal gene, 

complete cds) (E = 6.1e~ 50 ). 

A disclosed NOV34 polypeptide (SEQ ID NO: 136) encoded by SEQ ID NO: 135 is 

310 amino acid residues and is presented using the one-letter code in Table 34B. Signal P, 

Psort and/or Hydropathy results predict that NOV34 contains a signal peptide and is likely to 

25 be localized to the plasma membrane with a certainty of 0.6000. The most likely cleavage site 

for a NOV34 peptide is between amino acids 36 and 37: IMG-NT. 
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Table 34B. Encoded NOV34 protein sequence (SEQ ID NO: 136) 

TQEGIYFILTDIPGFEASHIWISIPVCCLYTISIMGNTTILTVIRTEPSVHQRMYLFLSMLALTDLGLTLTT 
LPTVMQLLWFISTVRRISSEARFAQFFFLHGFSFMESSVLLT^SVDCYVAICCPLHYASILTNEVIGRTGLAII 
' CCCVLAVLPSLFLLKRLPFCHSHLLSRSYCLHQDMIRLVCADIRLNSWYGFALALFIIIVDPLLIVISYTLI 
LKNILGTATWAERLRALNNCLSHILAVLVLYIPMVGVSMTHRFAKHASPLVHVIMANIYLIAPPVMNPII 
VKNKQIQWGMLNFLSLKNMHSR 



The disclosed NOV34 amino acid sequence has 160 of 296 amino acid residues (54%) 
identical to, and 210 of 296 amino acid residues (70%) similar to, the 319 amino acid residue 
ptnr:TREMBLNEW-ACC:AAG41684 protein from Mus musculus (Mouse) (MOR 3'BETA4) 
5 (E = 6.3e' 83 ). 

NOV34 is predicted to be expressed in at least Apical microvilli of the retinal pigment 
epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, corpus 
callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 

10 umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
cell lines, fetal lymphoid tissue, adult lymphoid tissue, those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 

15 spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

NOV34 also has homology to the amino acid sequences shown in the BLASTP data 

20 listed in Table 34C. 



Table 34C. BLAST results for NOV34 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17472775 |ref |XP_ 
061808 . 1 | 
(XM_061808) 


similar to MOR 
3Beta4 (H. 
sapiens) [Homo 
sapiens] 


317 


298/300 
(99%) 


298/300 
(99%) 


e-136 


gi | 17456767 |ref | XP_ 
061618 . 1 | 
{XM_061618> 


similar to 
prostate specific 
G-protein coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


879 


168/289 
(58%) 


215/289 
(74%) 


6e-8l 


gi | 174 5 67 53 |ref |XP_ 
061614.1] 
(XM_061614) 


similar to MOR 
3Beta4 (H. 
sapiens) [Homo 
sapi ens] 


315 


154/294 
(52%) 


214/294 
(72%) 


5e-75 
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gi | 174 72781 | ref | XP_ 
061811 . 1 | 
(XM_061811) 


similar to 
OLFACTORY 
RECEPTOR 5112 
(HOR5BETA12) (H. 
sapiens) [Homo 
sapiens] 


312 


150/290 
(51%) 


204/290 
(69%) 


6e-74 


gi | 119 0822 0 | gb | AAG4 
1684. l| (AF133300) 


MOR 3 1 Beta4 [Mus 
musculus] 


319 


159/294 
(54%) 


209/294 
(71%) 


2e-72 



The homology of these sequences is shown graphically in the ClustalW analysis shown 



in Table 34D. 



Table 34D Clustal W Sequence Alignment 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



1) NOV34 (SEQ ID NO: 136) 

2) gi | 17472775 |ref | XP06 18 08 . 1 | (XM_06 1808 ) similar to MOR 3Beta4 (H. sapiens) 
[Homo sapiens] (SEQ ID NO: 472) 

3) gi| 17456767 | ref |XP_061618.1 | (XM_061618) similar to prostate specific G-protem 
coupled receptor (H. sapiens) [Homo sapiens] (SEQ ID NO: 473) 

4) gi | 17456753 |ref|XP_06i614.1 | (XM_061614) similar to MOR 3Beta4 (H. sapiens) 
[Homo sapiens] (SEQ ID NO: 4 74) 

5) gi|l747278l|ref |XP_061811.1| (XM_061811) similar to OLFACTORY RECEPTOR 5112 
(HOR5BETA12) (H. sapiens) [Homo sapiens] (SEQ ID NO: 4 75) 

6) gi|H908220|gb|AAG41684.l| (AF133300) MOR 3 ' Beta4 [Mus musculus] (SEQ ID NO:476) 

10 20 30 40 50 60 

NOV34 1 1 

gi I 17472775 | 1 1 

gi | 17456767 j 1 MSLALDLCPLSQRLEAFPSSIVLFFQTAPAVRHPKGLLELHKTVPTSIKEELKGFFPTSD 60 

gi 1 17456753 j 1 1 

gi | 17472781 1 1 " " 1 

gi | 11908220 j 1 1 

70 80 90 100 110 120 

— I — I — I — I — I — I — I — I — I — I — I — ' , 

NOV34 1 

gi I 17472775 | 1 1 

gi | 17456767 | 61 HFIITDFI AK YHT DLK W AVLG I AT PRQ Q F KALNT C I S H I C AVL I F Y VP T L S AAMLHQ F AR 120 

gi|l7456753| 1 ~~ 1 

gi 1 17472781 j 1 ~ 1 

gi j 11908220 j 1 1 

130 140 150 160 170 180 

NOV34 1 

gi I 17472775 | 1 1 

gi j 17456767 j 121 DVSPMIHVLMADIFLLVPPLLNPIVYCVKTHQIREKVVGKLCPKNCFLKSKILPRCSFVP 180 

gi | 17456753 | 1 1 

gi | 17472781 j 1 " 1 

gi j 11908220 j 1 " ** 1 

190 200 210 220 230 240 

I I I I I I I I I I I I , 

NOV34 1 " "**"" 1 

gi I 17472775 | 1 1 

gi j 17456767 I 181 GFRLAYYYLPPHPSKVSFLDPVEKANRSAPTQFSPMPSADASLLADLGTFSSLQRATFFL 240 

gi|l7456753| 1 " X 

gi 1 17472781 j 1 " 

gi | 11908220 j 1 1 

250 260 270 280 290 300 

NOV34 1 ~ 1 

gi | 17472775 | 1 X 

gi 17456767 241 TGFQGLEGLHGWISIPFCFIYLTVILGNLTILHVICTDATLHGPMYYFLGMLAVTDLGLC 300 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



gi | 17456753 | 1 
gi 1 17472781 1 1 
gi j 11908220 j 1 



310 320 
..|....|....|.--.| 



330 



340 
|....|....| 



350 



360 



NOV34 

gi | 17472775 | 
gi j 17456767 j 
gi | 17456753 j 
gi|l747278l| 
gi 1 11908220 | 



NOV34 

gi|l7472775 | 
gi|l7456767| 
gij 17456753 j 
gi 1 17472781 1 
gijll908220 j 



NOV34 

gi | 17472775 | 
gi | 17456767 j 
gi j 17456753 | 
gi|l747278l| 
gi 1 11908220 j 



NOV34 

gi | 17472775 | 
gi j 17456767 j 
gij 17456753 j 
gi j 17472781 j 
gij 11908220 j 



NOV34 

gi | 17472775 | 
gi |17456767 j 
gi | 17456753 j 
gij 17472781 j 
gij 11908220 j 



NOV34 

gi | 17472775 | 
gi j 17456767 j 
gi j 17456753 j 
gij 17472781 j 
gijll908220| 



NOV34 

gi | 17472775 | 
gij 17456767 j 
gi j 17456753 j 
gi|l747278l| 
gij 11908220 j 



NOV34 



3 01 LSTLPTVLGI FWFDTREIG I PACFTQLFFIHTLSSMESS VLLSMS IDRYVAVCNPLHDST 3 6 0 
1 



370 



380 



390 



400 



410 



420 



3 61 VLTPACIVKMGLSSVLRSALLILPLPFLLKRFQYCHSHVLAHAYCLHLEIMKLACSSIIV 420 



1 
1 

421 
1 
1 
1 



1 
1 

481 
1 
1 
1 



1 
1 

541 
1 
1 
1 



430 



460 



470 
..|.. 



480 



NH I YGLFWACTVGVD S L L I F LS Y AL I LRT VL S I AS HQERLRALNTC V SHI C A VLL FY I P 480 



490 



500 



510 



520 



530 
..|.. 



540 



MIGLSLVHRFGEHLPRWHLFMSYVYLLVPPLMNPI I YS I KTKQIRQRI IKKFQFI KSLR 540 
1 



550 



560 



570 



580 



590 



600 



I 



CQEGI 

SJs QVrgj - T^QEG I Y 

cnhqyclnllqdfgghppsplsphtmt^gsligSss^svsat 
ISIgdwniS, 



610 



620 



jDWNifflSD - AVE P I 

Bglf-S--vthpa] 
Jats-Sjss|ivsst 

630 640 




650 



660 




205 



299 



i orr:? t-n i p* 



S i! * o 



10 



15 



20 



25 



gi | 17472775 | 
gi | 17456767 | 
gi | 17456753 j 
gi|l747278l| 
gi|H908220 j 



NOV34 

gi | 17472775 | 
gi | 17456767 j 
gi j 17456753 j 
gi | 17472781 | 
gi | 11908220 j 



153 CC^dA^ 
721 ^^AL|F 
153 S|GVgL 
151 ffiFgTpF 

153 ? ~- 



'AlApLIplSg 212 

! F|ft^STj&glffl 78 0 

jSTFgMffl 210 

ITLLwBmB 212 



NOV34 


206 




gi| 17472775 | 


213 




gij 17456767 | 


781 


SgL 


gi j 17456753) 


213 


SXF 


gi |17472781| 


211 


LFF 


gi j 11908220 j 


213 







870 880 

— I — I • • 

300 
317 
879 

ijjHKFVLRRRF 315 

' frmfhhiki 312 

llcspkissitm 319 



FLSLKNMHSR- 
RV f TQAFCY 



Table 34E list the domain descriptions from DOMAIN analysis results against 
NOV34. This indicates that the NOV34 sequence has properties similar to those of other 
proteins known to contain this domain. 



30 



35 



40 



45 



50 



Table 34E Domain Analysis of NOV34 

gnl | Pfam|pf amOOOOl , 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO: 810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 41.6 bits (96) , Expect = 7e-05 



NOV4 0 : 


36 


Sbjct : 


1 


NOV4 0 : 


96 


Sbjct : 


61 


NOV4 0 : 


156 


Sbjct : 


117 


NOV4 0 : 


216 


Sbjct : 


172 


NOV40 : 


268 


Sbjct : 


232 



GNTTILTVIRTEPSVHQRMYLFLSMI.AI-TDLGI-TLTTI-PTVMQLLWFNVRRISSEARFAQ 
|| ++ || ♦ +11 11+ II II I + I 

GNLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 



I I 



+ 1 



+ + ++ ++ 



I I 



95 
60 



F F FLHG F S FME S S VLL AM S VD C Y V A ICC P LH Y AS I LTNE V I GRTG LA 1 1 C CC VL AVL P S L 155 

+ | 1 + 1 + 1 l + ll Mill 1+1+ NM I 

GALF WNGYAS I LLLTAI S IDRYLAI VHPLRYRRI RT PRRAKVLILLVWVLALLLS- ■ 116 



G-Protein Coupled Receptor (GPCRs) have been identified as an extremely large 
family of protein receptors in a number of species. At the phylogenetic level they can be 
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classified into four major subfamilies. These receptors share a seven transmembrane domain 
structure with many neurotransmitter and hormone receptors. They are likely to be involved in 
the recognition and transduction of various signals mediated by G-Protetns, hence their name 
G-Protein Coupled Receptors. The human GPCR genes are generally intron-less and belong to 
5 four gene subfamilies, displaying great sequence variability. These genes are dominantly 
expressed in olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large family of GPCRs in 
a number of species. As members of the GPCR family, these receptors share a seven 
transmembrane domain structure with many neurotransmitter and hormone receptors, and are 

10 likely to underlie the recognition and G-protein-mediated transduction of odorant signals. Like 
GPCRs, the ORs they can be expressed in a variety of tissues where they are thought to be 
involved in recognition and transmission of a variety of signals. The human OR genes are 
typically intron-less and belong to four different gene subfamilies, displaying great sequence 
variability. These genes are dominantly expressed in olfactory epithelium. 

1 5 The NOV34 nucleic acid of the invention encoding a Olfactory Receptor-like protein 

includes the nucleic acid whose sequence is provided in Table 34A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 34A while still encoding a protein that maintains 
its Olfactory Receptor-like activities and physiological functions, or a fragment of such a 

20 nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
to those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. The invention additionally includes nucleic acids or nucleic acid 
fragments, or complements thereto, whose structures include chemical modifications. Such 
modifications include, by way of non-limiting example, modified bases, and nucleic acids 

25 whose sugar phosphate backbones are modified or derivatized. These modifications are carried 
out at least in part to enhance the chemical stability of the modified nucleic acid, such that 
they may be used,- for example, as antisense binding nucleic acids in therapeutic applications 
in a subject. In the mutant or variant nucleic acids, and their complements, up to about 36% of 
the residues may be so changed. 

30 The NOV34 protein of the invention includes the Olfactory Receptor-like protein 

whose sequence is provided in Table 34B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
34B while still encoding a protein that maintains its Olfactory Receptor-like activities and 
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physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 49% of the bases may be so changed. 

The NOV34 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: developmental diseases, MHCII and III 
diseases (immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders, Cell Shape disorders, Feeding 
disorders, control of feeding, potential obesity due to over-eating, potential disorders due to 
starvation (lack of apetite), noninsul in-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm, adenocarcinoma, lymphoma, prostate cancer, 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease, multiple sclerosis, treatment of 
Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, ulcers, asthma, 
allergies, benign prostatic hypertrophy, psychotic and neurological disorders, including 
anxiety, schizophrenia, manic depression, delirium, dementia, severe mental retardation, 
Dentatorubro-pallidoluysian atrophy (DRPLA), Hypophosphatemic rickets, autosomal 
dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease or Gilles 
de la Tourette syndrome and other diseases, disorders and conditions of the like. 

NOV34 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV34 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 

NOV35 

A disclosed NOV35 nucleic acid of 1 102 nucleotides (also referred to as CG567 18-01) 
encoding a novel Olfactory Receptor-like protein is shown in Table 35A. An open reading 
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frame was identified beginning with an ATG initiation codon at nucleotides 92-94 and ending 
with a TGA codon at nucleotides 1049-1051 . Putative untranslated regions, if any, found 
upstream from the initiation codon and downstream from the termination codon are underlined 
in Table 35A, and the start and stop codons are in bold letters. 



Table 35 A. NOV35 Nucleotide Sequence (SEQ ID NO: 137) 

ataactctactaactaaaagaaaatgtatctaaatattttacataacacttaatgtatcacttctatagga " 

TGGAAGCCTTTTGAGGACAA ATGCTGCAAAACCAGGACACCATGGAAATCCTAAGCAACTCAACATCTAAA 
TTTCCAACCTTCTTGTTGACCGGCATTCCTGGCCTAGAGTCTGCCCATGTCTGGATCTCCATTCCTTTCTG 
TTGTTTTTATGCCATTGCCCTCTCTGGGAACAGCGTGATCCTGTTTGTCATCATTACCCAGCAGAGTCTCC 
ATGAACCCATGTATTATTTCCTCTTCAGGCTATCAGCCACTGATCTGGGCTTGACTGTTTCTTCATTGTCA 
ACAACATTAGGTATCCTCTGGTTTGAGGCACGTGAAATCAGTCTATATAGCTGCATTGTCCAGATGTTTTT 
TCTTCATGGATTCACTTTTATGGAATCTGGAGTGCTGGTGGCTACAGCCTTTGACCGTTATGTGGCCATCT 
GTGACCCTCTGAGGCACACTACCATTCTCACTAATTCCAGAATCATTCAAATGGGTCTTCTGATGATTACA 
CGTGCTATAGTACTAATATTGCCACTACTTTTGCTCCTTAAGCCTCTCTATTTCTGTAGAATGAATGCCCT 
TTCTCACTCCTATTGTTACCATCCAGATGTGATTCAATTAGCATGTTCAGACATTCGGGCAAATAGCATCT 
GTGGATTAATTGATCTCATCCTGACCACTGGAATAGATACACCATGCATTGTCCTGTCATATATCTTAATT 
ATTCACTCTGTCCTCAGAATTGCCTCCCCTGAAGAATGGCACAAGGTCTTCAGCACCTGTGTCTCCCATGT 
GGGAGCAGTTGCTTTCTTCTACATCCACATGCTGAGCCTGTCCTTGGTGTATCGCTATGGTCGGTCAGCCC 
CCAGAGTAGTCCATTCAGTGATGGCTAATGTATACCTGCTTTTACCCCCTGTGCTCAACCCCATCATCGAC 
ACTOTAAAAACAAfl ar<A AATCCGCAAGXSCTATGCTCAGTCTGCTGCTTACAAAATGA ACAGACAT AGTTTT 
ATTTGATACAAACCTGGCATGAATGACTTGCACTGTA 



The NOV35 nucleic acid, located on chromosome 1 1, has 636 of 1006 bases (63%) 
identical to a gb:GENBANK-ID:AF133300|acc:AF133300.2 mRNA from Mus musculus 
(MOR 3'Betal, MOR 3'Beta2, MOR 3'Beta3, and MOR 3'Beta4 genes, complete cds; Cbx3 
pseudogene, complete sequence; and MOR 3'Beta5 and MOR 3'Beta6 genes, complete cds) (E 
10 =1.2e 51 ). 

A disclosed NOV35 polypeptide (SEQ ID NO: 138) encoded by SEQ ID NO: 137 is 
319 amino acid residues and is presented using the one-letter code in Table 35B. Signal P, 
Psort and/or Hydropathy results predict that NOV35 contains a signal peptide and is likely to 
be localized to the plasma membrane with a certainty of 0.6000. 



Table 35B. Encoded NOV35 protein sequence (SEQ ID NO: 138) 

MLQNQDTMEILSNSTSKFPTFLLTGIPGLESAHVWISIPFCCFYAIALSGNSVILFVIITQQSLHEPMYYFL 
FRLSATDLGLTVSSLSTTLGILWFEAREISLYSCIVQMFFLHGFTFMESGVLVATAFDRYVAICDPLRHTTI 
LTNSRIIQMGLLMITRAIVLILPLLLLLKPLYFCRMNALSHSYCYHPDVIQLACSDIRANSICGLIDLILTT 
GIDTPCIVLSYILIIHSVLRIASPEEWHKVFSTCVSHVGAVAFFYIHMLSLSLVYRYGRSAPRWHSVMANV 
YLLLPPVLNPI IDSVKTKQIRKAMLSLLLTK ; . 

15 

The disclosed NOV35 amino acid sequence has 160 of 309 amino acid residues (51%) 
identical to, and 219 of 309 amino acid residues (70%) similar to, the 321 amino acid residue 
ptnr :TREMBLNE W- ACC : A AG42 3 64 protein from Homo sapiens (Human) (odorant receptor 
HOR3'BETAl) (E = 2.5e" 81 ). 
20 NOV35 is predicted to be expressed in at least Apical microvilli of the retinal pigment 

epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, corpus 
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callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 
umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
5 cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 
spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
10 were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

NOV35 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 35C. 



Table 35C. BLAST results for NOV35 




Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%> 


Expect 


gi | 17456801 | ref |XP_ 
061626 . 1 | 
(XM_061626) 


similar to 
OLFACTORY 
RECEPTOR 5112 
(HOR5BETA12) (H. 
sapiens) [Homo 
sapiens] 


342 


196/312 
(62%) 


251/312 
(79%) 


le-95 


gi | 17456777 | ref |XP_ 
061621 . 1 | 
(XM_061621) 


similar to 
olfactory- 
receptor- like 
protein COR3beta 
(H. sapiens) 
[Homo sapiens] 


327 


165/312 
(52%) 


218/312 
(68%) 


6e-77 


gi | 174 567 6 7 |ref |XP_ 
061618 . 1 | 
(XM_061618) 


similar to 
prostate specific 
G-protein coupled 
receptor (H. 
sapiens ) [Homo 
sapiens] 


879 


158/304 
(51%) 


218/304 
(70%) 


8e-74 

t 


gi | 174727 81 | ref |XP_ 
061811 . 1 | 
(XM_061811) 


similar to 
OLFACTORY 
RECEPTOR 5112 
(HOR5BETA12) (H. 
sapiens) [Homo 
sapiens] 


312 


155/295 
(52%) 


209/295 
(70%) 


le-73 


gi | 182 02242 | sp |08 86 
2 8 j OXE2_RAT 


Olfactory- 
receptor 51E2 <G- 
protein coupled 
receptor RAlc) 


320 


147/305 
(48%) 


206/305 
(67%) 


4e-73 



The homology of these sequences is shown graphically in the ClustalW analysis shown 



in Table 35D. 
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Table 35D Clustal W Sequence Alignment 



1) NOV3 5 (SEQ ID NO: 13 8) 

2) gi| 174S680l|ref | XP__06 1626 . 1 | (XM_061626) similar to OLFACTORY RECEPTOR 5112 
(HOR5BETA12) (H. sapiens) [Homo sapiens] (SEQ ID NO:477) 

3) gi| 17456777 | ref | XP_06 1621 . 1 | (XM_061621) similar to olfactory receptor- like 
protein COR3beta (H. sapiens) [Homo sapiens] (SEQ ID NO:478) 

4) gi | 17456767 | ref |XP_061618 . 1 | (XM_061618) similar to prostate specific G-protein 
coupled receptor (H. sapiens) [Homo sapiens] (SEQ ID NO:479) 

5) gi| 17472781 | ref | XP_06 1811 . 1 | (XM_061811) similar to OLFACTORY RECEPTOR 5112 
(H0R5BETA12) (H. sapiens) [Homo sapiens] (SEQ ID NO: 4 80) 

6) gi | 18202242 | sp 1 08 8 62 8 | OXE2__RAT Olfactory receptor 51E2 (G-protein coupled 
receptor RAlc) (SEQ ID NO: 481) 

10 20 30 40 50 60 

NOV35 1 " 

gi | 17456801 1 1 ] " 1 

gi | 17456767 J 1 MSLALDLCPLSQRLEAFPSSIVLFFQTAPAVRHPKGLLELHKTVPTSIKEELKGFFPTSD 60 

gi|l747278l| 1 " " "t 

gi|l8202242| 1 " 

70 80 90 100 110 120 

NOV35 1 

gi | 17456801 1 1 " " " J 

g± | 17456767 J 61 HFIITDFIAKYHTDLKWAVLGIATPRQQFKALNTCISHICAVLIFYVPTLSAAMLHQFAR 120 

gi|1747278l| 1 """" i 

gijl8202242| 1 

130 140 150 160 170 180 

NOV35 1 

gi 1 17456801 1 1 I 1 

|i | 17456767 | 121 DVSPMIHVLMADIFLLVPPLLNPIVYCVKTHQIREKWGKLCPKNCFLKSKILPRCSFVP 180 

gi 1 17472781 1 1 ~ ~ ~ * 

gi j 18202242 1 1 

190 200 210 220 230 240 

NOV35 1 

gi 1 17456801 1 1 " 

gi | 1745 6777 | 1 

gi | 17456767 | 181 GFRLAYYYLPPHPSKVSFLDPVEKANRSAPTQFSPMPSADASLLADLGTFSSLQRATFFL 240 

gi 1 17472781 1 1 ~~ * 

gi 1 18202242 | 1 

250 260 270 280 290 300 

NOV35 1 

gi|l745680l| 1 ~ * 

|i| 1745676?! 241 TGFQGLEGLHGWISIPFCFIYLTVILGNLTILHVICTDATLHGPMYYFLGMLAVTDLGLC 300 

gi | 17472781 | 1 * 

gi|l8202242| 1 

310 320 330 340 350 360 

NOV35 1 

gi | 17456801 | 1 _ * 

g±\ 17456767 | 301 LSTLPTVLGIFWFDTREIGIPACFTQLFFIHTLSSMESSVLLSMSIDRYVAVCNPLHDST 360 

gi | 17472781 | 1 " * 

gi|l8202242| 1 
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370 380 390 400 



410 420 
,.|....|....| 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



NOV3 5 1 

gi | 17456801 | 1 

gi|l7456777| 1 

gijl7456767 I 361 

gi|l747278l| 1 

gi 1 18202242 j 1 



NOV35 

gi | 17456801 
gi j 17456777 
gi j 17456767 
gi|l7472781 
gi j 18202242 



NOV35 

gi|l745680l| 
gi j 17456777 j 
gi | 17456767 | 
gi|l747278l| 
gi|l8202242| 



NOV3 5 

gi|l745680l| 
gi | 17456777 j 
gi|l7456767| 
gi 1 17472781 1 
gi | 18202242 | 



NOV3 5 

gi | 17456801 | 
gi | 17456777 | 
gi | 17456767 | 
gi | 17472781 j 
gi|l8202242| 



NOV35 

gi|l745680l| 
gi | 17456777 j 
gij 17456767 j 
gi|l747278l| 
gij 18202242 j 



VLTPACIVKMGLSSVLRSALLILPLPFLLKRFQYCHSHVLAHAYCLHLEIMKLACSSIIV 420 
1 



430 
..|.. 



440 



450 460 470 480 



1 
1 
1 

421 

1 

1 



1 
1 
1 

481 

1 

1 



NHIYGLFWACTVGVDSLLIFLSYALILRTVLSIASHQERLRALNTCVSHICAVLLFYIP 480 
1 



490 



500 



510 520 530 540 



MIGLSLVHRFGEHLPRWHLFMSYVYLLVPPLMNPIIYSIKTKQIRQRIIKKFQFIKSLR 540 

1 

1 



580 



590 



NOV35 


160 


gi 


17456801 


165 


gi 


17456777 


153 


gi 


17456767 


721 


gi 


17472781 


151 


gi 


18202242 


150 



NOV3 5 

gi | 17456801 | 
gij 17456777 j 
gij 17456767 j 
gi|1747278l| 
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45 



gl 1 18202242 | 210 VMFBsBS*15l@&9lQl p l$ S S A iS FC! 

850 860 870 



LDV 269 



NOV3 5 


280 






g±l 


17456801 | 


285 






gi| 17456777 | 


273 




CSTlJ 


gi |17456767 j 


841 


LE 




gi|l747278l| 


271 




gi|l8202242 j 


270 






NOV35 


319 




319 


gi 


17456801| 


342 


E 


342 


gi 


17456777 j 


327 




327 


gi 


17456767 j 


879 




879 


gi 


17472781| 


312 




312 


gi 


18202242 j 


320 




320 



KTKQI 
K0KQl| 
KTK0I 

KTKQI 
KTK^I 
KTKQI 



-Hi: 



890 900 

|....|....|....| 

319 

iSKSNHQLFLIRDKAIY 341 

^SWGFNVRGLRGRWD- - 3 27 

879 

g|FSFHHIKI 312 

RVL^f|KsCD|3dIEAGGNT 3 20 



880 

lsBllt] 

|q ^FpHLQS| 

SDR V'L'IIAFCY 



Table 35E list the domain descriptions from DOMAIN analysis results against 
NOV35. This indicates that the NOV35 sequence has properties similar to those of other 
proteins known to contain this domain. 



gnl |Pfam|pfam00001, 7tm_l, 
(SEQ ID NO: 810) 

CD-Length = 254 residues, 4 0.9% aligned 
Score = 56.2 bits (134), Expect = 3e-09 



Table 35E Domain Analysis of NOV35 

7 transmembrane receptor (rhodopsin family) 



NOV35 * 50 GNSVILFVIITQQSLHEPMYYFLFRLSATDLGLTVSSIjSTTIiGILWFEAREISLYSCIVQ 109 

| | +++ ||+ +| | || |+ II II I + 

Sbjct : 1 GNLLVILVILRTKKLRTPTNI FLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 

NOV35- 110 M F FLHGFT FME S G VL V AT AFDR Y V A I CD P LRHTT I LTNS R 1 1 QM 153 

HI - II Ml IM+ I I I + 

Sbjct: 61 GALFWNGYASILLLTAI SIDRYLAIVHPLRYRRIRTPRRAKVL 104 

G-Protein Coupled Receptor (GPCRs) have been identified as an extremely large 
family of protein receptors in a number of species. At the phylogenetic level they can be 
classified into four major subfamilies. These receptors share a seven transmembrane domain 
structure with many neurotransmitter and hormone receptors. They are likely to be involved in 
the recognition and transduction of various signals mediated by G-Proteins, hence their name 
G-Protein Coupled Receptors. The human GPCR genes are generally intron-less and belong to 
four gene subfamilies, displaying great sequence variability. These genes are dominantly 
expressed in olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large family of GPCRs in 

a number of species. As members of the GPCR family, these receptors share a seven 

transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
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likely to underlie the recognition and G-protein-mediated transduction of odorant signals. Like 
GPCRs, the ORs they can be expressed in a variety of tissues where they are thought to be 
involved in recognition and transmission of a variety of signals. The human OR genes are 
typically intron-less and belong to four different gene subfamilies, displaying great sequence 
5 variability. These genes are dominantly expressed in olfactory epithelium. 

The NOV35 nucleic acid of the invention encoding a Olfactory Receptor-like protein 
includes the nucleic acid whose sequence is provided in Table 35 A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 35A while still encoding a protein that maintains 
1 0 its Olfactory Receptor-like activities and physiological functions, or a fragment of such a 

nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
to those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. The invention additionally includes nucleic acids or nucleic acid 
fragments, or complements thereto, whose structures include chemical modifications. Such 
15 modifications include, by way of non-limiting example, modified bases, and nucleic acids 

whose sugar phosphate backbones are modified or derivatized. These modifications are carried 
out at least in part to enhance the chemical stability of the modified nucleic acid, such that 
they may be used, for example, as antisense binding nucleic acids in therapeutic applications 
in a subject. In the mutant or variant nucleic acids, and their complements, up to about 37% of 
20 the residues may be so changed. 

The NOV35 protein of the invention includes the Olfactory Receptor-like protein 
whose sequence is provided in Table 35B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
35B while still encoding a protein that maintains its Olfactory Receptor-like activities and 
25 physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 52% of the bases may be so changed. 

The NOV35 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
30 have efficacy for treatment of patients suffering from: developmental diseases, MHCII and III 
diseases (immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders, Cell Shape disorders, Feeding 
disorders, control of feeding, potential obesity due to over-eating, potential disorders due to 



starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or H1V-2), pain, 
cancer (including but not limited to Neoplasm, adenocarcinoma, lymphoma, prostate cancer, 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 
hypertension, urinary retention, osteoporosis, Crohn's disease, multiple sclerosis, treatment of 
Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, ulcers, asthma, 
allergies, benign prostatic hypertrophy, psychotic and neurological disorders, including 
anxiety, schizophrenia, manic depression, delirium, dementia, severe mental retardation, 
Dentatorubro-pallidoluysian atrophy (DRPLA), Hypophosphatemic rickets, autosomal 
dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease or Gilles 
de la Tourette syndrome and other diseases, disorders and conditions of the like. 

NOV35 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV35 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 

NOV36 

NOV36 includes two novel cadherin 1 1-like proteins disclosed below. The disclosed 
sequences have been named NOV36a and NOV36b. 

NOV36a 

A disclosed NOV36a nucleic acid of 2476 nucleotides (also referred to as CG56729- 
01) encoding a novel cadherin 1 1-like protein is shown in Table 36A. An open reading frame 
was identified beginning with an ATG initiation codon at nucleotides 46-48 and ending with a 
TGA codon at nucleotides 2389-2391. Putative untranslated regions, if any, found upstream 
from the initiation codon and downstream from the termination codon are underlined in Table 
36A, and the start and stop codons are in bold letters. 

Table 36A. NOV36a Nucleotide Sequence (SEQ ID NO: 139) 
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TCAGCACCCAGGGCCAGTGAACAGAGCCCTGGCTGGAGTCCAAAC ATQTGGGGCCTGGTGAGGCTCCTGC 
TGGCCTGGCTGGGTGGCTGGGGCTGCATGGGGCGTCTGGCAGCCCCAGCCCGGGCCTGGGCAGGGTCCCG 
GGAACACCCAGGGCCTGCTCTGCTGCGGACTCGAAGGAGCTGGGTCTGGAACCAGTTCTTTGTCATTGAG 
GAATATGCTGGTCCAGAGCCTGTTCTCATTGGCAAGCTGCACTCGGATGTTGACCGGGGAGAGGGCCGCA 
CCAAGTACCTGTTGACCGGGGAGGGGGCAGGCACCGTATTTGTGATTGATGAGGCCACAGGCAATATTCA 
TGTTACCAAGAGCCTTGACCGGGAGGAAAAGGCGCAATATGTGCTACTGGCCCAAGCCGTGGACCGAGCC 
TCCAACCGGCCCCTGGAGCCCCCATCAGAGTTCATCATCAAAGTGCAAGACATCAACGACAATCCACCCA 
TTTTTCCCCTTGGGCCCTACCATGCCACCGTGCCCGAGATGTCCAATGTCGGTACATCAGTGATCCAGGT 
GACTGCTCACGATGCTGATGACCCCAGCTATGGGAACAGTGCCAAGCTGGTGTACACTGTTCTGGATGGA 
CTGCCTTTCTTCTCTGTGGACCCCCAGACTGGTGTGGTGCGTACAGCCATCCCCAACATGGACCGGGAGA 
CACAGGAGGAGTTCTTGGTGGTGATCCAGGCCAAGGACATGGGCGGCCACATGGGGGGGCTGTCAGGCAG 
CACTACGGTGACTGTCACGCTCAGCGATGTCAACGACAACCCCCCCAAGTTCCCACAGAGTCTATACCAG 
T T CT C CGTGGTGG AG AC AGCTGG AC C TGG C AC AC TGGTGGGC CGGCT CCGGGC CC AGG AC C C AG AC CTGG 
GGGACAACGCCCTGATGGCATACAGCATCCTGGATGGGGAGGGGTCTGAGGCCTTCAGCATCAGCACAGA 
CTTGCAGGGTCGAGACGGGCTCCTCACTGTCCGCAAGGTTCTAGACTTTGAGAGCCAGCGCTCCTACTCC 
TTCCGTGTCGAGGCCACCAACACGCTCATTGACCCAGCCTATCTGCGGCGAGGGCCCTTCAAGGATGTGG 
CCTCTGTGCGTGTGGCAGTGCAAGATGCCCCAGAGCCACCTGCCTTCACCCAGGCTGCCTACCACCTGAC 
AGTGCCTGAGAACAAGGCCCCGGGGACCCTGGTAGGCCAGATCTCCGCGGCTGACCTGGACTCCCCTGCC 
AGCCCAATCAGGTACTCCATCCTCCCCCACTCAGATCCGGAGCGTTGCTTCTCTATCCAGCCCGAGGAAG 
GCACCATCCATACAGCAGCACCCCTGGATCGCGAGGCTCGCGCCTGGCACAACCTCACTGTGCTGGCTAC 
AGAGCTCGGTGAGGACTCCCAGGCCTCGCGCGTGCAAGTGGCCATCCAGACCCTGGATGAGAATGACAAT 
GCTCCCCAGCTGGCTGAGCCCTACGATACTTTTGTGTGTGACTCTGCAGCTCCTGGCCAGCTGATTCAGG 
TCATCCGGGCCCTGGACAGAGATGAAGTTGGCAACAGTAGCCATGTCTCCTTTCAAGGTCCTCTGGGCCC 
TGATGCCAACTTTACTGTCCAGGACAACCGAGATGGCTCCGCCAGCCTGCTGCTGCCCTCCCGCCCTGCT 
CCACCCCGCCATGCCCCCTACTTGGTTCCCATAGAACTGTGGGACTGGGGGCAGCCGGCGCTGAGCAGCA 
CTGCCACAGTGACTGTTAGTGTGTGCCGCTGCCAGCCTGACGGCTCTGTGGCATCCTGCTGGCCTGAGGC 
TCACCTCTCAGCTGCTGGGCTCAGCACCGGCGCCCTGCTTGCCATCATCACCTGTGTGGGTGCCCTGCTT 
GCCCTGGTGGTGCTCTTCGTGGCCCTGCGGCGGCAGAAGCAAGAAGCACTGATGGTACTGGAGGAGGAGG 
ACGTCCGAGAGAACATCATCACCTACGACGACGAGGGCGGCGGCGAGGAGGACACCGAGGCCTTCGACAT 
CACGGCCTTGCAGAACCCGGACGGGGCGGCCCCCCCGGCGCCCGGCCCTCCCGCGCGCCGAGACGTGTTG 
CCCCGGGCCCGGGTGTCGCGCCAGCCCAGACCCCCCGGCCCCGCCGACGTGGCGCAGCTCCTGGCGCTGC 
GGCTCCGCGAGGCGGACGAGGACCCCGGCGTACCCCCGTACGACTCGGTGCAGGTGTACGGCTACGAGGG 
CCGCGGCTCCTCTTGCGGCTCCCTCAGCTCCCTGGGCTCCGGCAGCGAAGCCGGCGGCGCCCCCGGCCCC 
GCGGAGCCGCTGGACGACTGGGGTCCGCTCTTCCGCACCCTGGCCGAGCTGTATGGGGCCAAGGAGCCCC 
CGGC C C C CTG AGCG C C CGGGC TGG C C CGG CC C AC CG CGGGGGGGGGG C AGCGGG C AC AGGC C CT CT GAG T 
GAGCCCCACGGGGTCCAGGCGGGCGG 



The disclosed NOV36a nucleic acid sequence, located on chromosome 14, has 992 of 
1514 bases (65%) identical to a gb:GENBANK-ID:HUMCAl lA|acc:L34056.1 mRNA from 
Homo sapiens (cadherin-1 1 mRNA, complete cds) (E = 7.3e~ 145 ) . 

A disclosed NOV36a polypeptide (SEQ ID NO: 140) encoded by SEQ ID NO: 139 is 
781 amino acid residues and is presented using the one-letter amino acid code in Table 36B. 
Signal P, Psort and/or Hydropathy results predict that NOV36a contains a signal peptide and is 
likely to be localized in the mitochondrial inner membrane with a certainty of 0.8227 in one 
embodiment and to the plasma membrane with a certainty of 0.4400 in an additional 
embodiment. The most likely cleavage site for a NOV36a polypeptide is between amino acids 
16 and 17: GWG-CM. 



Table 36B. Encoded NOV36a protein sequence (SEQ ID NO:140). 

MWGLVRLLI^WLGGWGCMGRLAAPARAWAGSREHPGPALLRTRRSWVWNQFFVIEEYAGPEPVLIGKLHSDVDRG 
EGRTK YL LTGE GAGT VF V I DE ATGN I HVT K S LDRE E KAQ YVXi L AQ AVDRASNR P LE P P S E F 1 1 KVQD I NDN P P I F 
PLGPYHATVPEMSNVGTSVIQVTAHDADDPSYGNSAKLVYTVLDGLPFFSVDPQTGWRTAIPNMDRETQEEFLV 
V I Q AKDMGGHMGGL S GST T VT VTL S D VNDNP PKFPQSLYQFS WE TAG PGTL VGRLRAQD PDLGDN ALM AY S I LD 
GEGSEAFSISTDLQGRDGLLTVRKVLDFESQRSYSFRVEATNTLIDPAYLRRGPFKDVASVRVAVQDAPEPPAFT 
QAAYHLTVPENKAPGTLVGQISAADLDSPASPIRYSILPHSDPERCFSIQPEEGTIHTAAPLDREARAWHNLTVL 
ATELGEDSQASRVQVAIQTLDENDNAPQLAEPYDTFVCDSAAPGQLIQVIRALDRDEVGNSSHVSFQGPLGPDAN 
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FTVQDNRDGSASLLLPSRPAPPRHAPYLVPIELWDWGQPALSSTATVTVSVCRCQPDGSVASCWPEAHLSAAGLS 
TGALLAIITCVGALLALWLFVALRRQKQEALMVLEEEDVRENIITYDDEGGGEEDTEAFDITALQNPDGAAPPA 
PGPPARRDVLPRARVSRQPRPPGPADVAQLLALRLREADEDPGVPPYDSVQVYGYEGRGSSCGSLSSLGSGSEAG 
GAPGPAEPLDDWGPLFRTLAELYGAKEPPAP 



The disclosed NOV36a amino acid sequence has 434 of 746 amino acid residues 
(58%) identical to, and 552 of 746 amino acid residues (73%) similar to, the 796 amino acid 
residue ptnr:SWISSPROT-ACC:P55287 protein from Homo sapiens (Human) (CADHERIN- 
5 1 1 precursor (osteoblast-cadherin) (OB-cadherin) (OSF-4)) (E = 2.3e -229 ). 

NOV36a is predicted to be expressed in at least Cerebral Medulla/Cerebral white 
matter, Gall Bladder, Retina, Temporal Lobe and Uterus. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
10 sources. 

In addition, NOV36a is predicted to be expressed in the following tissues because of 
the expression pattern of (GENBANK-ID: gb : G EN BANK-ID : HUMC A 1 1 A|acc:L34056.1) a 
closely related Homo sapiens cadherin-1 1 mRNA, complete cds homolog in species Homo 
sapiens: osteoblasts. 

15 

NOV36b 

A disclosed NOV36b nucleic acid of '2476 nucleotides (also referred to as CG56729- 
02) encoding a novel Cadherin 1 1-like protein is shown in Table 36C. An open reading frame 
was identified beginning with an ATG initiation codon at nucleotides 46-48 and ending with a 
20 TGA codon at nucleotides 2389-2391. Putative untranslated regions, if any, found upstream 
from the initiation codon and downstream from the termination codon are underlined in Table 
36C, and the start and stop codons are in bold letters. 



Table 36C. NOV36b Nucleotide Sequence (SEQ ID NO:141) 

TCAGCACCCAGGGCCAGTGAACAGAGCCCTGGCTGGAGTCCAAAC ATQTGGGGCCTGGTGAGGCTCCTGC 
TGGCCTGGCTGGGTGGCTGGGGCTGCATGGGGCGTCTGGCAGCCCCAGCCCGGGCCTGGGCAGGGTCCCG 
GGAACACCCAGGGCCTGCTCTGCTGCGGACTCGAAGGAGCTGGGTCTGGAACCAGTTCTTTGTCATTGAG 
GAATATGCTGGTCCAGAGCCTGTTCTCATTGGCAAGCTGCACTCGGATGTTGACCGGGGAGAGGGCCGCA 
CCAAGTACCTGTTGACCGGGGAGGGGGCAGGCACCGTATTTGTGATTGATGAGGCCACAGGCAATATTCA 
TGTTACCAAGAGCCTTGACCGGGAGGAAAAGGCGCAATATGTGCTACTGGCCCAAGCCGTGGACCGAGCC 
TCCAACCGGCCCCTGGAGCCCCCATCAGAGTTCATCATCAAAGTGCAAGACATCAACGACAATCCACCCA 
TTTTTCCCCTTGGGCCCTACCATGCCACCGTGCCCGAGATGTCCAATGTCGGTACATCAGTGATCCAGGT 
GACTGCTCACGATGCTGATGACCCCAGCTATGGGAACAGTGCCAAGCTGGTGTACACTGTTCTGGATGGA 
CTGCCTTTCTTCTCTGTGGACCCCCAGACTGGTGTGGTGCGTACAGCCATCCCCAACATGGACCGGGAGA 
CACAGGAGGAGTTCTTGGTGGTGATCCAGGCCAAGGACATGGGCGGCCACATGGGGGGGCTGTCAGGCAG 
CACTACGGTGACTGTCACGCTCAGCGATGTCAACGACAACCCCCCCAAGTTCCCACAGAGTCTATACCAG 
TTCTCCGTGGTGGAGACAGCTGGACCTGGCACACTGGTGGGCCGGCTCCGGGCCCAGGACCCAGACCTGG 
GGGACAACGCCCTGATGGCATACAGCATCCTGGATGGGGAGGGGTCTGAGGCCTTCAGCATCAGCACAGA 
CTTGCAGGGTCGAGACGGGCTCCTCACTGTCCGTAAGCCCCTAGACTTTGAGAGCCAGCGCTCCTACTCC 
TTCCGTGTCGAGGCCACCAACACGCTCATTGACCCAGCCTATCTGCGGCGAGGGCCCTTCAAGGATGTGG 
CCTCTGTGCGTGTGGCAGTGCAAGATGCCCCAGAGCCACCTGCCTTCACCCAGGCTGCCTACCACCTGAC 
AGTGCCTGAGAACAAGGCCCCGGGGACCCTGGTAGGCCAGATCTCCGCGGCTGACCTGGACTCCCCTGCC 
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AGCCCAATCAGGTACTCCATCCTCCCCCACTCAGATCCGGAGCGTTGCTTCTCTATCCAGCCCGAGGAAG 
GCACCATCCATACAGCAGCACCCCTGGATCGCGAGGCTCGCGCCTGGCACAACCTCACTGTGCTGGCTAC 
AGAGCTCGGTGAGGACTCCCAGGCCTCGCGCGTGCAAGTGGCCATCCAGACCCTGGATGAGAATGACAAT 
GCTCCCCAGCTGGCTGAGCCCTACGATACTTTTGTGTGTGACTCTGCAGCTCCTGGCCAGCTGATTCAGG 
TCATCCGGGCCCTGGACAGAGATGAAGTTGGCAACAGTAGCCATGTCTCCTTTCAAGGTCCTCTGGGCCC 
TGATGCCAACTTTACTGTCCAGGACAACCGAGATGGCTCCGCCAGCCTGCTGCTGCCCTCCCGCCCTGCT 
CCACCCCGCCATGCCCCCTACTTGGTTCCCATAGAACTGTGGGACTGGGGGCAGCCGGCGCTGAGCAGCA 
CTGCCACAGTGACTGTTAGTGTGTGCCGCTGCCAGCCTGACGGCTCTGTGGCATCCTGCTGGCCTGAGGC 
TCACCTCTCAGCTGCTGGGCTCAGCACCGGCGCCCTGCTTGCCATCATCACCTGTGTGGGTGCCCTGCTT 
GCCCTGGTGGTGCTCTTCGTGGCCCTGCGGCGGCAGAAGCAAGAAGCACTGATGGTACTGGAGGAGGAGG 
ACGTCCGAGAGAACATCATCACCTACGACGACGAGGGCGGCGGCGAGGAGGACACCGAGGCCTTCGACAT 
CACGGCCTTGCAGAACCCGGACGGGGCGGCCCCCCCGGCGCCCGGCCCTCCCGCGCGCCGAGACGTGTTG 
CCCCGGGCCCGGGTGTCGCGCCAGCCCAGACCCCCCGGCCCCGCCGACGTGGCGCAGCTCCTGGCGCTGC 
GGCTCCGCGAGGCGGACGAGGACCCCGGCGTACCCCCGTACGACTCGGTGCAGGTGTACGGCTACGAGGG 
CCGCGGCTCCTCTTGCGGCTCCCTCAGCTCCCTGGGCTCCGGCAGCGAAGCCGGCGGCGCCCCCGGCCCC 
GCGGAGCCGCTGGACGACTGGGGTCCGCTCTTCCGCACCCTGGCCGAGCTGTATGGGGCCAAGGAGCCCC 
CGGCCCCCTG AGCGCCCGGGCTGGCCCGGCCCACCGCGGGGGGGGGGCAGCGGGCACAGGCCCTCTGAGT 
GAGCCCCACGGGGTCCAGGCGGGCGG 



The disclosed NOV36b nucleic acid sequence, located on chromosome 1 1, has 1 100 of 
1 109 bases (99%) identical to a gb:GENBANK-ID:AK025342|acc:AK025342.1 mRNA from 
Homo sapiens (cDNA: FLJ21689 fis, clone COL09459) (E = 2.3e' 240 ) . 
5 A disclosed NOV36b polypeptide (SEQ ID NO: 142) encoded by SEQ ID NO: 141 is 

781 amino acid residues and is presented using the one-letter amino acid code in Table 36D. 
Signal P, Psort and/or Hydropathy results predict that NOV36b contains a signal peptide and is 
likely to be localized to the mitochondrial inner membrane with a certainty of 0.8227 in one 
embodiment and to the plasma membrane with a certainty of 0.4400 in an additional 
10 embodiment. The most likely cleavage site for a NOV36b peptide is between amino acids 16 
and 17:GWG-CM. 



Table 36D. Encoded NOV36b protein sequence (SEQ ID NO: 142). 

MWGLVRLLLAWLGGWGCMGRLAAPARAWAGSREHPGPALLRTRRSWVWNQFFVIEEYAGPEPVLIGKLHSDVDRG 
EG RT K YL LTG EGAGT VFV I DE ATGN I HVT KS LDREE KAQ Y VL L AQ AVDRA SNR PLEPPSEFII K VQD I NDN P P I F 
PLGPYHATVPEMSNVGTSVIQVTAHDADDPSYGNSAKLVYTVLDGLPFFSVDPQTGWRTAIPNMDRETQEEFLV 

V I Q AKDMGGHMGG L S GS T T VT VT L S D VNDNP PKFPQSLYQFS WE T AG PGTL VGR LRAQD P DLGDN ALM AY S I LD 
GEGSEAFSISTDLQGRDGLLTVRKPLDFESQRSYSFRVEATNTLIDPAYLRRGPFKDVASVRVAVQDAPEPPAFT 
QAAYHLTVPENKAPGTLVGQISAADLDSPASPIRYSILPHSDPERCFSIQPEEGTIHTAAPLDREARAWHNLTVL 
ATELGEDSQASRVQVAIQTLDENDNAPQLAEPYDTFVCDSAAPGQLIQVIRALDRDEVGNSSHVSFQGPLGPDAN 
FTVQDNRDGSASLLLPSRPAPPRHAPYLVPIELWDWGQPALSSTATVTVSVCRCQPDGSVASCWPEAHLSAAGLS 
TGALLAI ITCVGALLALWLFVALRRQKQEALMVLEEEDVRENI I TYDDEGGGEEDTEAFDITALQNPDGAAPPA 
PGPPARRDVLPRARVSRQPRPPGPADVAQLLALRLREADEDPGVPPYDSVQVYGYEGRGSSCGSLSSLGSGSEAG 
GAPGPAEPLDDWGPLFRTLAELYGAKEPPAP . , 



The disclosed NOV36b amino acid sequence has 435 of 746 amino acid residues 
(58%) identical to, and 553 of 746 amino acid residues (74%) similar to, the 796 amino acid 
15 residue ptnr:SWISSNEW-ACC:P55287 protein from Homo sapiens (Human) (cadherin-1 1 
precursor (osteoblast-cadherin) (OB-cadherin) (OSF-4)) (E = 2.5e" 230 ). 

NOV36b is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
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nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raj i, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. Expression information was derived from the tissue sources of the 
5 sequences that were included in the derivation of the NOV36b sequence. 

In addition, NOV36b is predicted to be expressed in the following tissues because of 
the expression pattern of (GENBANK-ID: gb:GENBANK-ID:AK025342|acc:AK025342.1) a 
closely related Homo sapiens cDNA: FLJ21689 fis, clone COL09459 homolog in species 
Homo sapiens: ostoeblasts. 
10 NOV36a also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 36E. 



Table 36E. BLAST results for NOV36a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 16553903 | dbj | BAB 
71613 . 1 | (AK057922) 


unnamed protein 
product [Homo 
sapiens] 


493 


459/466 
(98%) 


461/466 
(98%) 


0.0 


gi 

19 


13626134 | sp|0933 
CADB CHICK 


CADHERIN-11 
PRECURSOR 


792 


430/749 
(57%) 


552/749 
(73%) 


0 . 0 


07: 


3377485 |gb|AAC28 
i.l| (AF002983) 


cadherin 
precursor 
[Xenopus laevis] 


794 


429/751 
(57%) 


547/751 
(72%) 


0 . 0 


gi | 1705549 | sp|P5528 
8 | CADB_MOUSE 


CADHERIN-11 
PRECURSOR 
( OSTEOBLAST - 
CADHERIN) (OB- 
CADHERIN) (OSF-4) 


796 


432/753 
(57%) 


552/753 
(72%) 


0.0 


gi | 1377894 | dbj | BAAO 
4798.1] (D21254) 


OB-cadherin- 1 
[Homo sapiens] 


796 


434/753 
(57%) 


552/753 
(72%) 


0.0 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
15 in Table 36F. 

Table 36F Information for the ClustalW proteins 

1) NOV36a (SEQ ID NO: 140) 

2) NOV36b (SEQ ID NO: 142) 

3) gi | 16553903 | dbj | BAB71613 - 1 | (AK057922) unnamed protein product [Homo sapiens] 
20 (SEQ ID NO: 482) 

4) gi| 13626134 |sp|093319|CADB_CHICK CADHERIN-11 PRECURSOR (SEQ ID NO:483) 

5) gi | 3377485 |gb| AAC28073 . 1 | (AF002983) cadherin precursor [Xenopus laevis] (SEQ ID 
NO:484) 

6) gi| 1705549 |sp|P55288|CADB_MOUSE CADHERIN-11 PRECURSOR ( OSTEOBLAST - CADHERIN ) (OB- 
25 CADHERIN) (OSF-4) (SEQ ID NO:485) 

7) gi | 1377894 | dbj |BAA04798.l| (D21254) OB-cadherin- 1 [Homo sapiens] (SEQ IDNO:486) 
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NOV36a 
NOV36b 
gi|l6553903| 




20 

|... 
LGGWG 1 
LGGWGCMG 
LGGWGCMG 




60 
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RgjWVWNQF 
WVWNQF 
RfiSWVWNQF 



51 
51 
51 



gi | 13626134 | 
gi j 3377485 | 
gi j 1705549 | 
gi | 1377894 j 



i mkedncl: 

1 MKKDFCLHGL 

1 MKENYCLQ, 

1 MKENYCLQ, 




ffl^YSHAI TTEKjjNHVRP^L: 

||lch shafapeHrghlrp; 




NOV3 6a 

NOV3 6b 

gi 1 16553903 | 

gi|l3626134| 

gi | 3377485 | 

gi j 1705549 j 

gi j 1377894 j 




120 



TKgLDREE 

tkSldree 
tkIldree 
tkBldree 
tkjjldree 

TKHLDREE 
TKMLDREE 



111 
111 
111 
120 
120 
120 
120 



130 



140 



150 



NOV3 6a 

NOV3 6 b 

gi | 16553903 | 

gi j 13626134 | 

gi|3377485| 

gi j 1705549 | 

gi j 1377894 j 



NOV3 6a 
NOV3 6b 
gi|l6553903| 
gi | 13626134 j 
gi|3377485| 
gi j 1705549 | 
gi | 1377894 j 



KVQDINDNPP 
KVQDINDNPP 
KVQDINDNPP 
KVQDINDNPP 
KVQDINDNPP 
KVQDINDNPP 
KVQDINDNPP 





170 
. . I . . 



180 



| 



pemsnvgtsviq 
pemsnvgtsviq 
yhaHvpemsnvgtsviq 
^yhaBvpeEsnvgtsviq 
yhaSvpemsnvgtsvi Q 
yhaSvpeHsnvgtsviq 
yhaSvpeBsnvgtsviq 



171 
171 
171 
180 
180 
180 
180 
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PNMDREgaEE 
PNMDReHSeE 

pnmdreBiee 
pnmdre^ee 
pnmpre SS ee 

PNMPRe HI eE 
PNMDReHIeE 



240 




250 



260 



NOV3 6a 
NOV3 6b 
gi | 16553903 | 
gi j 13626134 | 
gi j 3377485 | 
gi j 1705549 | 
gi | 1377894 j 



NOV3 6a 
NOV3 6b 

gi 
gi 
gi 
gi 
gi 



16553903 | 
13626134 | 
3377485 | 
1705549 | 
1377894 



NOV3 6a 
NOV3 6b 
gi | 16553903 | 
gi | 13626134 | 
gi | 3377485 | 
gi j 1705549 j 
gi | 1377894 | 



NOV3 6a 
NOV3 6b 
gi 1 16553903 | 




|AYfflR 3 51 
JAYgR 351 
jKFlS 3 60 
HjN 360 
|KFa|S 360 
jKF^S 3 60 



370 



380 



390 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



gi | 13626134 | 
gi|3377485 | 
gi j 1705549 j 
gi j 1377894 j 



NOV36a 

NOV36b 

gi | 16553903 | 

gi | 13626134 j 

gi|3377485 | 

gi | 1705549 j 

gi j 1377894 j 



NOV3 6a 

NOV36b 

gi | 16553903 | 

gi|l3626134| 

gi | 3377485 | 

gi j 1705549 I 

gi j 1377894 j 



NOV3 6a 

NOV36b 

gi | 16553903 | 

gi | 13626134 j 

gi|3377485 | 

gi | 1705549 | 

gi | 1377894 | 



NOV3 6a 
NOV3 6b 

gi 
gi 
gi 
gi 
gi 



16553903 | 
13626134 | 
3377485 | 
1705549 | 
1377894 | 



NOV3 6a 

NOV3 6b 

gi | 16553903 | 

gi 1 13626134 j 

gi|3377485| 

gi 1 1705549 j 

gi | 1377894 j 



NOV3 6a 
NOV3 6b 
gi|l6553903 | 
gi j 13626134 | 
gi | 3377485 
gi | 1705549 
gi | 1377894 




8 

PEIgfH-tijP 535 
3PEipiHP||p 536 
IpElflH-IlP 539 
PEllfe-fciP 53 9 




560 

| | . 1 

iLS- PSRP^PPRHAPl 
,|-PSRP§PPRHAP| 

__T I'-pkscp 

yLlRREGF"|RQ|CQDL 
~*R|jGRGVFSRQiCQDL 
ARRGGFSRQKQDF 
f^YARRGGF§RQKQDL 



580 



590 



600 




YLgPI 

ylSpi 



660 





700 



710 



LQNPDG 
LQNPDG 



Pl ARRljvLPRAgVSRQPRPraGPA^A 7C 
'§GPA^A 7C 



P PARRl}vLPRAgVSRQPRp| 



EAFDig LQNPDG 
EAFDIgC LQNPDG 
EAFDIg LQNPDG 
EAFDlW LQNPDG 



690 

• I 

AAPR 

AAPR 
AIAPSl 

INGFIgRKDIpPlYQYMPg- PGLRP. 

ingfmHrkdikpefqynpHdigirr 

INGFi||RKDI£p|IyQYMpH- PGLRP. 

ingfiHrkdi j|p|;yqympS- pglrp. 



730 



740 



704 QLpALj^LJ 
7 04 QL|jALg 
489 




750 



PPYDSffiQjjYGYEGRGS 

ppydsSqBygyegrgs 



4 



760 



SLSSLgS 
SLSSLSS 



770 

..|....|. 

g|eaggapgp. 
g|eaggapgp. 



DPjgPPYDSOQgYGYEGRGS 

dp^ppydsBq* YGYEGRGS 

DpHgPPYDSgQj YGYEGRGS 
DpBSpPYDSMoI YGYEGRGS 






781 
493 
FDDDS 7 92 
ITCEDDS 794 
liTFDDDS 7 96 
FDDDS 7 96 



315 



II ii ji U. (I «H" i- 1) Ij il" 5 



STi k 



Tables 36G-P list the domain description from DOMAIN analysis results against 
NOV36. This indicates that the NOV36 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 36G. Domain Analysis of NOV36 

gnl | Pfam|pfam01049 / Cadherin_C_term, Cadherin cytoplasmic region. 
Cadherins are vital in cell -cell adhesion during tissue 

differentiation. Cadherins are linked to the cytoskeleton by catenins . 
Catenins bind to the cytoplasmic tail of the cadherin. Cadherins 
cluster to form foci of homophilic binding units. A key determinant to 
the strength of the binding that it is mediated by cadherins is the 
juxtamembrane region of the cadherin. This region induces clustering 
and also binds to the protein pl20ctn. (SEQ ID NO:827) 
CD-Length = 150 residues, 98.7% aligned 
Score = 99.8 bits (247), Expect = 5e-22 



10 



15 



20 



25 



NOV36 : 
Sbjct : 


625 
1 


RRQKQEALMVLEEEDVRENI I TY DDEGGGEEHTRAFD1TALQNPDGAAPPAPGPPARRDV 

1 | + | + | |++ l + l l + l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l + l 1 M + l I++ 1 1 

RRRKKEPLIIDEDEDIRENI INYDDEGGGEEDTDAFDISALRSGGNPKPIEELKLRRDIK 


684 
60 


NOV3 6 : 
Sbjct : 


685 
61 


LPPJ\RVSRQPRPPGPADVAQLLALRLREADEDPGVPPYDSVQVYGY£GJ?GSSCGSLSSLG 

+ 1 III 1 l + l - II llllkl M II + 1 

PELQSLPRPRRPPAPDDIADFINEKLKEADNDPTAPPYDSLQTYAY- -EGSGSVAGSLSS 


744 
118 


NOV3 6 : 
Sbjct : 


745 
119 


SGSE'AGGAPGPABPLDDWGPLFRTLAELYG 774 

1 + + \+\\\\ 1+ II++II 
LNS S TTD S DQD YD YLNDWG P R F KKLADM YG 14 8 








Table 36H. Domain Analysis of NOV36 






gnl | Pfam|pfam00028, cadherin, Cadherin domain (SEQ ID NO:828) 
CD-Length = 92 residues, 97.8% aligned 
Score = 81.3 bits (199), Expect = 2e-16 




NOV36 : 


379 


YHLTVPENKAPGTLVGQ I S AADLDS P A - S P I R YS I LPHSD PERCFS I QPEEGT I HTAAPL 

I +MII II 1 ++I 1 1 1 MM 1 1 1 1+ 1 + 1 M 

YSASVPENAPVGTEVLTVTATDADLGPNGRI FYS I LGG - GPGGWFRI DPDTGDLSTTKPL 


437 


Sbjct : 


1 


59 


NOV36: 


438 


DREARAWHNLTVLATELGEDSQASRVQVAIQ 4 68 

111+ + IIIMI+ 1 + M 

DRESIGEYELTVLATDSGGPPLSGTTTVTIT 90 




Sbjct : 


60 





Table 36L Domain Analysis of NOV36 

gnl|Pfam|pfam00028, cadherin, Cadherin domain (SEQ ID NO:828) 
CD-Length = 92 residues, 100.0% aligned 
Score =72.8 bits (177), Expect = 7e-14 



30 



NOV3 6 : 
Sbjct: 



264 YQFS WETAGPGTLVGRLRAQDPDLGDNALMAYS I LDGEGSEAFS I STDLQGRDGLLTVR 323 

I || I I III + I I Ml I + MM I Ml 11+ 
1 YSASVPENAPVGTEVLTVTATDADLGPNGRI FYS I LGGGPGGWFRIDPD TGDLSTT 5 6 



NOV36 : 324 KVLDFESQRS YS FRVEATNTL I DPAYLRRGPFKDVASVRVAVQ 366 



316 



Sbjct: 57 



I II II I I II+ + I + l + I 

KPLDRES I GEYELTVLATDSGGPPLS GTTTVT I TVL 



92 



Table 36 J. Domain Analysis of NOV36 

gnl | Pfam| pf am00028 , cadherin, Cadherin domain (SEQ ID NO -.828) 
CD-Length = 92 residues, 85.9% aligned 
Score = 63.9 bits (154), Expect = 3e-ll 



10 



NOV36 : 
Sbjct : 
NOV36 : 
Sbjct: 



58 



YHATVPEMSNVGTSVIQVTAHDADDPSYGNSAKLVYTVLDGLP- - FFSVDPQTGWRTAI 212 

I MM + III 1+ III III I + ++ I++I II +1 +11 II + I 

YSASVPENAPVGTEVLTVTATDAD LGPNGRIFYSILGGGPGGWFRIDPDTGDLSTTK 57 

PNMDRETQEE FL W I QAKDMGGH 23 5 
I +| I |+ |+ + + | | | | 
P - LDRES I GE YELTVLATDSGGP 7 9 



Table 36K. Domain Analysis of NOV36 

gnl | Pfam|pf am0 0 02 8 , cadherin, Cadherin domain (SEQ ID NO: 82 8) 
CD-Length = 92 residues, 97.8% aligned 
Score = 59.3 bits (142), Expect = 8e-10 



15 



20 



NOV3 6 : 
Sbjct : 
NOV3 6 : 
Sbj Ct : 



52 



111 



63 



FVIEEYAGPEPVLIGKLHSDVDRG- EGRTKYLLTGEGAGTVFVIDEATGNIHVTKSLDRE 

♦ | | + + +1 I I II I + I I I I II II++ M I I I I 

ASVPENAPVGTEVLTVTATDADLGPNGRI FYS ILGGGPGGWFRIDPDTGDLSTTKPLDRE 



110 



62 



EKAQYVLLAQAVDRASNRPLEPPSEFI IKVQ 

+11 II II + II 

SIGEYELTVLATDSGG- PPLSGTTTVTITVL 



141 



92 



Table 36L. Domain Analysis of NOV36 



gnl |Pfam|pfam00028, cadherin, Cadherin domain (SEQ ID NO:828) 
CD-Length = 92 residues, 100.0% aligned 
Score = 44.7 bits (104), Expect = 2e-05 





NOV42 : 


483 


25 


Sbjct : 


1 




NOV42 : 


543 




Sbjct : 


58 



30 



YDTFVCDSAAPGQLIQVIRALDRDEVGNSSHVSFQGPLGPDANFTVQDNRDGSASLLLPS 

| | ++ | I + + I I I I II I + + + I 

YSASVPENAPVGTEVLTVTATDADLGPNGRI FYS ILGGGPGGWFRIDPD TGDLSTTK 



542 



57 



RPAPPRHAFYUJV I ELWDWGQ PAL S S TAT VTVS VC 

I + + I I I II I III++I 
PLDRE SIGEYELTVLATDSGG PPLSGTTTVTITVL 



577 



92 



Table 36M. Domain Analysis of NOV36 

gnl | Smart | smart00112 , CA, Cadherin repeats.; Cadherins are 
glycoproteins involved in Ca2+-mediated cell -cell adhesion. Cadherin 
domains occur as repeats in the extracellular regions which are 
thought to mediate cell -cell contact when bound to calcium. (SEQ ID 
NO: 82 9) 

CD-Length = 82 residues, 100.0% aligned 
Score = 79.7 bits (195), Expect = 6e-16 



317 
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i nnypm . firs 3.1 np 

• • 





NOV3 6 : 


396 




Sbj Ct : 


1 


5 


NOV3 6 : 


455 




Sbjct : 


60 



ISAADLDSPA- SPIRYSILPHSDPERCFS IQPEEGTIHTAAPLDREARAWHNLTVLATEL 

+ 11 I II + 1 1 1 1 + MINIM Mill +++ III 11 + 

VSATDADSGENGKVTYSILS-GNDGGLFSIDPETGI ITTTKPLDREEQSEYTLTVEATDG 



GEDSQASRVQVAIQTLDENDNAP 

I +11+11 Mill 

GGP PL S ST ATVT VTVLDVNDNAP 



477 



82 



454 



59 



Table 36N. Domain Analysis of NOV36 

gnl I Smart | smart 00 112 , CA, Cadherin repeats.; Cadherins are 
glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin 
domains occur as repeats in the extracellular regions which are 
thought to mediate cell-cell v contact when bound to calcium. (SEQ ID 
NO: 82 9) 

CD-Length = 82 residues, 96.3% aligned 
Score = 73.9 bits (180), Expect = 3e-14 



10 NOV3 6: 7 0 SDVDRGE-GRTKYLLTGEGAGTVFVIDEATGNIHVTKSLDREEKAQYVLLAQAVDRASNR 

+| I M |+ I + I +111 II I II IIIII+++I I +1 I 

Sbjct : 4 TDADSGENGKVTYS I LSGNDGGLFS I DPETGI I TTTKPLDREEQSEYTLTVEATDGGGP - 



128 



62 



15 



NOV3 6: 12 9 



Sbjct: 63 



PLEPPSEFIIKVQDINDNPP 

II + + I l + MI I 
PLS S T ATVT VTVLDVNDNAP 



148 



82 



Table 360. Domain Analysis of NOV36 

gnl I Smart | smart00112, CA, Cadherin repeats.; Cadherins are 
glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin 
domains occur as repeats in the extracellular regions which are 
thought to mediate cell -cell contact when bound to calcium. (SEQ ID 
NO: 82 9) 

CD-Length = 82 residues, 98.8% aligned 
Score = 63.5 bits (153), Expect = 4e-ll 



20 



25 



N0V3 6 : 


281 


Sbjct : 


1 


NOV3 6 : 


341 


Sbjct : 


57 



LRAQDPDLGDNALMAYS ILDGEGSEAFS I STDLQGRDGLLTVRKVLDFESQRSYSFRVEA 340 

+ I I I 1+1 + Mil I III + I++I I II I I 1+ III 

VSATDADSGENGKVTYS I LSGNDGGLFS I DPE TGI ITTTKPLDREEQSEYTLTVEA 5 6 



TNTLIDPAYLRRGPFKDVASVRVAVQDAPEPP 

1+ I l+l I I I + 

TDGGGP PL S S T AT VT VT VL D VNDNA 



372 



81 



Table 36P. Domain Analysis of NOV36 

gnl | Smart | smart00112, CA, Cadherin repeats.,- Cadherins are 
glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin 
domains occur as repeats in the extracellular regions which are 
thought to mediate cell- cell contact when bound to calcium. (SEQ ID 
NO: 829) 

CD-Length = 82 residues, 74.4% aligned 

Score = 55.8 bits (133), Expect = 9e-09 



30 



NOV3 6: 172 



Sbjct : 



VTAHDADDPSYGNSAKLVYTVLDGLPF- - FS VDPQTG WRTAI PNMDRETQEE FLWI QA 22 9 

|+| IN | |+ I++I I II+II+II++ I I +111 I 1+ + ++I 

VSATDADSGENG KVTYS I LSGNDGGLFS I DPETG I ITTTKP - LDREEQSEYTLTVEA 5 6 



NOV36: 230 KDMGG 234 
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I II 

Sbjct: 57 TDGGG 61 
Sbjct: 57 TDGGG €1 

5 Cadherins, first discovered in mouse teratocarcinoma cells, are a family of animal 

glycoproteins responsible for calcium-dependent cell-cell adhesion. Cadherins preferentially 
interact with themselves in a homophilic manner in connecting cells; thus acting as both 
receptor and ligand. There are a number of different isoforms distributed in a tissue-specific 
manner in a wide variety of organisms. Cells containing different cadherins tend to segregate 

10 in vitro, while those that contain the same cadherins tend to preferentially aggregate together. 
This observation is linked to the finding that cadherin expression causes morphological 
changes involving the positional segregation of cells into layers, suggesting they may play an 
important role in the sorting of different cell types during morphogenesis, histogenesis and 
regeneration. They may also be involved in the regulation of tight and gap junctions, and in 

1 5 the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins 
which are component of intercellular desmosome junctions involved in the interaction of 
plaque proteins. The first three cadherins to be described were E-cadherin is present on many 
types of epithelial cells; N-cadherin on nerve, muscle, and lens cells; and P-cadherin on cells 
in the placenta and epidermis. 

20 The NOV36 proteins bear close resemblance to cadherin- 1 1 , a member of the cadherin 

family of proteins, expressed in osteoblasts. The tissue expression in brain, uterus and retina of 
these NOV36 proteins indicate they might play an important role during organogenesis and 
development. 

The disclosed NOV36 nucleic acid of the invention encoding a cadherin 1 1-like 
25 protein includes the nucleic acid whose sequence is provided in Table 36A, 36C or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 36A or 36C while still encoding a 
protein that maintains its cadherin 1 1-like activities and physiological functions, or a fragment 
of such a nucleic acid. The invention further includes nucleic acids whose sequences are 
30 complementary to those just described, including nucleic acid fragments that are 

complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
35 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 

319 




binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, in one embodiment up to about 35% of the NOV36a residues 
may be so changed and in an additional embodiment up to about 1% of the NOV36b residues 
may be so changed. 

5 The disclosed NOV36 protein of the invention includes the cadherin 1 1 -like protein 

whose sequence is provided in Table 36B or 36D. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 36B or 36D while still encoding a protein that maintains its cadherin 1 1-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 

10 up to about 43% of the NOV36a and NOV36b bases may be so changed. 

The above defined information for this invention suggests that these cadherin 1 1-like 
proteins (NOV36) is a member of a "cadherin 1 1 family". Therefore, the NOV36 nucleic acids 
and proteins identified here may be useful in potential therapeutic applications implicated in 
(but not limited to) various pathologies and disorders as indicated below. The potential 

15 therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

20 The nucleic acids and proteins of NOV36 are useful in Von Hippel-Lindau (VHL) 

syndrome, cirrhosis, transplantation, endometriosis, fertility, anemia, ataxia-telangiectasia, 
autoimmune disease, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 
allergies, immunodeficiencies, graft versus host disease (GVHD), lymphaedema, muscular 
dystrophy, Lesch-Nyhan syndrome, myasthenia gravis, and/or other pathologies and disorders. 

25 NOV36 nucleic acids and polypeptides are further useful in the generation of 

antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV36 protein have multiple 

30 hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 
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NOV37 



A disclosed NOV37 nucleic acid of 8575 nucleotides (also referred to as CG56733-01) 
encoding a novel Ten-M2-like protein is shown in Table 37A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 199-201 and ending with a 
5 TAA codon at nucleotides 8476-8478. Putative untranslated regions, if any, found upstream 
from the initiation codon and downstream from the termination codon are underlined in Table 
37A, and the start and stop codons are in bold letters. 



TAAAGTACCTGTCATCTTGACAAGTGGCGGAGCGGAGGAGTCAAGGATTATAAATGATCACAGCCAGGTCC 

AGCTCGCCCCGTGATTGGGCTCTCCCGCGATCTGCACCGGGGGAAGCGCATGAGAGGCCAATGAGACTTGA 

ACCCTGAGCCTAAGTTGTCACCAGCAGGACTGATGTGCACACAGAAGGAATGAAGT ATGGATGTGAAAGAA 

CGCAGGCCTTACTGCTCCCTGACCAAGAGCAGACGAGAGAAGGAACGGCGCTACACAAATTCCTCCGCAGA 

CAATGAGGAGTGCCGGGTACCCACACAGAAGTCCTACAGTTCCAGCGAGACATTGAAAGCTTTTGATCATG 

ATTCCTCGCGGCTGCTTTACGGCAACAGAGTGAAGGATTTGGTTCACAGAGAAGCAGACGAGTTCACTAGA 

C AAAG C AGGATGC ACT AT GGAAAC CG AGT CAC AG AC C T C ATC C AC CGGGAGTC AGATG AGTT T C CT AGAC A 

AGGTATCCTTCACCAGGGCTACTCCCTTAGCACAGGGTCTGACGCCGACTCCGACACCGAGGGAGGGATGT 

CTCCAGAACACGCCATCAGACTGTGGGGCAGAGGGATAAAATCCAGGCGCAGTTCCGGCCTGTCCAGTCGT 

GAAAACTCGGCCCTTACCCTGACTGACTCTGACAACGAAAACAAATCAGATGATGAGAACGGTCGTCCCAT 

TCCACGTACATCCTCGCGTAGTCTCCTCCCATTTGTTCAGCTGCCTAGCTCCCATAATCCTCCACCAGTTA 

GCTGCCAGATGCCATTGCTAGACAGCAACACCTCCCATCAAATCATGGACACCAACCCTGATGAGGAATTC 

TCCCCCAATTCATACCTGCTCAGAGCATGCTCAGGGCCCCAGCAAGCCTCCAGCAGTGGTCCTCCGAACCA 

CCACAGCCAGTCGACTCTGAGGCCCCCTCTCCCACCCCCTCACAACCACACGCTGTCCCATCACCACTCGT 

CCGCCAACTCCCTCAACAGGAACTCACTGACCAATCGGCGGAGTCAGATCCACGCCCCGGCCCCAGCGCCC 

AATGACCTGGCCACCACACCAGAGTCCGTTCAGCTTCAGGACAGCTGGGTGCTAAACAGCAACGTGCCACT 

GGAGACCCGGCACTTCCTCTTCAAGACCTCCTCGGGGAGCACACCCTTGTTCAGCAGCTCTTCCCCGGGAT 

ACCCTTTGACCTCAGGAACGGTTTACACGCCCCCGCCCCGCCTGCTGCCCAGGAATACTTTCTCCAGGAAG 

GCTTTCAAGCTGAAGAAGCCCTCCAAATACTGCAGCTGGAAATGTGCTGCCCTCTCCGCCATTGCCGCGGC 

CCTCCTCTTGGCTATTTTGCTGGCGTATTTCGCAGCAATGCATCTGCTCGGACTCAATTGGCAACTCCAGC 

CTGCAGATGGGCACACCTTTAACAATGGGATAAGGACCGGCTTACCAGGAAACGATGATGTGGCAACAATG 

CCATCTGGAGGCAAAGTGCCCTGGTCGTTGAAAAACAGCAGCATAGACAGTGGTGAAGCAGAAGTTGGTCG 

GCGGGTAACACAAGAAGTCCCACCAGGGGTGTTTTGGAGGTCACAAATTCACATCAGTCAGCCCCAGTTCT 

TAAAGTTCAACATCTCCCTCGGGAAGGACGCTCTCTTTGGTGTTTACATAAGAAGAGGACTTCCACCATCT 

CATGCCCAGTATGACTTCATGGAACGTCTGGACGGGAAGGAGAAGTGGAGTGTGGTTGAGTCTCCCAGGGA 

ACGCCGGAGCATACAGACCTTGGTTCAGAATGAAGCCGTGTTTGTGCAGTACCTGGATGTGGGCCTGTGGC 

ATCTGGCCTTCTACAATGATGGAAAAGACAAAGAGATGGTTTCCTTCAATACTGTTGTCCTAGATTCAGTG 

CAGGACTGTCCACGTAACTGCCATGGGAATGGTGAATGTGTGTCCGGGGTGTGTCACTGTTTCCCAGGATT 

TCTAGGAGCAGACTGTGCTAAAGCTGCCTGCCCTGTCCTGTGCAGTGGGAATGGACAATATTCTAAAGGGA 

CGTGCCAGTGCTACAGCGGCTGGAAAGGTGCAGAGTGCGACGTGCCCATGAATCAGTGCATCGATCCTTCC 

TGCGGGGGCCACGGCTCCTGCATTGATGGGAACTGTGTCTGCTCTGCTGGCTACAAAGGCGAGCACTGTGA 

GGAAGTTGATTGCTTGGATCCCACCTGCTCCAGCCACGGAGTCTGTGTGAATGGAGAATGCCTGTGCAGCC 

CTGGCTGGGGTGGTCTGAACTGTGAGCTGGCGAGGGTCCAGTGCCCAGACCAGTGCAGTGGGCATGGCACG 

TACCTGCCTGACACGGGCCTCTGCAGCTGCGATCCCAACTGGATGGGTCCCGACTGCTCTGTTGTGTGCTC 

AGTAGACTGTGGCACTCACGGCGTCTGCATCGGGGGAGCCTGCCGCTGTGAAGAGGGCTGGACAGGCGCAG 

CGTGTGACCAGCGCGTGTGCCACCCCCGCTGCATTGAGCACGGGACCTGTAAAGATGGCAAATGTGAATGC 

CGAGAGGGCTGGAATGGTGAACACTGCACCATTGATGGCTGCCCTGACTTGTGCAACGGTAACGGGAGATG 

CACACTGGGTCAGAACAGCTGGCAGTGTGTCTGCCAGACCGGCTGGAGAGGGCCCGGATGCAACGTTGCCA 

TGGAAACTTCCTGTGCTGATAACAAGGATAATGAGGGAGATGGACTCATTGACTGCATGGATCCCGATTGC 

TGCCTACAGAGTTCCTGCCAGAATCAGCCCTATTGTCGGGGACTGCCGGATCCTCAGGACATCATTAGCCA 

AAGCCTTCAATCGCCTTCTCAGCAAGCTGCCAAATCCTTTTATGATCGAATCAGTTTCCTTATAGGATCTG 

ATAGCACCCATGTTATACCTGGAGAAAGTCCTTTCAATAGCTTGGTTTCTCTCATCCGAGGCCAAGTAGTA 

ACTACAGATGGAACTCCCCTGGTCGGTGTGAACGTGTCTTTTGTCAAGTACCCAAAATACGGCTACACCAT 

CACCCGCCAGGATGGCACGTACTCCCTCTCCAGGTTCGACCTGATCGCAAATGGAGGTGCTTCCTTGACTC 

TACACTTTGAGCGAGCCCCGTTCATGAGCCAGGAGCGCACTGTGTGGCTGCCGTGGAACAGCTTTTACGCC 

ATGGACACCCTGGTGATGAAGACCGAGGAGAACTCCATCCCCAGTTGTGACCTCAGTGGTTTTTGTCGGCT 

TGATCCCATCATCATCTCCTCCCCTCTGTCCACTTTCTTTAGTGCTGCCCCTGGGCAGAATCCCATCGTGC 

CTGAGACCCAGGTACTTCATGAAGAAATCGAGCTCCCTGGTTCCAATGTGAAACTTCGCTATCTGAGCTCT 

AGAACTGCAGGGTACAAGTCACTGCTGAAGATCACCATGACCCAGTCCACAGTGCCCCTGAACCTCATTAG 

GGTTCACCTGATGGTGGCTGTCGAGGGGCATCTCTTCCAGAAGTCATTCCAGGCTTCTCCCAACCTGGCCT 

ACACCTTCATCTGGGACAAGACAGATGCGTATGGCCAAAGGGTGTATGGACTCTCAGATGCTGTTGGTATG 

TTTTGGTTTCAAAGGACAGCCCTCCTTCAGGGATTCGAGCTGGACCCCTCCAACCTCGGTGGCTGGTCCCT 

AG AC AAAC AC CAC AT C CT C AATGTT AAAAGTGGT AT C CT AC AC AAAGGC ACTGGGG AAAAC C AGT T C CT G A 



Table 37A. NOV37 Nucleotide Sequence (SEQ ID NO:143) 
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CCCAGCAGCCTGCCATCATCACCAGCATCATGGGCAATGGTCGCCGCCGGAGCATTTCCTGTCCCAGCTGC 
AACGGCCTTGCTGAAGGCAACAAGCTGCTGGCCCCAGTGGCTCTGGCTGTTGGAATCGATGGGAGCCTCTA 
TGTGGGTGACTTCAATTACATCCGACGCATCTTTCCCTCTCGAAATGTGACCAGCATCTTGGAGTTACGGA 
GAAATAAAGAGTTTAAACATAGCAACAACCCAGCACACAAGTACTACTTGGCAGTGGACCCCGTGTCCGGC 
TCGCTCTACGTGTCCGACACCAACAGCAGGAGAATCTACCGCGTCAAGTCTCTGAGTGGAACCAAAGACCT 
GGCTGGGAATTCGGAAGTTGTGGCAGGGACGGGAGAGCAGTGTCTACCCTTTGATGAAGCCCGCTGCGGGG 
ATGGAGGGAAGGCCATAGATGCAACCCTGATGAGCCCGAGAGGTATTGCAGTAGACAAGAATGGGCTCATG 
TACTTTGTCGATGCCACCATGATCCGGAAGGTTGACCAGAATGGAATCATCTCCACCCTGCTGGGCTCCAA 
TGACCTCACTGCCGTCCGGCCGCTGAGCTGTGATTCCAGCATGGATGTAGCCCAAGTTCGTCTGGAGTGGC 
CAACAGACCTTGCTGTCAATCCCATGGATAACTCCTTGTATGTTCTAGAGAACAATGTCATCCTTCGAATC 
ACCGAGAACCACCAAGTCAGCATCATTGCGGGACGCCCCATGCACTGCCAAGTTCCTGGCATTGACTACTC 
ACTCAGCAAACTAGCCATTCACTCTGCCCTGGAGTCAGCCAGTGCCATTGCCATTTCTCACACTGGGGTCC 
TCTACATCACTGAGACAGATGAGAAGAAGATTAACCGTCTACGCCAGGTAACAACCAACGGGGAGATCTGC 
CTTTTAGCTGGGGCAGCCTCGGACTGCGACTGCAAAAACGATGTCAATTGCAACTGCTATTCAGGAGATGA 
TGCCTACGCGACTGATGCCATCTTGAATTCCCCATCATCCTTAGCTGTAGCTCCAGATGGTACCATTTACA 
TTGC AGACCT TGG AAAT ATT CGG AT C AGGGCGGT C AGC AAG AAC AAGC CTGTT CTT AATGC CTT C AACC AG 
TATGAGGCTGCATCCCCCGGAGAGCAGGAGTTATATGTTTTCAACGCTGATGGCATCCACCAATACACTGT 
GAGCCTGGTGACAGGGGAGTACTTGTACAATTTCACATATAGTACTGACAATGATGTCACTGAATTGATTG 
ACAATAATGGGAATTCCCTGAAGATCCGTCGGGACAGCAGTGGCATGCCCCGTCACCTGCTCATGCCTGAC 
AACCAGATCATCACCCTCACCGTGGGCACCAATGGAGGCCTCAAAGTCGTGTCCACACAGAACCTGGAGCT 
TGGTCTCATGACCTATGATGGCAACACTGGGCTCCTGGCCACCAAGAGCGATGAAACAGGATGGACGACTT 
TCTATAGCTATGACCACGAAGGCCGCCTGACCAACGTGACGCGCCCCACGGGGGTGGTAACCAGTCTGCAC 
CGGGAAATGGAGAAATCTATTACCATTGACATTGAGAACTCCAACCGTGATGATGACGTCACTGTCATCAC 
CAACCTCTCTTCAGTAGAGGCCTCCTACACAGTGGTACAAGATCAAGTTCGGAACAGCTACCAGCTCTGTA 
ATAATGGTACCCTGAGGGTGATGTATGCTAATGGGATGGGTATCAGCTTCCACAGCGAGCCCCATGTCCTA 
GCGGGCACCATCACCCCCACCATTGGACGCTGCAACATCTCCCTGCCTATGGAGAATGGCTTAAACTCCAT 
TGAGTGGCGCCTAAGAAAGGAACAGATTAAAGGCAAAGTCACCATCTTTGGCAGGAAGCTCGAGGTCCATG 
GAAGAAATCTCTTGTCCATTGACTATGATCGAAATATTCGGACTGAAAAGATCTATGATGACCACCGGAAG 
TTCACCCTGAGGATCATTTATGACCAGGTGGGCCGCCCCTTCCTCTGGCTGCCCAGCAGCGGGCTGGCAGC 
TGTCAACGTGTCATACTTCTTCAATGGGCGCCTGGCTGGGCTTCAGCGTGGGGCCATGAGCGAGAGGACAG 
ACATCGACAAGCAAGGCCGCATCGTGTCCCGCATGTTCGCTGACGGGAAAGTGTGGAGCTACTCCTACCTT 
GACAAGATGGTCCTCCTGCTTCAGAGCCAACGTCAGTATATATTTGAGTATGACTCCTCTGACCGCCTCCT 
TGCCGTCACCATGCCCAGCGTGGCCCGGCACAGCATGTCCACACACACCTCCATCGGCTACATCCGTAATA 
TTTACAACCCGCCTGAAAGCAATGCTTCGGTCATCTTTGACTACAGTGATGACGGCCGCATCCTGAAGACC 
TCCTTTTTGGGCACCGGACGCCAGGTGTTCTACAAGTATGGGAAACTCTCCAAGTTATCAGAGATTGTCTA 
CG AC AGT AC C GC CGTC AC C T T CGGGT ATG ACG AG AC C ACTGGTGT CTTGAAG ATGGTC AAC CT C C AAAGTG 
GGGGCTTCTCCTGCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTCC 
GAGGAAGGCATGGTCAATGCCAGGTTTGACTACACCTATCATGACAACAGCTTCCGCATCGCAAGCATCAA 
GCCCGTCATAAGTGAGACTCCCCTCCCCGTTGACCTCTACCGCTATGATGAGATTTCTGGCAAGGTGGAAC 
ACTTTGGTAAGTTTGGAGTCATCTATTATGACATCAACCAGATCATCACCACTGCCGTGATGACCCTCAGC 
AAACACTTCGACACCCATGGGCGGATCAAGGAGGTCCAGTATGAGATGTTCCGGTCCCTCATGTACTGGAT 
GACGGTGCAATATGACAGCATGGGCAGGGTGATCAAGAGGGAGCTAAAACTGGGGCCCTATGCCAATACCA 
CGAAGTACACCTATGACTACGATGGGGACGGGCAGCTCCAGAGCGTGCCGGCCGTCAATGACCGCCCGACC 
TGGCGCTACAGCTATGACCTTAATGGGAATCTCCACTTACTGAACCCAGGCAACAGTGTGCGCCTCATGCC 
CTTGCGCTATGACCTCCGGGATCGGATAACCAGACTCGGGGATGTGCAGTACAAAATTGACGACGATGGCT 
ATCTGTGCCAGAGAGGGTCTGACATCTTCGAATACAATTCCAAGGGCCTCCTAACAAGAGCCTACAACAAG 
GCCAGCGGGTGGAGTGTCCAGTACCGCTATGATGGCGTAGGACGGCGGGCTTCCTACAAGACCAACCTGGG 
C C ACC AC CTG C AG T ACT T CT ACTCTG ACCTCC AC AAC C CG ACGCGC AT C AC CC ATGTCT AC AAT C ACTC CA 
ACTCGGAGATTACCTCACTGTACTACGACCTCCAGGGCCACCTCTTTGCCATGGAGAGCAGCAGTGGGGAG 
GAGTACTATGTTGCCTCTGATAACACAGGGACTCCTCTGGCTGTGTTCAGCATCAACGGCCTCATGATCAA 
ACAGCTGCAGTACACGGCCTATGGGGAGATTTATTATGACTCCAACCCCGACTTCCAGATGGTCATTGGCT 
TCCATGGGGGACTCTATGACCCCCTGACCAAGCTGGTCCACTTCACTCAGCGTGATTATGATGTGCTGGCA 
GGACGATGGACCTCCCCAGACTATACCATGTGGAAAAACGTGGGCAAGGAGCCGGCCCCCTTTAACCTGTA 
TATGTTCAAGAGCAACAATCCTCTCAGCAGTGAGCTAGATTTGAAGAACTACGTGACAGATGTGAAAAGCT 
GGCTTGTGATGTTTGGATTTCAGCTTAGCAACATCATTCCTGGCTTCCCGAGAGCCAAAATGTATTTCGTG 
CCTCCTCCCTATGAATTGTCAGAGAGTCAAGCAAGTGAGAATGGACAGCTCATCACAGGTGCCCACCAGAC 
AACAGAGAGACATAACCAGGCTTTCATGGCTCTGGAAGGACAGGTCATTACTAAAAAGCTCCACGCCAGCA 
TCCGAGAGAAAGCAGGTCATTGGTTTGCCACCACCACGCCCATCATTGGCAAAGGCATCATGTTTGCCATC 
AAAGAAGGGCGGGTGACCACGGGCGTGTCCAGCATCGCCAGCGAAGATAGCCGCAAGGTGGCATCTGTGCT 
GAACAACGCCTACTACCTGGACAAGATGCACTACAGCATCGAGGGCAAGGACACCCACTACTTTGTGAAGA 
TTGGCTCAGCCGATGGCGACCTGGTCACACTAGGCACCACCATCGGCCGCAAGGTGCTAGAGAGCGGGGTG 
AACGTGACCGTGTCCCAGCCCACGCTGCTGGTCAACGGCAGGACTCGAAGGTTCACGAACATTGAGTTCCA 
GTACTCCACGCTGCTGCTCAGCATCCGCTATGGCCTCACCCCCGACACCCTGGACGAAGAGAAGGCCCGCG 
TCCTGGACCAGGCGAGACAGAGGGCCCTGGGCACGGCCTGGGCCAAGGAGCAGCAGAAAGCCAGGGACGGG 
AGAGAGGGGAGCCGCCTGTGGACTGAGGGCGAGAAGCAGCAGCTTCTGAGCACCGGGCGCGTGCAAGGGTA 
CGAGGGATATTACGTGCTTCCCGTGGAGCAATACCCAGAGCTTGCAGACAGTAGCAGCAACATCCAGTTTT 
TAAGACAGAATGAGATGGGAAAGAGGTAA CAAAATAATCTGCTGCCATTCCTTGTCTGAATGGCTCAGCAG 
GAGTAACTGTTATCTCCTCTCCTAAGGAGATGAAGACCTAACAGGGGCACTGCGG 
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The NOV37 nucleic acid, located on chromosome 5, has 4965 of 5004 bases (99%) 
identical to a gb:GENBANK-ID:AB032953|acc:AB032953.1 mRNA from Homo sapiens 
(mRNA for KIAA1 127 protein, partial cds) (E = 0.0). 

A disclosed NOV37 polypeptide (SEQ ID NO: 144) encoded by SEQ ID NO: 143 is 
5 2759 amino acid residues and is presented using the one-letter code in Table 37B. Signal P, 
Psort and/or Hydropathy results predict that NOV37 does not contain a signal peptide and is 
likely to be localized extracellularly with a certainty of 0.7900 in one embodiment, to the 
plasma membrane with a certainty of 0.7900 in another embodiment and to the nucleus with a 
certainty of 0.6000 in an additional embodiment. 



MDVKERRPYCSLTKSRREKERRYTNSSADNEECRVPTQKSYSSSETLKAFDHDSSRLLYGNRVKDLVHREAD 
EFTRQSRMHYGNRVTDLIHRESDEFPRQGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGL 
SSRENSALTLTDSDNENKSDDENGRPIPRTSSRSLIiPFVQLPSSHNPPPVSCQMPLLDSNTSHQIMDTNPDE 
EFSPNSYLLRACSGPQQASSSGPPNHHSQSTLRPPLPPPHNHTLSHHHSSANSLNRNSLTNRRSQIHAPAPA 
PNDLATTPESVQLQDSWVLNSNVPLETRHFLFKTSSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRK 
AFKLKKPS K YCS WKCAAL S AI AAALLLAI LLAYFAAMHLLGLNWQLQP ADGHT FNNG I RTGL PGNDDVATMP 
SGGKVPWSLKNSSIDSGEAEVGRRVTQEVPPGVFWRSQIHISQPQFLKFNISLGKDALFGVYIRRGLPPSHA 
QYDFMERLDGKEKWSWESPRERRSIQTLVQNEAVFVQYLDVGLWHLAFYTSTOGKDKEMVSFNTVVLDSVQDC 
PRNCHGNGECVSGVCHCFPGFLGADCAKAACPVLCSGNGQYSKGTCQCYSGWKGAECDVPMNQCIDPSCGGH 
GSCIDGNCVCSAGYKGEHCEEVDCLDPTCSSHGVCVNGECLCSPGWGGLNCELARVQCPDQCSGHGTYLPDT 
GLCSCDPNWMGPDCSWCSVDCGTHGVCIGGACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNG 
EHCTIDGCPDLCNGNGRCTLGQNSWQCVCQTGWRGPGCNVAMETSCADNKDNEGDGLIDCMDPDCCLQSSCQ 
NQPYCRGLPDPQDIISQSLQSPSQQAAKSFYDRISFLIGSDSTHVIPGESPFNSLVSLIRGQWTTDGTPLV 
GVNVSFVKYPKYGYTITRQDGTYSLSRFDLIANGGASLTLHFERAPFMSQERTWLPWNSFYAMDTLVMKTE 
ENSIPSCDLSGFCRLDPIIISSPLSTFFSAAPGQNPIVPETQVLHEEIELPGSNVKLRYLSSRTAGYKSLLK 
I TMTQSTVPLNL I RVHLMVAVEGHLFQKS FQAS PNL A YT F I WD KT D AYG QR VYG L S D AVGM F W FQRT ALL QG 
FELDPSNLGGWSLDKHHILNVKSGILHKGTGENQFLTQQPAIITSIMGNGRRRSISCPSCNGLAEGNKLLAP 
VALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRRNKEFKHSNNPAHKYYLAVDPVSGSLYVSDTNSRRIY 
RVKSLSGTKDLAGNSEWAGTGEQCLPFDEARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDATMIRKVDQN 
G 1 1 STLLGSNDLTAVRPLSCDS S MD VAQ VR LEW PTDL A VNPMDNS L YVL ENNV I LRI TENHQVS 1 1 AGRPMH 
CQVPGIDYSLSKLAIHSALESASAIAISHTGVLYITETDEKKINRLRQVTTNGEICLLAGAASDCDCKNDVN 
CNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPGEQELYVFNAD 
GIHQYTVSLVTGEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKWS 
TQNLELGLMTYDGNTGLLATKSDETGWTTFYSYDHEGRLTNVTRPTGWTSLHREMEKSITIDIENSNRDDD 
VT V I TNL S S VE AS Y TWQDQVRNS YQL CNNGT L R VM Y ANGMG I S FH S E P H V L AGT I T PT I GR CN I S L PMENG 
LNSIEWRLRKEQIKGKVTIFGRKLEVHGRNLLSIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSG 
LAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFADGKVWSYSYLDKMVLLLQSQRQYIFEYDSSDR 
LLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIV 
YDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSEEGMVNARFDYTYHDNSFRIASIK 
PVISETPLPVDLYRYDEISGKVEHFGKFGVI YYDINQI ITTAVMTLSKHFDTHGRIKEVQYEMFRSLMYWMT 
VQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVPAVNDRPTWRYSYDLNGNLHLLNPGNSVRLMPLR 
YDLRDRITRLGDVQYKIDDDGYLCQRGSDIFEYNSKGLLTRAYNKASGWSVQYRYDGVGRRASYKTNLGHHL 
QYFYSDLHNPTRITHVYNHSNSEITSLYYDLQGHLFAMESSSGEEYYVASDNTGTPLAVFSINGLMIKQLQY 
TAYGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDYTMWKNVGKEPAPFNLYMFKSN 
NPLSSELDLKNYVTDVKSWLVMFGFQLSNIIPGFPRAKMYFVPPPYELSESQASENGQLITGAHQTTERHNQ 
AFMALEGQVITKKLHASIREKAGHWFATTTPI IGKGIMFAI KEGRVTTGVSSIASEDSRKVASVLNNAYYLD 
KMHYSIEGKDTHYFVKIGSADGDLVTLGTTIGRKVLESGVNVTVSQPTLLVNGRTRRFTNIEFQYSTLLLSI 
RYGLTPDTLDEEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVE 
QYPELADSSSNIQFLRQNEMGKR 



The disclosed NOV37 amino acid sequence has 2634 of 2764 amino acid residues 
(95%) identical to, and 2679 of 2764 amino acid residues (96%) similar to, the 2764 amino 
acid residue ptnr:SPTREMBL-ACC:Q9WTS5 protein from Mus musculus (Mouse) (TEN- 
M2) (E = 0.0). 



Table 37B. Encoded NOV37 protein sequence (SEQ ID NO:144) 
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NOV37 is predicted to be expressed in at least Amygdala, Brain, Bronchus, Cerebral 
Medulla/Cerebral white matter, Cochlea, Coronary Artery, Epidermis, Hair Follicles, 
Hippocampus, Hypothalamus, Kidney, Left cerebellum, Lung, Lymph node, Parietal Lobe, 
Pineal Gland, Retina, Right Cerebellum, Substantia Nigra, Vulva and Whole Organism. This 
5 information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

In addition, NOV37 is predicted to be expressed in the following tissues because of the 
expression pattern of (GENBANK-ID: gb:GENBANK-ID:AB032953|acc: AB032953.1) a 
10 closely related Homo sapiens mRNA for KIAA1 127 protein, partial cds homolog in species 

Homo sapiens: Amygdala, Brain, Bronchus, Cerebral Medulla/Cerebral white matter, Cochlea, 
Coronary Artery, Epidermis, Hair Follicles, Hippocampus, Hypothalamus, Kidney, Left 
cerebellum, Lung, Lymph node, Parietal Lobe, Pineal Gland, Retina, Right Cerebellum and 
Substantia Nigra. 

15 NOV37 also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 37C. 



Table 37C. BLAST results for NOV37 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 7657415 | ref |NP 0 
35986. 2| 
<NM_011856) 


odd Oz /ten-m 
homolog 2 

(Drosophila) ; odd 
Oz/ten-m homolog 
3 (Drosophila) 

[Mus musculus] 


2764 


2633/277 
7 (94%) 


2677/2777 
(95%) 


0 . 0 


gi | 9910320 | ref |NP_0 
644 73.l| 
(NM_020088) 


neurestin alpha 
[Rattus 
norvegicus] 


2765 


2625/277 
8 (94%) 


2676/2778 
(95%) 


0 . 0 


gi| 10241574 | emb | CAC 
09416 . 1 | (AJ279031) 


teneurin-2 
[Gallus gallus] 


2802 


2525/281 
8 (89%) 


2639/2818 
(93%) 


0 . 0 


gi | 5307761 | dbj | BAA8 
1892. l| (AB026979) 


ten-m3 [Danio 
rerio] 


2590 


1707/260 
2 (65%) 


2097/2602 
(79%) 


0 . 0 


gi | 676036 9 | gb | AAF2 8 
316 . 1 | AF195418 1 
(AF195418) 


ODZ3 [MUS 
musculus] 


2346 


1665/236 
0 (70%) 


1993/2360 
(83%) 


0 . 0 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
20 in Table 37D. 

Table 37D Clustal W Sequence Alignment 

1) NOV37 (SEQ ID NO: 144) 

2) gi | 7657415 | ref | NP_035986 . 2 | (NM_011856) odd Oz/ten-m homolog 2 (Drosophila); odd 
25 Oz/ten-m homolog 3 (Drosophila) [Mus musculus] (SEQ ID NO: 487) 

3) gi | 9910320 | ref | NP_0644 73 . 1 | (NM_020088) neurestin alpha [Rattus norvegicus] (SEQ 
ID NO: 4 88) 
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4) gi | 10241574 | emb | CAC09416 . 1 | (AJ279031) teneurin-2 [Gallus gallus] (SEQ ID 
NO : 4 8 9 ) 

5) gi| 53 07761 |dbj |BAA818 92.l| (AB026979) ten-m3 [Danio rerio] (SEQ ID NO:490) 

6) gi j 6760369 | gb|AAF28316 . 1 |AF195418_1 (AF195418) ODZ3 [Mus musculus] (SEQ ID 
NO : 4 91) 
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NOV37 



gi 
gi 
gi 
gi 



7657415 | 
9910320 j 
10241574 | 
5307761 | 
6760369 | 



NOV37 

gi|7657415| 
gi | 9910320 | 
gi j 10241574 | 
gi | 5307761 | 
gi | 6760369 | 



NOV37 

gi | 7657415 | 
gi | 9910320 | 
gi | 10241574 | 
gi | 5307761 | 
gi | 6760369 | 



NOV37 

gi | 7657415 | 
gi | 9910320 j 
gi j 1024157,4 | 
gi | 5307761 | 
gi | 6760369 | 



NOV37 

gi|7657415| 
gi|9910320| 
gi j 10241574 | 
gi j 5307761 | 
gi | 6760369 | 



61 

59 

59 

59 

1 

1 
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119 

119 

119 

1 

1 
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QKSYSSSETL 
QKSYSSSETL 
QKSYSSSETL 
QKSYSSSETL 
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- S RMHYGNR - gTBL I HR] 
3TN FTLAE LGicgP S PH: 
^ANFTLAELGlcjlPSPH: 
GTN FTLAELGjfc §P S PH! 



100 




110 
. . I . . 



120 



GILHQGYSLSTGSDADSD 
GILHQGYSLSTGSDADSD 
(GILHQGYSLSTGSDADSD 
GILHQGYSLSTGSDADSD 



118 

118 

118 

118 

1 

1 



130 
. . I . . 



140 



150 



teggms pehai rlwgrgi ksgrs sg 
teggmspehairlwgrgiksgrssg 
teggmspehairlwgrgiksHrssg 
teggms pehai rlwgrgi kshrssg 




160 170 180 

|....L...^|....|....| 

167 

167 

167 

FHTHLSEKLK 17 8 

1 

1 



DSDNENKSD 
DSDNENKSD 
DSDNENKSD 
'DSDNENKSD 
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200 



210 



230 



240 
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NOV37 

gi|7657415| 
gi|9910320| 
gi j 10241574 | 
gi|530776l| 
gi j 6760369 | 



NOV37 

gi | 7657415 | 
gi|9910320 j 
gi j 10241574 | 
gi|530776l| 
gi|6760369| 



310 



320 



330 



340 



350 



360 




370 



380 



390 



400 



410 



420 




430 



440 



450 



460 



470 
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NOV37 

gi|7657415 | 
gi | 9910320 j 
gi j 10241574 | 
gi|530776l| 
gi|6760369| 



NOV37 

gi|7657415| 
gi|9910320| 
gi (10241574 | 
gi|530776l| 
gi|6760369| 




, NGKLGGJ^HEjj|IT< 29 



490 



500 



510 



520 



530 



540 




QEVPPG 
QEVPPG 
QEVPPG 
QEVPPG 
QgVPPGj 
QEVPPG] 
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NOV37 

gi |7657415 | 
gi | 9910320 j 
gi j 10241574 | 
gi j 5307761 | 
gi j 6760369 j 
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NOV37 

gi [7657415 | 
gi | 9910320 | 
gi | 10241574 | 
gi | 5307761 | 
gij 6760369 j 



NOV3 7 

gi |7657415 | 
gi | 9910320 | 
gi | 10241574 | 
gi|530776l| 
gi|6760369| 



NOV37 

gi | 7657415 | 
gi | 9910320 j 
gij 10241574 | 
gi|530776l| 
gij 6760369 j 



561 
561 
561 
589 
393 
150 



660 
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EWSFNTWLDSVQDCPRNCHGNGECVSGffiCHCFPGFLGADCAKAACPVIiCSGNGQYSKG 
EivSFNTWLDSVQDCPRNCHGNGECVSGBcHCFPGFLGADCAKAACPVLCSGNGQYSKG 
E|VSFNTWLDSVQDCPRNCHGNGECVSGgCHCFPGFLGADCAKAACPVLCSGNGQYSKG 
EgVSFgT y^L D SVQD CPRNCHGNGECVSG|jcHCFPGFg!GADCAKAACPVLCSGNGQYSKG 
EBvsjfoTlilgj^ 



iCPRNCHGNGECVSG0CHCFPGFLGraDC| 



\CPVLCSGNGQYSKG 
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620 
620 
648 
452 
209 
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NOV37 

gi | 7657415 | 
gi | 9910320 | 
gij 10241574 | 
gi | 5307761 | 
gi j 6760369 j 



NOV37 

gi (7657415 | 
gij 9910320 | 
gij 10241574 | 
gij 5307761 | 
gij6760369 j 
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vdcgthgvc i gg acrceegwtggacdqr vchprc 
vdcgthgvciggacrceegwtgBacdqrvchprc 
vdcgthgvciggacrceegwtggacdqrvchprc 
vdcgthgvc i ggacrceegwtgsacdqrvchprc 
vdcg|hgvc i gggcrceegwtg^cd[j^chprci 
vdcgBhgvc^ggScrceegwtg@ac3jqr5chprc| 
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ehgtckdgkcecregwngehcti 
ehgtckdgkcecregwngehcti 
ehgtckdgkcecregwngehcti 
ehgtckdgkcecregwngehcti 

EHGTCKDGKCECEjgGWjjGEHCTl 
IeHGTCKDGKCEcSIgWNGEHCTI 
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NOV37 

gi [7657415 | 
gi | 9910320 | 
gi | 10241574 | 
gi|530776l| 
gi | 6760369 | 



NOV37 

gi | 7657415 | 
gi | 9910320 | 
gi j 10241574 | 
gi|530776l| 
gi j 6760369 j 



NOV37 


970 


gi 


7657415 | 


964 


gi 


9910320 | 


965 


gi 


10241574 | 


1002 


gi 


5307761 | 


799 


gi 


6760369| 


556 



NOV37 


1030 


gi [7657415 | 


1024 


gi | 9910320 | 


1025 


gi | 10241574 | 


1062 


gi | 5307761 1 


859 


gi | 6760369 | 


616 


NOV37 


1090 


gi | 7657415 | 


1084 


gi | 9910320 | 


1085 


gi j 10241574 | 


1122 


gi|530776l| 


919 


gi|6760369| 


676 


NOV37 


1140 


gi [^7657415 | 


1144 


gi|9910320| 


1145 


gi| 10241574 | 


1182 


gi|530776l| 


979 


gi | 6760369 | 


736 


NOV37 


1198 


gi | 7657415 | 


1204 


gi | 9910320 | 


1205 


gi | 10241574 | 


1242 


gi|530776l| 


1039 


gi|6760369| 


796 


NOV37 


1258 


gi |7657415 | 


1264 


gi j 9910320 | 


1265 


gi | 10241574 | 


1302 
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LNLlgVHLMVAVEGHLFQKSFgASPNLAYTFIWDKTDAYGQRVYGLSDAV: 
LNL I SvHLMVAVEGHLFQKS fSaS PNLAYTF I WDKTDAYGQRVYGLSDAW 



LNLIgVHLMVAVEGHLFQKSF j ASPNLAYTFIWDKTDAYGQj]jVYGLSDAVVSVGFEYE||C 
L|gVHLMVAVgGgLFQKfflF A^PNL^YTFIWDKTDAYSjjQRVYGLS^AVVSVGFEYEgC 

lSSvhlmvavSgSlfqkEf ! as pnlaytf i wd ktdaySqUvygls Saws vgSeyeSc 
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IMGNGRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLSJVGDFNYIRRIFPSRNVTSILE 
IMGNGRRRS I SCPSCNGLAEGNKLLAPVALAVGI DGSLFVGDFNYI RRI FPSRNVTS ILE 
IMGNGRRRSISCPSCNGLAEGNKLLAPVALAVGIDGSLFVGDFNYIRRIFPSRNVTSILE 
IMGNGRRRS I SCPSCNGLAEGNKLLAPVALAVGI DGSLFVGDFNYI RRI FPSRNVTS ILE 
IMGNGRRRS I SCPSCNGfflA|GNKLLAPVALAgG§DGSLFVGDFNY I RRI FPSgNVTSraE 
I M GNG RRRSISCPS CNgBa^GNK L L A P V AL AHG I DG S L?| VGD FN y[$R R I F P S SnVT S ML E 
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NKEFKHSN 
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LR 
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NKEFKHSN 
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LR 
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NKEFKHSN 
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LR 
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NKEFKHSN 


i 
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1317 
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gi I 5307761 I 
gi|6760369| 
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NOV37 

gi|7657415 | 
gi j 9910320 | 
gi | 10241574 | 
gi | 5307761 | 
gij 6760369 j 
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gi 
gi 
gi 
gi 
gi 



7657415 | 
9910320 j 
10241574 | 
5307761 | 
6760369 | 



NOV37 

gi|7657415 | 
gij 9910320 | 
gi j 10241574 | 
gij 5307761 | 
gij6760369j 



NOV37 

gi|7657415 | 
gi j 9910320 j 
gi j 10241574 | 
gi j 5307761 | 
gi j 6760369 j 



NOV37 

gi | 7657415 | 
gi j 9910320 j 
gi j 10241574 | 
gij530776l| 
gij6760369 j 



NOV37 

gi|7657415| 
gij 9910320 j 
gi j 10241574 | 
gij 5307761 | 
gi j 6760369 j 



NOV37 

gi|7657415| 
gij9910320| 
gij 10241574 | 
gij 5307761 | 
gi|6760369j 
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1323 
1324 
1361 
1151 
908 



1378 
1383 



GEQCLPFDEARCGDGGKAgDATLMSPRGIAVDKNGLMYFVDATMIRKVDQNGIISTLLGS 
GEQCLPFDEARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDATMIRKVDQNGIISTLLGS 
GEQCLPFDEARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDATMIRKVDQNGIISTLLGS 
GEQCLPFDEARCGDGGKAVDATLMSPRGIAVDK0GLMYFVDATMIRKVDQNGIISTLLGS 
GEOCLPFDEARCGDGGKAB|AgL^P|GIAVDKNG[gYFVDgTMIRKVDiSNGIISTLLGS 

geqclpfdearcgdggkavbatlmsp§g3!a0dknglhyfvd§tm irkvdqngiistllgs 



1440 

1377 
1382 
1383 
1420 
1210 
967 



1470 



1480 



1490 




70 



NOV37 

gi|7657415| 



1810 
. . I . . . 



1820 



1830 



1840 



1850 



veasytvvqdqvrnsyqlcSngtlrvmyangm^sfhsephvlagt|tptigrcnislpm 
veasytvvqdqvrnsyqlc^gtlrvmyangm^sfhseph vlagtgtptigrcisr islpm 
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I860 

1797 
1802 



,{" 4 tS '"V 1 ***"*! V'"h "r ""Ms t?*V ~ '~" lt ■*{» 4 ' ~ '*"'•; 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



gi | 9910320 | 
gi | 10241574 | 
gi|5307761 
gij 6760369 



NOV37 

gi | 7657415 
gi | 9910320 
gi | 10241574 | 
gij 5307761 
gi | 6760369 



NOV37 

gi | 7657415 | 
gij 9910320 j 
gi j 10241574 | 
gi j 5307761 | 
gi j 6760369 j 



NOV37 

gi|7657415 | 
gi j 9910320 j 
gi j 10241574 | 
gi|530776l| 
gi j 6760369 j 



NOV37 

gi | 7657415 | 
gi j 9910320 j 
gi j 10241574 | 
gi j 5307761 | 
gi j 6760369 j 



NOV37 

gi | 7657415 | 
gij 9910320 j 
gij 10241574 | 
gi j 5307761 | 
gi j 6760369 j 



NOV37 



gi 
gi 
gi 
gi 
gi 



7657415 j 
9910320 j 
10241574 | 
5307761 | 
6760369 



NOV37 

gi | 7657415 | 
gi j 9910320 j 
gi j 10241574 | 
gi j 5307761 | 
gi j 6760369 j 



1744 
1781 

1571 1DSF 
1328 kDiF 



VEASYTVVQDQVRNSYQLCgNGTLRVMYANGM' 
VEASYTVVQDQVRNSYQ LC||NGTL RVMYANGM 
YT^ODO|iRNSYO^M p^Sl RVflYANGMa 



XGYDEsfeU 




|ds: 



1870 



1880 



1890 



1900 



1910 



1920 




I 



2170 
. . I . . . 



2180 
I I ... J 



2190 



2200 



2210 



2097 
2103 
2104 
2141 
1930 
1687 



pvdlyrydeisgkvehfgkfgviyydinqiittavmtlskhfdthgrikevqyemfrslm 
pvdlyrydeisgkvehfgkfgviyydinqiittavmtlskhfdthgrikevqyemfrslm 
pvdlyrydeisgkvehfgkfgviyydinqiittavmtlskhfdthgrikevqyemfrslm 
pvdlyrydeisgkvehfgkfgviyydinqiittavmtlskhfdthgrikevqyemfrslm 

p|DLYSraDglSGKVEfflFGKFGVIYYDINQIl|TAVMTrojKHFDffiHGRIKEgQYEgFRSLM 

podlys|!d|isgkveH^^ 



2036 
2042 
2043 
2080 
1870 
1627 



2096 
2102 
2103 
2140 
1929 
1686 



2220 

A 

2156 
2162 
2163 
2200 
1989 
1746 



2230 



2240 



2250 



2260 



2270 



2280 




70 



2290 
..|... 



2300 



2310 
..|... 



2320 
..|... 



2330 
..|... 



2340 
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NOV37 

gi|7657415 | 
gi | 9910320 j 
gi | 10241574 | 
gi|530776l| 
gi j 6760369 j 




PLRYDLRDRITRLGDVQYKIDDDG 
PLRYDLRDRITRLGDVQYKIDDDG 
PLRYDLRDRITRLGDVQYKIDDDG 
PLRYDLRDRITRLGDVQYKIDDDG 
PLRYDLRDRITRLGDVQYgjD^DG 
PLRYDLRDRI TRLGDVQyBSdBdG 




2360 



2370 



2380 



2390 



2400 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



NOV37 

7657415 | 
9910320 | 
10241574 | 
5307761 | 
6760369 j 



gi 
gi 
gi 
gi 



NOV37 

gi|7657415 | 
gij 9910320 j 
gi | 10241574 | 
gi | 5307761 | 
gi | 6760369 j 



NOV37 

gi | 7657415 | 
gi | 9910320 j 
gi | 10241574 | 
gi|530776l| 
gij 6760369 j 



NOV37 

gi|7657415 | 
gij 9910320 j 
gi j 10241574 | 
gi j 5307761 | 
gi j 6760369 j 



NOV37 

gi 
gi 
gi 
gi 
gi 



7657415 | 
9910320 j 
10241574 | 
5307761 | 
6760369 j 



NOV37 

gi|7657415 | 
gij9910320 j 
gi j 10241574 | 
gij530776l| 
gi j 6760369 j 



NOV37 

gi | 7657415 | 
gi j 9910320 j 
gi j 10241574 | 
gi j 5307761 | 
gij6760369j 




2635 
2640 
2641 
2678 
2469 
2225 



2694 
2699 
2700 
2737 
2525 
2281 



330 



;t n n y p n 31 P. - o t " > .1 o P 




2770 



10 



15 



NOV37 

gi|7657415| 
gi j 9910320 j 
gi 1 10241574 I 
gi|530776l| 
gi|6760369| 



NOV37 

gi|7657415| 
gi j 9910320 j 
gi j 10241574 I 
gi j 5307761 | 
gi|6760369| 



I 



I 



2780 
■ •I... 



2790 
■ ■ I ■ ■ ■ 



2800 



2810 



2820 



2695 
2700 
2701 
2738 
2526 
2282 J 



KEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQN 
KEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQN 
KEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQN 
KEQQKARDGREGSRgWTlgGEKQQLL^TGRVQGYEGYYVLPVEQYPELADSSSNIQFLRQN 
nEOOj^RDGgEGBRLWTEGEK^QLLSlGivtiGYgGYYVL^EQYPELADS ^ SNSQFLRQr 

1eoo|Hrdg^g3rlwtegek3olls^ 



2754 
2759 
2760 
2797 
2585 
S{ 2341 




20 Table 37E lists the domain description from DOMAIN analysis results against 

NOV37. This indicates that the NOV37 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 37E. Domain Analysis of NOV37 

gnl I Pf am I pf am02 0 68 , Metallothio_PEC , Plant PEC family metallothionein. 
(SEQ ID NO: 83 0) 

CD-Length = 77 residues, 97.4% aligned 
Score = 41.6 bits (96), Expect = 6e-04 



25 



30 



35 



40 



NOV3 7: 73 8 



Sbjct: 2 



NOV3 7: 79 7 



Sbjct: 43 



CSVDCGTHGVCIGG-ACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKCECREGWNGEHCT 7 96 



III III -III I I II 

CDDKCGCPSPCPGGNSCRCTSGGEAGAGDQ- 



II II I 

-EHTTC PC- - 



I I I I 

GEHC- 



42 



IDGC-PDLCNGNGRCTLGQNSWQCVCQTGWRGPGCNVAMETSCA 83 9 

M I I I + II I II I III 

- -GCNPCTCPKTQTPTGRKGRANCSC GAGCTCA SCA 7 6 



Neurestin shows homology to a neuregulin gene product, human gamma-heregulin, a 

Drosophila receptor-type pair-rule gene product, Odd Oz (Odz) / Ten(m), and Ten(a), 

suggesting a possible function in synapse formation and morphogenesis. A mouse neurestin 

homolog has independently been cloned as DOC4 from the NIH-3T3 cell line. Northern blot 

analysis showed that neurestin is highly expressed in the brain and also in other tissues at 

much lower levels. In situ hybridization studies showed that neurestin is expressed in many 

types of neurons, including pyramidal cells in the cerebral cortex and tufted cells in the 

olfactory bulb during development. In adults, neurestin is mainly expressed in olfactory and 

hippocampal granule cells, which are known to be generated throughout adulthood. 

Nonetheless, in adults the expression of neurestin was experimentally induced in external 

tufted cells during regeneration of olfactory sensory neurons. These results suggest a role for 

neurestin in neuronal development and regeneration in the central nervous system 
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Neurestin is a putative transmembrane protein whose expression is developmentally 
regulated in neurons. Neurestin expression pattern were examined in mitral/tufted cells in the 
developing rat olfactory bulb. In the main olfactory bulb, neurestin expression was segregated 
in the dorso-rostral area and in the ventro-caudal area, but not in between. In the accessory 
5 olfactory bulb, neurestin expression was found only in the far caudal area. This area did not 
completely correspond to a caudal half of the vomeronasal nerve and glomerular layers 
positive for a G-protein Go alpha. These spatio-temporal expression patterns suggest that 
neurestin functions as a target recognition molecule that specifies zonal projection patterns of 
olfactory and vomeronasal sensory neurons 

10 The NOV37 nucleic acid of the invention encoding a Ten-M2-like protein includes the 

nucleic acid whose sequence is provided in Table 37A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 37 A while still encoding a protein that maintains its Ten- 
M2-like activities and physiological functions, or a fragment of such a nucleic acid. The 

15 invention further includes nucleic acids whose sequences are complementary to those just 

described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

20 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the residues 
may be so changed. 

25 The NOV37 protein of the invention includes the Ten-M2-like protein whose sequence 

is provided in Table 37B. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residue shown in Table 37B while still 
encoding a protein that maintains its Ten-M2-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 35% of the bases 

30 may be so changed. 

The NOV37 nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications implicated in various diseases and disorders described 
below and/or other pathologies. For example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from: brain disorders including epilepsy, 
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eating disorders, schizophrenia, ADD, cancer, heart disease, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders, psoriasis, colon cancer, leukemia, 
AIDS, thalamus disorders, metabolic disorders including diabetes and obesity, lung diseases 
5 such as asthma, emphysema, cystic fibrosis, and cancer, pancreatic disorders including 

pancreatic insufficiency and cancer, and prostate disorders including prostate cancer and other 
diseases, disorders and conditions of the like. 

NOV37 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 

10 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV37 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 

15 disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 



NOV38 

A disclosed NOV38 nucleic acid of 1090 nucleotides (also referred to as CG56737-01) 
encoding a novel Activin Beta C Chain-like protein is shown in Table 38A. An open reading 
20 frame was identified beginning with an ATG initiation codon at nucleotides 3-5 and ending 
with a TAG codon at nucleotides 1068-1070. Putative untranslated regions, if any, found 
upstream from the initiation codon and downstream from the termination codon are underlined 
in Table 3 8 A, and the start and stop codons are in bold letters. 



Table 38A. NOV38 Nucleotide Sequence (SEQ ID NO: 145) 

CAATGACCTCCTCATTGCTTCTGGCCTTTCTCCTCCTGGCTCCAACCACAGTGGCCACTCCCAGAGCTGG 
CGGTCAGTGTCCAGCATGTGGGGGGCCCACCTTGGAACTGGAGAGCCAGCGGGAGCTGCTTCTTGATCTG 
GCCAAGAGAAGCATCTTGGACAAGCTGCACCTCACCCAGCGCCCAACACTGAACCGCCCTGTGTCCAGAG 
CTGCTTTGAGGACTGCACTGCAGCACCTCCACGGGGTCCCACAGGGGGCACTTCTAGAGGACAACAGGGA 
ACAGGAATGTGAAATCATCAGCTTTGCTGAGACAGACTCCACTTCAGCCTACAGCTCCCTGCTCACTTTT 
CACCTGTCCACTCCTCGGTCCCACCACCTGTACCATGCCCGCCTGTGGCTGCACGTGCTCCCCACCCTTC 
CTGGCACTCTTTGCTTGAGGATCTTCCGATGGGGACCAAGGAGGAGGCGCCAAGGGTCCCGCACTCTCCT 
GGCTGAGCACCACATCACCAACCTGGGCTGGCATACCTTAACTCTGCCCTCTAGTGGCTTGAGGGGTGAG 
AAGTCCGGTGTCCTGAAACTGCAACTAGACTGCAGACCCCTAGAAGGCAACAGCACAGTTACTGGACAAC 
CGAGGCGGCTCTTGGACACAGCAGGACACCAGCAGCCCTTCCTAGAGCTTAAGATCCGAGCCAATGAGCC 
TGGAGCAGGCCGGGCCAGGAGGAGGACCCCCACCTGTGAGCCTGCGACCCCCTTATGTTGCAGGCGAGAC 
C ATT ACGT AG AC TT C CAGGAACT GGG ATGGCGGG AC TGG AT ACTG CAG C C CGAGGGGT AC C AG CTG AAT T 
ACTGCAGTGGGCAGTGCCCTCCCCACCTGGCTGGCAGCCCAGGCATTGCTGCCTCTTTCCATTCTGCCGT 
CTTCAGCCTCCTCAAAGCCAACAATCCTTGGCCTGCCAGTACCTCCTGTTGTGTCCCTACTGCCCGAAGG 
CCCCTCTCTCTCCTCTACCTGGATCATAGTGGCAATGTGGTCAAGACGGATGTGCCAGATATGGTGGTGG 
AGGCCTGTGGCTGCAGCTAGCAAGAGGACCTGGGGCTTTG 
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The disclosed NOV38 nucleic acid sequence, located on chromosome 12ql3.1, has 
748 of 935 bases (80%) identical to a gb:GENBANK-ID:MMU96386|acc:U96386.1 mRNA 
from Mus musculus (activin beta E subunit mRNA, complete cds) (E = 5.2e" 120 ) . 

A disclosed NOV38 polypeptide (SEQ ID NO: 146) encoded by SEQ ID NO: 145 is 
5 355 amino acid residues and is presented using the one-letter amino acid code in Table 38B. 
Signal P, Psort and/or Hydropathy results predict that NOV38 contains a signal peptide and is 
likely to be localized extracellularly with a certainty of 0.5135. The most likely cleavage site 
for a NOV38 peptide is between amino acids 18 and 19: TVA-TP . 

Table 38B. Encoded NOV38 protein sequence (SEQ ID NO:146). 

MTSSLLLAFLLLAPTTVATPRAGGQCPACGGPTLELESQRELLLDLAKRSILDKLHLTQRPTLNRPVSRAALRTA 
LQHLHGVPQGALLEDNREQECEIISFAETDSTSAYSSLLTFHLSTPRSHHLYHARLWLHVLPTLPGTLCLRIFRW 
GPRRRRQGSRTLI^EHHITNLGWHTLTLPSSGLRGEKSGVLKLQLDCRPLEGNSTVTGQPRRLLDTAGHQQPFLE 
LKIRANEPGAGRARRRTPTCEPATPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAASFH 
SAVFSLLKANNPWPASTSCCVPTARRPLSLLYLDHSGNWKTDVPDMWEACGCS 

10 The disclosed NOV38 amino acid sequence has 217 of 355 amino acid residues (61%) 

identical to, and 253 of 355 amino acid residues (71%) similar to, the 352 amino acid residue 
ptnr:SWISSPROT-ACC:P55103 protein from Homo sapiens (Human) (inhibin beta C chain 
precursor (activin beta-C chain)) (E = 7.6e" 103 ). 

NOV38 is predicted to be expressed in at least ovary and liver. This information was 

15 derived by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 

Possible small nucleotide polymorphisms (SNPs) found for NOV38 are listed in Table 

20 38C. 



Table 38 


!C: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


PAF 


95 


14 


T>C 


N/A 



NOV38 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 38D. 



Table 38D. BLAST results for NOV38 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 13899338 | ref | NP_ 
113667. 1| 
(NM 031479) 


hypothetical 
protein MGC4638 
[Homo sapiens] 


350 


291/348 
(83%) 


307/348 
(87%) 


e-153 
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gi | 14714 53 9 |gb | AAH1 


inhibin beta E 


350 


253/352 


280/352 


e-129 


0404 . 1 | AAH10404 


[Mus musculus] 




(71%) 


(78%) 




(BC010404) 












gi | 6680453 | ref | NP_0 


inhibin beta E 


350 


253/352 


280/352 


e-129 


32408 . 1 | 


[Mus musculus] 




(71%) 


(78%) 




(NM 008382) 












gi | 13 92 916 0 | ref |NP_ 


activin beta E 


350 


250/352 


279/352 


e-125 


114003. l| 


[Rattus 




(71%) 


(79%) 




(NM 031815) 


norvegicus] 










gi|4809189| gb | AAD3 0 


activin beta E 


350 


248/352 


279/352 


e-124 


133.l|AF140032_l 


[Rattus 




(70%) 


(78%) 




(AF140032) 


norvegicus] 











The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 38E. 

Table 38E Information for the ClustalW proteins 

5 1) NOV3 8 (SEQ ID NO: 14 6) 

2) gi | 13899338 | ref |NP__113667 . 1 1 (NM_031479) hypothetical protein MGC4 63 8 [Homo 
sapiens] {SEQ ID NO: 4 92) 

3) gi | 14714539 | gb| AAH10404 . 1 | AAH10404 (BC010404) inhibin beta E [Mus musculus] (SEQ 
ID NO: 4 93) 

10 4) gi | 6680453 | ref |NP_032408 . 1 | (NM_008382) inhibin beta E [Mus musculus] (SEQ ID 

NO: 4 94) 

5) gi | 13929160 | ref |NP_114003 . 1 | (NM_031815) activin beta E [Rattus norvegicus] (SEQ 
ID NO: 4 95) 

6) gi [4809189 |gb|AAD30133 . 1 | AF140032_1 (AF140032) activin beta E [Rattus norvegicus] 
15 (SEQ ID NO:496) 



20 



25 



30 



35 



40 



NOV3 8 


1 


gi 


13899338 | 


1 


gi 


14714539] 


1 


gi 


6680453 | 


1 


gi 


13929160 | 


1 


gi 


4809189 | 


1 



NOV3 8 

gi|l3899338| 
gi | 14714539 | 
gij 6680453 | 
gi|l3929160| 
gi|4809189 | 





45 



50 



NOV38 

gi|l3899338 | 
gij 14714539 | 
gi j 6680453 | 
gi|l3929160 | 
gi|4809189| 



177 
172 
172 
172 
172 
172 



TLPSSGLR 
TLPSSGLR 
TLPSSGLR 
TLPSSGLR 
TLPSSGLR 
TLPSSGLR 




220 



230 
. . I . . 



240 
- - I 



PRgLLDTAGgQgPFLELKIRANEPGAG 

pr SlldtagSqSpflelkiranepgag 
pr 1lldtagmqspflelkiranepgag 
pr JlldtagBq|pflelkiranepgag 
pr illdtagBqSpflelkiranepgag 
pr IlldtagHqIpflelkiranepgag 



236 
231 
231 
231 
231 
231 
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NOV3 8 

gi | 13899338 | 
gi j 14714539 j 
gi j 6680453 | 
gi | 13929160 | 
gi|4809189| 



250 



260 



270 



280 



290 




310 



320 



330 



NOV3 8 

gi | 13899338 | 
gi j 14714539 j 
gi j 6680453 | 
gi j 13929160 | 
gi|4809189| 




I 



340 



I 



350 
• - I . - 



SCCVPTARRPLSLLYLDHgGNVVKTDVPDMWEACGCS 
SCCVPTARRPLSLLYLDHNGNVVKTDVPDMWEACGCS 
SCCVPTARRPLSLLYLDHNGNVVKTDVPDMWEACGCS 
SCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 
SCCVPTARRPLSLLYLDHNGNVVKTDVPDMWEACGCS 
S CCVPTARRPL S LL YLDHNGNVVKTDVPDM WEACGCS 



355 
350 
350 
350 
350 
350 



Tables 38F-G list the domain description from DOMAIN analysis results against 
NOV38. This indicates that the NOV38 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 38F Domain Analysis of NOV38 

gnl | Smart | smart 0 02 04 , TGFB, Transforming growth factor-beta (TGF-beta) 
family; Family members are active as di sulphide- linked- homo- or 
heterodimers . TGFB is a multifunctional peptide that controls 
proliferation, differentiation, and other functions in many cell types 
(SEQ ID NO: 831) 

CD-Length = 102 residues, 100.0% aligned 
Score = 134 bits (336) , Expect = le-32 



NOV38: 252 CCRRDHYTOFQELGWRDWILQPEGYQLOTCSGQCPPHLAGSPGIAASFHSAVFSLLKANN 

I I I Mil-Ill III- Ml II Ml I- I- I- MI+ I + 

Sbjct: 1 CRRHDLYVDFKDLGWDDWI IAPKGYNAYYCEGECPFPLSERLN- - ATNHAIVQSLVHALD 



311 



58 



NOV3 8 : 



312 P W P AS T S C C V PT ARRP L S L L Y LDH S GNWKTD VP DM WE ACGC S 3 55 



Sbjct: 59 



Mill MMI I MM 



Mill III 



PGAVPKPCCVPTKLSPLSMLYYDDDGNWLRNYPNMWEECGCR 102 



Table 38G. Domain Analysis of NOV38 

gnl | Pf am |pf am00019 , TGF-beta, Transforming growth factor beta like 
domain (SEQ ID NO: 83 2) 

CD-Length = 105 residues, 97.1% aligned 
Score = 114 bits (286) , Expect = 7e-27 



NOV3 8: 252 CCRRDHYVDFQELGWRDW I LQPEG YQLNYCSGQC PPHLAGS PG I AAS FHS AVFSLLKANN 311 

I I IMI++MI 111+ MM Mill II I I 1+ + +I++ I 

Sbjct: 4 CRLRSLYVDFRDLGWGDWI IAPEGYIANYCSGSCPFPLRDDLN — LSNHAILQTLVRLRN 61 

NOV3 8 : 312 PWPASTSCCVPTARRPLSLLYLDHSGNWKTDVPDMWEACGCS 3 55 

I Mill MMIM + III M 1+ III 

Sbjct: 62 PRAVPQPCCVPTKLSPLSMLYLDDNSNWLRLYPNMSVKECGCR 105 
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Activins are homo- or heterodimers of related beta subunits (see 147290) while 
inhibins are dimers composed of an alpha subunit (147380) and an activin beta subunit 
(summarized in Schmitt et ah, Genomics 1996, 32(3):358-66). Activin proteins belong to the 
TGF-beta superfamily (see 190180), the members of which have important roles in cell 
5 determination, differentiation, and growth. 

TGFB is a multifunctional peptide that controls proliferation, differentiation, and other 
functions in many cell types. It was first identified by its ability to cause phenotypic 
transformation of rat fibroblasts. TGFB is chemically distinct from TGFA. It has essentially no 
sequence homology with TGFA or with epidermal growth factor, of which TGFA is an 

10 analog. Members of the same gene family as TGFB include inhibin, which inhibits pituitary 
secretion of follicle stimulating hormone, and Mullerian inhibitory substance, which is 
produced by the testis and is responsible for regression of the Mullerian ducts (anlagen of the 
female reproductive system) in the male embryo. Many cells synthesize TGFB and almost all 
of them have specific receptors for this peptide. Alpha and beta TGFs are classes of 

15 transforming growth factors. TGFB acts synergistically with TGFA in inducing 
transformation. It also acts as a negative autocrine growth factor. 

TGF-beta plays an important role in wound healing. A number of pathologic 
conditions, such as idiopathic pulmonary fibrosis, scleroderma, and keloids, which share the 
characteristic of fibrosis, are associated with increased TGF-beta- 1 expression. 

20 The disclosed NOV38 nucleic acid of the invention encoding a Activin Beta C Chain- 

like protein includes the nucleic acid whose sequence is provided in Table 38A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 38A while still encoding a protein 
that maintains its Activin Beta C Chain-like activities and physiological functions, or a 

25 fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of non-limiting example, 

30 modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
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acids, and their complements, in one embodiment up to about 20% of the NOV38 residues 
may be so changed. 

The disclosed NOV38 protein of the invention includes the Activin Beta C Chain-like 
protein whose sequence is provided in Table 38B. The invention also includes a mutant or 
5 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 38B while still encoding a protein that maintains its Activin Beta C Chain-like 
activities and physiological functions, or a functional fragment thereof. In one embodiment a 
mutant or variant protein of NOV38, up to about 39% of the bases may be so changed. 

The above defined information for this invention suggests that these Activin Beta C 

10 Chain-like proteins (NOV38) is a member of a "Activin Beta C Chain family". Therefore, the 
NOV38 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 

15 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The nucleic acids and proteins of NOV38 are useful in Alzheimer disease-5, Myxoid 
liposarcoma, Stickler syndrome, type I (3), SED, Alpha-ketoglutarate dehydrogenase 

20 deficiency, Cerebral cavernous malformations-2, Greig cephalopolysyndactyly syndrome, 
Hyperinsulinism, familial, MODY, type 2, Pallister-Hall syndrome, Polydactyly, postaxial, 
types Al and B, Polydactyly, postaxial, type IV, Retinitis pigmentosa-9, Charcot-Marie-Tooth 
neuropathy-2D, Colton blood group, Deafness, autosomal dominant 5, Macular dystrophy, 
dominant cystoid, Radioulnar synostosis with amegakaryocytic thrombocytopenia, Von 

25 Hippel-Lindau (VHL) syndrome, cirrhosis, transplantation, fertility and/or other pathologies 
and disorders. 

NOV38 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
30 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

NOVX Antibodies" section below. For example the disclosed NOV38 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
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disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 

NOV39 

NOV39 includes novel Activin Beta C Chain-like and Inhibin Beta E Chain Precursor- 
5 like proteins disclosed below. The disclosed sequences have been named NOV39a and 
NOV39b. 

NOV39a 

A disclosed NOV39a nucleic acid of 1 1 12 nucleotides (also referred to as CG56737- 
10 02) encoding a novel Inhibin Beta E Chain Precursor-like protein is shown in Table 39A. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 40- 
42 and ending with a TAG codon at nucleotides 1090-1092. Putative untranslated regions, if 
any, found upstream from the initiation codon and downstream from the termination codon are 
underlined in Table 39A, and the start and stop codons are in bold letters. 

Table 39A. NOV39a Nucleotide Sequence (SEQ ID NO: 147) 

" CCATCCGAGGCTCCTGAACCGGGGCCATTCACCAGGAGC ATQCGGCTCCCTGATGTCCAGCTCTGGCTQG 
TGCTGCTGTGGGCACTGGTGCGAGCACAGGGGACAGGGTCTGTGTGTCCCTCCTGTGGGGGCTCCAAACT 
GG CACCCCAAGCAGAACGAGCTCTGGTGCTGGAGCTAGCCAAGCAGCAAATCCTGGATGGGTTGCACCTG 
ACCAGTCGTCCCAGAATAACTCATCCTCCACCCCAGGCAGCGCTGACCAGAGCCCTCCGGAGACTACAGC 
CAGGGAGTGTGGCTCCAGGGAATGGGGAGGAGGTCATCAGCTTTGCTACTGTCACAGACTCCACTTCAGC 
C T AC AG CT C C CTG CT C AC TTTT C AC C TGT CC ACT C CT CGGTC C C ACC ACC TGT AC C ATGCC CGCCTGTGG 
CTGCACGTGCTCCCCACCCTTCCTGGCACTCTTTGCTTGAGGATCTTCCGATGGGGACCAAGGAGGAGGC 
GC CAAGGGT C C CG CACTC T C CTGGCT G AG C ACC AC ATC AC CAACC TGGGC TGGC AT ACCT T AACTCTGC C 
CTCTAGTGGCTTGAGGGGTGAGAAGTCTGGTGTCCTGAAACTGCAACTAGACTGCAGACCCCTAGAAGGC 
AACAGCACAGTTACTGGACAACCGAGGCGGCTCTTGGACACAGCAGGACACCAGCAGCCCTTCCTAGAGC 
TTAAGATCCGAGCCAATGAGCCTGGAGCAGGCCGGGCCAGGAGGAGGACCCCCACCTGTGAGCCTGCGAC 
CCCCTTATGTTGCAGGCGAGACCATTACGTAGACTTCCAGGAACTGGGATGGCGGGACTGGATACTGCAG 
CCCGAGGGGTACCAGCTGAATTACTGCAGTGGGCAGTGCCCTCCCCACCCGGCTGGCAGCCCAGGCATTG 
CTGCCTCTTTCCATTCTGCCGTCTTCAGCCTCCTCAAAGCCAACAATCCTTGGCCTGCCAGTACCTCCTG 
TTGTGTCCCTACTGCCCGAAGGCCCCTCTCTCTCCTCTACCTGGATCATAATGGCAATGTGGTCAAGACG 
GATGTGCCAGATATGGTGGTGGAGGCCTGTGGCTGCAGCTAG CAAGAGGACCTGGGGCTTTG 

15 ' ~ ~ ~~ ~ 

The disclosed NOV39a nucleic acid sequence, located on chromosome 7pl3-15, has 
923 of 1 1 10 bases (83%) identical to a gb:GENBANK-lD:MMU96386|acc:U96386.1 mRNA 
from Mus musculus (activin beta E subunit mRNA, complete cds) (E = 2.1e" 165 ) . 

A disclosed NOV39a polypeptide (SEQ ID NO: 148) encoded by SEQ ID NO: 147 is 
20 350 amino acid residues and is presented using the one-letter amino acid code in Table 39B. 

Signal P, Psort and/or Hydropathy results predict that NOV39a contains no signal peptide and 
is likely to be localized extracellularly with a certainty of 0.3700. The most likely cleavage site 
for a NOV39a peptide is between amino acids 19 and 20: VRA-QG. 
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Table 39B. Encoded NOV39a protein sequence (SEQ ID NO: 148). 

MRLPDVQLWLVLLWALVRAQGTGSVCPSCGGSKLAPQAERALVLELAKQQILDGLHLTSRPRITHPPPQAALTRA 
LRRLQPGSVAPGNGEEVISFATVTDSTSAYSSLLTFHLSTPRSHHLYHARLWLHVLPTLPGTLCLRIFRWGPRRR 
RQGSRTLLAEHHITNLGWHTLTLPSSGLRGEKSGVLKLQLDCRPLEGNSTVTGQPRRLLDTAGHQQPFLELKIRA 
NEPGAGRARRRTPTCEPATPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHPAGSPGIAASFHSAVFS 
LLKANNPWPASTSCCVPTARRPLSLLYLDHNGNVVKTDVPDMVVEACGCS 



The disclosed NOV39a amino acid sequence has 287 of 350 amino acid residues 
(82%) identical to, and 301 of 350 amino acid residues (86%) similar to, the 350 amino acid 
residue ptnr:SWISSPROT-ACC:O08717 protein from Mus musculus (Mouse) (Inhibin Beta E 
Chain Precursor (Activin Beta-E Chain)) (E = 5.0e~ 154 ). 

NOV39a is predicted to be expressed in at least the following tissues: ovary and liver. 
Expression information was derived from the tissue sources of the sequences that were 
included in the derivation of the NOV39a sequence. 



NOV39b 



A disclosed NOV39b nucleic acid of 1 1 12 nucleotides (also referred to as CG56647- 
03) encoding a novel Inhibin Beta E Chain-like protein is shown in Table 39C. An open 
reading frame was identified beginning with an ATG initiation codon at nucleotides 40-42 and 
ending with a TAG codon at nucleotides 1090-1092. Putative untranslated regions, if any, 
found upstream from the initiation codon and downstream from the termination codon are 
underlined in Table 39C, and the start and stop codons are in bold letters. 



Table 39C. NOV39b Nucleotide Sequence (SEQ ID NO:149) 

CCATCCGAGGCTCCTGAACCGGGGCCATTCACCAGGAGC ATQCGGCTCCCTGATGTCCAGCTCTGGCTGG 
TGCTGCTGTGGGCACTGGTGCGAGCACAGGGGACAGGGTCTGTGTGTCCCTCCTGTGGGGGCTCCAAACT 
GGCACCCCAAGCAGAACGAGCTCTGGTGCTGGAGCTAGCCAAGCAGCAAATCCTGGATGGGTTGCACCTG 
ACCAGTCGTCCCAGAATAACTCATCTTCCACCCCAGGCAGCGCTGACCAGAGCCCTCCGGAGACTACAGC 
CAGGGAGTGTGGCTCCAGGGAATGGGGAGGAGGTCATCAGCTTTGCTACTGTCACAGACTCCACTTCAGC 
CTACAGCTCCCTGCTCACTTTTCACCTGTCCACTCCTCGGTCCCACCACCTGTACCATGCCCGCCTGTGG 
CTGCACGTGCTCCCCACCCTTCCTGGCACTCTTTGCTTGAGGATCTTCCGATGGGGACCAAGGAGGAGGC 
GCCAAGGGTCCCGCACTCTCCTGGCTGAGCACCACATCACCAACCTGGGCTGGCATACCTTAACTCTGCC 
CTCTAGTGGCTTGAGGGGTGAGAAGTCTGGTGTCCTGAAACTGCAACTAGACTGCAGACCCCTAGAAGGC 
AACAGCACAGTTACTGGACAACCGAGGCGGCTCTTGGACACAGCAGGACACCAGCAGCCCTTCCTAGAGC 
TTAAGATCCGAGCCAATGAGCCTGGAGCAGGCCGGGCCAGGAGGGGGACCCCCACCTGTGAGCCCGCGAC 
CCCCTTATGTTGCAGGCGAGACCATTACGTAGACTTCCAGGAACTGGGATGGCGGGACTGGATACTGCAG 
CCCGAGGGGTACCAGCTGAATTACTGCAGTGGGCAGTGCCCTCCCCACCTGGCTGGCAGCCCAGGCATTG 
CTGTCTCTTTCCATTCTGCCGTCTTCAGCCTCCTCAAAGCCAACAATCCTTGGCCTGCCAGTACCTCCTG 
TTGTGTCCCTACTGCCCGAAGGCCCCTCTCTCTCCTCTACCTGGATCATAATGGCAATGTGGTCAAGACG 
GATGTGCCAGATATGGTGGTGGAGGCCTGTGGCTGCAGCTAGCAAGAGGACCTGGGGCTTTG 



The disclosed NOV39b nucleic acid sequence, located on chromosome 7pl3-15, has 
920 of 1 1 10 bases (82%) identical to a gb:GENBANK-ID:MMU96386|acc:U96386.1 mRNA 
from Mus musculus (activin beta E subunit mRNA, complete cds) (E - 3.7e" 164 ) . 



340 



A disclosed NOV39b polypeptide (SEQ ID NO: 150) encoded by SEQ ID NO: 149 is 
350 amino acid residues and is presented using the one-letter amino acid code in Table 39D. 
Signal P, Psort and/or Hydropathy results predict that NOV39b contains a signal peptide and is 
likely to be localized extracellularly with a certainty of 0.3700. The most likely cleavage site 
for a NOV39b peptide is between amino acids 19 and 20: VRA-QG. 

Table 39D. Encoded NOV39b protein sequence (SEQ ID NO:150). 

MRLPDVQLWLVLLWALVRAQGTGSVCPSCGGSKLAPQAERALVLELAKQQILDGLHLTSRPRITHLPPQAALTRA 
LRRLQPGSVAPGNGEEVISFATVTDSTSAYSSLLTFHLSTPRSHHLYHARLWLHVLPTLPGTLCLRIFRWGPRRR 
RQGSRTLLAEHHITNLGWHTLTLPSSGLRGEKSGVLKLQLDCRPLEGNSTVTGQPRRLLDTAGHQQPFLELKIRA 
NEPGAGRARRGTPTCEPATPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAVSFHSAVFS 
LLKANNPWPASTSCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 

The disclosed NOV39b amino acid sequence has 285 of 350 amino acid residues 
(81%) identical to, and 299 of 350 amino acid residues (85%) similar to, the 350 amino acid 
residue ptnr:SWISSPROT-ACC:O08717 protein from Mus musculus (Mouse) (Inhibin Beta E 
Chain Precursor (Activin Beta-E Chain)) (E - 1 .5e" 152 ). 

NOV39b is predicted to be expressed in at least the following tissues: ovary and liver. 
Expression information was derived from the tissue sources of the sequences that were 
included in the derivation of the NOV39b sequence. 

Possible small nucleotide polymorphisms (SNPs) found for NOV39a are listed in 
Table 39E. 



Table 39E: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


PAF 


1095 


11 


T>C 


N/A 



Possible small nucleotide polymorphisms (SNPs) found for NOV39b are listed in 
Table 39F. 



Table 39F: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


PAF 


933 


9 


C>T 


0.222 



NOV39a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 39G. 



Table 39G. BLAST results for NOV39a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 
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gi | 13899338 | ref | NP 
113667. 1| 
(NM 031479) 


hypothetical 
protein MGC463 8 
[Homo sapiens] 


350 


34 9/350 
(99%) 


34 9/3 50 
(99%) 




gi|l4714539|gb|AAHl 
0404 . 1 | AAH10404 
(BC010404) 


inhibin beta E 
[Mus musculus] 


350 


288/351 
(82%) 


302/351 
(85%) 


ti JL. O .5 


gi | 6680453 | ref | NP_0 
32408.1 | 
(NM 008382) 


inhibin beta E 
[Mus musculus] 


350 


287/351 
(81%) 


301/351 
(84%) 


e-152 


gi | 13929160 | ref | NP 

114003.1] 

(NM 031815) 


activin beta E 
[Rattus 
norvegicus] 


350 


281/351 
(80%) 


294/351 
(83%) 


e-146 


gi | 4 8 0 9189 | gb | AAD3 0 
133 .1 |AF140032 1 
(AF140032) 


activin beta E 
[Rattus 
norvegicus] 


350 


279/351 
(79%) 


294/351 
(83%) 


e-146 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 39H. 



10 
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Table 39H Information for the ClustalW proteins 



1) NOV39a (SEQ ID NO: 148) 

2) NOV3 9b (SEQ ID NO: 150) 

3) gi | 13899338 | ref | NP_1 13 667 . 1 | (NM_031479) hypothetical protein MGC4 63 8 [Homo 
sapiens] (SEQ ID NO: 497) 

4) gi|l4714539|gb|AAH10404.l|AAH10404 (BC0104.04) inhibin beta E [Mus musculus] (SEQ 
ID NO: 4 98) 

5) gi | 6680453 | ref |NP_032408 . 1 | (NM_008382) inhibin beta E [Mus musculus] (SEQ ID 
NO: 4 99) 

6) gi | 13929160 | ref | NP_114003 . 1 | (NM_031815) activin beta E [Rattus norvegicus] (SEQ 
ID NO: 500) 

7) . gi|4809189|gb|AAD30133.l|AF140032_l (AF140032) activin beta E [Rattus 
norvegicus] (SEQ ID NO: 501) 



20 



25 



30 



35 



40 



45 



NOV39a 

NOV39b 

gi | 13899338 | 

gi j 14714539 j 

gij 6680453 | 

gi [13929160 | 

gi|4809189| 



NOV3 9a 
NOV39b 
gi|l3899338| 
gi|l4714539| 
gi j 6680453 | 
gi |13929160| 
gi |4809189 | 



NOV39a 
NOV39b 
gi|l3899338| 
gij 14714539 j 
gi|6680453 | 
gijl3929160| 
gi|4809189 I 



1 
1 
1 
1 
1 
1 
1 





70 
•I-- 



80 



PQAALTRALRRLQP 
PQAALTRALRRLQP 
PQAALTRALRRLQP 
PQAALTRALRRLQP 
PQAALTRALRRLQP 
PQAALTRALRRLQP 
PQAALTRALRRLQP 




130 



140 



150 



160 



170 




180 
I 



GWHgLTLPSSGL 

gwhHltlpssglr 
gwhHltlpssglr 
gwhBltlpssglr 
gwhBltlpssglr 
gwhHltlpssglr 
gwhBltlpssglr 



179 
179 
179 
179 
179 
179 
179 



190 



200 



210 
I - - 
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220 



230 



240 
••I 



10 



15 



20 



25 



NOV39a 


180 


G 


NOV3 9b 


180 


G 


gi|l3899338 | 


180 


G 


gi| 14714539] 


180 


S 


gij 6680453 | 


180 


S 


gij 13929160 | 


180 


S 


gi|4809189 | 


180 


S 



NOV3 9a 
NOV3 9b 
gi|l3899338 J 
gij 14714539 | 
gi|6680453 | 
gi|l3929160| 
gi|4809189| 



240 
240 
240 
240 
240 
240 
240 



NOV3 9a 


300 


NOV3 9b 


300 


gi|l3899338| 


300 


gij 14714539 j 


300 


gi|6680453| 


300 


gi|l3929160| 


300 


gi|4809189| 


300 




GgPRgLLDTAGjgQgPFLELKIRANEPGAGRARRRTPT 

g3pr 

GBPR 



prhlldtaggqgpflelkiranepgagrarrrtpt 
prhlldtaggggpflelkiranepgagrarrrtpt 
prjjlldtaggqspflelkiranepgagrarrrtpt 
prJlldtagSqIpflelkiranepgagrarrrtpt 



239 
239 
239 
239 
239 
239 
239 



250 
■ - I - - 



260 
. - I . . 



270 
- - I - - 



280 
. . I . . 



290 



300 



CEPgTPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHjgAGSPGIAASFHSAVF 
CEPgTPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAjJSFHSAVF 
CEPgTPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAASFHSAVF 
CEPgTPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAASFHSAVF 
CEPraTPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAASFHSAVF 
CE^TPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGSPGIAASFHSAVF 
CEggTPLCCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHLAGS PGIAAS FHSAVF 



310 

_L 




I 



320 
. . I . . 



330 



I 
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350 



ISCCVPTARRPLSLLYLDHNGNVVKTDVPDMWEACGCS 
jSCCVPTARRPLSLLYLDHNGNVVKTDVPDMVVEACGCS 
jSCCVPTARRPLSLLYLDHNGNVVKTDVPDMWEACGCS 
SCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 
SCCVPTARRPLSLLYLDHNGNVVKTDVPDMWEACGCS 
SCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 
iSCCVPTARRPLSLLYLDHNGNVVKTDVPDMWEACGCS 



350 
350 
350 
350 
350 
350 
350 



299 
299 
299 
299 
299 
299 
299 
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Tables 39I-J list the domain description from DOMAIN analysis results against 
NOV39. This indicates that the NOV39 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 391 Domain Analysis of NOV39 

gnl|Smart|smart00204, TGFB, Transforming growth factor-beta (TGF-beta) 
family; Family members are active as disulphide- linked homo- or 
heterodimers. TGFB is a multifunctional peptide that controls 
proliferation, differentiation, and other functions in many cell 
types. (SEQ ID NO: 83 3) 

CD-Length = 102 residues, 100.0% aligned 
Score = 133 bits (335), Expect = ie-32 
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NOV3 9 : 24 7 CCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHPAGSPGIAASFHSAVFSLLKANN 3 06 

III MM-III III- Ml II Ml + |+ |+ | || + | + 

Sbj ct : 1 CRRHDLYVDFKDLGWDDWI IAPKGYNAYYCEGECPFPLSERLN- - ATNHAIVQSLVHALD 5 8 

NOV3 9 : 3 0 7 PWPASTSCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 3 5 0 

_ _ r I MMI MMI I -MM + Mill III 

Sbjct : 59 PGAVPKPCCVPTKLSPLSMLYYDDDGNWLRNYPNMWEECGCR 102 



Table 39 J. Domain Analysis of NOV39 

gnl | Pfam|pfam00019, TGF-beta, Transforming growth factor beta like 
domain. (SEQ ID NO: 83 4) 

CD-Length = 105 residues, 97.1% aligned 
Score = 114 bits (286), Expect = 7e-27 
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NOV46 : 247 CCRRDHYVDFQELGWRDWILQPEGYQLNYCSGQCPPHPAGSPGIAASFHSAVFSLLKANN 306 

I I Mil-Ill 111+ I I I I Mill II | |+ + + | ++ | 

Sbjct : 4 CRLRSLYVDFRDLGWGDWI I APEGY I ANYCSGSCPFPLRDDLN- - LSNHAILQTLVRLRN 61 

NOV4 6 : 3 07 PWPASTSCCVPTARRPLSLLYLDHNGNWKTDVPDMWEACGCS 3 5 0 

I Mill MMIM I III M 1+ III 

Sbjct : 62 PRAVPQPCCVPTKLSPLSMLYLDDNSNWLRLYPNMSVKECGCR 105 

Activins are homo- or heterodimers of related beta subunits (see 147290) while 
inhibins are dimers composed of an alpha subunit (147380) and an activin beta subunit 
(summarized in Schmitt et al., Genomics 1996, 32(3):358-66). Activin proteins belong to the 
TGF-beta superfamily (see 190180), the members of which have important roles in cell 
determination, differentiation, and growth. 

TGFB is a multifunctional peptide that controls proliferation, differentiation, and other 
functions in many cell types. It was first identified by its ability to cause phenotypic 
transformation of rat fibroblasts. TGFB is chemically distinct from TGFA. It has essentially no 
sequence homology with TGFA or with epidermal growth factor, of which TGFA is an 
analog. Members of the same gene family as TGFB include inhibin, which inhibits pituitary 
secretion of follicle stimulating hormone, and Mullerian inhibitory substance, which is 
produced by the testis and is responsible for regression of the Mullerian ducts (anlagen of the 
female reproductive system) in the male embryo. Many cells synthesize TGFB and almost all 
of them have specific receptors for this peptide. Alpha and beta TGFs are classes of 
transforming growth factors. TGFB acts synergistically with TGFA in inducing 
transformation. It also acts as a negative autocrine growth factor. 

TGF-beta plays an important role in wound healing. A number of pathologic 
conditions, such as idiopathic pulmonary fibrosis, scleroderma, and keloids, which share the 
characteristic of fibrosis, are associated with increased TGF-beta- 1 expression. 

The disclosed NOV39 nucleic acid of the invention encoding a Activin Beta C Chain- 
like protein includes the nucleic acid whose sequence is provided in Table 39A, 39C or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 39A, or 39C while still 
encoding a protein that maintains its Activin Beta C Chain-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
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derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, in one embodiment up to about 17% of the NOV39a residues 
may be so changed, and in an additional embodiment up to about 1 8% of the NOV39b 
residues may be so changed. 

The disclosed NOV39 protein of the invention includes the Activin Beta C Chain-like 
protein whose sequence is provided in Table 39B, or 39D. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 39B, or 39D while still encoding a protein that maintains its Activin 
Beta C Chain-like activities and physiological functions, or a functional fragment thereof. In 
one embodiment a mutant or variant protein of NOV39a, up to about 39% of the bases may be 
so changed. 

The above defined information for this invention suggests that these Activin Beta C 
Chain-like proteins (NOV39) is a member of a "Activin Beta C Chain family". Therefore, the 
NOV39 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The nucleic acids and proteins of NOV39 are useful in Alzheimer disease-5, Myxoid 
liposarcoma, Stickler syndrome, type I (3), SED, Alpha-ketoglutarate dehydrogenase 
deficiency, Cerebral cavernous malformations-2, Greig cephalopolysyndactyly syndrome, 
Hyperinsulinism, familial, MODY, type 2, Pallister-Hall syndrome, Polydactyly, postaxial, 
types Al and B, Polydactyly, postaxial, type IV, Retinitis pigmentosa-9, Charcot-Marie-Tooth 
neuropathy-2D, Colton blood group, Deafness, autosomal dominant 5, Macular dystrophy, 
dominant cystoid, Radioulnar synostosis with amegakaryocytic thrombocytopenia, Von 
Hippel-Lindau (VHL) syndrome, cirrhosis, transplantation, fertility and/or other pathologies 
and disorders. 

NOV39 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
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known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. For example the disclosed NOV39 protein have multiple 
hydrophilic regions, each of which can be used as an immunogen. This novel protein also has 
value in development of powerful assay system for functional analysis of various human 
disorders, which will help in understanding of pathology of the disease and development of 
new drug targets for various disorders. 



NOV40 

A disclosed NOV40 nucleic acid of 1606 nucleotides (also referred to as CG56097-01) 
encoding a UDP glycosyltransferase-like protein is shown in Table 40A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 1-3 and ending 
with a TAG codon at nucleotides 1600-1602. The start and stop codons are shown in bold in 
Table 40A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 40 A. NOV40 nucleotide sequence (SEQ ID NO: 151). 

ATGGCTATGAAATGGACTTCAGTCCTTCTGTTGATACAGCTGAGCTATTACTCTAGCTCTGGGAGTTGTGGA 
AAT GTGCCG CTGTGGC C CATGG AAT AT AGT C CT TGGAT G AAT AT AAAG AC AAT C CTGGAT AAACT T ATGC AG 
ATAAGTCATGAGGTGACTGTTCTAACATTGTCAGCTTCCATTCTTGTTGATCCCAACATAACATCTGTTACT 
AAATTTGAGGTTTATTCTATATCTGTAATTAAAGATGATTTTGCAGGGTTTTTTTTCACACAACAGATTACT 
AAATGGATACATGATCTTCCAAAACATATATTTTGGTTTAAATGTGTTCCCTTCAAGAATATTCTTTGGGAA 
TATTCTGGTTATACTGAGAAGTTCTTTAAAGATGTAGTTTTGAACAAGAAACTTATGACAAACCTACAAGAA 
TCAAGGTCTGATGTCGTTCATGCAAATGCCATTGGTCCCTTTGGAGAGCTGCTGGCTGAGCTATTAAAAATA 
TCCTTTGTGTACAGTCTCCACTTCTCTCCTGGCTACACATTTGAGAAATACAGTGGAGGATTTCTACTTCCA 
CCTTCCTATGGAGCTGTTATTCTGTCAGAATTAAGTGGTTCGATGACATTCATGGAGACAGTAAGAAATATT 
ATATATGTGTTTTATTTTGACTTTTGGTTCCAAACATTTGATATGAAGAAGGGAGACCAGTTTTACAGTGAA 
GTTCTAGGTAAGTCATGTTTTTTATCTGAGATAATGGGAAAAGCTGAAATGTGGCTCATTCGAAACTACTGG 
TATTTGGAATTTCCTCGCCCACTCTTACCTAATTTTGAATTTGTTGTAAGACTCTACTGCAAACCTGTCAAC 
CCCCTGCCTAAGGAGAAAATGGAAGAATTTGCCCAGAGCTCTGATGAAGACGGTGTTGTGTTTTCTCTGGAG 
TCAGCTGTGCAAAACCTTACAGAAGAAAAAGCTGATCTTATCACTTCGGCCCTGGCTCAGATTCCACAAAAA 
GTCATGAAGTTCGGAAGGAAACCAAATACCTTAAGATCCAATACTCAGTGGCATAGGTGGATCCCACAGAAT 
GAATGTCTTATCCTAGATCATCCCCAAACCAAAGCCTTTATAACTTATGGTGGAACAAATAGCATCTATGAG 
ATG ATC TAC CGTGG AGT C C CTT C C ATGGGC ATT C C TTTGTT TGCGGAC C AAC AT GAT AAC AT TGCTC ACATG 
AAGGCCAAGGGAGCAGCTGTTATATTGGACTTGAGCACAAAGTCAAGTACAGATTTGCTCGATATATCTGTG 
TTCGTATCTTTATTTTTATCCTTCAGATATAAAGAGAGTGTTATGAAATTATCAAGAATTCAACATGATCAA 
CCAGTGAAGCCCCTGGATCGAGCAGTCTTCTGGATTGAATTTGTCATGCGCCACAAAGGAGCCAAACACCTT 
CGAGTTGCAGCCCGTGACCTCACCTGGTTCCAGTACCACTCTTTGGATGTGATTGGGTTTCTGCTGGCCTGT 

GTGGCAACTGTGACATTTATCATCACAAAGTGTTGTCTGTTTTGTTTCTGGAAGTTTACTAGAAAAGTGAAG 
AAGGAAAAAAGGGATTAGTTAT 



In a search of public sequence databases, the NOV40 nucleic acid sequence, located on 
chromosome 4, has 1305 of 1606 bases (81%) identical to a gb:GENBANK- 
ID:HUMUDPGTA|acc:J05428.1 mRNA from Homo sapiens (Human 3,4-catechol estrogen 
UDP-glucuronosyltransferase mRNA, complete cds) (E = 6.4e 217 ). 

The disclosed NOV^O polypeptide (SEQ ID NO: 152) encoded by SEQ ID NO: 151 has 
533 amino acid residues and is presented in Table 40B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV40 has no signal peptide and is 
likely to be localized to the endoplasmic reticulum (membrane) with a certainty of 0.8200. 
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Alternatively, NOV40 may also localize to the plasma membrane with a certainty of 0.4600, to 
the microbody (peroxisome) with a certainty of 0.3012, or to the Iysosome (membrane) with a 
certainty of 0.2000. The most likely cleavage site for NOV40 is between positions 20 and 21: 
SSS-GS. 

5 

Table 40B. Encoded NOV40 protein sequence (SEQ ID NO:152). 

MAMKWTSVLLLIQLSYYSSSGSCGNVPLWPMEYSPWMNIKTILDKLMQISHEVTVLTLSASILVDPNITSVT 
KFEVYS I SVI KDDFAGFFFTQQI TKWIHDLPKHI FWFKCVPFKNI LWE YSGYTEKFFKDWLNKKLMTNLQE 
SRSDWHANAIGPFGELLAELLKISFVYSLHFSPGYTFEKYSGGFLLPPSYGAVILSELSGSMTFMETVRNI 
IYVFYFDFWFQTFDMKKGDQFYSEVLGKSCFLSEIMGKAEMWLIPJSTYWYLEFPRPLLPNFEFVVRLYCKPVN 
PLPKEKMEEFAQSSDEDGWFSLESAVQNLTEEKADLITSALAQIPQKVMKFGRKPNTLRSNTQWHRWIPQN 
ECLILDHPQTKAFITYGGTNSI YEMI YRGVPSMGIPLFADQHDNIAHMKAKGAAVILDLiSTKSSTDLLDISV 
FVSLFLSFRYKESVMKLSRIQHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAARDLTWFQYHSLDVIGFLLAC 
VATVTFI ITKCCLFCFWKFTRKVKKEKRP * 

A search of sequence databases reveals that the NOV40 amino acid sequence has 353 
of 533 amino acid residues (66%) identical to, and 412 of 533 amino acid residues (77%) 
similar to, the 529 amino acid residue ptnr:SWISSPROT-ACC:P16662 protein from Homo 
1 0 sapiens (Human) (UDP-Glucuronosyltransferase 2b7 Precursor, Microsomal (EC 2.4.1.17) 
(UDPGT) (3,4-Catechol Estrogen Specific) (UDPGTH-2)) (E = 7.2e 185 ). 

NOV40 is predicted to be expressed in at least the following tissues: liver tissue. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
15 Literature sources, and/or RACE sources. 

NOV40 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 40C. 



Table 40C. BLAST results for NOV40 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%> 


Positives 
(%) 


Expect 


gi | 4507825 | ref |NP_0 
01065 . 1 | 
(NM_001074) 


UDP 

gly cosyl t rans f er a 
se 2 family, 
polypeptide B7 


529 


353/536 
(65%) 


412/536 
(76%) 


0.0 


gi | 617 5 0 83 | sp| P0613 
3 |UDB4_HUMAN 


UDP- 

GLUCURONO S Y LTRAN S 
FERASE 2B4 
PRECURSOR , 
MICROSOMAL 
(UDPGT) 

( HYODEOX Y CHOL I C 
ACID) (HLUG25) 
(UDPGTH-1) 


528 


355/536 
(66%) 


409/536 
(76%) 


0 . 0 


gi|484383 |pir| j JN06 
19 


glucuronosyl trans 
ferase (EC 
2.4.1.17) 2B-4 
precursor - human 


528 


354/536 
(66%) 


408/536 
(76%) 


0.0 
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gi| 3153 832 |gb|AAC95 
UUz .1 | l/\r U b4 ^ uu ; 


UDP- 

glucuronosyl trans 
ferase 2B4 
precursor [Homo 
sapi ens] 


528 


354/536 
(66%) 


409/536 
(76%) 


0.0 


gi | 4 079707 |gb| AAC98 
726. 1| (AF016310) 


UDP 

glucuronosyl trans 
ferase [Macaca 
f ascicularis] 


529 


351/536 
(65%) , 


410/536 
(76%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 40D. In the ClustalW alignment of the NOV40 protein, 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 40D. ClustalW Analysis of NOV40 



1) Novel NOV40 (SEQ ID NO: 152) 

2) gi I 45,07825 I ref | NP_001065 . 1 | (NM_001074) UDP glycosyltransf erase 2 family, 
polypeptide B7 (SEQ ID NO: 502) 

3) gi I 6175083 | sp | P0613 3 | UDB4_HUMAN UDP -GLUCURONOSYLTRANS FERASE 2B4 PRECURSOR 
MICROSOMAL (UDPGT) (HYODEOXYCHOLIC ACID) (HLUG25) (UDPGTH-1) (SEQ ID NO: 503) 

4) gi I 484383 I pir J |JN0619 glucuronosyl transferase (EC 2.4.1.17) 2B-4 precursor - 
human (SEQ ID NO: 504) 

5) gi|3153832|gb|AAC95002.l| (AF0642.00) UDP-glucuronosyltransf erase 2B4 precursor 
[Homo sapiens] (SEQ ID NO: 505) 

6) gi I 4079707 |gb I AAC98726.1| (AF016310) UDP-glucuronosyltransf erase [Macaca 
f ascicularis] (SEQ ID NO: 506) 



NOV40 

gi|4507825| 
gi I 6175083 j 
gi |484383 | 
gi|3153832 | 
gi j 4079707 | 



1 

1 
1 
1 
1 
1 



NOV40 


61 


SI 


L 




gi|4507825| 


61 


SI 


L 


£ 


gi j 6175083 | 


61 


SI 


S 


FD 


gi | 484383 | 


61 


SI 


S 


FD 


gi|3153832 | 


61 


SI 


S 


FD 


gi|4079707| 


61 


SI 




FD 



NOV40 

gi |4507825 | 
gi j 6175083 | 
gi [484383 | 
gi|3153832 | 
gi |4079707 | 



121 
119 
119 
119 
119 
119 
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190 



200 



NOV40 

gi | 4507825 | 
gi | 6175083 j 
gi | 484383 | 
gi|3153832 | 
gi | 4079707 | 



NOV40 

gi | 4507825 | 
gi|6175083 | 
gi j 484383 | 
gi|3153832 | 
gi | 4079707 | 



NOV4 0 

gi|4507825 | 
gi I 6175083 | 
gi|484383 | 
gi | 3153832 | 
gi|4079707 | 



gi 
gi 
gi 
gi 



4507825 | 
6175083 j 
484383 | 
3153832 | 
4079707 | 



181 TF 

179 TF 

179 AI 

179 AI 

179 AI 

179 NF 



210 220 



_L 



230 



I 



240 



EKgSGG^i3PPSY^V|i3SELS^MTF|E^j^IYVBYFlFWFQ£3FDMKKSDQFYSE 
EKHSGG^FPPSYVPVVMSELfDQMTFSERVKNMIYVLYFgFWF^IFDMKKWDQFYSE 

ekhsgg^fppsyvpvvmselsdqmtfHervknmiyvlyf|fwfqifdmkkwdqfyse 

EKHSGGfflLFPPSYVPVVMSELSDQMTFyERVKNMIYVLYFraFWFQIFDMKKWDQFYSE 
EKHSGGfflLFPPSYVPWMSELSDQMTF|ERVKNMIYVLYFiFWFQIFDMKKWDQFYSE 

ekhSggBlfppsyvpvvmselsd^mtf*ervknmiy.|^lyfBfBfqiS^mkkwdqfyse 



240 
238 
238 
238 
238 
238 



250 



260 



270 



280 



290 



300 



NOV40 


301 


gi| 4507825 | 


298 


gi j 6175083 | 


298 


gi | 484383 | 


298 


gi| 3153832 | 


298 


gi j 4079707 j 


298 




NOV4 0 


418 


gi|4507825 | 


416 


gi j 6175083 j 


416 


gi|484383 | 


416 


gi|3153832 | 


416 


gi |4079707 | 


416 


NOV40 


478 



430 440 

— I- - 
5ISVFVJSUFLSFF 



450 




460 
- I - • 



480 



HDQPVKPLDRAVF 
HDQPVKPLDRAVF 
HDQPVKPLDRAVF 
HDQPVKPLDRAVF 
HDQPVKPLDRAVF 
HDQPVKPLDRAVF' 




520 



530 




Table 40E lists the domain descriptions from DOMAIN analysis results against 
NOV40. This indicates that the NOV40 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 40E Domain Analysis of NOV40 

gnl | Pfam|pf am00201, UDPGT, UDP-glucoronosyl and UDP-glucosyl 

transferase. (SEQ ID NO: 835) 

CD-Length = 501 residues, 100.0% aligned 

Score = 587 bits (1514), Expect = 4e-169 



N0V4 0 : 


24 


Sbjct : 


1 


NOV4 0: 


84 


Sbjct: 


61 


NOV4 0: 


137 


Sbjct : 


110 


NOV4 0 : 


197 


Sbjct: 


170 


NOV4 0 : 


257 


Sbjct: 


229 


NOV4 0 : 


316 


Sbjct : 


288 


NOV4 0 : 


374 


Sbjct : 


346 


NOV4 0 : 


434 


Sbjct : 


405 


NOV4 0 : 


494 


Sbjct: 


464 



GNVPLWPME YS PWMNIKT I LDKLMQI SHEVTVLTLSAS I LVDPNI TS VTKFEVYS I S VI K 83 

I I +111+ I MM II +1 + 1 MUM MMI+ I I Ml I I I 

GKVLWPMDGSHWMNMKGILLELVQRGHEVTVLRPSASILIGPAKPSNLKFETYPDSATK 60 



DDFAGFFFTQQITKWIHDLPKHIFWFKCVPFKNILW EYSGYTEKFFKDWLNK 13 6 

I I + + M +1 Ml |++| II 

- -WFMEAAEAGTVWSYFSALQEYSDGARVSCKELVGNK 109 



EELENLF PKRVMN - 



II I I I II Ml 1+ + 



I I MM I Ml II I 



II 



M I I + I++ + I 



I I 



I l + l 



M + MM MMIIMM 11 + 



I I 



1 1 



Ml Ml I I 



1+ M 11+ I MIIIIIIM+ +1 I ll + ll +11+ +I + III+ +1 Il + I + M 

SNIPEEKANEIASALAQIPQKVLWRFDGTKPSTLGNNTRLVKWLPQND--LLGHPKTRAF 34 5 
I T YGGTNS I YEM I YRGVPSMG I P L FADQHDN I AHMKAKGAAV ILDLSTKSS TDLLDISVF 4 33 

+ 1+ l + l +11 I III +I + III II II. II + MIMI I++ I +1 111 + 

VTHAGSNGVYEAICHGVPMVGMPLFGDQMDNAKHMEAKGAAVTLNVLTMTSEDLLNALK- 404 
VSLFLSFRYKESVMKLtSRIQHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAARDLTWFQYH 4 93 

MI++I + II I 1 1 1 1 1 II I 1 1 I 1 1 1 1 1 1 1 1 M I M I M I II MM + IM 

- TVINDPS YKENIMRLSS I HHDQPVKPLDRAVFW I EFVMRHKGAKHLRPAAHDLTWYQYH 463 
SLDVIGFLLACVATVTFI I TKCCLFCFWKFTRKVKKEK 531 

II II I M M M M 1 1 II Mill + 11 11+ I 

SLDVIGFLLACVATVAFI TFKCCLFGYRKFVGKKKRVK 501 

The UDP-glucuronosyltransferases, a group of isoenzymes located primarily in hepatic 
endoplasmic reticulum and nuclear envelope, are encoded by a large multigene family that has 
evolved to produce catalysts with differing but overlapping substrate specificities. Two 
subfamilies are recognized by sequence identities. UGT1 consists of at least 4 isoenzymes that 
catalyze the glucuronidation of phenols and bilirubin. All 4 map to chromosome 2 and 
probably derive from the same gene (UGT1). The UGT2 family contains at least 5 members 
catalyzing steroid or bile acid glucuronidation. Members of the subfamily share 65 to 90% 
amino acid sequence identity. However, unlike the phenol UGT cDNAs, where the high 
degree of identity is concentrated in the 3-prime region of the cDNA, the steroid UGTs have a 
high degree of sequence homology throughout the cDNA. The disclosed NOV40 nucleic acid 
of the invention encoding a UDP Glycosyltransferase -like protein includes the nucleic acid 
whose sequence is provided in Table 40A or a fragment thereof. The invention also includes a 
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mutant or variant nucleic acid any of whose bases may be changed from the corresponding 
base shown in Table 40A while still encoding a protein that maintains its UDP[ 
Glycosyltransferase -like activities and physiological functions, or a fragment of such a 
nucleic acid. The invention further includes nucleic acids whose sequences are 
complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 19 percent of the bases may be so changed. 

The disclosed NOV40 protein of the invention includes the UDP Glycosyltransferase - 
like protein whose sequence is provided in Table 40B. The invention also includes a mutant 
or variant protein any of whose residues may be changed from the corresponding residue 
shown in Table 40B while still encoding a protein that maintains its UDP Glycosyltransferase 
-like activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 35 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab>2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this UDP Glycosyltransferase -like 
protein (NOV40) is a member of a "UDP Glycosyltransferase family". Therefore, the NOV40 
nucleic acids and proteins identified here may be useful in potential therapeutic applications 
implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV40 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in brain disorders including Crigler-Najjar syndrome, 
Gilbert syndrome, and/or other diseases and pathologies. 
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NOV40 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV40 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
5 NOVX Antibodies" section below. The disclosed NOV40 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 
10 NOV41 

NOV41 includes three novel adrenal secretory serine protease-like proteins disclosed 
below. The disclosed sequences have been named NOV41a and NOV41b. 

NOV41a 

A disclosed NOV4 la nucleic acid of 2155 nucleotides (also referred to as CG56680- 
15 01) encoding a Sodium/Hydrogen Exchanger 4-like protein is shown in Table 41 A. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 1 6-18 and 
ending with a TAG codon at nucleotides 2140-2142. The start and stop codons are shown in 
bold in Table 41 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 41 A. NOV41a nucleotide sequence (SEQ ID NO: 153). 

GAGAAGCCCACAGGA ATGGCTCTGCAGATGTTCGTGACTTACAnTrPTTnnAATT^TT-rnr'TAr'TGCTAGTG 
GCTCTTGAGTGTTCTGAAGCATCTTCTGATTTGAATGAATCTGCAAATTCCACTGCTCAGTATGCATCTAAC 
GCTTGGTTTGCTGCTGCCAGCTCAGAGCCAGAGGAAGGGATATCTGTTTTTGAACTGGATTATGACTATGTG 
CAAATTCCTTATGAGGTCACTCTCTGGATACTTCTAGCATCCCTTGCAAAAATAGGTTTCCACCTCTACCAC 
AGGCTGCCAGGCCTCATGCCAGAAAGCTGCCTCCTCATCCTGGTGGGGGCGCTGGTGGGCGGCATCATCTTC 
GGCACCGACCACAAATCGCCTCCGGTCATGGACTCCAGCATCTACTTCCTGTATCTCCTGCCACCCATCGTT 
CTGGAGGGCGGCTACTTCATGCCCACCCGGCCCTTCTTTGAGAACATCGGCTCCATCCTGTGGTGGGCAGTA 
TTGGGGGCCCTGATCAACGCCTTGGGCATTGGCCTCTCCCTCTACCTCATCTGCCAGGTGAAGGCCTTTGGC 
CTGGGCGACGTCAACCTGCTGCAGAACCTGCTGTTCGGCAGCCTGATCTCCGCCGTGGACCCAGTGGCCGTG 
CTAGCCGTGTTTGAGGAAGCGCGCGTGAACGAGCAGCTCTACATGATGATCTTTGGGGAGGCCCTGCTCAAT 
GATGGCATTACTGTGGTGGTCTTATACAATATGTTAATTGCCTTTACAAAGATGCATAAATTTGAAGACATA 
GAAACTGTCGACATTTTGGCTGGATGTGCCCGATTCATCGTTGTGGGGCTTGGAGGGGTATTGTTTGGCATC 
GTTTTTGGATTTATTTCTGCATTTATCACACGTTTCACTCAGAATATCTCTGCAATTGAGCCACTCATCGTC 
TTCATGTTCAGCTATTTGTCTTACTTAGCTGCTGAAACCCTCTATCTCTCCGGCATCCTGGCGATCACAGCC 
TGCGCAGTAACAATGAAAAAGTACGTGGAAGAAAACGTGTCCCAGACATCATACACGACCATCAAGTACTTC 
ATGAAGATGCTGAGCAGCGTCAGCGAGACCTTGATCTTCATCTTCATGGGTGTGTCCACTGTGGGCAAGAAT 
CACGAGTGGAACTGGGCCTTCATCTGCTTCACCCTGGCCTTCTGCCAAATCTGGAGAGCCATCAGTGTATTT 
GCTCTCTTCTATATCAGTAACCAGTTTCGGACTTTCCCCTTCTCCATCAAGGACCAGTGCATCATTTTCTAC 
AGTGGTGTTCGAGGAGCTGGAAGTTTTTCACTTGCATTTTTGCTTCCTCTGTCTCTTTTTCCTAGGAAGAAA 
ATGTTTGTCACTGCTACTCTAGTAGTTATATACTTTACTGTATTTATTCAGGGAATCACAGTTGGCCCTCTG 
GTCAGGTACCTGGATGTTAAAAAAACCAATAAAAAAGAATCCATCAATGAAGAGCTTCATATTCGTCTGATG 
GATCACTTAAAGGCTGGAATCGAAGATGTGTGTGGGCACTGGAGTCACTACCAAGTGAGAGACAAGTTTAAG 
AAGTTTGATCATAGATACTTACGGAAAATCCTCATCAGAAAGAACCTACCCAAATCAAGCATTGTTTCTTTG 
TACAAGAAGCTGGAAATGAAGCAAGCCATCGAGATGGTGGAGACTGGGATACTGAGCTCTACAGCTTTCTCC 
ATAC C C CAT C AGGC C C AG AGG AT AC AAGG AAT C AAAAG AC T T T C CC CTG AAG ATGTGGAGTC C AT AAGGG AC 
ATT CTG AC AT C C AAC ATGT AC C AAGTT CGG CAAAGGAC C CTGT C CT AC AAC AAATAC AACCT C AAAC C CC AA 
ACAAGTGAGAAGCAGGCTAAAGAGATTCTGATCCGCCGCCAGAACACCTTAAGGGAGAGCATGAGGAAAGGT 
CACAGCCTGCCCTGGGGAAAGCCGGCTGGCACCAAGAATATCCGCTACCTCTCCTACCCCTACGGGAATCCT 
CAGTCTGCAGGAAGAGACACAAGGGCTGCTGGGTTCTCAGGTAAGCTGCCCACCTGGCTGCTCTGCTGCTTT 
TCTGTAGAGTCAGGTGGTAAATATCTGGGGGTGTGGGCCAAGAGGCAACATTAAGAACATTATGTAG 
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In a search of public sequence databases, the NOV41a nucleic acid sequence, located 
on chromosome 2, has 1820 of 2156 bases (84%) identical to a gbrGENBANK- 
ID:RATNHEXIV|acc:M85301.1 mRNA from Rattus norvegicus (Rat sodium-hydrogen 
exchange protein-isoform 4 (NHE-4) mRNA, complete cds) (E = 6.4e" 217 ). 

The disclosed NOV41a polypeptide (SEQ ID NO: 154) encoded by SEQ ID NO: 1 53 
has 708 amino acid residues and is presented in Table 41B using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV41a has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 
NOV41a may also localize to the Golgi body with a certainty of 0.4600, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3700, or to the endoplasmic reticulum (lumen) 
with a certainty of 0.1000. The most likely cleavage site for NOV41a is between positions 26 
and 27: SEA-SS. 



Table 41B. Encoded NOV41a protein sequence (SEQ ID NO:154). 

MALQMFVTYSPWNCL.LLLVALECSEASSDLNESANSTAQYASNAWFAAASSEPEEGISVFELDYDYVQIPYE 
VTLWILLASLAKIGFHLYHRLPGLMPESCLLILVGALVGGIIFGTDHKSPPVMDSSIYFLYLLPPIVLEGGY 
FMPTRPFFENIGSILWWAVLGALINALGIGLSLYLICQVKAFGLGDVNLLQNLLFGSLISAVDPVAVLAVFE 
EARWEQLYMMIFGEALLNDGITVWLYNMLIAFTKMHKFEDIETVDILAGCARFIWGLGGVLFGIVFGFI 
SAFITRFTQNISAIEPLIVFMFSYLSYLAAETLYLSGILAITACAVTMKKYVEENVSQTSYTTIKYFMKMLS 
SVSETLIFIFMGVSTVGKNHEWNWAFICFTLAFCQIWRAISVFALFYISNQFRTFPFSIKDQCIIFYSGVRG 
AGSFSLAFLLPLSLFPRKKMFVTATLWIYFTVFIQGITVGPLVRYLDVKKTNKKESINEELHIRLMDHLKA 
GIEDVCGHWSHYQVRDKFKKFDHRYLRKILIRKNLPKSSIVSLYKKLEMKQAIEMVETGILSSTAFSIPHQA 
QRIQGIKRLSPEDVESIRDILTSNMYQVRQRTLSYNKYNLKPQTSEKQAKEILIRRQNTLRESMRKGHSLPW 
GKPAGTKNIRYLSYPYGNPQSAGRDTRAAGFSGKLPTWLLCCFSVESGGKYLGVWAKRQH 

A search of sequence databases reveals that the NOV41a amino acid sequence has 599 
of 688 amino acid residues (87%) identical to, and 631 of 688 amino acid residues (91%) 
similar to, the 717 amino acid residue ptnr:SWISSPROT-ACC:P26434 protein from Rattus 
norvegicus (Rat) (Sodium/Hydrogen Exchanger 4 (NA(+)/H(+) Exchanger 4) (NHE-4)) (E = 
0.0). 

NOV41a is predicted to be expressed in at least the stomach. This information was 
derived by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 
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In addition, the sequence is predicted to be expressed in stomach, colon and small 
intestine; lesser amounts in kidney, brain, uterus and skeletal muscle because of the expression 
pattern of (GENBANK-ID: gb:GENBANK-ID:RATNHEXIV|acc:M85301.1) a closely related 
Rat sodium-hydrogen exchange protein-isoform 4 (NHE-4) mRNA, complete cds homolog. 

NOV41b 

In the present invention, the target sequence identified previously, NOV41a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in . 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV41b. This differs from the previously 
identified sequence (NOV41a) in having 17 different aminoacids. 

A disclosed NOV41b nucleic acid of 2436 nucleotides (also referred to as CG56680- 
02) encoding a Sodium/Hydrogen Exchanger 4-like protein is shown in Table 41C. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 86-88 and 
ending with a TAA codon at nucleotides 2369-2371 . The start and stop codons are shown in 
bold in Table 41C, and the 5' and 3' untranslated regions, if any, are underlined. 
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Table 41C. NOV41b nucleotide sequence (SEQ ID NO:155). 

ATGCAGTCACTCTCTAGAAGCCTCCCCGACTTCAGATGTGTGGCACACATCCACACAGGGGTGTAGGTAGGA 

GAAGCCCACAGGA ATGGCTCTGCAGATGTTCGTGACTTArAnTrrTTrtHAa-rTrz-r-rT^CTACTGCTACTCGC 

TCTTGAGTGTTCTGAAGCATCTTCTGATTTGAATGAATCTGCAAATTCCACTGCTCAGTATGCATCTAACGC 

TTGGTTTGCTGCTGCCAGCTCAGAGCCAGAGGAAGGGATATCTGTTTTTGAACTGGATTATGACTATGTGCA 

AATTCCTTATGAGGTCACTCTCTGGATACTTCTAGCATCCCTTGCAAAAATAGGCTTCCACCTCTACCACAG 

GCTGCCAGGCCTCATGCCAGAAAGCTGCCTCCTCATCCTGGTGGGGGCGCTGGTGGGCGGCATCATCTTCGG 

CACCGACCACAAATCGCCTCCGGTCATGGACTCCAGCATCTACTTCCTGTATCTCCTGCCACCCATCGTTCT 

GGAGGG CGG CT ACT T C ATG C CC ACC CGG C C CTT CTTTG AGAAC ATCGGCT C CAT C CTGTGGTGGG C AGTATT 

GGGGGCCCTGATCAACGCCTTGGGCATTGGCCTCTCCCTCTACCTCATCTGCCAGGTGAAGGCCTTTGGCCT 

GGGCGACGTCAACCTGCTGCAGAACCTGCTGTTCGGCAGCCTGATCTCCGCCGTGGACCCAGTGGCCGTGCT 

AGCCGTGTTTGAGGAAGCGCGCGTGAACGAGCAGCTCTACATGATGATCTTTGGGGAGGCCCTGCTCAATGA 

TGGCATTACTGTGGTGTTATACAATATGTTAATTGCCTTTACAAAGATGCATAAATTTGAAGACATAGAAAC 

TGTCGACATTTTGGCTGGATGTGCCCGATTCATCGTTGTGGGGCTTGGAGGGGTATTGTTTGGCATCGTTTT 

TGGATTTATTTCTGCATTTATCACACGTTTCACTCAGAATATCTCTGCAATTGAGCCACTCATCGTCTTCAT 

GTTCAGCTATTTGTCTTACTTAGCTGCTGAAACCCTCTATCTCTCCGGCATCCTGGCGATCACAGCCTGCGC 

AGTAACAATGAAAAAGTACGTGGAAGAAAACGTGTCCCAGACATCATACACGACCATCAAGTACTTCATGAA 

GATGCTGAGCAGCGTCAGCGAGACCTTGATCTTCATCTTCATGGGTGTGTCCACTGTGGGCAAGAATCACGA 

GTGGAACTGGGCCTTCATCTGCTTCACCCTGGCCTTCTGCCAAATCTGGAGAGCCATCAGTGTATTTGCTCT 

CTTCTATATCAGTAACCAGTTTCGGACTTTCCCCTTCTCCATCAAGGACCAGTGCATCATTTTCTACAGTGG 

TGTTCGAGGAGCTGGAAGTTTTTCACTTGCATTTTTGCTTCCTCTGTCTCTTTTTCCTAGGAAGAAAATGTT 

TGTCACTGCTACTCTAGTAGTTATATACTTTACTGTATTTATTCAGGGAATCACAGTTGGCCCTCTGGTCAG 

GTACCTGGATGTTAAAAAAACCAATAAAAAAGAATCCATCAATGAAGAGCTTCATATTCGTCTGATGGATCA 

CTTAAAGGCTGGAATCGAAGATGTGTGTGGGCACTGGAGTCACTACCAAGTGAGAGACAAGTTTAAGAAGTT 

TGATCATAGATACTTACGGAAAATCCTCATCAGAAAGAACCTACCCAAATCAAGCATTGTTTCTTTGTACAA 

GAAGCTGGAAATGAAGCAAGCCATCGAGATGGTGGAGACTGGGATACTGAGCTCTACAGCTTTCTCCATACC 

CCATCAGGCCCAGAGGATACAAGGAATCAAAAGACTTTCCCCTGAAGATGTGGAGTCCATAAGGGACATTCT 

GACATCCAACATGTACCAAGTTCGGCAAAGGACCCTGTCCTACAACAAATACAACCTCAAACCCCAAACAAG 

TG AG AAGC AGG CT AAAGAG ATTCTG AT C CGC CG C C AG AAC AC C TT AAGGG AGAG C ATG AGG AAAGGT C AC AG 

CCTGCCCTGGGGAAAGCCGGCTGGCACCAAGAATATCCGCTACCTCTCCTACCCCTACGGGAATCCTCAGTC 

TGCAGGAAGAGACACAAGGGCTGCTGGGTTCTCAGGTAAGCTGCCCACCTGGCTGCTCCTTTGGTTGAGGTT 

CGGTCGAGGTGGACAGCTGACCATGGACACGGCAGGGACCATCACAGGTCCCATAGTCCTTTGCTCCAAAAA 

AAATAGTGTTATTGTCCACAAGATTGTTTTGGTGTTTCTCAAGAGTCTGTCTTCCTATAACTGTGAAAGGAG 

GATTTCTGGAATTCAGAAGAGAGCTATTGAGTTTGCTGTGTTGAAGCTATTAAACATGGATCTATAAGCAGC 

AGGAAGATTTTTTCCAAGGACTGGGAGCAAACTTGCAGGCTCTGCCATGTACTTATTGTG " 



In a search of public sequence databases, the NOV41b nucleic acid sequence, located 
on chromosome 2, has 1818 of 2163 bases (84%) identical to a gb:GENBANK- 
ID:RATNHEXIV|acc:M85301.1 mRNA from Rattus norvegicus (Rat sodium-hydrogen 
5 exchange protein-isoform 4 (NHE-4) mRNA, complete cds) (E = 0.0). 

The disclosed NOV41b polypeptide (SEQ ID NO: 1 56) encoded by SEQ ID NO: 155 
has 761 amino acid residues and is presented in Table 41 D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV41 b has no signal peptide 
and is likely to be localized to the plasma membrane with a certainty of 0.8200. Alternatively, 
10 NOV41b may also localize to the Golgi body with a certainty of 0.4600, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3700, or to the endoplasmic reticulum (lumen) 
with a certainty of 0.1000. The most likely cleavage site for NOV41b is between positions 26 
and 27: SEA-SS. 



Table 41D. Encoded NOV41b protein sequence (SEQ ID NO:156). 

M^QMFVTYSPWNCLLLLVALECSEASSDLNESANSTAQYASNAWFAAASSEPEEGISVFELDYDYVQIPYE 
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VTLWILLASLAKIGFHLYHRLPGLMPESCLLILVGALVGGIIFGTDHKSPPVMDSSIYFLYLLPPIVLEGGY 
FMPTRPFFENIGSILWWAVLGALINALGIGLSLYLICQVKAFGLGDVNLLQNLLFGSLISAVDPVAVLAVFE 
EARVNEQLYMM I FGEALLNDGI T WLYNML I AFTKMHKFEDI ETVDI LAGCARFI WGLGGVLFGI VFGF I S 
AFITRFTQNISAIEPLI VFMFSYLSYLAAETLYLSGILAITACAVTMKKYVEENVSQTSYTTIKYFMKMLSS 
VSETLIFIFMGVSTVGKNHEWNWAFICFTLAFCQIWRAISVFALFYISNQFRTFPFSIKXJQCIIFYSGVRGA 
GSFSLAFLLPLSLFPRKKMFVTATLWIYFTVFIQGITVGPLVRYLDVKKTNKKESINEELHIRLMDHLKAG 
IEDVCGHWSHYQVRDKFKKFDHRYLRKILIRKNLPKSSIVSLYKKLEMKQAIEMVETGILSSTAFSIPHQAQ 
RI QG I KRLS PEDVES I RDI LTSNMYQVRQRTLS YNKYNLKPQTSEKQAKE I L I RRQNTLRESMRKGHSL PWG 
KPAGTKNIRYLSYPYGNPQSAGRDTRAAGFSGKLPTWLLLWLRFGRGGQLTMDTAGTITGPIVLCSKKNSVI 
vHKIVLVFLKSLSSYNCERRI SG I QKRAIEFAVLKLLNMPL 



A search of sequence databases reveals that the NOV41b amino acid sequence has 606 
of 717 amino acid residues (84%) identical to, and 641 of 717 amino acid residues (89%) 
similar to, the 717 amino acid residue ptnr:SWISSPROT-ACC:P26434 protein from Rattus 
norvegicus (Rat) (Sodium/Hydrogen Exchanger 4 (NA(+)/H(+) Exchanger 4) (NHE-4)) (E = 
0.0). 

NOV41b is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. . 

NOV41a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 4 IE. 



Table 41E. BLAST results for NOV41a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi| 127814 |sp|P26434 
| N AH 4 _ RAT 


SODIUM /HYDROGEN 
EXCHANGER 4 

(NA(+) /H(+) 
EXCHANGER 4) 

(NHE-4) 


717 


599/688 
(87%) 


631/688 
(91%) 


0 . 0 


gi | 134 6 65 8 | sp | P4 87 6 
3 | NAH2 RAT 


SODIUM /HYDROGEN 
EXCHANGER 2 
(NA(+)/H(+) 
EXCHANGER 2) 
(NHE-2) (H7) 


813 


421/659 
(63%) 


523/659 
(78%) 


0.0 


gi | 17 0 92 22 | sp | P504 8 
2 j NAH2RABIT 


SODIUM /HYDROGEN 
EXCHANGER 2 

(NA(+) /H(+) 
EXCHANGER 2) 

(NHE-2) 


809 


419/659 
(63%) 


522/659 
(78%) 


0 . 0 


gi | 15529998 | ref |NP 
003039. 2| 
<NM 003048) 


solute carrier 
family 9 

( sodium/hydrogen 
exchanger) , 
isoform 2 [Homo 
sapiens} 


812 


405/611 
(66%) 


499/611 
(81%) 


0.0 
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gi|698l560|ref |NP 0 


solute carrier 


697 


372/560 


457/560 


0.0 


36785.1) 


family 9 




(66%) 


(81%) 


(NM 012653) 


( sodium/hydrogen 










exchanger 2 ) , 












antiporter 2, 












Na+/H+ (Na+/H+ 












exchanger 2) 












[Rattus 












norvegicus] 











The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 41F. In the ClustalW alignment of the NOV41 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 41F. ClustalW Analysis of NOV41 

1) Novel NOV41a (SEQ ID NO: 154) 

2) Novel NOV41b (SEQ ID NO: 15 6) 

3) gi j 12 78 14 | sp | P2 64 34 | NAH4_RAT SODIUM/HYDROGEN EXCHANGER 4 (NA( + )/H( + ) EXCHANGER 

4) (NHE-4) (SEQ ID NO:507) 

4) gi|l346658|sp|P48763 | NAH2_RAT SODIUM/ HYDROGEN EXCHANGER 2 (NA(+)/H(+) EXCHANGER 
2) (NHE-2) (H7) (SEQ ID NO:508) 

5) gi | 1709222 | sp| P50482 |NAH2_RABIT SODIUM /HYDROGEN EXCHANGER 2 (NA(+)/H(+) 
EXCHANGER 2) (NHE-2) (SEQ ID NO:509) 

6) gi|l5529998|ref | NP_0 03 03 9 . 2 | (NM_003048) solute carrier family 9 
(sodium/hydrogen exchanger), isoform 2 [Homo sapiens] (SEQ ID NO:510) 

7) gi| 6981560 |ref | NP_03 6785 . 1 | (NM_012653) solute carrier family 9 (sodium/hydrogen 
exchanger 2), antiporter 2, Na+/H+ (Na+/H+ exchanger 2) [Rattus norvegicus] (SEQ ID 
NO: 511) 



NOV41a 

NOV41b 

gi | 127814 | 

gi | 1346658 | 

gi | 1709222 | 

gi|l5529998| 

gi I 6981560 | 



I 



10 



i 



I 



MALQMFOTYSPWNC 

MALQMF^T YS PWNC 

MGPAMLRAFSSWKVv 

mgpsgtahrMraplswi 
mesagtgrsSrtppprl 
meplgnwrsIraplppm 




30 

IC^eassdHnB 

IcgEAgSD^N a 

JtcleaSsy; 



40 



i-gdv^GPAG. 
El-SdvAGPVG 




50 
.|.. 
lAQY&N^ 
|AQYASNAW 



SS P^GQQT PDAp 

|^lldapgarga^npp^paS t " 

IliLNAPKAMGlKsPL^Pr fJ " 

illnapramgtBss ppBp 




NOV41a 

NOV41b 

gi | 127814 | 

gi 1 1346658 | 

gi | 1709222 | 

gi|l5529998| 

gi|6981560 | 



NOV41a 
NOV41b 




109 
109 
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gi 1 127814 I 
gi|l346658| 
gi|l709222 j 
gi | 15529998 | 
gi j 6981560 | 



NOV41a 

NOV41b 

gi | 127814 | 

gi|l346658| 

gi|l709222 j 

gi | 15529998 | 

gi | 6981560 | 




190 




GIGLSL 
GIGLSL 
GIGLSL 
GIGLSL 
GIGISSL 

gig|sl 

GIGLSL 



200 




210 



220 



- I 



LLQNLLFGSLISAVDPVAVLAVFE 
LLQNL L FG S L I S A VD P VAVL A V F E 
LLQNLLFGSLISAVDPVAVLAVFE 
LLQNLLFGSLI SAVDPVAVLAVFE 
LLQNLLFGSLISAVDPVAVLAVFE 
LLQNLLFGSLISAVDPVAVLAVFE! 
LLQNLLFGSLI SAVDP VAVLAVFe! 




NOV41a 

NOV41b 

gi | 127814 | 

gi|l346658| 

gi|l709222 j 

gi|l5529998| 

gi j 6981560 | 



NOV41a 

NOV41b 

gi | 127814 | 

gi|l346658| 

gi 1 1709222 | 

gi (15529998 | 

gi | 6981560 | 



NOV41a 

NOV41b 

gi | 127814 | 

gi|l346658| 

gi | 1709222 | 

gi |15529998| 

gi|6981560| 




370 



380 



390 



400 



410 



349 
348 
348 
357 
356 
356 
241 



YTTIKYFMKMLSSVSETLI 
YTTIKYFMKMLSSVSETLI 
YTTIKYFMKMLSSVSETLI 
YTTIKYFMKMLSSVSETLI 
YTTIKYFMKMLSSVSETLI 
YTTIKYFMKMLSSVSETLI 
YTTIKYFMKMLSSVSETLI! 



FI FMGVSTVGKNHEWNWAFMCFTLAFCffll W 
FI FMGVSTVGKNHEWNWAFyCFTLAFcffll W 
FIFMGVSTVGKNHEWNWAFVCFTLAFcffllW: 
FIFMGVSTVGKNHEWNWAFVCFTLAFCffllW 
FI FMGVSTVGKNHEWNWAFVCFTLAFCffll W 
F I FMGVSTVGKNHE WNWAFVC FTLAFcjfetw 
FI FMGVSTVGKNHEWNWAFVCFTLAFCai WRAI 
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NOV41a 

NOV41b 

gi 1 127814 | 

gijl346658| 

gi j 1709222 | 

gijl5529998| 

gi | 6981560 I 



NOV41a 

NOV41b 

gi 1 127814 | 

gi 1 1346658 | 

gi|l709222| 

gi|l5529998| 

gi j 6981560 | 



NOV41a 
NOV41b 




550 
. . I . . 



560 
I 



528 
527 



|YLRK|LIR^jgPKSSIVSLYKKLE^K[ 
aYLRK |LIR§NfflPKSSIVSLYKKLESK 



570 

1^ 



580 



\IEMffiETG 
\IEMKETG 



590 

TAFSIPHQAQgl 

tafsiphqaqHi 



600 
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gi 1 127814 I 
gi|l346658| 
gi j 1709222 j 
gijl5529998| 
gi|6981560| 



NOV41a 

10 NOV41b 

gi | 127814 | 
gi j 1346658 | 
gi | 1709222 j 
gi|l5529998| 

15 gi j 6981560 | 



NOV41a 
20 NOV41b 

gi | 127814 | 
x gi | 1346658 | 

gij 1709222 j 

gi j 15529998 | 
25 gi j 6981560 | 



NOV41a 

30 NOV41b 

gi | 127814 | 
gi j 1346658 | 
gi | 1709222 j 
gi j 15529998 | 

35 gij 6981560 | 



NOV41a 

40 NOV41b 

gi | 127814 | 
gi|l346658| 
gi | 1709222 j 
gi|l5529998| 

45 gij 6981560 | 




PTPYQSESld( 

ceT 



P||FASLND< 
jPgFASLNDC 

p||FASLNDC@e| 



610 



620 



630 



640 



650 









1 




EE 


SL 


P 


647 




SL 


P 


646 


GQ 


SL 


P 


64 6 


Pi 


SL 


N 


656 




SL 


N 


655 


Ids 


SL 


N 


655 


pk 


SL 


N 


540 



670 



680 




690 
■ - I- - 



700 



I 



710 



720 



541 re; 



[SAGRDTBA^OFSg SlLPT&ji 687 

|q J SAGJ|DT|AXGFsffl 8lPT^| 686 

^p a^garaMe^tH NPCr^ 

||KLpKKN^^^DgNS SDSDMDG - TTVLNLQPRAR: 
|KLMRrai^ADgDS SDSEADAGTTVLNLQPp" 
^"L^RRTlglMDgNSSDSDADAGTTVLNLQPI 
IKL^KKN1?vSn'XdBnS SDSDMDG - TTVLNLQ 




688 
687 
687 
716 
716 
716 
600 



708 



730 

I < 22 1§GG 

LLWLRFGRGGQLT| 
LHFLLC|^-AMVl 

PEpfflKpSSQA' 
PEQfflKHpQS 

"kkaJspa^ 

790 



750 



ZfTAGT&TG- 



PDOj 



760 

• I I - I I 

_. lYLg 

01 VLCS KKJ^S VI Vni|I VI 
J || IW S 

;WKME^DVGSARAPrasVTPAPRSKEGGTdTPg 
WKNEgDAGSGQGQ 3S P P AAPR^KEGGT^T P I 
pTPPTPH§REKGT|iTS5 

svtpaprskeggtSti 



770 



780 




VDSGRD1 
VGSARAP 



jWAKRQH 7 08 

|FlMs|Js@YNC 73 7 

[PGG@ETQP&LL 712 




800 



810 
..|.. 



I 



70Q 

73 8 ERjgiSGlQgRAIEFAVL^QlsrMDL 761 

713 CRNLH 717 

776 RC ^H S | T l DVPPKPPP p3^ RRASEp GNRKGRLGNEKP 813 

772 -G^HsETiGGRPKPPP|IJ|RRASEPGNRKSRLGSDKP 809 

7 75 SEggS|TfclPPKPPPpffi|wRASEPGSRKARFGSEKP 812 

66 0 RGgEgs|TgDGPPKPPP^|RRASEPGNRKGRLGNEKP 6 97 
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Na+/H+ antiporters are key transporters in maintaining the pH of actively metobolizing 
cells. Na+/H+ exchange proteins eject protons from cells, effectively eliminating excess acid 
from actively metabolising cells. Na+/H+ exchange activity is also crucial for the regulation of 
cell volume, and for the reabsorption of NaCl across renal, intestinal, and other epithelia. 
These antiports exchange Na+ for H+ in an electroneutral manner, and this activity is carried 
out by a family of Na+/H+ exchangers, or NHEs. In mammalian cells, Na+/H+ exchange 
activity is found in both the plasma membrane and inner mitochondrial membrane. To date, 
six mammalian isoforms have been identified (designated NHE1-NHE6). These exchangers 
are highly-regulated (glyco)phosphoproteins, which, based on their primary structure, appear 
to contain 10-12 transmembrane regions at the N-terminus and a large cytoplasmic region at 
the C-terminus. The transmembrane regions M3-M12 share identity with other members of the 
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family. The M6 and M7 regions are highly conserved. Thus, this is thought to be the region 
that is involved in the transport of sodium and hydrogen ions. The cytoplasmic region has little 
similarity throughout the family. There is some evidence that they may exist in the cell 
membrane as homodimers, but the molecular mechanisms of antiport are unclear. Na+/H+ 
5 antiporters play an important role in signal transduction. 

The disclosed NOV41 nucleic acid of the invention encoding a Sodium/Hydrogen 
Exchanger 4 -like protein includes the nucleic acid whose sequence is provided in Table 41 A, 
41C or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 41 A or 4 1C while 

10 still encoding a protein that maintains its UDP[ Gly cosy transferase -like activities and 

physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 

15 thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 

20 variant nucleic acids, and their complements, up to about 16 percent of the bases may be so 
changed. 

The disclosed NOV41 protein of the invention includes the Sodium/Hydrogen 
Exchanger 4 -like protein whose sequence is provided in Table 4 IB or 4 ID. The invention 
also includes a mutant or variant protein any of whose residues may be changed from the 
25 corresponding residue shown in Table 41 B or 41 D while still encoding a protein that 

maintains its Sodium/Hydrogen Exchanger 4 -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 37 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
30 (F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Sodium/Hydrogen Exchanger 4 - 
like protein (NOV41) is a member of a "Sodium/Hydrogen Exchanger 4 family". Therefore, 
the NOV41 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
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below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 

5 and cell types composing (but not limited to) those defined here. 

The NOV41 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in brain disorders including hypercalcemia, ulcers, 
inflammatory bowel disease, diverticular disease; diseases of the kidney including diabetes, 
autoimmune disease, renal artery stenosis, interstitial nephritis, and others; diseases of the 

0 brain including Von Hippel-Lindau (VHL) syndrome , Alzheimer's disease, epilepsy, and 
others; endometriosis, fertility, muscular dystrophy, Lesch-Nyhan syndrome, myasthenia 
gravis, and/or other diseases and pathologies. 

NOV41 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV41 substances for use in 

5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV41 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 

0 understanding of pathology of the disease and development of new drug targets for various 
disorders. 



NOV42 includes three novel Kupffer Cell Receptor -like proteins disclosed below. 
The disclosed sequences have been named NOV42a, NOV42b, NOV42c, and NOV42d. 



A disclosed NOV42a nucleic acid of 1760 nucleotides (also referred to as CG56682- 
01) encoding a Kupffer Cell Receptor -like protein is shown in Table 42A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 16-18 and ending 
with a TGA codon at nucleotides 1661-1663. The start and stop codons are shown in bold in 
30 Table 42 A, and the 5' and 3' untranslated regions, if any, are underlined. 



NOV42 



25 



NOV42a 
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Table 42 A. NOV42a nucleotide sequence (SEQ ID NO: 157). 

TGGCTGGGAGCAGTGCTGGAGG ATGAAGGAAGCAGAGATGGACGGTGAGGCAGTCCGCTTCTGCACAGATAA 
CCAGTGTGTCTCCCTGCACCCCCAAGGTGTGGACTCTGTGGCAATGGCTCCTGCAGCCCCCAAGATACCGAG 
GCTCGTTCAGGCTACCCCGGCATTTATGGCTGTGACCTTGGTCTTCTCTCTTGTGACTCTCTTTGTAGTGGG 
TAAGCCCCCAGGTGACCCAAATCTCACTAACTTTCTCTCCTTTCAGCACAAAGTCCCCAGGGGCCCCAGATG 
CACACTCGATCATCACCACTTTGGCAGGGAGGCAGAAATGCGAGAGCTTATCCAGACATTTAAAGGCCACAT 
GGAGAATTCCAGTGCCTGGGTAGTAGAAATCCAGATGTTGAAGTGCAGAGTGGACAATGTCAATTCGCAGCT 
CCAGGTGCTCGGTGATCATCTGGGAAACACCAATGCTGACATCCAGATGGTAAAAGGAGTTCTAAAGGATGC 
CACTACATTGAGTTTGCAGACACAGATGTTAAGGAGTTCCCTGGAGGGAACCAATGCTGAGATCCAGAGGCT 
CAAGGAAGACCTTGAAAAGGCAGATGCTTTAACTTTCCAGACGCTGAATTTCTTAAAAAGCAGTTTAGAAAA 
CACCAGCATTGAGCTCCACGTGCTAAGCAGAGGCTTAGAAAATGCAAACTCTGAAATTCAGATGTTGAATGC 
CAGTTTGGAAACGGCAAATGCTTTAAACTCCCAGACCCAGGCCTTTATAAAAAGCAGTTTTGACAACACTAG 
TGCTGAGATCCAGTTCTTAAGAGGTCATTTGGAAAGAGCTGGTGATGAAATTCACGTGTTAAAAAGGGATTT 
GAAAATGGTCACAGCCCAGACCCAAAAAGCAAATGGCCGTCTGGACCAGACAGATACTCAGATTCAGGTATT 
CAAGTCAGAGATGGAAAATGTGAATACCTTAAATGCCCAGATTCAGGTCTTAAATGGTCATATGAAAAATGC 
C AGC AG AGAGAT AC AGAC C CT AAAAC AAGG AATGAAGAATGCT T CAGC C TT AAC TT C CC AGAC CC AG ATGTT 
AGACAGCAATC TGC AG AAGGCC AGTGC CGAGAT CC AGAGGTT AAGAGGGGAT CT AGAGAACACC AAAGCT CT 
AACCATGGAAATCCAGCAGGAGCAGAGTCGCCTGAAGACCCTCCATGTGGTCATTACTTCACAGGAACAGCT 
ACAAAGAACCCAAAGTAAGCAGCTTCTCCAGATGGTCCTGCAAGGCTGGAAGTTCAATGGTGGAAGCTTATA 
TTATTTTTCTAGTGTCAAGAAGTCTTGGCATGAGGCTGAGCAGTTCTGCGTGTCCCAGGGAGCCCATCTGGC 
ATCTGTGGCCTCCAAGGAGGAGCAGGCATTTCTGGTAGAGTTCACAAGTAAAGTGTACTACTGGATCGGTCT 
CACTGACAGGGGCACAGAGGGCTCCTGGCGCTGGACAGATGGGACACCATTCAACGCCGCCCAGAACAAAGG 
GTTTTGGGAAAAGAATCAGTCTGACAACTGGCGGCACAAGAATGGGCAGACTGAAGACTGTGTCCAAATTCA 
GCAGAAGTGGAATGACATGACCTGTGACACCCCCTATCAGTGGGTGTGCAAGAAGCCCATGGGCCAGGGTGT 
GGCCTG AGGGCAGGCCAGAGCTGAGGGGCTGCTCCTGCTTGCCAATACTGACCCTCCTCCTCGATGCCTTCG 
GAGCCTCTGAGCTCTGCTTGTTCTCTGGGACC 



In a search of public sequence databases, the NOV42a nucleic acid sequence, located 
on chromosome 2, has 1214 of 1730 bases (70%) identical to a gb:GENBANK- 
lD:D88577|acc:D88577.1 mRNA from Mus musculus (mRNA for Kupffer cell receptor, 
complete cds) (E = 3.9e~ 162 ). 

The disclosed NOV42a polypeptide (SEQ ID NO: 158) encoded by SEQ ID NO: 157 
has 546 amino acid residues and is presented in Table 42B using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV42a has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.7900. Alternatively, 
NOV42a may also localize to the microbody (peroxisome) with a certainty of 0.3000, to the 
Golgi body with a certainty of 0.3000, or to the endoplasmic reticulum (membrane) with a 
certainty of 0.2000. The most likely cleavage site for NOV42a is between positions 65 and 66: 
VVG-KP. 



Table 42B. Encoded NOV42A protein sequence (SEQ ID NO: 158). 

MKEAEMDGEAVRFCTDNQCVSLHPQGVDSVAMAPAAPKIPRLVQATPAFMAVTLVFSLVTLFVVGKPPGDPN 
LTNFLSFQHKVPRGPRCTLDHHHFGREAEMRELIQTFKGHMENSSAWWEIQMLKCRVDNVNSQLQVLGDHL 
GNTNADIQMVKGVLKDATTLSLQTQMLRSSLEGTNAEIQRLKEDLEKADALTFQTLNFLKSSLENTSIELHV 
LS RGLENANSE IQMLNASLETANALNSQTQAFI KSS FDNTSAE I QFLRGHLERAGDE I HVLKRDLKMVTAQT 
QKANGRLDQTDTQIQVFKSEMENVNTIJ^AQIQVLNGHMKNASREIQTLKQGMKNASAIiTSQTQMLDSNLQKA 
SAEIQRLRGDLENTKALTMEIQQEQSRLKTLHWITSQEQLQRTQSKQLLQMVLQGWKFNGGSLYYFSSVKK 
SWHEAEQFCVSQGAHLASVASKEEQAFLVEFTSKVYYWIGLTDRGTEGSWRWTDGTPFNAAQNKGFWEKNQS 
DNWRH KNGQT ED C VQ I QQKWNDMTCDTP YQWVCKKPMGQGVA 
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A search of sequence databases reveals that the NOV42a amino acid sequence has 301 
of 546 amino acid residues (55%) identical to, and 396 of 546 amino acid residues (72%) 
similar to, the 548 amino acid residue ptnr:SWISSPROT-ACC:P70194 protein from Mus 
musculus (Mouse) (Kupffer Cell Receptor) (E = 0.0). 

NOV42a is predicted to be expressed in at least cartilage. This information was derived 
by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 

In addition, the sequence is predicted to be expressed in rKupffer cells (liver) because 
of the expression pattern of (GENBANK-ID: gb:GENBANK-ID:D88577|acc:D88577.1) a 
closely related Mus musculus mRNA for Kupffer cell receptor, complete cds homolog. 

NOV42b 

In the present invention, the target sequence identified previously, NOV42a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
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the sequence reported below, which is designated NOV42b. This differs from the previously 
identified sequence (NOV42a) in having 3 less aminoacids and 26 different ones. 

A disclosed NOV42b nucleic acid of 1769 nucleotides (also referred to as CG56682- 
02) encoding a Kupffer cell receptor-like protein is shown in Table 42C. An open reading 
5 frame was identified beginning with a ATG initiation codon at nucleotides 23-25 and ending 
with a TGA codon at nucleotides 1670-1672. The start and stop codons are shown in bold in 
Table 42C, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 42C. NOV42b nucleotide sequence (SEQ ID NO:159). 

TGGCTGGGAGCAGTGCTGGAGG ATGAAGGAAGCAGAGATGGACGGTGAGGCAGTCCGCTTCTGCACAGATAA 
CCAGTGTGTCTCCCTGCACCCCCAAGAGGTGGACTCTGTGGCAATGGCTCCTGCAGCCCCCAAGATACCGAG 
GCTCGTTCAGGCTACCCCGGCATTTATGGCTGTGACCTTGGTCTTCTCTCTTGTGACTCTCTTTGTAGTGGT 
TCAACAGCAGACAAGACCTGTTCCGAAGCCTGTGCAAGCCGTAATTCTGGGAGACAACATTACTGGGCATTT 
ACCTTTTGAACCCAACAATCATCACCACTTTGGCAGGGAGGCAGAAATGCGAGAGCTTATCCAGACATTTAA 
AGGTCACATGGAGAATTCCAGTGCCTGGGTAGTAGAAATCCAGATGTTGAAGTGCAGAGTGGACAATGTCAA 
TTCGCAGCTCCAGGTGCTCGGTGATCATCTGGGAAACACCAATGCTGACATCCAGATGGTAAAAGGAGTTCT 
AAAGGATGCCACTACATTGAGTTTGCAGACACAGATGTTAAGGAGTTCCCTGGAGGGAACCAATGCTGAGAT 
CCAGAGGCTCAAGGAAGACCTTGAAAAGGCAGATGCTTTAACTTTCCAGACGCTGAATTTCTTAAAAAGCAG 
TTTAGAAAACACCAGCATTGAGCTCCACGTGCTAAGCAGAGGCTTAGAAAATGCAAACTCTGAAATTCAGAT 
GTTGAATGCCAGTTTGGAAACGGCAAATGCTTTAAACTCCCAGACCCAGGCCTTTATAAAAAGCAGTTTTGA 
CAACACTAGTGCTGAGATCCAGTTCTTAAGAGGTCATTTGGAAAGAGCTGGTGATGAAATTCACGTGTTAAA 
AAGGGATTTGAAAATGGTCACAGCCCAGACCCAAAAAGCAAATGGCCGTCTGGACCAGACAGATACTCAGAT 
TCAGGTATTCAAGTCAGAGATGGAAAATGTGAATACCTTAAATGCCCAGATTCAGGTCTTAAATGGTCATAT 
GAAAAATGCCAGCAGAGAGATACAGACCCTAAAACAAGGAATGAAGAATGCTTCAGCCTTAACTTCCCAGAC 
CCAGATGTTAGACAGCAATCTGCAGAAGGCCAGTGCCGAGATCCAGAGGTTAAGAGGGGATCTAGAGAACAC 
CAAAGCTCTAACCATGGAAATCCAGCAGGAGCAGAGTCGCCTGAAGACCCTCCATGTGGTCATTACTTCACA 
GGAACAGCTACAAAGAACCCAAAGTAAGCAGCTTCTCCAGATGGTCCTGCAAGGCTGGAAGTTCAATGGTGG 
AAGCTTATATTATTTTTCTAGTGTCAAGAAGTCTTGGCATGAGGCTGAGCAGTTCTGCGTGTCCCAGGGAGC 
CCATCTGGCATCTGTGGCCTCCAAGGAGGAGCAGGCATTTCTGGTAGAGTTCACAAGTAAAGTGTACTACTG 
GATCGGTCTCACTGACAGGGGCACAGAGGGCTCCTGGCGCTGGACAGATGGGACACCATTCAACGCCGCCCA 
GAACAAAGGGTTTTGGGAAAAGAATCAGTCTGACAACTGGCGGCACAAGAATGGGCAGACTGAAGACTGTGT 
CCAAATTCAGCAGAAGTGGAATGACATGACCTGTGACACCCCCTATCAGTGGGTGTGCAAGAAGCCCATGGG 
CCAGGGTGTGGCCTGA GGGCAGGCCAGAGCTGAGGGGCTGCTCCTGCTTGCCAATACTGACCCTCCTCCTCG 
ATGCCTTCGGAGCCTCTGAGCTCTGCTTGTTCTCTGGGACC 



In a search of public sequence databases, the NOV42b nucleic acid sequence, located 
10 on chromosome 2, has 1054 of 1469 bases (71%) identical to a gb:GENBANK- 

ID:D88577|acc:D88577.1 mRNA from Mus musculus (mRNA for Kupffer cell receptor, 
complete cds) (E = l.le" 161 ). 

The disclosed NOV42b polypeptide (SEQ ID NO: 160) encoded by SEQ ID NO: 159 
has 549 amino acid residues and is presented in Table 42D using the one-letter amino acid 
15 code. Signal P, Psort and/or Hydropathy results predict that NOV42b has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.7900. Alternatively, 
NOV42b may also localize to the microbody (peroxisome) with a certainty of 0.3000, to the 
Golgi body with a certainty of 0.3000, or to the endoplasmic reticulum (membrane) with a 
certainty of 0.2000. The most likely cleavage site for NOV42b is between positions 67 and 68: 
20 VQQ-QT. 
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Table 42D. Encoded NOV42b protein sequence (SEQ ID NO: 160). 

MKEAEMDGEAVRFCTDNQCVSLHPQEVDSVAMAPAAPKIPRLVQATPAFMAVTLVFSLVTLFVWQQQTRPV 
PKPVQAVILGDNITGHLPFEPNNHHHFGREAEMRELIQTFKGHMENSSAVTVVEIQMLKCRVDNVNSQLQVLG 
DHLGNTNADIQMVKGVLKDATTLSLQTQMLRSSLEGTNAEIQRLKEDLEKADALTFQTLNFDKSSLENTSIE 
LHVLSRGLENANSEIQMLNASLETANALNSQTQAFI KS S FDNTS AE I QFLRGHLERAGDEIHVLKRDLKMVT 
AQTQKANGRLDQTDTQIQVFKSEMENVNTI^AQ 

Q KAS AE I QRL RGDLENT KALTME I QQE Q S RL KT L HW I T S Q E QLQ RTQ S KQLL QM VX.QGWK FNGG S L YY FS S 
VKKSWHEAEQFCVSQGAHLASVASKEEQAFLVEFTSKVYYWIGLTDRGTEGSWRWTDGTPFNAAQNKGFWEK 
NQSDNWRHKNGQTEDCVQIQQKWNDMTCDTPYQWVCKKPMGQGVA 



A search of sequence databases reveals that the NOV42b amino acid sequence has 304 
of 549 amino acid residues (55%) identical to, and 401 of 549 amino acid residues (73%) 
5 similar to, the 548 amino acid residue ptnr:SWISSPROT-ACC:P70194 protein from Mus 

158 

musculus (Mouse) (Kupffer Cell Receptor) (E = 3.7e" ). 

NOV42b is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
10 mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. 

NOV42c 

A disclosed NOV42c nucleic acid of 1874 nucleotides (also referred to as CG56682- 
03) encoding a Kupffer cell receptor-like protein is shown in Table 42E. An open reading 
15 frame was identified beginning with a ATG initiation codon at nucleotides 1-3 and ending 

with a TAA codon at nucleotides 1702-1704. The start and stop codons are shown in bold in 
Table 42E, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 42E. NOV42c nucleotide sequence (SEQ ID NO:161). 

ATGGACGGTGAGGCAGTCCGCTTCTGCACAGATAACCAGTGTGTCTCCCTGCACCCCCAAGAGGTGGACTCT 
GTGGCAATGGCTCCTGCAGCCCCCAAGATACCGAGGCTCGTTCAGGCTACCCCGGCATTTATGGCTGTGACC 
TTGGTCTTCTCTCTTGTGACTCTCTTTGTAGTGGTTCAACAGCAGACAAGACCTGTTCCGAAGCCTGTGCAA 
GCCGTAATTCTGGGAGACAACATTACTGGGCATTTACCTTTTGAACCCAACAATCATCACCACTTTGGCAGG 
GAGGCAGAAATGCAAGAGCTTATCCAGACATTTAAAGGCCACATGGAGAATTCCAGTGCCTGGGTAGTAGAA 
ATCCAGATGTTGAAGTGCAGAGTGGACAATGTCAATTCGCAGCTCCAGGTGCTCGGTGATCATCTGGGAAAC 
ACCAATGCTGACATCCAGATGGTAAAAGGAGTTCTAAAGGATGCCACTACATTGAGTTTGCAGACACAGATG 
TTAAGGAGTTCCCTGGAGGGAACCAATGCTGAGATCCAGAGGCTCAAGGAAGACCTTGAAAAGGCAGATGCT 
TTAACTTTCCAGACGCTGAATTTCTTAAAAAGCAGTTTAGAAAACACCAGCATTGAGCTCCACGTGCTAAGC 
AGAGGCTTAGAAAATGCAAACTCTGAAATTCAGATGTTGAATGCCAGTTTGGAAACGGCAAATACCCAGGCT 
CAGTTAGCCAATAGCAGTTTAAAGAACGCTAATGCTGAGATCTATGTTTTGAGAGGCCATCTAGATAGTGTC 
AATGACTTGAGGACCCAGAACCAGGTTTTAAGAAATAGTTTGGAAGGAGCCAATGCTGAGATCCAGGGACTA 
AAGGAAAATTTGCAGAACACAAATGCTTTAAACTCCCAGACCCAGGCCTTTATAAAAAGCAGTTTTGACAAC 
ACTAGTGCTGAGATCCAGTTCTTAAGAGGTCATTTGGAAAGAGCTGGTGATGAAATTCACGTGTTAAAAAGG 
GATTTGAAAATGGTCACAGCCCAGACCCAAAAAGCAAATGGCCATCTGGACCAGACAGATACTCAGATTCAG 
GTATTCAAGTCAGAGATGGAAAATGTGAATACCTTAAATGCCCAGATTCAGGTCTTAAATGGTCATATGAAA 
AATGCCAGCAGAGAGATACAGACCCTAAAACAAGGAATGAAGAATGCTTCAGCCTTAACTTCCCAGACCCAG 
ATGTTAGACAGCAATCTGCAGAAGGCCAGTGCCGAGATCCAGAGGTTAAGAGGGGATCTAGAGAACACCAAA 
GCTCTAACCATGGAAATCCAGCAGGAGCAGAGTCGCCTGAAGACCCTCCATGTGGTCATTACTTCACAGGAA 
CAGCTACAAAGAACCCAAAGTCAGCTTCTCCAGATGGTCCTGCAAGGCTGGAAGTTCAATGGTGGAAGCTTA 
TATTATTTTTCTAGTGTCAAGAAGTCTTGGCATGAGGCTGAGCAGTTCTGCGTGTCCCAGGGAGCCCATCTG 
GCATCTGTGGCCTCCAAGGAGGAGCAGGCATTTCTGGTAGAGTTCACAAGTAAAGTGTACTACTGGATCGGT 
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CTCACTGACAGGGGCACAGAGGGCTCCTGGCGCTGGACAGATGGGACACCATTCAACGCCGCCCAGAACAAA 
nrrT < -rr'TAr,GAGrCACAGCACCAGGAAGGGATGCTGCCTTCATCTAA CAGTATAAAGCCCTGTTGTCTTCG 
GGTTTTGGGAAAAGAATCAGTCGACAACTGGCGGCACAAGAATGGGCAGACTGAAGACTGTGTCCAAATTCA 
GCAGAAGTGGAATGACATGACCTGTGACACCCCCTATCAGTGGGTGTGCAAGAAGCCCATGGGCCAGGGTGT 

GG 



In a search of public sequence databases, the NOV42c nucleic acid sequence, located 
on chromosome 2, has 689 of 993 bases (69%) identical to a gb:GENBANK- 
ID:D88577|acc:D88577.1 mRNA from Mus musculus (mRNA for Kupffer cell receptor, 
complete cds) (E = 1.6e" 120 ). 

The disclosed NOV42c polypeptide (SEQ ID NO: 162) encoded by SEQ ID NO: 161 
has 567 amino acid residues and is presented in Table 42F using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV42c has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.7900. Alternatively, 
NOV42c may also localize to the microbody (peroxisome) with a certainty of 0.3000, to the 
Golgi body with a certainty of 0.3000, or to the endoplasmic reticulum (membrane) with a 
certainty of 0.2000. The most likely cleavage site for NOV42c is between positions 62 and 63: 
VQQ-QT. 
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Table 42F Encoded NOV42c protein sequence (SEQ ID NO:162). 



MDGEAVRFCTDNQCVSLHPQEVDSVAMAPAAPKIPRLVQATPAFMAVTLVFSLVTLFVWQQQTRPVPKPVQ 
AVILGDNITGHLPFEPNNHHHFGREAEMQELIQTFKGHMENSSAWVVEIQMLKCRVDNVNSQLQVLGDHLGN' 
TNADIQMVKGVLKDATTLSLQTQMLRSSLEGTNAEIQRLKEDLEKADALTFQTLNFLKSSLENTSIELHVLS 
RGLENANSE I QMLNAS LETANTQAQL ANS SLKNANAE I YVLRGHLDS VNDLRTQNQVLRNSLEGANAE I QGL 
KENLQNTNALNSQTQAFIKSSFDNTSAEIQFLRGHLERAGDEIHVLKRDLKMVTAQTQKANGHLDQTDTQIQ 
VFKSEMENWTLNAQIQVLNGHMKNASREIQTLKQGMKNASALTSQTQMLDSNLQKASAEIQRLRGDLENTK 
ALTMEIQQEQSRLKTLHVVITSQEQLQRTQSQLLQIWLQGWKFNGGSLYYFSSVKKSWHEAEQFCVSQGAHL 
ASVASKEEQAFLVEFTSKVYYWIGLTDRGTEGSWRWTDGTPFNAAQNKASLGATAPGRDAAFI 



A search of sequence databases reveals that the NOV42c amino acid sequence has 191 
of 412 amino acid residues (46%) identical to, and 273 of 412 amino acid residues (66%) 
similar to, the 548 amino acid residue ptnr:SWTSSNEW-ACC:P70194 protein from Mus 
musculus (Mouse) (Kupffer Cell Receptor) (E = 5.3e" 92 ). 

NOV42c is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. . 
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NOV42d 

A disclosed NOV42d nucleic acid of 1985 nucleotides (also referred to as CG56682- 
04) encoding a Kupffer cell receptor-like protein is shown in Table 42G. An open reading 
frame was identified beginning with a GTC initiation codon at nucleotides 2-4 and ending 
5 with a TAA codon at nucleotides 1658-1660. The start and stop codons are shown in bold in 
Table 42G, and the 5' and 3' untranslated regions, if any, are underlined. Because the starting 
codon is not a traditionl initiation codon, NOV42d could be a partial reading frame extending 
further into the 5' 

Table 42G. NOV42d nucleotide sequence (SEQ ID NO:163). 

AGTCCGCTTCTGCACAGATAACCAGTGTGTCTCCCTGCACCCCCAAGAGGTGGACTCTGTGGCAATGGCTCC 
TGCAGCCCCCAAGATACCGAGGCTCGTTCAGGCTACCCCGGCATTTATGGCTGTGACCTTGGTCTTCTCTCT 
TGTGACTCTCTTTGTAGTGGTTCAACAGCAGACAAGACCTGTTCCGAAGCCTGTGCAAGCCGTAATTCTGGG 
AGACAACATTACTGGGCATTTACCTTTTGAACCCAACAATCATCACCACTTTGGCAGGGAGGCAGAAATGCA 
AGAGCT T AT C C AG AC AT TT AAAGGC C AC ATGGAGAATT C C AGTGCCTGGGT AGT AGAAATC C AGATGTTG AA 
GTGCAGAGTGGACAATGTCAATTCGCAGCTCCAGGTGCTCGGTGATCATCTGGGAAACACCAATGCTGACAT 
CCAGATGGTAAAAGGAGTTCTAAAGGATGCCACTACATTGAGTTTGCAGACACAGATGTTAAGGAGTTCCCT 
GGAGGGAACCAATGCTGAGATCCAGAGGCTCAAGGAAGACCTTGAAAAGGCAGATGCTTTAACTTTCCAGAC 
GCTGAATTTCTTAAAAAGCAGTTTAGAAAACACCAGCATTGAGCTCCACGTGCTAAGCAGAGGCTTAGAAAA 
TGCAAACTCTGAAATTCAGATGTTGAATGCCAGTTTGGAAACGGCAAATACCCAGGCTCAGTTAGCCAATAG 
CAGTTTAAAGAACGCTAATGCTGAGATCTATGTTTTGAGAGGCCATCTAGATAGTGTCAATGACTTGAGGAC 
C C AG AACC AGGT T TT AAGAAAT AGT TTGGAAGG AG C C AATGCT GAG ATC C AGGG AC T AAAGG AAAATTTG C A 
GAACACAAATGCTTTAAACTCCCAGACCCAGGCCTTTATAAAAAGCAGTTTTGGCAACACTAGTGCTGAGAT 
CCAGTTCTTAAGAGGTCATTTGGAAAGAGCTGGTGATGAAATTCACGTGTTAAAAAGGGATTTGAAAATGGT 
CACAGCCCAGACCCAAAAAGCAAATGGCCGTCTGGACCAGACAGATACTCAGATTCAGGTATTCAAGTCAGA 
GATGGAAAATGTGAATACCTTAAATGCCCAGATTCAGGTCTTAAATGGTCATATGAAAAATGCCAGCAGAGA 
GAT AC AGACC C TAAAACAAGGAATGAAG AATGCT T C AG C CTT AACT T C C C AGAC CC AGATGT T AG AC AGC AA 
TCTGCAGAAGGCCAGTGCCGAGATCCAGAGGTTAAGAGGGGATCTAGAGAACACCAAAGCTCTAACCATGGA 
AATCCAGCAGGAGCAGAGTCGCCTGAAGACCCTCCATGTGGTCATTACTTCACAGGAACAGCTACAAAGAAC 
CCAAAGTCAGCTTCTCCAGATGGTCCTGCAAGGCTGGAAGTTCAATGGTGGAAGCTTATATTATTTTTCTAG 
TGTCAAGAAGTCTTGGCATGAGGCTGAGCAGTTCTGCGTGTCCCAGGGAGCCCATCTGGCATCTGTGGCCTC 
CAAGGAGGAGCAGGCATTTCTGGTAGAGTTCACAAGTAAAGTGTACTACTGGATCGGTCTCACTGACAGGGG 
CACAGAGGGCTCCTGGCGCTGGACAGATGGGACACCATTCAACGCCGCCCAGAACAAAGCGGCCACTAGGGG 
ATQ AAGGACCCATCTCAAGTCAGCTCCCTAGACTCATCCCATGTCAGCTCCCTAGGAGCCACAGCACCAGGA 
AGGGATGCTGCCTTCATCTAACAGTATAAAGCCCTGTTGTCTTCGGGTTTTGGGAAAAGAATCAGT CTGACA 
ACTGGCGGCACAAGAATGGGCAGACTGAAGACTGTGTCCAAATTCAGCAGAAGTGGAATGACATGACCTGTG 
ACACCCCCTATCAGTGGGTGTGCAAGAAGCCCATGGGCCAGGGTGTGGCCTGAGGGCAGGCCAGAGCTGAGG 
GGCTGCTCCTGCTTGCCAATACTGACCCTCCTCCTCGATGC 



10 In a search of public sequence databases, the NOV42d nucleic acid sequence, located 

on chromosome 2, has 705 of 1023 bases (68%) identical to a gb:GENBANK- 
ID:D88577|acc:D88577.1 mRNA from Mus musculus (mRNA for Kupffer cell receptor, 
complete cds) (E = 3.7e" 124 ). 

The disclosed NOV42d polypeptide (SEQ ID NO: 164) encoded by SEQ ID NO: 163 

15 has 552 amino acid residues and is presented in Table 42H using the one-letter amino acid 

code. Signal P, Psort and/or Hydropathy results predict that NOV42d has a signal peptide and 

is likely to be localized to the plasma membrane with a certainty of 0.7900. Alternatively, 

NOV42d may also localize to the microbody (peroxisome) with a certainty of 0.3000, to the 

Golgi body with a certainty of 0.3000, or to the endoplasmic reticulum (membrane) with a 
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certainty of 0.2000. The most likely cleavage site for NOV42d is between positions 57 and 58: 
VQQ-QT. 



Table 42H. Encoded NOV42d protein sequence (SEQ ID NO: 164). 

VRFCTDNQCVSLHPQEVDSVA^PAAPKIPRLVQATPAFMAVTLVFSLVTLFVVVQQQTRPVPKPVQAVILG 
DNITGHLPFEPNNHHHFGREAEMQELIQTFKGHMENSSAWVEIQMLKCRVDISnmSQLQVLGDHLGNTNADI 
QMVKGVIjKDATTLSLQTQMLRSSLEGTNAEIQRLKEDLEKADALTFQTLNFLKSSLENTSIEIiHVLSRGLEN 
ANSE I QMLNASLETANTQAQLANS S LKNANAE I YVLRGHLDS VNDLRTQNQVLRNS LEGANAE I QGLKENLQ 
NTNALNSQTQAFI KS S FGNTS AE IQFLRGHLERAGDE IHVLKRDLKMVTAQTQKANGRLDQTDTQI QVFKSE 
MENWTLNAQIQVLNGHMKNASREIQTLKQGMKNASALTSQTQMLDSNLQKASAEIQRLRGDLENTKALTME 
IQQEQSRLKTLHWITSQEQLQRTQSQLLQMVLQGWKFNGGSLYYFSSVKKSWHEAEQFCVSQGAHLASVAS 
KEEQAFLVEFTSKVYYWIGLTDRGTEGSWRWTDGTPFNAAQNKAATRG 



A search of sequence databases reveals that the NOV42d amino acid sequence has 187 
of 404 amino acid residues (46%) identical to, and 269 of 404 amino acid residues (66%) 
similar to, the 548 amino acid residue ptnr:SWISSNEW-ACC:P70194 protein from Mus 
musculus (Mouse) (Kupffer Cell Receptor) (E = 1.2e" 89 ). 

NOV42d is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea and uterus. . 

NOV42a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 42L 



Table 421. BLAST results for NOV42a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 7 949066 | ref | NP_0 
58031. 1 | 
(NM_016751) 


C-type (calcium 
dependent , 
carbohydrate 
recognition 
domain) lectin, 
superf amily 
member 13 ; 
kupffer cell 
receptor; Kupffer 
cell c-type 
lectin receptor 
[Mus musculus] 


548 


300/548 
(54%) 


395/548 
(71%) 


e-153 


gi | 16758588 | ref |NP_ 
446205. l| 
(NM_053753) 


Kupffer cell 
receptor [Rattus 
norvegicus] 


550 


293/547 
(53%) 


382/547 
(69%) 


e-147 


gi | 7657291 | ref |NP_0 
56532 . 1 | 
(NM_015717) 


Langerhans cell 
specific c-type 
lectin; langerin 
[ Homo sap iens] 


328 


92/261 
(35%) 


143/261 
(54%) 


2e-4i 


gi | 17426713 | emb | CAC 
85632 . 1 | (AJ313164) 


langerin [Mus 
musculus] 


326 


91/264 
(34%) 


140/264 
(52%) 


ie-40 



368 
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gi | 17059581 | emb | CAC 
82936 . 1 | (AJ302711) 


C type lectin 
[Mus wusculus] 


331 


91/264 
(34%) 


140/264 
(52%) 


2e-40 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 42J. In the ClustalW alignment of the NOV42 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 42 J. ClustalW Analysis of NOV42 



Novel NOV4 2a 
Novel NOV4 2b 
Novel NOV4 2C 
Novel NOV4 2d 



{SEQ ID NO:158) 

(SEQ ID NO: 160) 

(SEQ ID NO: 162) 

(SEQ ID NO: 164) 



1) 
2) 
3) 
4) 

5) gi | 7949066 | ref | NP_058031 . 1 | (NM_016751) C-type (calcium dependent, carbohydrate 
recognition domain) lectin, superfamily member 13; kupffer cell receptor; Kupffer 
cell c-type lectin receptor [Mus wusculus] (SEQ ID NO: 512) 

6) gi | 16758588 | ref | NP_446205 . 1 1 ' (NM_053753) Kupffer cell receptor [Rattus 
norvegicus] (SEQ ID NO: 513) 

7) gi | 7657291 | ref |NP__C56532 . 1 | (NM_015717) Langerhans cell specific c-type lectin; 
langerin [Homo sapiens] (SEQ ID NO: 514) 

8) gi | 17426713 | emb | CAC85632 . 1 | (AJ313164) langerin [Mus wusculus] (SEQ ID NO: 515) 

9) gi| 17059581 | emb |CAC82936.l| (AJ302711) C type lectin [Mus musculus] (SEQ ID 
NO: 516) 



NOV42a 

NOV42b 

NOV42C 

NOV42d 

gi|7949066| 

gi|16758588| 

gi|7657291| 

gi | 17426713 | 

gi | 17059581 



NOV42a 
NOV42b 
NOV42c 
NOV42d 
gi|7949066| 
gi [16758588 | 
gi|765729l| 
gi j 17426713 | 
gi | 17059581 



59 
59 
54 
49 
59 
59 
58 
56 
61 



.... | .... | . . v | 
--MKEAlMpGEAVR 

- -mkeaImBgeav^ 

MB.GEA 1 



--MKEA|LNRD1 

--mkeaIlnrdv, „. 

MTV^K^APDAH 

M^EgAPEr" 

MPEAEM^EgAP 

70 




80 



G^GPKBAgjPI 
PPPK§|GPSL^ 
|PPPKQDLSPVI 
PPPKQDLSPVI 

90 



I 



Hi s gLftigTP! 
|iS3lBS' 

RTI. ShB 3 VI LgLMV| 
|TLgHB 8 gi Vgl|jv| 

cpcvAFTc: 

C|CVAFT( 1 




100 



110 



120 




— I — l — I — i . . | — | . . 

jGKPPGD- - PNLTNFLSFQHKgP-RGPRCTLpIH: 

qSqtrpvpkpvqavilgdnJ^tghlpfepnnhh: 
qqqtrpvpkpvqavi lgdnp|tghlpfepn^hhh 
q§qtrpvpkpvqavilgdis^tghlpfepn]|hhh 

sijPWRPEWNKEPPSLL ^RGSNNSGHDNHSQ 

LpPWRQKQNEDHPVKAG--gHGGNYSGSD|cSQ 
lYPiFMG 
jFYPHLMG 

ifypIlmg 




IF^GHMEg 

if^ghmeS 
tf^ghmeK 

TFpGHMES 

rliIdyeeS 
sd|asg-{S 



NOV42a 


116 


SS 


NOV42b 


119 


ss 


NOV42c 


114 


SS 


NOV42d 


109 


ss 




115 

118 

113 

108 

115 

115 

69 

67 

72 



180 



TQ 


M 


LRSSL 


175 


TQ 


M 


LRSSL 


178 


TQ 


M 


LRSSL 


173 


TQ 


M 


LRSSL 


168 
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gi (7949066 | 
gi|l6758588| 
gi |765729l| 
gi j 17426713 | 
gi|17059581 



NOV42a 

NOV42b 

NOV42C 

NOV42d 

gi |7949066 | 

gi|l6758588 | 

gi j 7657291 1 

gi|l7426713 | 

gi | 17059581 




NOV42a 

NOV42b 

NOV42c 

NOV42d 

gi |7949066 | 

gi | 16758588 | 

gi | 7657291 | 

gi|l7426713| 

gi j 17059581 



236 
239 



250 



270 



280 



290 



300 



239 
242 



23 4 jE^gTQAQLANSSLKNANAEIYVLRGHLDSVNDLRTQNQVLRNSLEGANAEIQGLKENLQ 2 93 
22 9 EtBStOAOLANSSLKNANAEIYVLRGHLDSVNDLRTQNQVLRNSLEGANAEIQGLKENLQ 2 88 



■sEB!-- 



23 6 QSgSj 
23 6 pSgH 




239 
239 
134 
132 
137 



NOV42a 

NOV42b 

NOV4 2c 

NOV42d 

gi |7949066 | 

gi j 16758588 | 

gi |7657291 | 

gi | 17426713 | 

gi|l7059581 
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380 



390 



410 



NOV42a 

NOV42b 

NOV42c 

NOV42d 

gi |7949066 | 

gi|l6758588| 

gi|765729l| 

gi|l7426713 | 

gi|l7059581 



NOV42a 

NOV42b 

NOV42c 

NOV42d 

gi |7949066 | 

gi|!6758588| 

gi|765729l| 

gi|l7426713 | 

gi|l7059581 



NOV42a 
NOV4 2b 
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NOV42C 


473 








< 


NOV42d 


468 




8NG 




SL 


gi | 7949066 | 


416 








3 

ISTF 


gi | 16758588 | 


416 




Yifn 




KF 


gi |7657291 | 


199 




Y|FK 




NF 


gi j 17426713 | 


197 








NF 

■ 5 


gi| 17059581 


202 








N ^ 



NOV42a 
NOV42b 
NOV42C 
NOV42d 
gi|7949066| 
gi j 16758588 | 
gi|765729l| 
gi j 17426713 | 
gi |17059581 



NOV42a 


537 


NOV42b 


540 


NOV42C 


567 


NOV42d 


552 


gi | 7949066 | 


536 


gi|l6758588 | 


536 


gi|765729l| 


319 


gij 17426713 | 


317 


gi| 17059581 


322 




550 



560 



570 



580 



sgd; 
.^iB^vd; 

gTAGGkJl! 
jKAADGXP; 

{gAADGgjpj 

590 




610 

- - I - • - - I ♦ - 
|PMGQGVA- - 

Ipmgqgva- - 



STGWSAARVG - 



546 
549 
567 
552 
548 



STDWSVARTDQS 55 0 




328 
326 
331 



SS^PW^ 53 5 
TAkNTO 535 
KT^IxFlf 318 
•NT^LFX 316 
•NTBLFlj 321 



Tables 42K-N list the domain descriptions from DOMAIN analysis results against 
35 NOV42. This indicates that the NOV42 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 42K Domain Analysis of NOV42a 

gnl | Smart | smart 0 00 34, CLECT, C-type lectin (CTL) or carbohydrate- 
recognition domain (CRD) ; Many of these domains function as calcium- 
dependent carbohydrate binding modules. (SEQ ID NO: 836) 
CD-Length = 124 residues, 98.4% aligned 
Score = 124 bits (311), Expect = le-29 



40 



45 



50 



NOV42 : 


415 


Sbjct : 


3 


NOV42 : 


471 


Sbjct: 


63 


NOV42 : 


527 


Sbj Ct : 


113 



QGWK-FNGGSLYYFSSVKKSWHEAEQFCVSQGAHLASVASKEEQAFLVEFTSKV YYW 

II + II I Ik ll + l +1+ II I MINI* Ml 11+ 1 1 1 

SGWVSYPGGKCYKFSTEKKTWADAQAFCQSLGAHLASIHSEEENDFLLSLLKNSNSDYYW 



I GLTDRGT E G S WR WTDGT P FNAAQNKG F WE KNQ S DNWRHKNGQTE DC VQ I QQ - - 

Ilk + III + I + II+ 11+ + + M 

IGLSRPDSNGSWQWSDGSGPVDYSN 



MTCDTPYQWVCK 
+ + | + ++ | + 
VSCTSKLPFICE 



538 



124 



470 



62 



526 



KWND 

+ +11 + II I I 

-WAPGEPGG SGNCWLSTSGGGKWND 112 
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Table 42L Domain Analysis of NOV42a 

gnl | Pfam|pf am0 005 9 , lectin_c, Lectin C-type domain. This family 
includes both long and short form C-type (SEQ ID NO: 83 7) 
CD-Length = 107 residues, 99.1% aligned 
Score = 115 bits (288) , Expect = 6e-27 



NOV4 2 : 431 KKSWHEAEQFCVSQGAHLASVASKEEQAFLVEFT - - SKVYYWIGLTDRGTEGS WRWTDGT 4 8 8 

M ||+ I MM III II I I I MINI MM I I M + 

Sbjct : 2 SKTWAEAQAACQKLGGGLVSIQSAEEQDFLTSLTKASNSYAWIGLTDINTEGTWWTDGS 61 



NOV42: 4 89 PFNAAQNKGFWEKNQSDNWRHKNGQTEDCVQIQ QKWNDMTCDTPYQWVCKK 

I I I + +1 +1 IMM 1 1 1 1 I + + 11 + 

Sbjct: 62 PVNYT NWAPGEPNNRGNK E D C VE I YTDGN KWNDE P CG S KL P Y VCE F 



539 



107 
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Table 42M Domain Analysis of NOV42a 



gnl | Pf am|pf am01576 , Myosin_tail 
multi-subunit complex made up o 
chains it is a fundamental cont 
cell types. This family consist 
tail region. The coiled-coil is 
molecules of myosin. These can 
thick filament. The coiled-coil 
backbone the thick filament. ( 
CD-Length =86 0 residues, 2 9.4% 
Score =43.1 bits (100), Expect 



Myosin tail . The myosin molecule is a 
f two heavy chains and four light 
ractile protein found in all eukaryote 
s of the coiled-coil myosin heavy chain 

composed of the tail from two 
then assemble into the macromolecular 

region provides the structural 
SEQ ID NO: 83 8) 
aligned 
= 4e-05 



NOV4 2 : 


121 


Sbjct : 


187 


NOV42 : 


181 


Sbjct : 


247 


NOV4 2 : 


241 


Sbjct : 


300 


NOV4 2 : 


301 


Sbjct : 


357 


NOV4 2 : 


361 


Sbjct : 


411 



VE I QM LKC R VDNVNS QLQ VLGDHLGNTNAD I QM VKG VLKD ATTL S LOT QM L RS S LE GTNA 

+ + |+ ++| + || | ++ + | ++ | | + ||| 

SQLSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQLE 

EIQRLKEDLEKADALTFQTLNFLKSSLENTSIELHVLSRGLENANSEIQMLNASLETANA 

I +1 |+ + | |++ I +11 M + II Mi 

EAKRS LEE ES RERAN LQAQLRQL EHDLDSLREQLEEESEAKAELERQLS KANA 

LNSQTQAFI KSS FDNTS AE I QFLRGHLERAGDE I HVLKRDLKMVTAQTQKANGRLDQTDT 

| + + +| + |+ + |+ I + 1+ +1 M 
EIQQWRSKFESEGALRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQS 



SAEIQRLRGDLENTKALTMEIQQEQSRLK 

I 1+ M+ +1 1 I +++I l + 

STELFRLKNELEELKDQVEALRRENKNLQ 



389 



439 



180 



246 



240 



299 



300 



356 



360 



QIQVFKSEMENVNTLNAQIQVLNGHMKNASREIQTLKQGMKNASALTSQTQMLDSNLQKA 

+ + + +| + | | ++ I I I + + I + I ++ + 

ELEDLQIELERANAAASE LEKKQKNFDKILAEWK- - - RKVDELQAELDTAQREARNL 410 
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Table 42N Domain Analysis of NOV42c 

gnl | Pf am | pf am01576 , Myosin_tail , Myosin tail. The myosin molecule is a 
multi-subunit complex made up of two heavy chains and four light 
chains it is a fundamental contractile protein found in all eukaryote 
cell types. This family consists of the coiled-coil myosin heavy chain 
tail region. The coiled-coil is composed of the tail from two 
molecules of myosin. These can then assemble into the macromolecular 
thick filament. The coiled-coil region provides the structural 
backbone the thick filament. (SEQ ID NO: 838) 
CD-Length = 860 residues, 30.2% aligned 
Score = 39.3 bits (90), Expect = 6e-04 



NOV4 2 : 


135 


Sbjct : 


556 


NOV4 2 : 


195 


Sbjct : 


606 


NOV42 : 


255 


Sbjct : 


662 


N0V42 : 


315 


Sbjct : 


715 


N0V4 2 : 


369 


Sbjct: 


775 



SQLQVLGDHLGNTNADIQMVKGVLKDATTLSLQTQMLRSSLEGTNAEIQRLKEDLEKADA 194 
++ | ++ || | |+ | | + I + I+ + +1 I II +11 + I 

NELEIALDHANKANAEAQ KNVKKYQQQVKELQTQVE EEQRAREDAREQLA 6 0 5 

LTFQTI^FLKSSLENTSIELHVLSRGLENANSEIQMLNASLETANALNSQTQAFIKSSFD 2 54 

+ + |++ || I I +1+1 I + I I I +1 + I 

VAERRATALEAELEELRSALEQAERARKQAETE LAEASERVNELTAQNSSLIAQK-R 661 

10 NOV42 : 255 NTSAEIQFLRGHLERAGDEIHVLKRDLKJ^VTAQTQKANGRLDQTDTQIQVFKSEMENVNT 314 

|+ |+ I+I+I+ + + + l I + +++ + +++ 

KLEGELAALQSDLDEAVNELKAAEE RAK KAQ AD AARLAE E L RQ EQEH S QHLE R 714 

LNAQIQVLNGHMKNASREIQT- -LKQGMKNASALTSQTQMLDSNL QKASAEIQR-LR 368 

15 I |+ + ++ I + II I I I ++ + I++ I 1+ I I 1+ II 

LRKQLESQVKELQVRLDEAEAAALKGGKKMIQKLEARVRELEAELDGEQRRHAETQKNLR 774 

GDLENTKALTMEIQQEQSRLKTLHWITSQEQLQRTQSKQL 40 9 
|| ++++++ |+ | ++ + +| +| | 
20 Sbjct: 775 KMERRVKELQFQVEEDKKNLERLQDLVDKLQAKIKTYKRQL 815 

Kupffer cells are found in the linings of the liver sinusoids, and are phagocytic. A 
receptor uniquely found on the surface of rat Kupffer cells binds oligosaccharides terminating 
in galactose, N-acetylgalactosamine, and fucose. A number of different families of proteins 

25 share a conserved domain which was first characterized in some animal lectins. Animal lectins 
display a wide variety of architectures. They are classified according to the carbohydrate- 
recognition domain (CRD) of which there are two main types, S-type and C-type. C-type 
lectins (CTL) display a wide range of specificities and function as a calcium-dependent 
carbohydrate-recognition domain. They are found predominantly but not exclusively in 

30 vertebrates. CTLs can be classified into a number of subgroups based on their function and 
structure: 1) Collectins, represented by the soluble mannose-binding proteins of mammalian 
serum and liver; 2) Selectins, membrane-bound proteins involved in inflammation; and 3) 
Endocytic lectins, membrane-bound receptors that mediate endocytosis of glycoproteins. 
Endocytic lectins are type-II membrane proteins where the CTL domain is located at the C- 

35 terminal extremity of the proteins, and include the Kupffer Cell Receptor. 

The disclosed NOV42 nucleic acid of the invention encoding a Kupffer Cell Receptor- 
like protein includes the nucleic acid whose sequence is provided in Table 42A, 42C, 42E, 
42G, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any 
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of whose bases may be changed from the corresponding base shown in Table 42A, 42C, 42E, 
or 42G while still encoding a protein that maintains its Kupffer Cell Receptor -like activities 
and physiological functions, or a fragment of such a nucleic acid. The invention further 
includes nucleic acids whose sequences are complementary to those just described, including 
5 nucleic acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
10 chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 32 percent of the bases may be so 
changed. 



15 like protein whose sequence is provided in Table 42B,42D, 42F, or 42H. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 42B, 42D, 42F, or 42H while still encoding a protein 
that maintains its Kupffer Cell Receptor -like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 66 percent of the 

20 residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 



(NOV42) is a member of a "Kupffer Cell Receptor family". Therefore, the NOV42 nucleic 
25 acids and proteins identified here may be useful in potential therapeutic applications 

implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
30 delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 



The disclosed NOV42 protein of the invention includes the Kupffer Cell Receptor - 



The above disclosed information suggests that this Kupffer Cell Receptor -like protein 



The NOV42 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome, cirrhosis, 
transplantation, arthritis, tendinitis, and/or other diseases and pathologies. 
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NOV42 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV42 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
5 NOVX Antibodies" section below. The disclosed NOV42 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

10 NOV43 

A disclosed NOV43 nucleic acid of 1 108 nucleotides (also referred to as CG56690-01) 
encoding a P2Y Purinoceptor -like protein is shown in Table 43 A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 12-14 and ending with a 
TAA codon at nucleotides 1095-1097. The start and stop codons are shown in bold in Table 

15 43 A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 43 A. NOV43 nucleotide sequence (SEQ ID NO: 165). 

GTCATGATGTT ATGCTGTCCATTTTGCTTCCTTCCAGGGGAAGCAGAAGCGGGAGCCGTCGTGGAGCTCTGC 
TCCTGGAGGGAGCCTCCCGGGACATGGAGAAGGTGGACATGAATACATCACAGGAACAAGGTCTCTGCCAGT 
TCTCAGAGAAGTACAAGCAAGTCTACCTCTCCCTGGCCTACAGTATCATCTTTATCCTAGGGCTGCCACTAA 
ATGGCACTGTCTTGTGGCACTCCTGGGGCCAAACCAAGCGCTGGAGCTGTGCCACCACCTATCTGGTGAACC 
TGATGGTGGCCGACCTGCTTTATGTGCTATTGCCCTTCCTCATCATCACCTACTCACTAGATGACAGGTGGC 
CCTTCGGGGAGCTGCTCTGCAAGCTGGTGCACTTCCTGTTCTATATCAACCTTTACGGCAGCATCCTGCTGC 
TG ACCTGCAT CT CTGTG CAC C AGTTC C T AGGTGTGTG C C AC C C AC TGTG TTCGCTGCCC T AC CGG AC C CG C A 
GGCATGCCTGGCTGGGCACCAGCACCACCTGGGCCCTGGTGGTCCTCCAGCTGCTGCCCACACTGGCCTTCT 
CCCACACGGACTACATCAATGGCCAGATGATCTGGTATGACATGACCAGCCAAGAGAATTTTGATCGGCTTT 
TTGCCTACGGCATAGTTCTGACATTGTCTGGCTTTCTTTCCCTCCTTGGTCATTTTGGTGTGTATTCACTGA 
TGGTCAGGAGCCTGATCAAGCCAGAGGAGAACCTCATGAGGACAGGCAACACAGCCCGAGCCAGGTCCATCC 
GGACCATCCTACTGGTGTGTGGCCTCTTCACCCTCTGTTTTGTGCCCTTCCATATCACTCGCTCCTTCTACC 
TCACCATCTGCTTTCTGCTTTCTCAGGACTGCCAGCTCTTGATGGCAGCCCAGTGTGGCCTACAAGATATGG 
AGGCCTCTGGTGAGTGTGAGCAGCTGCCTCAACCCAGTCCTGTACTTTCTTTCAAGGGGGGCAAAAATAGAG 
T C AGGCT CC T C CAG AAACT G AGGC AGAAC AAGTTGGGTGAG CAT C C AGC TGGGAGGAAG AGATGC C C AGGGT 
TGAACAGATCTGGGTAATGCCAAGGTGA 



In a search of public sequence databases, the NOV43 nucleic acid sequence, located on 
chromosome 2, has 585 of 924 bases (63%) identical to a gbiGENBANK- 
ID:GDP2Y3|acc:X98283.1 mRNA from Gallus gallus {G.domesticus mRNA for G protein- 
20 coupled P2 receptor) (E = 3.6c" 45 ). 

The disclosed NOV43 polypeptide (SEQ ID NO: 166) encoded by SEQ ID NO: 165 has 
361 amino acid residues and is presented in Table 43B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV43 has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
25 NOV43 may also localize to the mitochondrial inner membrane with a certainty of 0.5862, to 
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the mitochondrial intermembrane space with a certainty of 0.41 14, or to the Golgi body with a 
certainty of 0.4000. The most likely cleavage site for NOV43 is between positions 13 and 14: 
SRS-GS. 



Table 43B. Encoded NOV43 protein sequence (SEQ ID NO: 166). 

MLSILLPSRGSRSGSRRGALLLEGASRDMEKVDMNTSQEQGLCQFSEKYKQVYLSLAYSIIFILGLPLNGTV 
LWHSWGQTKRWSCATTYLVNLMVADLLYVLLPFLIITYSLDDRWPFGELLCKLVHFLFYINLYGSILLLTCI 
SVHQFLGVCHPLCSLPYRTRRHAWLGTSTTWALWLQLLPTLAFSHTDYINGQMIWYDMTSQENFDRLFAYG 
IVLTLSGFLSLLGHFGVYSLMVRSLIKPEENLMRTGNTARARSIRTILLVCGLFTLCFVPFHITRSFYLTIC 
FLLSQDCQLLMAAQCGLQDMEASGECEQLPQPSPVLSFKGGKNRVRLLQKLRQNKLGEHPAGRKRCPGLNRS 
_G 

5 

A search of sequence databases reveals that the NOV43 amino acid sequence has 105 
of 261 amino acid residues (40%) identical to, and 153 of 261 amino acid residues (58%) 
similar to, the 328 amino acid residue ptnr:SWISSNEW-ACC:Q98907 protein from Gallus 
gallus (Chicken) (P2Y Purinoceptor 3 (P2Y3) (Nucleoside Diphosphate Receptor)) (E = 0.0). 
10 NOV43 is predicted to be expressed in brain because of the expression pattern of 

(GENB ANK-ID : gb:GENBANK-ID:GDP2Y3|acc:X98283.1) a closely related G.domesticus 
mRNA for G protein-coupled P2 receptor homolog in species Gallus gallus,, 

NOV43 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 43C. 

15 



Table 43C. BLAST results for NOV43 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 282 968 0 | sp | P7 9 92 
8 |P2Y8_XENLA 


P2Y PURINOCEPTOR 
8 (P2Y8) 


537 


111/259 
(42%) 


154/259 
(58%) 


5e-49 


gi | 2707256 | gb| AAC60 
339. l| (AF031897) 


G protein coupled 
P2Y nucleotide 
receptor 
[Meleagris 
gal lopavo] 


374 


107/277 
(38%) 


158/277 
(56%) 


4e-46 


gi | 2495017 | sp| Q9890 
7 | P2Y3_CHICK 


P2Y PURINOCEPTOR 
3 (P2Y3) 
(NUCLEOSIDE 
DIPHOSPHATE 
RECEPTOR) 


328 


105/261 
(40%) 


153/261 
(58%) 


6e-45 


gi | 1072 018 0 | sp |093 3 
61 | P2Y3_MELGA 


P2Y PURINOCEPTOR 
3 (P2Y3) 
(NUCLEOSIDE 
DIPHOSPHATE 
RECEPTOR) 


328 . 


105/269 
(39%) 


155/269 
(57%) 


2e-44 



376 



gi | 13928944 | ref | NP_ 


purinergic 


361 


118/310 


169/310 


6e-44 


113868.1 | 


receptor P2Y, G- 




(38%) 


(54%) 




(NM_031680) 


protein coupled, 
4 ; pyrimidinergic 
receptor P2Y, G- 
protein coupled, 
4 [Rattus 
norvegicus] 











The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 43D. In the ClustalW alignment of the NOV43 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 43D. ClustalW Analysis of NOV43 

1) Novel NOV43 (SEQ ID NO: 166) 

2) gi | 2829680 | sp| P79928 | P2Y8_XENLA P2Y PUR I NOCE PTOR 8 (P2Y8) (SEQ ID NO: 517) 

3) gi | 2707256 |gb|AAC60339 . 1 | (AF031897) G protein' coupled P2Y nucleotide receptor 
[Meleagris gallopavo] (SEQ ID NO: 518) 

4) gi| 2495017 |sp|Q98907|P2Y3_CHICK P2Y PURINOCEPTOR 3 (P2Y3) (NUCLEOSIDE 
DIPHOSPHATE RECEPTOR) (SEQ ID NO: 519)' 

5) gi|l0720180|sp|O9336l|P2Y3_MELGA P2Y PURINOCEPTOR 3 (P2Y3) (NUCLEOSIDE 
DIPHOSPHATE RECEPTOR) (SEQ ID NO: 520) 

6) gi | 13928944 | ref |NP_113868 . 1 | (NM_031680) purinergic receptor P2Y, G-protein 
coupled, 4; pyrimidinergic receptor P2Y, G-protein coupled, 4 [Rattus .norvegicus] 
(SEQ ID NO:521) 



NOV43 

gi | 2829680 | 
gi | 2707256 | 
gi j 2495017 j 
gi | 10720180 | 
gi j 13928944 | 



10 20 30 

|....|....|....|....|..^! 
MLSILLPSRGSRSGSRRGgLLLEGASRDMEKVDMNTSQEQGI 
-MTEDIMATgYPTgLTTPYLPMKLLMNLTNDTEDlg 
-MDAPYRMfBlSp^TPTPTP - - WLGGNTTAAAEJ5 



NOV43 


61 






gi | 2829680 | 


53 






gi [2707256 | 


51 






gi|2495017| 


31 






gi | 10720180 | 


31 






gi | 13928944 | 


41 







NOV43 


120 




gi |2829680 | 


113 


E 


gi |2707256 j 


111 


KVF 


gi |2495017 j 


91 




gi j 10720180 | 


91 


|ft 


gij 13928944| 


101 


TGL 




377 



ii. £1 f li Q rjl.. wj* -„ T\ _ 3. '•"■> 1i, i! ' ii 7/. 



190 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



NOV43 


179 




gi|2829680 | 


172 


TLcmy 


gi|2707256| 


170 


Ticgx 


gi|2495017 j 


151 


IAQGL 


gij 10720180 | 


151 


IAQGL 


gij 13928944 j 


160 


AGC0V 



NOV43 


238 


vras 


gi | 2829680 | 


232 


T0E 


gi [2707256 j 


230 




gi|2495017 j 


211 




gij 10720180 | 


211 




gij 13928944 j 


220 





NOV43 

gi | 2829680 | 
gi j 2707256 j 
gi j 2495017 j 
gij 10720180 | 
gij 13928944 | 



NOV43 

gi|2829680 | 
gi j 2707256 j 
gi j 2495017 j 
gij 10720180 | 
gi j 13928944 j 



NOV43 

gi | 2829680 | 
gij 2707256 j 
gi j 2495017 j 
gi j 10720180 | 
gijl3928944 j 



NOV43 

gi | 2829680 | 
gij2707256 j 
gij 2495017 j 
gi j 10720180 | 
gi j 13928944 j 



NOV43 

gi | 2829680 | 
gij2707256| 
gij 2495017 j 
gi j 10720180 | 
gij 13928944 j 



200 

JdyingqmIi 1 

/spgv: 

JSSISDNSI 

w 

3TQRNR[ 

STgRNl 
MANGTTi: 



250 

I KPE ENLMRTG - OTARA 



210 




220 
I • 

GIvlTgSG 

staBmcll 

SSS|jMALL 

git|t|tg 
it|t^tg 

SSA* 




|PIVSGNQQT;OP§ , 

^]|rsfpspspr||p§y 
1|kdeligla-||hkk 
deligla-Phk] 

gPLPGAGQ SSSR%| 




290 



300 




LTICFLLSQD 




Q 296 


YYA 


g-LLGIK 




Y 290 


YTS 


1-YFQAD 




Q 288 


LIV 


gSSASLP 




P 269 


LIV 


BsSPTLP 




P 269 


YQA 


g-LLQAD 




H 275 


350 




360 



I 



I 



,QKL§QN 342 

jlRTV@RRSSVPNRRCMHTNHP 34 9 

p 334 

308 



t^EGAAQR 

- -estP 



308 
320 



342 

350 QTEPHMTAGP 

335 RPVP TSL 

308 Y L 

308 Y 

321 KPKPRTAASS 




I 



420 
-•I 



400 410 

• • • I • • • J — J - • 

5EHP AGRjKRCPGXiNR 3 59 

||PVi@AEEI PSNGSMVRDENGEGSREHRVEWTDTKEINQMMNRRSTIKRN 4 0 9 

■PSVDSSWGSCCN SESgGMGTj|wS 370 

»KM@ SKWgQDHC&Y 326 

SKWRHDHcStY 3 26 

LHEESISR W&THQDSTFSA 3 55 



440 



450 



480 



359 

410 STDKNDMK 

370 

326 

326 

355 




361 



1 



460 470 

|....|....|....|....|....|....|....|....|....| 

361 

SNYLPYVEWEKEDYETKRENRKTTEQSSKTNAEQDELQTQIDSRLK 4 6 9 

374 

328 

GS 328 

--YE0DRL 361 

490 500 510 520 530 540 

.|....|....|....|....|....|....|....|....|....|....| 

361 

470 RGKWQLSSKKGAAQENEKGHMEPSFEGEGTSTWNLLTPKMYGKKDRLAKNVEEVGYGKEK 52 9 

374 374 

328 328 

328 328 

361 361 



....|... 

361 361 

53 0 ELQNFPKA 53 7 

374 374 

328 328 

328 328 

361 361 



65 Tables 43E lists the domain descriptions from DOMAIN analysis results against 

NOV43. This indicates that the NOV43 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 43E Domain Analysis of NOV43 

gnl | Pf am | pf amOOOOl , 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO:810) 

CD-Length = 254 residues, 85.0% aligned 
Score = ill bits (277) , Expect = 8e-26 



NOV43 : 


69 


NGTVLWHS WGQTKRWSCATTYLVNLMVADLLYVL - LPFLI ITYSLDDRWPFGELLCKLVH 


127 






II- 1 + +MI IIIII++I II + 1 + 111+ Mill 




Sbjct : 


2 


NLLVILVILRTKKLRTPTNIFLI^^VADLLFLLTLPPWALYYLVGGDWVFGDALCKLVG 


61 


NOV43 : 


128 


FLFYINLYGSILLLTCISVHQFLGVCHPLCSLPYRTRRHAWLGTSTTW^VVLQ-LLPTL 


186 






ii +i i Mini ii+ ++i + iii ii i i + i i i i i 




Sbjct : 


62 


ALFVVNGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILLVWVL7ALLLSLPPLL 


121 


NOV43 : 


187 


AFSHTDYINGQMIWYDMTSQENFDRLFAYGIVI/TI/5GFI/SZ/I/GHFGVYSLMVRSLIKPE- 


245 






1 + 1 + + + 1 1 1 1+ ++I + I 1 




Sbjct : 


122 


FSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTRILRTLRKRAR 


181 


NOV43 : 


246 


-ENLMRTGNTARARSIRTILLVCGLFTLCFVPFHIT 2 8 0 








+ + + + + + ++ + +| + | +| M++I + M 




Sbjct : 


182 


SQRSLKRRSSSERKAAKMLLVWWFVLCWLPYHIV 217 





The P2Y Purinoreceptor belongs to the family of G-Protein Coupled Receptors. G- 

20 protein-coupled receptors (GPCRs) constitute a vast protein family that encompasses a wide 
range of functions (including various autocrine, paracrine and endocrine processes). They 
show considerable diversity at the sequence level, on the basis of which they can be separated 
into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of 
families for which there are indications of evolutionary relationship, but between which there 

25 is no statistically significant similarity in sequence [1]. The currently known clan members 
include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal 
mating pheromone receptors, and the metabotropic glutamate receptor family. The rhodopsin- 
like GPCRs themselves represent a widespread protein family that includes hormone, 
neurotransmitter and light receptors, all of which transduce extracellular signals through 

30 interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands 
vary widely in structure and character, the amino acid sequences of the receptors are very 
similar and are believed to adopt a common structural framework comprising 7 
transmembrane (TM) helices. See CMKRL2 (601805). Using degenerate PCR to find cDNAs 
encoding new G protein coupled-receptors in human B cells, Owman et al. (1996) identified a 

35 CMKRL1 cDNA which encodes a 3S2-amino acid polypeptide with a calculated mass of 43 

kD. The nearest homologs of this novel sequence are the chemoattractant leukocyte receptors, 

such as the C5a anaphylatoxin receptor and the FMLP receptor. Northern blotting revealed 

transcripts of 5 kb and 7.5 kb in several tissues of the immune system including spleen, 

thymus, and lymph node. Owman et al. (1996) considered the high level of expression in 
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lymphoid tissues suggestive of the role of CMKRL1 in the regulation of the inflammatory 
system. The authors mapped the CMKRL1 gene to 1 4q 1 1 ,2-q 1 2 by fluorescence in situ 
hybridization. Akbar et al. (1996) used a chicken P2Y3 cDNA to screen a human 
erythroleukemia (HEL) cell cDNA library and cloned a purinoceptor cDNA, which they 
5 termed P2Y7. Sequencing revealed an open reading frame coding for a polypeptide of 352 
amino acids having 7 putative transmembrane domains. The P2Y7 receptor has 23 to 30% 
identity to other P2Y receptors, but forms a unique branch within the P2Y family. Northern 
blot analysis showed that the P2Y7 gene produced a 1 .6-kb transcript which is expressed at 
highest levels in human heart, human skeletal muscle, rat heart, and rat cardiomyocytes and at 

10 lower levels in human brain and human liver. Akbar et al. (1996) noted that its expression in 
HEL cells is below the threshold of detection by Northern blot. Binding and displacement 
assays in COS-7 cells showed that P2Y7 has a high affinity for ATP and much less for UTP 
and ADP. The rank order of affinities in the binding series was distinct from any known for 
the P2Y1-P2Y6 receptors. Like other P2Y receptors, P2YR is coupled to phospholipase C and 

1 5 not to adenylate cyclase. Akbar et al. (1996) speculated that P2Y7 may be the cardiac P2Y 
receptor involved in the regulation of cardiac muscle contraction through modulation of L- 
type calcium currents. Akbar et al. (1996) used PCR on a panel of mouse-rodent somatic cell 
hybrids to localize the P2RY7 gene to human chromosome 14. Somers et al. (1997) did 
sequence tagged site (STS) mapping of the P2RY7 gene using the National Center for 

20 Biotechnology Information (NCBI) database. In this way, they positioned the P2RY7 gene 
between D14S283 and D14S264. Leukotriene B4 (LTB4) is a potent chemoattractant that is 
primarily involved in inflammation, immune responses, and host defense against infection 
(Samuelsson et al., 1987; Chen et al., 1994). LTB4 activates inflammatory cells by binding to 
its cell surface receptor, BLTR. LTB4 can also bind and activate the intranuclear transcription 

25 factor PPAR-alpha, resulting in the activation of genes that terminate inflammatory processes 
(Devchand et al., 1996). Yokomizo et al. (1997) cloned the cDNA encoding a cell surface 
LTB4 receptor that is highly expressed in human leukocytes. Two cDNA clones isolated from 
retinoic acid-differentiated HL-60 cells contained identical open reading frames encoding a 
protein of 352 amino acids and predicted to contain 7 membrane-spanning domains, but 

30 different 5-prime untranslated regions. In Chinese hamster ovary (CHO) cells stably 

expressing this receptor, LTB4 induced increases in intracellular calcium, accumulation of D- 
myo-inositol-l,4,5-triphosphate, and inhibition of adenyiyl cyclase. Furthermore, CHO cells 
expressing exogenous BLTR showed marked chemotactic responses toward low 
concentrations of LTB4 in a pertussis-toxin-sensitive manner. Yokomizo et al. (1997) found 
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that the putative purinoceptor P2Y7 has a primary structure identical to that of one of the 
BLTR clones, HL-5. To determine whether BLTR also functions as a purinoceptor, they 
established stable transformants of BLTR in glioma cells that possess negligible amounts of 
intrinsic purinoceptors. In these cells, up to 300 microM caused no change in intracellular 
5 calcium levels, but significant increases in the calcium concentrations were induced by 

exposure to 10 nanoM LTB4. These results were interpreted to indicate that this receptor is not 
a purinoceptor, but a BLTR. 

The disclosed NOV43 nucleic acid of the invention encoding a P2Y Purinoceptor - 
like protein includes the nucleic acid whose sequence is provided in Table 43A or a fragment 

10 thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 43 A while still encoding a protein 
that maintains its UDP[ Glycosy transferase -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to those just described, including nucleic acid fragments that are 

15 complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 

20 stability of the modified nucleic acid, such that they may be used, for example, as antisense 

binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 37 percent of the bases may be so changed. 

The disclosed NOV43 protein of the invention includes the P2Y Purinoceptor -like 
protein whose sequence is provided in Table 43B. The invention also includes a mutant or 

25 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 43B while still encoding a protein that maintains its P2Y Purinoceptor -like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 62 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 

30 (Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this P2Y Purinoceptor -like protein 
(NOV43) is a member of a "P2Y Purinoceptor family". Therefore, the NOV43 nucleic acids 
and proteins identified here may be useful in potential therapeutic applications implicated in 
(but not limited to) various pathologies and disorders as indicated below. The potential 
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therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
5 (but not limited to) those defined here. 

The NOV43 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome , Alzheimer's 
disease, Stroke, Tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, 
Cerebral palsy, Epilepsy,Lesch-Nyhan syndrome, Multiple sclerosis,Ataxia- 

10 telangiectasia,Leukodystrophies,Behavioral disorders, Addiction, Anxiety, Pain, 
Neuroprotection, and/or other diseases and pathologies. 

NOV43 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV43 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

15 known in the art, using prediction from hydrophobic ity charts, as described in the "Anti- 

NOVX Antibodies" section below. The disclosed NOV43 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 

20 disorders. 

NOV44 

A disclosed NOV44 nucleic acid of 934 nucleotides (also referred to as CG56692-01) 
encoding a G Protein Coupled Receptor-like protein is shown in Table 44A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 15-17 and ending 
25 with a TAA codon at nucleotides 921-923. The start and stop codons are shown in bold in 
Table 44A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 44A. NOV44 nucleotide sequence (SEQ ED NO: 167). 

~TGAC CT TGG AAT CT ATGG AC AT AC C AC AAAAT AT C AC AGAATTTT T CATGCTGGGGCT CTC AC AG AACT C AG 
AGGTACAGAGAGTTCTCTTTGTGGTCTTTTTGCTGATCTATGTGGTCACGGTTTGTGGCAACATGCTCATTG 
TGGTCACTATCACCTCCAGCCCCACGCTGGCTTCCCCTGTGTATTTTTTCCTGGCCAACCTATCCTTTATTG 
ACACCTTTTATTCTTCTTCTATGGCTCCTAAACTCATTGCTGACTCATTGTATGAGGGGAGAACCATCTCTT 
ATGAGTGCTGCATGGCTCAGCTCTTTGGAGCTCATTTTTTGGGAGGTGTTGAGATCATTCTGCTCACAGTGA 
TGGCTTATGACCGCTATGTGGCCATCTGTAAGCCCCTGCACAATACTACCATCATGACCAGGCATCTCTGTG 
CCATGCTTGTAGGGGTGGCTTGGCTTGGGGGCTTCCTGCATTCATTGGTTCAGCTCCTCCTGGTCCTTTGGT 
TGCCCTTCTGTGGGCCCAATGTGATCAATCACTTTGCCTGTGACTTGTACCCTTTGCTGGAAGTTGCCTGCA 
CCAATACGTATGTCATTGGTCTGCTGGTGGTTGCCAACAGTGGTTTAATCTGCCTGTTGAACTTCCTCATGC 
TGGCTGCCTCCTACATTGTCATCCTGTACTCCTTGAGGTCCCACAGTGCAGATGGGAGATGCAAAGCCCTCT 
CCACCTGTGGAGCCCACTTCATTGTTGTTGCCTTGTTCTTTGTGCCCTGTATATTTACTTATGTGCATCCAT 
TTTCTACTTTACCTATAGACAAAAATATGGCATTATTTTATGGTATTCTGACACCTATGTTGAATCCACTCA 
TTTATACCCTGAGAAATGAAGAGGTAAAAAATGCCATGAGAAAGCTCTTTACATGGTAAGAAATTGCAGG 
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i ri o m. 4? ... n i t n ;p 

In a search of public sequence databases, the NOV44 nucleic acid sequence, located on 
chromosome 7, has 783 of 920 bases (85%) identical to a gb : GENB ANK- 
ID:AB030895|acc:AB030895.1 mRNA from Mus musculus (gene for odorant receptor 
MORI 8, complete cds) (E = 4.5e~ 146 ). 
5 The disclosed NOV44 polypeptide (SEQ ID NO: 1 68) encoded by SEQ ID NO: 1 67 has 

302 amino acid residues and is presented in Table 44B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV44 has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV44 may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 
10 reticulum (membrane) with a certainty of 0.3000, or to the mitochondrial inner membrane with 
a certainty of 0.0300. The most likely cleavage site for NOV44 is between positions 39 and 
40: VCG-NM. 



Table 44B. Encoded NOV44 protein sequence (SEQ ID NO: 168). 

MDIPQNITEFFMLGLSQNSEVQRVLFVVFLLI YVVTVCGNMLIVVTITSSPTLiASPVYFFIjANLSFIDTFYS 
SSMAPKLI ADSLYEGRTI S YECCMAQLFGAHFL.GGVE 1 1 LLTVMAYDRYVAI CKPLHNTTIMTRHLCAMIjVG 
VAWLGGFLHSLVQLLLVLWLPFCGP1WINHFACDLYPLLEVACTNTYVIGLLVVANSGLICLLNFLMLAASY 
IVILYSLRSHSADGRCKALSTCGAHFIWALFFVPCIFTYVHPFSTLPIDKNMALFYGILTPMLNPLIYTLR 
NEEVKNAMRKLFTW 



15 A search of sequence databases reveals that the NOV44 amino acid sequence has 257 

of 301 amino acid residues (85%) identical to, and 280 of 301 amino acid residues (93%) 
similar to, the 308 amino acid residue ptnr:SPTREMBL-ACC:Q9R0K2 protein from Mus 
musculus (Mouse) (Odorant Receptor MOR18) (E - 5.0e-138). 

NOV44 also has homology to the amino acid sequences shown in the BLASTP data 

20 listed in Table 44C. 



Table 44C. BLAST results for NOV44 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17472367 |ref |XP_ 
061659.1 | 
(XM_061659) 


similar to 
odorant receptor 
16 (H. sapiens) 
[Homo sapiens] 


324 


302/302 
(100%) 


302/302 
(100%) 


e-143 


gi | 11496249 | ref | NP 
067343. 1| 
(NM_021368) 


odorant receptor 
16 [Mus musculus] 


308 


257/301 
(85%) 


280/301 
(92%) 


e-127 


gi | 11464995 | ref |NP 
065261. l| 
(NM 020515) 


gene for odorant 
receptor A16 [Mus 
musculus] 


302 


234/300 
(78%) 


262/300 
(87%) 


e-111 


gi | 17459946 | ref |XP_ 
062088.1 | 
(XM_062088) 


similar to 
odorant receptor 
16 (H. sapiens) 
[Homo sapiens] 


316 


191/295 
(64%) 


232/295 
(77%) 


9e-95 
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gi | 17472365 | ref | XP 


similar to 


544 


183/296 


230/296 


le-91 


061658. 1| 


odorant receptor 




(61%) 


(76%) 




{XM 061658) 


16 (H. sapiens) 
[Homo sapiens] 











The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 44D. In the ClustalW alignment of the NOV44 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 44D. ClustalW Analysis of NOV44 

1) Novel NOV44 (SEQ ID NO: 168) 

2) gi | 17472367 | ref |XP_061659. 1 | (XM_061659) similar to odorant receptor 16 (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 522) 

3) gi | 11496249 | ref |NP_067343 . 1 | (NM_021368) odorant receptor 16 [Mus musculus] (SEQ 
ID NO: 523) 

4) gi | 11464995 | ref | NP_065261 . 1 | (NM_020515) gene for odorant receptor A16 [Mus 
musculus] (SEQ ID NO: 524) 

5) gi | 17459946 | ref |XP_062088 . 1 | (XM_062088) similar to odorant receptor 16 (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 525) 

6) gi | 17472365 | ref |XP_061658 . 1 | (XM_061658) similar to odorant receptor 16 (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 526) 



10 



20 



30 



NOV44 

gi | 17472367 | 
gi | 11496249 | 
gi j 11464995 | 
gi | 17459946 j 
gi 1 17472365 | 



40 



50 



60 



MTQISSNAFSRDFQNSNAFEVQVKDPIHVEDVPGPKSEFCSVFPSTPQASGNFQNQIFQD 



1 
1 
1 
1 
1 

60 



120 



NOV44 

gi|l7472367| 
gijll496249 j 
gi | 11464995 j 
gijl7459946| 
gi|l7472365| 




NOV44 


39 


gi | 17472367 | 


61 


gij 11496249 | 


39 


gij 11464995| 


39 


gij 17459946 | 


39 


gij 17472365 j 


121 


NOV44 


98 


gi |17472367 | 


120 



130 



140 



150 



160 



170 



180 




190 
..I.. 



200 



210 



220 



230 



240 



qlfgahflggvei illtvmaydryvaickplhgtt 
qlfBahflggvei illtvmaydryvaickplhStt 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



gi | 11496249 | 
gi|ll464995| 
gi | 17459946 j 
gi 1 17472365 I 



NOV44 


158 


gi 1 17472367 | 


180 


gij 11496249] 


158 


gi | 11464995 | 


158 


gi | 17459946 | 


159 


gi j 17472365 j 


240 


NOV44 


218 


gi | 17472367 | 


240 


gi 1 11496249 j 


218 


gi j 11464995 j 


218 


gi 1 17459946 j 


219 


gi j 17472365 j 


300 


NOV44 


278 


gi|l7472367 | 


300 


gij 11496249| 


278 


gi|H464995| 


278 


gi jl7459946 j 


279 


gi 1 17472365 j 


352 


NOV44 


302 


gi | 17472367 | 


324 


gi j 11496249 j 


308 


gi j 11464995 j 


302 


gi j 17459946 j 


316 


gi j 17472365 j 


412 


NOV44 


302 


gi | 17472367 | 


324 


gi | 11496249 | 


308 


gij 11464995 j 


302 


gij 17459946 | 


316 


gi j 17472365 j 


472 


NOV44 


302 


gi| 17472367] 


324 


gi|H496249j 


308 


gij 11464995] 


302 


gij 17459946 j 


316 


gij 17472365 j 


532 





390 
..|.. 



400 
..|.. 



410 



420 
■-I 



302 
324 



IEXRSEV VGA 3 08 

RiC 3 02 

tWNl^LMV VSDEKENIKL 316 

I I WGNLFISVI I FISPALEHFFEAAEI ILLSVMAY 411 



430 



440 



450 
..|.. 



460 



470 
..|.. 



480 
■•I 



302 
324 
308 
302 
316 



490 



500 



510 



520 



530 



540 



302 
324 
308 
302 
316 



550 
..(... 

302 

324 

308 

302 

316 

532 KKLWKQ I MTTDDK 544 



60 



Table 44E lists the domain description from DOMAIN analysis results against 
NOV44. This indicates that the NOV44 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 44E Domain Analysis of NOV44 

gnl | Pf am| pf amOOOOl , 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO:810) 

CD-Length = 254 residues, 100.0% aligned 
Score = 82.0 bits (201), Expect = 4e-17 



NOV43 : 


39 


Sbjct : 


1 


NOV4 3 : 


99 


Sbjct : 


61 


NOV4 3 : 


159 


Sbjct : 


121 


NOV4 3 : 


219 


Sbjct : 


181 


NOV4 3 : 


272 


Sbjct : 


241 



GNMLIVVTITSSPTLASPVYFFLANLSFIDTFYSSSMAPKLiIADSIjYEGRTISYECCMAQ 98 

II+I+++ I + I +1 II ll + I + ++ I + + I 

GNLLVILVILRTKKLRTPTNI FLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 

LFGAHFLGGVEI ILLTVMAYDRYVAICKPLHNTTIMTRHLCAMLVGVAWLGGFiHSIi VQL 158 

I l + l I I ++ I I l + l I II II +1+ + |+ I I I I 

GAL FWNGYAS I LLLTAI S I DRY LA I VH P LR YRR I RT PRRAKVL I LL VWVLALLLS LP PL 120 



1 + 



+ lll + ll 
LAYVNSCLNP II Y 2 54 

G-protein-coupled receptors (GPCRs) constitute a vast protein family that 
encompasses a wide range of functions (including various autocrine, paracrine and endocrine 
processes). They show considerable diversity at the sequence level, on the basis of which they 
can be separated into distinct groups. We use the term clan to describe the GPCRs, as they 
embrace a group of families for which there are indications of evolutionary relationship, but 
between which there is no statistically significant similarity in sequence. The currently known 
clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP 
receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor 
family. 

The rhodopsin-like GPCRs themselves represent a widespread protein family that 
includes hormone, neurotransmitter and light receptors, all of which transduce extracellular 
signals through interaction with guanine nucleotide-binding (G) proteins. Although their 
activating ligands vary widely in structure and character, the amino acid sequences of the 
receptors are very similar and are believed to adopt a common structural framework 
comprising 7 transmembrane (TM) helices 

The disclosed NOV44 nucleic acid of the invention encoding a G Protein Coupled 
Receptor -like protein includes the nucleic acid whose sequence is provided in Table 44A or 
a fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 44A while still encoding a 
protein that maintains its UDP[ Glycosyltransferase -like activities and physiological 

386 



o m 7 p n :i n i 7? .1. h-ip 



functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
5 include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 

10 acids, and their complements, up to about 15 percent of the bases may be so changed. 

The disclosed NOV44 protein of the invention includes the G Protein Coupled 
Receptor -like protein whose sequence is provided in Table 44B. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 44B while still encoding a protein that maintains its G Protein Coupled 

15 Receptor -like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 39 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this G Protein Coupled Receptor -like 

20 protein (NOV44) is a member of a "G Protein Coupled Receptor family". Therefore, the 
NOV44 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 

25 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV44 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Systemic lupus erythematosus , Autoimmune disease, 

30 Asthma, Emphysema, Scleroderma, allergy, ARDS, and/or other diseases and pathologies. 

NOV44 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV44 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
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NOVX Antibodies" section below. The disclosed NOV44 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
5 disorders. 

NOV45 

A disclosed NOV45 nucleic acid of 994 nucleotides (also referred to as CG56694-01) 
encoding a Mas Proto-Oncogene-like protein is shown in Table 45A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 17-19 and ending with a 
10 TGA codon at nucleotides 980-982. The start and stop codons are shown in bold in Table 
45A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 45A. NOV45 nucleotide sequence (SEQ ID NO:169). 

ACTAGGGTTCCTGAGCATGGATCCAACCATCCCAGCCTTGGGTACAGAACTGACACCAATCAATGGACGGGA 
GGAGACTCCTTGCTACAAGCAAACCCTGAGCCTCACAGGGCTGACGTGCATCGTTTCCCTTGTCGGGATGAC 
AGGAAATGCAGTCGTGCTCTGGCTCCTGGGCTTCCGCATGCGCAGGAACGCCTTCTCCATCTACATCTTCAA 
CCTGTCCATGGCCGACTTCCTCTTTCTCAGAAGCCACATTATACGTTTTCCGTTAAGCCTCATCAATATCCT 
CCATCCCATCTTCAAAATCCTCAGCCCTGTGATGATGTTTTCCTACCTTGCAAGCCTGAGCTTTCTAAGCGC 
CATGAGCACCGAGCGCTGCCTGTACGTCCTGTGGCCCATCTGGGAGCGCTGCCGCCCCCGCCCCTACACCTG 
TCAGCGGTCGTGTGTGTCATGCTCTGGGCCCTGTCTCTGCTGCGGAGCGTCCTGGAGTGGAGTTTCTGTGAC 
TTCCTGTTTAGTGGTGCTGATTCTGTTTGGTGTAAAACATCAGATTTCATCATAGTAGGGGGGCTGATTTTT 
TTATGTGTGGCTCTCTGTGGTTCCAGCCTGGTCCTGCTGGTCAGGATCCTTTGTGGGTCCCGGAAGATGCCA 
CTGACCAGGCTGTACGTGACCATCCTGCTCATAGCGCTGGTCTTCCTCCTCTGTGGCCTGCCCTTTGGCATT 
CGGTTTTTCCTATTTTCATGGAACCACGTGGATTTGGAAGTCTTATATTGTCACGTTCATCTAGTTTCCATT 
TTCCTTTCCTCTCTTAACGGCCAACCCCAACATTTACTTCTTCGTGGGCTCCTTAAGGCAGTGTCAAAAAAG 
GCAGAACCTGAAGCTGGTTCTCCAGAGGGCTCTGCAGGACACGACTGAGGTGAATGAAGGTGGACGATGGCT 
TCCTGAGGAAACCCTGGAGCTGTCAGGAAGCAGATTTGGGCAGTGAGGAAGAGCCTCT 



In a search of public sequence databases, the NOV45 nucleic acid sequence, located on 
chromosome 7, has 353 of 580 bases (60%) identical to a gb:GENBANK- 
15 ID:I08606|acc:I08606.1 mRNA from Unknown. (Sequence 1 from Patent WO 8707472) (E = 



The disclosed NOV45 polypeptide (SEQ ID NO: 170) encoded by SEQ ID NO: 169 has 
319 amino acid residues and is presented in Table 45B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV45 has no signal peptide and is 
20 likely to be localized to the plasma membrane with a certainty of 0.6400. Alternatively, 

NOV45 may also localize to the Golgi body with a certainty of 0.4600, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3700, or to the endoplasmic reticulum (lumen) 
with a certainty of 0.1000. 



Table 45B. Encoded NOV45 protein sequence (SEQ ID NO: 170). 

MIQLLITSGLCSQGHQTRVPEHGSNHPSLGYRTDTNQWTGGDSLLQANPEPHRADVHRFPCRDDRKCSRAfA 
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PGLPHAQERLLHLHLQPVHGRLPLSQKPHYTFSVKPHQYPPSHLQNPQPCDDVFLPCKPELSKRHEHRALPV 
RPVAHLGALPPPPLHLSAWCWLWALSLLRSVLEWSFCDFLFSGADSVWCKTSDFI I VGGLI FL CVALCGS 
SLVLLVRILCGSRKMPLTRLYVTILLIALVFLLCGLPFGIRFFLFSWNHVDLEVLYCHVHLVSIFLSSLNGQ 
PQHLLLRGLLKAVSKKAEPEAGSPEGSAGHD 



A search of sequence databases reveals that the NOV45 amino acid sequence has 50 of 
168 amino acid residues (29%) identical to, and 87 of 168 amino acid residues (51%) similar 
to, the 378 amino acid residue ptnr:SWISSPROT-ACC:P35410 protein from Homo sapiens 
(Human) (Mas-Related G Protein-Coupled Receptor MRG) (E = 5.0e-138). 

NOV45 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 45 C. 



Table 45C. BLAST results for NOV45 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 15546 062 |gb| AAK9 
1804. 1| (AY042213) 


MrgXl G protein- 
coupled receptor 
[Homo sapiens) 


322 


108/130 
(83%) 


116/130 • 
(89%) 


2e-56 


gi | 17472340 | ref |XP 
061650.1 | 
<XM_061650) 


similar to MrgXl 
G protein-coupled 
receptor (H. 
s ap i en s ) [ Homo 
sapiens] 


1589 


108/130 
(83%) 


116/130 
(89%) ' 


3e-56 


gi | 16876453 | ref | NP 
473372 . 1 | 
(NM 054031) 


G protein-coupled 
receptor MRGX3 
[Homo sapiens] 


322 


104/130 
(80%) 


118/130 
(90%) 


2e-53 


gi | 17461239 | ref | XP 
062249.1 | 
(XM_062249) 


similar to MrgX3 
G protein- coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


322 


103/130 
(79%) 


118/130 
(90%) 


7e-53 


gi | 16876455 | ref | NP 
473373. l| 
(NM 054032) 


G protein-coupled 
receptor MRGX4 
[Homo sapiens] 


322 


97/124 
(78%) 


104/124 
(83%) 


2e-46 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 45D. In the ClustalW alignment of the NOV45 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 

« 

function. 



Table 45D. ClustalW Analysis of NOV45 

1) Novel NOV4 5 (SEQ ID NO: 170) 

2) gi | 15546062 |gb|AAK91804 .1 | (AY042213) MrgXl G protein-coupled receptor [Homo 
sapiens] (SEQ ID NO:527). - 



389 



3) gi | 17472340 | ref |XP_061650 . 1 | (XM 
receptor (H. sapiens) [Homo sapiens] 

4) gi | 16876453 | ref |NP_473372 . 1 | (NM 
sapiens] (SEQ ID NO: 529) 

5) gi | 17461239 | ref |XP_062249 . 1 | {XM 
receptor (H. sapiens) [Homo sapiens] 

6) gi | 16876455 | ref |NP_473373 . 1 | (NM 
sapiens] (SEQ ID NO: 531) 



061650) similar to MrgXl G protein-coupled 
(SEQ ID NO: 52 8) \ 

054031) G protein-coupled receptor MRGX3 [Homo 

062249) similar to MrgX3 G protein-coupled 
{SEQ ID NO:530) 

054 03 2) G protein- coupled receptor MRGX4 [Homo 



NOV45 

gi | 15546062 | 
gi | 17472340 | 
gi j 16876453 | 
gi j 17461239 j 
gi | 16876455 j 



NOV45 

gi 1 15546062 | 
gi j 17472340 j 
gi j 16876453 j 
gi | 17461239 | 
gi j 16876455 | 



NOV45 

gi | 15546062 | 
gi j 17472340 | 
gi | 16876453 j 
gi j 17461239 | 
gi|l6876455 | 



NOV4 5 

gi|l5546062 | 
gi | 17472340 | 
gi|l6876453| 
gi|l7461239| 
gi | 16876455 j 



1 
1 
1 
1 
1 
1 



61 
61 
61 
61 
61 
61 




130 140 
....|....|....|....|....| 

pcdd^hlpckpelskrhehraBpBrp 



150 160 170 180 

_| I I — j I I 

LGALPPPPLHLS AV^CVML WAL'SLL RgVLE W 18 0 

LgG OS^^ pgA BB o iBBB 1, 1 8 0 

FTGLSFLSAMSTERCLCVLWPICLVliL 18 0 

180 
IT 18 0 

IsEBEffiMSaP 18 0 




240 

[RfjgLCGS RKMP LTR L YHT I 24 0 



|EV3IFCHMHLVSIF 24 0 



FS 24 0 
FS 24 0 
IY 240 



NOV45 

gi | 15546062 
gi | 17472340 
gi j 16876453 
gi j 17461239 
gi | 16876455 



NOV45 

gi | 15546062 | 
gi j 17472340 | 
gi|l6876453| 
gi|l7461239| 
gi j 16876455 | 



241 
241 
241 
241 
241 
241 



301 
301 
301 
301 
301 
301 




.|....|....|....|....|....| 
HffiHLVSIFLiSLjSjGQPQHLLLRGLLKA 3 00 



250 260 270 280 290 300 

IipIALVFXj3c GLP FGpRF[gflFjS WgSHVDLEjyx|_ 

5S j3RQRQ[SRQNLKgljLQRAIjQDMPEVKV^GGFBREPW gC|^ElADS 300 
SSf 
33$ 



FL SgLNS SANPI I YFFVGS FRQRQNRQNLKLVLQRALQD 
FL SgLNS S ANP 1 1 YFFVGS FRQRQNRQNLKLVLQRALQD 



LSg^NSSANP I I YFFVGS FRQRQNRQNLKLVLQRALQD 



310 



320 




330 



340 
..|.. 



350 



TP 




300 


TP 




300 


KP 




300 


360 



319 

322 



GTVTAEYRRRNLNAHj^HHIGPNFSETLQDENTRKESRDRADLIPECASPELSNTEESSEV 36 0 




LELSGSR 
LELSGSR 
LELSGSR 



LEQ 
LEQ 
LGP 



322 
322 
322 



NOV45 

gi | 15546062 | 
gi | 17472340 j 
gi j 16876453 j 



370 
..|.. 



380 



390 
..|.. 



400 



410 
..|.. 



420 
•-I 



319 319 

322 322 

3 61 TESQKGPPPLAQGKSSPLDKREKDKGKYQELSSSIVSYGFCYKIVSVFPPRDTRLGFLSV 42 0 

322 322 

390 



gi I 17461239 I 
gi j 16876455 | 



322 
322 



322 
322 



430 440 450 460 470 480 

.-.-|....|-..-|----|....|.-..|....|....|....|....|....|....| 
NOV45 319 319 

gi|l5546062| 322 322 
gi j 17472340 j 421 DPTIPALGTELTPINGREETPCYKQTLSLTGLTCIVSLVGMTGNAWLWLLGFRMRRNAF 480 
gi j 16876453 j 322 322 
gi|l7461239| 322 322 
gij 16876455 j 322 322 

490 500 510 520 530 540 

...,|....|....|....|....|....|....|....|....|....|....|....| 
NOV45 319 319 

gi|l5546062| 322 322 
gi j 17472340 | 481 SIYIFISTLSMADFLFLRSHIIRFPLSLINILHPIFKILSPVMMFSYLASLSFLSAMSTERC 540 
gi|l6876453| 322 ■ 322 

gi|l7461239| 322 322 
gi|l6876455| 322 322 

550 560 570 580 590 600 

....|....|....|..-.|....|....|....|....|....|....|....|....| 
NOV45 319 319 

gi | 15546062 | 322 322 
gi j 17472340 j 541 LLVLLVRILCGSRKMPLTRLQCQNRQNLKLVLQRALQDTTEVNEGGRWLPEETLELSGSR 600 
gi|l6876453| 322 322 
gi j 17461239 | 322 322 
gi j 16876455 | 322 322 

610 620 630 640 650 660 

....|....|.-..|-..-|....|-...|....|....|....|....|....|....| 

NOV45 319 319 

gi|l5546062| 322 322 

gi j 17472340 j 601 FGQSVHCLWEEEWRGCGSLLMSCLLSKVNDITSSWFHIKEEHAGHPGVFSRKVTRLGFLS 660 

gi|l6876453| 322 322 

gi|l7461239| 322 322 

gi|l6876455| 322 322 



NOV45 

gi | 15546062 | 
gi | 17472340 j 
gij 16876453 j 
gi | 17461239 | 
gi | 16876455 | 



670 
..|.. 



680 



690 



700 
..|.. 



710 



720 



319 319 

322 322 

661 MDPTISTLDTELTPINGTEETLCYKQTLSLTVLTCIVSLVGLTGNAWLWLLGCRMRRNA 72 0 

322 322 

322 * 322 

322 322 



NOV45 


319 


gi 


15546062 | 


322 


gi 


17472340 j 


721 


gi 


16876453 | 


322 


gi 


17461239 | 


322 


gi 


16876455 | 


322 



730 740 750 760 770 780 

....|..-.|--..|.-..|....|....|....|....|....|....|....|....| 

319 

322 

FSIYILNLAAADFLFLSGRLIYSLLSFISIPHTISKILYPVMMFSYFAGLSFLSAVSTER 78 0 

322 
322 
322 



790 800 810 820 830 840 

..-.|....|...-|....|....|....|....|....|....|....|....|....| 
NOV45 319 ■ 319 

gi|l5546062 | 322 322 
gij 17472340 j 781 CLSVLWPIWYRCHRPTHLSAWCVLLWALSLLRSILEWMLCGFLFSGADSAWCQTSDFIT 840 
gi|l6876453| 322 322 
gi|l7461239| 322 322 
gi|l6876455| 322 322 



NOV45 

gi|l5546062| 



319 
322 



850 



860 



870 
..|.. 



880 



391 



890 



900 
-I 



319 

322 



gi|l7472340| 
gi|l6876453 | 
gi|l7461239| 
gi | 16876455 | 



841 VAWLIFLCWLCGSSLVLLIRILCGSRKIPLTRLYVTILLTVLVFLLCGLPFGIQFFLFL 90 0 



322 
322 
322 



322 
322 
322 



NOV45 

gi| 15546062 | 
gi | 17472340 j 
gi j 16876453 | 
gijl7461239| 
gi|l6876455| 



910 
..|.. 



920 



930 



940 



950 



960 
•-I 



319 319 

322 322 

901 WIHVDREVLFCHVHLVSIFLSALNSSANPII YFFVGSFRQRQNRQNLKLVIjQRALQDASE 96 0 

322 322 

322 322 

322 322 



NOV45 


319 




| 15546062 | 


322 


gi| 


| 17472340 | 


961 


gi| 


16876453 | 


322 


gi| 


17461239 j 


322 


gi| 


16876455 | 


322 



970 980 990 1000 1010 1020 

....|.-..|....|....|....|....|....|....|....|....|....|....| 

319 
322 

VDEGGGQLPEEILELSGSRLEQGTRLGFLSMDPTIPVLGTELTPINGTEETPCYNQTLSF 1020 

322 
322 
322 



NOV45 


319 


gi| 


| 15546062 | 


322 


gi| 


| 17472340 j 


1021 


gi| 


j 16876453 j 


322 


gi| 


17461239 | 


322 


gi| 


16876455 | 


322 



1030 
..|... 



1040 
..|... 



1050 
.|... 



1060 
..|... 



1070 
..|... 



1080 



319 

322 

1021 TVLTCIVSLVALTGNAWLWLLGFRMCRNAVSIYILNLVAANFLLLSSHIIHPCYTSSIT 10 8 0 

322 

322 
322 



NOV45 


319 


gi| 


|15546062 | 


322 


gi| 


1 17472340 | 


1081 


gi| 


| 16876453 | 


322 


gi| 


17461239] 


322 


gi| 


16876455 j 


322 



1090 



1100 



1110 

..I... 



1120 



1130 



1140 



319 

322 

10 81 PEYSECHEHQALPVNPVAHLPGPSGQDPLWI PEDAADQAVHDHLLTVLVFLLCGLPIGIQ c l 14 0 

322 
322 
322 



NOV45 

gi|l5546062| 
gi j 17472340 | 
gi|l6876453 | 
gi|l7461239| 
gi|l6876455| 



319 

322 



1150 



1160 



1170 
..|... 



1180 
..|... 



1190 
..|... 



1200 



319 
322 



1141 WALFSRIHMDWEVLYSHVHLPSIFLSSLNSSANPIIYFFMGFVRQHQNWQNLKLVLQRDL 12 0 0 
322 322 
322 322 
322 322 



1210 1220 1230 1240 1250 1260 

....|.-..|....|....|....|....|....|....|....|....|....|....| 

NOV45 319 319 

gi|l5546062| 322 , 322 

gi | 17472340 j 1201 QDTPEVDEEQNNVLRYI I LYIMYTLDTMSLHKNSLGNWFYGKSS AVI CLQNGDMLAL VSS 1260 

gijl6876453| 322 322 

gi|l7461239| 322 322 

gi|l6876455| 322 322 

1270 1280 1290 1300 1310 1320 

....|....|....|....|.-..|....|....|....|....|....|....|....| 

NOV45 319 319 

gi|l5546062| 322 322 

gi | 17472340 j 1261 KDNHGPFPLRTDIGGTGLLFLAVTAEYRRRNLNAHAHHIGPIVLKPCRMQTQGRREKTQW 1320 

gi|l6876453| 322 322 

gi|l7461239| 322 322 

gi|l6876455| 322 322 



1330 
..|... 



1340 
..|... 



1350 
|... 



1360 



1370 



1380 



392 



NOV45 

gi | 15546062 | 
gi | 17472340 j 
gi 1 16876453 j 
gijl7461239| 
gi j 16876455 j 



319 319 

322 322 

13 21 ISFLSVYVEISPKLHNTKGSSEVTESQKGSTSLARGSTSSTLDRRRRKDAQQSHIEPHFK 13 80 

322 322 

322 322 

322 322 



10 



15 



NOV45 

gi|l5546062 | 
gi j 17472340 | 
gi j 16876453 j 
gi j 17461239 | 
gi | 16876455 | 



1390 1400 1410 1420 1430 1440 

....|-...|.-..|....|....|....|....|....|....|....|....|....| 
319 319 
322 322 
13 81 GTLWNLRGTRLGFLSMNPTIPALDTEIAPISDTEETHPHRCGMEVLVLIVLILIIDLVG 1440 
322 322 
322 322 
322 322 



20 



25 



30 



35 



40 



NOV45 

gi|l5546062| 
gi | 17472340 j 
gi | 16876453 | 
gi j 17461239 | 
gi|16876455| 



NOV45 

gi | 15546062 | 
gi j 17472340 | 
gi j 16876453 j 
gi | 17461239 j 
gi | 16876455 | 



NOV45 

gi | 15546062 | 
gi j 17472340 | 
gi j 16876453 j 
gi j 17461239 | 
gi j 16876455 j 



319 

322 

1441 

322 

322 

322 



319 

322 

1501 

322 

322 

322 



319 

322 

1561 

322 

322 

322 



1450 
..|... 



1460 
..|... 



1470 
..|... 



1480 
..|... 



1490 
,.|... 



1500 



LAGNAVMLWLLGFCMHSNTFSLYILNLARADFLCTCFQI ITFINFFSDFVSSLSIHFSRF 



319 

322 

1500 

322 

322 

322 



1560 



1510 1520 1530 1540 1550 

.-..|....|....|....|....|....|....|....|....|....|....|....| 

319 
322 

VTTVLFSACITGLSMLSTISTEHRLSVLWPICSANPIIYFFMGSFRQLQNRKTLKLVLQR 1560 

322 
322 
322 



1570 



1580 



ALQDMLEVDEGGGQLPEETLKLSGSRLiGP 



319 

322 

1589 

322 

322 

322 



45 

Table 45E lists the domain description from DOMAIN analysis results against 
NOV45. This indicates that the NOV45 sequence has properties similar to those of other 
proteins known to contain this domain. 

50 

Table 45E Domain Analysis of NOV45 

gnl | Pf am | pf amOOOOl , 7tm_l, 7 transmembrane receptor (rhodopsin 
family). (SEQ ID NO: 810) 

CD-Length = 254 residues, only 55.9% aligned 
Score = 41.2 bits (95), Expect = 9e-05 



NOV44 : 165 CVMLWALSLLRSVLE WSFCDFLFSGADSVWCKTSDFI IVGGLIFLCV 211 

+++I M I 1+ +1 I + | I I I 

Sbjct: 105 ILLVWVLALLLSLPPLLFSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLV 164 

55 - 
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NOV44 : 212 ALCGSSLVLL VRILCGSRKMPL TRLYVTILLIALVFLLCGLPFGIRFFLFSWN 264 

I - -I II + + I+ + +11+11 11+ I II 

Sbjct: 165 ILVCYTRILRTLRKRARSQRSLKRRSSSERKAAKMLLVWWFVLCWLPYHIVLLLDSLC 224 

5 NOV44 : 2 65 HVDLEVLYCHVHLVSIFLSSLN 2 86 

+ + + | ++++ | + +| 

Sbjct: 225 LLSIWRVLPTALLITLWLAYVN 246 

10 The Mas Proto-Oncogene belongs to the family of G-Protein Coupled Receptors.G- 

prote in-coupled receptors (GPCRs) constitute a vast protein family that encompasses a wide 
range of functions (including various autocrine, paracrine and endocrine processes). They 
show considerable diversity at the sequence level, on the basis of which they can be separated 
into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of 

1 5 families for which there are indications of evolutionary relationship, but between which there 
is no statistically significant similarity in sequence [1]. The currently known clan members 
include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal 
mating pheromone receptors, and the metabotropic glutamate receptor family. 

The human mas oncogene was originally detected by its ability to transform NIH 3T3 

20 cells. We previously showed that the protein encoded by this gene is unique among cellular 
oncogene products in that it has seven hydrophobic potential transmembrane domains and 
shares strong sequence similarity with a family of hormone-receptor proteins. We have now 
cloned the rat homolog of the mas oncogene, determined its DNA sequence, and examined its 
expression in various rat tissues. A comparison of the predicted sequences of the rat and 

25 human mas proteins shows that they are highly conserved, except in their hydrophilic amino- 
terminal domains. Our examination of the expression of mas, determined by RNA-protection 
studies, indicates that high levels of mas RNA transcripts are present in the hippocampus and 
cerebral cortex of the brain, but not in other neural regions or in other tissues. This pattern of 
expression and the similarity of mas protein to known receptor proteins suggest that mas 

30 encodes a receptor that is involved in the normal neurophysiology and/or development of 
specific neural tissues. 

The rhodopsin-like GPCRs themselves represent a widespread protein family that 
includes hormone, neurotransmitter and light receptors, all of which transduce extracellular 
signals through interaction with guanine nucleotide-binding (G) proteins. Although their 

35 activating ligands vary widely in structure and character, the amino acid sequences of the 
receptors are very similar and are believed to adopt a common structural framework 
comprising 7 transmembrane (TM) helices 

The disclosed NOV45 nucleic acid of the invention encoding a Mas Proto-Oncogene- 
like protein includes the nucleic acid whose sequence is provided in Table 45A or a fragment 
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thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 45A while still encoding a protein 
that maintains its Mas Proto-Oncogene -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
5 are complementary to those just described, including nucleic acid fragments that are 

complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

10 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 40 percent of the bases may be so changed. 

The disclosed NOV45 protein of the invention includes the Mas Proto-Oncogene-like 

15 protein whose sequence is provided in Table 45B. The invention also includes a mutant or 

variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 45B while still encoding a protein that maintains its Mas Proto-Oncogene-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 32 percent of the residues may be so changed. 

20 The invention further encompasses antibodies and antibody fragments, such as F a b or 

(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Mas Proto-Oncogene -like protein 
(NOV45) is a member of a "Mas Proto-Oncogene family". Therefore, the NOV45 nucleic 
acids and proteins identified here may be useful in potential therapeutic applications 

25 implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 

30 and cell types composing (but not limited to) those defined here. 

The NOV45 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome , Alzheimer's 
disease, Stroke, Tuberous sclerosis, hypercalcemia, Parkinson's disease, Huntington's 
disease, Cerebral palsy, Epilepsy,Lesch-Nyhan syndrome, Multiple sclerosis,Ataxia- 
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telangiectasia, Leukodystrophies,Behavioral disorders, Addiction, Anxiety, Pain, 
Neuroprotection, and/or other diseases and pathologies. 

NOV45 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV45 substances for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV45 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
10 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV46 

NOV46 includes three novel Mas Proto-Oncogene -like proteins disclosed below. The 
disclosed sequences have been named NOV46a and NOV46b. 
15 NOV46a 

A disclosed NOV46a nucleic acid of 997 nucleotides (also referred to as CG56696-01) 
encoding a Mas Proto-Oncogene- like protein is shown in Table 46A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 12-14 and ending with a 
TGA codon at nucleotides 978-980. The start and stop codons are shown in bold in Table 
20 46A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 46A. NOV46a nucleotide sequence (SEQ ID NO: 171). 

~GGTTTCTGAGC ATGGATCCAACCATCTCAACCTTGGACACAGAACTGACACCAATCAACGGAACTGAGGAGA 
CTCTTTGCTACAAGCAGACCTTGAGCCTCACGGTGCTGACGTGCATCGTTTCCCTTGTCGGGCTGACAGGAA 
ACGCAGTTGTGCTCTGGCTCCTGGGCTGCCGCATGCGCAGGAACGCCTTCTCCATCTACATCCTCAACTTGG 
CCGCAGCAGACTTCCTCTTCCTCAGCGGCCGCCTTATATATTCCCTGTTAAGCTTCATCAGTATCCCCCATA 
CCATCTCTAAAATCCTCTATCCTGTGATGATGTTTTCCTACTTTGCAGGCCTGAGCTTTCTGAGTGCCGTGA 
GCACCGAGCGCTGCCTGTCCGTCCTGTGGCCCATCTGGTACCGCTGCCACCGCCCCACACACCTGTCAGCGG 
TGGTGTGTGTCCTGCTCTGGGCCCTGTCCCTGCTGCGGAGCATCCTGGAGTGGATGTTATGTGGCTTCCTGT 
TCAGTGGTGCTGATTCTGCTTGGTGTCAAACATCAGATTTCATCACAGTCGCGTGGCTGATTTTTTTATGTG 
TGGTTCTCTGTGGGTCCAGCCTGGTCCTGCTGATCAGGATTCTCTGTGGATCCCGGAAGATACCGCTGACCA 
GGCTGTACGTGACCATCCTGCTCACAGTACTGGTCTTCCTCCTCTGTGGCCTGCCCTTTGGCATTCAGTTTT 
TCCTATTTTTATGGATCCACGTGGACAGGGAAGTCTTATTTTGTCATGTTCATCTAGTTTCTATTTTCCTGT 
CCGCTCTTAACAGCAGTGCCAACCCCATCATTTACTTCTTCGTGGGCTCCTTTAGGCAGCGTCAAAATAGGC 
AGAACCTGAAGCTGGTTCTCCAGAGGGCTCTGCAGGACGCGTCTGAGGTGGATGAAGGTGGAGGGCAGCTTC 
CTGAGGAAATCCTGGAGCTGTCGGGAAGCAGATTGGAGCAGTGA GGAAGAGCCTCTGCCCT 



In a search of public sequence databases, the NOV46a nucleic acid sequence, located 
on chromosome 7, has 430 of 705 bases (60%) identical to a gb:GENBANK- 
ID:MMU249895|acc:AJ249895.1 mRNA from Mus musculus (mas proto-oncogene and Igf2r 
25 gene for insulin-like growth factor type 2 and L41ps and Au76 pseudogenes) (E = 9.3e" 22 ). 
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The disclosed NOV46a polypeptide (SEQ ID NO: 172) encoded by SEQ ID NO: 171 
has 322 amino acid residues and is presented in Table 46B using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV46a has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV46a may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV46a is between positions 45 and 46: 
TGN-AV. 



Table 46B. Encoded NOV46a protein sequence (SEQ ID NO: 172). 

MDPTISTLDTELTPINGTEETLCYKQTLSLTVLTCIVSLVGLTGNAWLWLLGCRMRRNAFSIYILNLAAAD 
FLFLSGRLIYSLLSFISIPHTISKILYPVMMFSYFAGLSFLSAVSTERCLSVLWPIWYRCHRPTHLSAVVCV 
LLWALSLLRSILEWMLCGFLFSGADSAWCQTSDFITVAWLIFLCWLCGSSLVLLIRILCGSRKIPLTRLYV 
TILL.TVLVFLLCGLPFGI QFFLFLWIHVDREVLFCHVHLVS I FLSALNSSANP I I YFFVGS FRQRQNRQNLK 
LVLQRALQDASEVDEGGGQLPEEILELSGSRLEQ 

A search of sequence databases reveals that the NOV46a amino acid sequence has 1 10 
of 275 amino acid residues (40%) identical to, and 175 of 275 amino acid residues (63%) 
similar to, the 324 amino acid residue ptnr: S WIS SPROT-ACCrP 12526 protein from Rattus 
norvegicus (Rat) (Mas Proto-Oncogene) (E = 1 .6e" 45 ). 

NOV46b 

In the present invention, the target sequence identified previously, NOV46a, was 

subjected to the exon linking process to confirm the sequence. PCR primers were designed by 

starting at the most upstream sequence available, for the forward primer, and at the most 

downstream sequence available for the reverse primer. In each case, the sequence was 

examined, walking inward from the respective termini toward the coding sequence, until a 

suitable sequence that is either unique or highly selective was encountered, or, in the case of 

the reverse primer, until the stop codon was reached. Such primers were designed based on in 

silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 

sequence of the target sequence, or by translated homology of the predicted exons to closely 

related human sequences sequences from other species. These primers were then employed in 

PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 

marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 

brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 

lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 

uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
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redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
5 were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV46b. This differs from the previously 
identified sequence (NOV46a) in having 2 amino acid changes. 

A disclosed NOV46b nucleic acid of 964 nucleotides (also referred to as CG56696-02) 
encoding a Mas-Related G Protein-Coupled Receptor -like protein is shown in Table 46C. An 
10 open reading frame was identified beginning with a ACC initiation codon at nucleotides 3-5 

and ending with a TGA codon at nucleotides 960-962. The start and stop codons are shown in 
bold in Table 46C, and the 5' and 3' untranslated regions, if any, are underlined. Because the 
start codon is not a traditional initiation codon, NOV46b could be a partial reading frame 
extending further in the 5' direction. 

Table 46C. NOV46b nucleotide sequence (SEQ ID NO: 173). 

~CAACCATCTCAACCTTGGACACAGAACTGACACCAATCAACGGAACTGAGGAGACTCTTTGCTACAAGCAGA 
CCTTGAGCCTCACGGTGCTGACGTGCATCGTTTCCCTTGTCGGGCTGACAGGAAACGCGGTTGTGCTCTGGC 
TCCTGGGCTGCCGCATGCGCAGGAACGCCTTCTCCATCTACATCCTCAACTTGGCCGCAGCAGACTTCCTCT 
TCCTCAGCGGCCGCCTTATATATTCCCTGTTAAGCTTCATCAGTATCCCCCATACCATCTCTAAAATCCTCT 
ATCCTGTGATGATGTTTTCCTACTTTGCAGGCCTGAGCTCTCTGAGTGCCGTGAGCACCGAGCGCTGCCTGT 
CCGTCCTGTGGCCCATCTGGTACCGCTGCCACCGCCCCACACACCTGTCAGCGGTGGTGTGTGTCCTGCTCT 
GGGCCCTGTCCCTGCTGCGGAGCATCCTGGAGTGGATGTTATGTGGCTTTCTGTTCAGTGGTGCTGATTCTG 
C TTGGTGTC AAAC AT C AGATT T CAT C AC AGTCGCGTGGCTGATTTT TTT ATGTGTGGTT CTC TGTGGGT C C A 
GCCTGGTCCTGCTGATCAGGATTCTCTGTGGATCCCGGAAGATACCGCTGACCAGGCTGTACGTGACCATCC 
CGCTCACAGTACTGGTCTTCCTCCTCTGTGGCCTGCCCTTTGGCATTCAGTTTTTCCTATTTTTATGGATCC 
ACGTGGACAGGGAAGTCTTATTTTGTCATGTTCATCTAGTTTCTATTTTCCTGTCCGCTCTTAACAGCAGTG 
CCAACCCCATCATTTACTTCTTCGTGGGCTCCTTTAGGCAGCGTCAAAATAGGCAGAACCTGAAGCTGGTTC 
TCCAGAGGGCTCTGCAGGACGCGTCTGAGGTGGATGAAGGTGGAGGGCAGCTTCCTGAGGAAATCCTGGAGC 
TGTCGGGAAGCAGATTGGAGCAGTGAGG 

15 " " — " ^ - " 

In a search of public sequence databases, the NOV46b nucleic acid sequence, located 
on chromosome 1 1, has 494 of 800 bases (61%) identical to a gb : GENB ANK- 
ID:AF295365|acc:AF295365.1 mRNA from Mus musculus (G-protein coupled receptor 
GPR90 mRNA, complete cds) (E - 1 .2e" 28 ). 

20 The disclosed NOV46b polypeptide (SEQ ID NO: 1 74) encoded by SEQ ID NO: 1 73 

has 319 amino acid residues and is presented in Table 46D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV46b has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV46b may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 

25 reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
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certainty of 0.3000. The most likely cleavage site for NOV46b is between positions 42 and 43: 
TGN-AV. 



Table 46D. Encoded NOV46b protein sequence (SEQ ID NO:174). 

TISTLDTELTPINGTEETLCYKQTLSLTVLTCIVSLVGLTGNAVVLWLLGCRMRRNAFSIYILNKAAADFLF 
LSGRIilYSLLSFISIPHTISKILYPVMMFSYFAGLSSLSAVSTERCLSVLWPIWYRCHRPTHLSAVVCVLLW 
ALSLLRSILEWMLCGFLFSGADSAWCQTSDFITVAWLIFLCWLCGSSLVLLIRILCGSRKIPLTRLYVTIP 
LTVLVFLLCGLPFGIQFFLFLWIHVDREVLFCHVHLVSIFLSALNSSANPIIYFFVGSFRQRQNRQNLKLVIi 
QRALQDASEVDEGGGQLPEEILELSG5RLEQ 

5 A search of sequence databases reveals that the NOV46b amino acid sequence has 1 10 

of 275 amino acid residues (40%) identical to, and 174 of 275 amino acid residues (63%) 
similar to, the 324 amino acid residue ptnr:SWISSPROT-ACC:P12526 protein from Rattus 
norvegicus (Rat) (Mas Proto-Oncogene) (E = 6.8e" 45 ). 
NOV46c 

10 A disclosed NOV46c nucleic acid of 1 030 nucleotides (also referred to as CG56698- 

01) encoding a Mas Proto-Oncogene-like protein is shown in Table 46E. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 17-19 and ending 
with a TGA codon at nucleotides 1007-1009. The start and stop codons are shown in bold in 
Table 46E, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 46E. NOV46c nucleotide sequence (SEQ ID NO:175). 

~ACTAGGGTTTCTGAGC ATQAATCCAACCATCCCAGCCTTGGATACAGAAATTGCACCAATTAGTGATACAGA 
GGAGACCCATCCTCATCGTTGTGGCATGGAGGTCCTGGTCCTCATAGTGCTGATCCTCATCATTGACCTGGT 
CGGGCTGGCAGGAAATGCAGTCATGCTCTGGCTCCTGGGCTTCTGCATGCACAGTAACACCTTCTCTCTCTA 
CAT C CT C AACC TGG CCAGGGCTGAC TT C CTCTG C AC CTGCTTC C AG ATT AT AAC ATT C ATT AATT TCTT C AG 
TGACTTTGTTAGTTCTCTCTCCATCCATTTCTCTAGATTTGTCACCACGGTGTTGTTCTCCGCCTGTATTAC 
AGGCCTGAGCATGCTGAGCACCATCAGCACCGAGCACCGCCTGTCCGTCCTGTGGCCCATCTGGTACTGCTG 
CCACTGCCCCACACACCTGTCAGCGGTCATGTGTGTCCTGCTCTGGGCCCTGTCCCTGTTGCAGAGCATCCT 
GGAGTGGATGTTCTGTAGCTTCCTGTTTAGTGATGTTGACTCTGATAATTGGTGTCAAATATTAGATTTCCT 
CACTGCTGTGTGGCTGATTTTTTTATCTGTGGTTCTCTGTGGGTTCACCCTGGTCCTGCTTGTCAGGATCAT 
ATGTGGATCCCAGAAGATGCCGCTGACCAGGCTGTATGTGACCATCCTGCTCACAGGGCTGGTCTTCCTCTT 
CTGCAGCCTGCCCCTCAGCATTCAGGGATTCCTATTATACTGGATCGAGAAGGATTTGGATGACTTACCTTG 
TGTTGTTCGTTTAATTTCCATTTTCCTGTCTGCTCTTAACAGCAGTGCCAACCCCATCATTTACTTCTTCAT 
GGGCTCCTTTAGGCAGCTTCAAAACAGGAAGACCCTCAAGCTGGTTCTCCAGAGGGCTCTGCAGGACATGCT 
TGAGGTGGATGAAGGTGGAGGGCAGCTTCCTGAGGAAACCCTGAAGCTGTCGGGAAGCAGATTGGGGCCATG 
AGGAAGAGCCTCTGCCCTGTTA 

15 ! ~ ~~ _ „ 

In a search of public sequence databases, the NOV46c nucleic acid sequence, located 
on chromosome 7, has 381 of 621 bases (61%) identical to a gb:GENBANK- 
ID:RATMAS|acc:J03823.1 mRNA from Rattus norvegicus (Rat mas oncogene, complete cds) 
(E = 9.3e" 22 ). 

20 The disclosed NOV46D polypeptide (SEQ ID NO: 176) encoded by SEQ ID NO: 175 

has 330 amino acid residues and is presented in Table 46F using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV46c has a signal peptide and 
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is likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV46c may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV46c is between positions 65 and 66: 
TFS-LY. 



Table 46F. Encoded NOV46c protein sequence (SEQ ID NO:176). 

MNPTIPALDTEIAPISDTEETHPHRCGMEVLVLIVLILIIDLVGLAGNAVMLWLLGFCMHSNTFSLYILNLA 
RADFLCTCFQIITFINFFSDFVSSLSIHFSRFVTTVLFSACITGLSMLSTISTEHRLSVLWPIWYCCHCPTH 
LSAVMCVLLWALSLLQSILEWMFCSFLFSDVDSDNWCQILDFLTAVWLIFLSWLCGFTLVLLVRIICGSQK 
MPLTRLYVTILLTGLVFLFCSLPLSIQGFLLYWIEKDLDDLPCWRLISIFLSALNSSANPIIYFFMGSFRQ 
LQNRKTLKLVLQRALQDMLEVDEGGGQLPEETLKLSGSRLGP 

A search of sequence databases reveals that the NOV46c amino acid sequence has 106 
of 279 amino acid residues (37%) identical to, and 166 of 279 amino acid residues (59%) 
similar to, the 324 amino acid residue ptnr:SWISSPROT-ACC:P12526 protein from Rattus 
norvegicus (Rat) (Mas Proto-Oncogene) (E = 1.6e" 45 ). 

NOV46c is predicted to be expressed in at least teratocarcinoma cell. This information 
was derived by determining the tissue sources of the sequences that were included in the 
invention including but not limited to SeqCalling sources, Public EST sources, Literature 
sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in hippocampus and brain 
because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:RATMAS|acc:J03823.1) a closely related Rat mas oncogene, complete cds homolog. 

NOV46d 

A disclosed NOV46d nucleic acid of 1005 nucleotides (also referred to as CG56702- 
01) encoding a Mas Proto-Oncogene-like protein is shown in Table 46G. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 17-19 and ending 
with a TGA codon at nucleotides 986-988. The start and stop codons are shown in bold in 
Table 46G, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 46G. NOV46d nucleotide sequence (SEQ ID NO: 177). 

ACTACCGTTTCTGAGC ATGGATCCAAGCAACCCAGCCTTGGATGCAGAACTGACACCAATTAACAGAACTGA 
GGAGACTCCTTGCTACAAGCAGACCCTGAGCCTCATGGGGCTGACGTGCATCATTTCCCTTGTCACGCTGAC 
AGGAAACGCGGTTGTGCTCTGGCTCCTGGGCTTCCGCATGCGCAGGAACGCCGTCTCCATCTACATCCTCAA 
CCTGGCTGCGGCAGACTTCCTCTTCCTCAGCGGCCACGTTATACATTCCGCCTCACTCCTCATCAATATCTG 
TCATCCCATCTCCAAAATCCTCATTCCTGTGATGACCTTTCTATACTTTACAGGCCTGAGCTTTCTGAGTGC 
CATGAGCACCGAGCGCTGCCTGTGCGTCCTGTGGCCCATCTGGTACCGCTGCCTCCTCCCCACACACCTGTC 
AGCGGTCGTGTGTGTCTTGCTTTGGGCCCTGTCCCTACTGCGGAGCATCCTGGAGGGAATGTTCTGTGACTT 
CCTGTTTAGTGATGCTGATTCTATTTGGTGTCAACCATCAGATTTCATCACAGTCGTGTGGCTGATTTTTTT 
ATGTGTGGTTCTCTGTGGGTCCAGCCTGGTCCTGCTGATTAGGATTCTCTGTGGATCCTGGAAGATGCCTCT 
GACCGGGCTGTACGTGACGATCCTGCTCACAGTGCTAGTCTTCCTACTCCGCAGCCTGCCCTTCGGCATTCG 
GTGGGCTCTGTCTACTGGGATCCACCTGGATTTGGAAGTCATTTTCTGTCATGTCCATCTAGTTTCCATTTT 

400 * 



CCTGTCCCCTCTAAACGGTAGTGCCAACCCCGTCATTTACTTCTTCGTGGGCTCCTTTAGGCAGCGTCAAAA 
TAGGCAGAACCTGAAGCTGGTTCTCCAGAGGGCTCTGCAGGACATGCCTGAGGTGAAGGTGGAAGGTGGAGG 
GCGGCTTCCTGAGGGAACCCTGGAGCTGTCGGGAAGCAGATTCGGGCAGTGAGGAAGAACCTCTGCCCT 



In a search of public sequence databases, the NOV46d nucleic acid sequence, located 
on chromosome 7, has 379 of 632 bases (59%) identical to a gb:GENBANK- 
ID:RATMAS|acc:J03823.1 mRNA from Rattus norvegicus (Rat mas oncogene, complete cds) 
(E = 4.7e- H ). 

The disclosed NOV46d polypeptide (SEQ ID NO: 178) encoded by SEQ ID NO: 177 
has 323 amino acid residues and is presented in Table 46H using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV46C has a signal peptide and 
is likely to be localized to the plasma membrane with a certainty of 0.6000. Alternatively, 
NOV46d may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 
reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 
certainty of 0.3000. The most likely cleavage site for NOV46d is between positions 46 and 47: 
GNA-VV. 



Table 46H. Encoded NOV46d protein sequence (SEQ ID NO: 178). 

MDPSNPALDAELTPINRTEETPCYKQTLSLMGLTCIISLVTLTGNAWLWLLGFRMRRNAVSIYILNLAAAD 
FLFLSGHVIHSASLLINICHPISKILIPVMTFLYFTGLSFLSAMSTERCLCVLWPIWYRCLLPTHLSAWCV 
LLWALSLLRSILEGMFCDFLFSDADSIWCQPSDFITVWLIFLCVVLCGSSIiVLLIRILCGSWKMPLTGLYV 
TILLTVLVFLLRSLPFGIRWALSTGIHLDLEVIFCHVHLVSIFLSPLNGSANPVIYFFVGSFRQRQNRQNLK 
LVLQRALQDMPEVKVEGGGRLPEGTLELSGSRFGQ 

A search of sequence databases reveals that the NOV46d amino acid sequence has 107 
of 275 amino acid residues (38%) identical to, and 167 of 275 amino acid residues (60%) 
similar to, the 324 amino acid residue ptnr:SWISSPROT-ACC:P12526 protein from Rattus 
norvegicus (Rat) (Mas Proto-Oncogene) (E = L5e" 40 ). 

In addition, NOV46d is predicted to be expressed in hippocampus and cerebral cortex 
of the brain because of the expression pattern of (GENBANK-ID: gb:GENBANK- 
ID:RATMAS|acc:J03823.1) a closely related Rat mas oncogene, complete cds homolog. 

NOV46a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 461. 
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Table 461. BLAST results for NOV46a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17472340] re f| XP 
061650. 1| 
(XM_061650) 


similar to MrgXl 
G protein-coupled 
receptor (H. 
sapiens) [Homo 
sapiens) 


1589 


322/322 
(100%) 


322/322 
(100%) 


e-172 


gi | 15546062 | gb j AAK9 
1804. l| (AY042213) 


MrgXl G protein- 
coupled receptor 
[ Homo sap i ens] 


322 


322/322 
(100%) 


322/322 
(100%) 


e-158 


gi | 16876453 | ref | NP_ 
4 73 3 72. l| 
(NM 054031) 


G protein-coupled 
receptor MRGX3 
[Homo sapiens] 


322 


269/322 
(83%) 


285/322 
(87%) 


e-131 


gi | 17461239 | ref |XP 
062249 . 1 | 
(XM062249) 


similar to MrgX3 
G protein-coupled 
receptor (H. 
sapiens) [Homo 
sapiens] 


322 


268/322 
(83%) 


285/322 
(88%) 


e-131 


gi | 16876455 | ref | NP 
473373. l| 
(NM 054032) 


G protein-coupled 
receptor MRGX4 
[Homo sapiens] 


322 


255/320 
(79%) 


275/320 
(85%) 


e-119 



The homology between these and other sequences is shown graphically in the 
Clustal W analysis shown in Table 46J. In the ClustalW alignment of the NOV46 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



1 0 Table 46 J. ClustalW Analysis of NOV46 

1) Novel NOV4 6a (SEQ ID NO: 17 2) 

2) Novel NOV4 6b (SEQ ID NO: 174) 

3) Novel NOV46C (SEQ ID NO: 176) 

4) Novel NOV46d (SEQ ID NO: 178) 

15 5) gi | 17472340 | ref |XP_061650 . 1 | (XM061650) similar to MrgXl G protein- coupled 
receptor (H. sapiens) [Homo sapiens] (SEQ ID NO: 532) 

6) gi | 15546062 |gb |AAK91804 . 1 | (AY042213) MrgXl G protein- coupled receptor [Homo 
sapiens] (SEQ ID NO: 533) 

7) gi | 16876453 | ref |NP_473 372 . 1 | (NM_054 031) G protein- coupled receptor MRGX3 [Homo 
20 sapiens] (SEQ ID NO: 534) 

8) gi | 17461239 | ref |XP_062249 . 1 | (XM_062249) similar to MrgX3 G protein-coupled 
receptor (H. sapiens) [Homo sapiens] (SEQ ID NO: 53 5) 

9) gi | 168764 55 | ref |NP_473 373 .1 | (NM_054 03 2) G protein-coupled receptor MRGX4 [Homo 
sapiens] (SEQ ID NO: 53 6) 



10 20 30 40 50 60 



NOV4 6b 1 i 

30 NOV4 6 c 1 1 

NOV46d 1 . 1 

gi | 17472340 | 1 MTQTTLHSHLPADPCTGNHRAVQYFDKHQASLQLLQAPPDTRLPFLS VDPSNPALDAELT 60 
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60 



65 
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gi | 15546062 | 
gi|l6876453 | 
gij 17461239| 
gi j 16876455 j 



NOV46a 
NOV46b 
NOV4 6C 
NOV4 6d 
gi | 17472340 | 
gi|l5546062 j 
gi | 16876453 j 
gi | 17461239 j 
gi | 16876455 j 



NOV46a 
NOV46b 
NOV4 6c 
NOV4 6d 
gi | 17472340 | 
gi j 15546062 j 
gijl6876453 j 
gi | 17461239 j 
gi|l6876455 j 



NOV4 6a 
NOV4 6b 
NOV4 6C 
NOV4 6d 
gi | 17472340 | 
gi | 15546062 j 
gi|l6876453 j 
gi|l7461239| 
gi | 16876455 j 



NOV46a 
NOV46b 
NO V4 6c 
NOV4 6d 
gi|l7472340| 
gi j 15546062 j 
gi|l6876453 | 
gi | 17461239 j 
gi|l6876455| 



NOV4 6a 

NOV4 6b 

NOV4 6C 

NOV46d 

gi 1 17472340 | 

gi j 15546062 j 

gi|l6876453| 

gij 17461239 | 

gi|l6876455| 



NOV4 6a 
NOV4 6b 
NOV4 6c 



1 i 

! , ! 

1 x 

! x 

70 80 90 100 110 120 

....|--.-|....|....|....|....|....|....|....|....|....|....| 

1 x 

1 i 

1 ! 

1 i 

61 PINRTEETPCYKQTLSLMGLTCIISLVTLTGNAWLWLLGFRMRRNAVSIYILNLAAADF 12 0 
1 x 

1 ± 

X 1 

! i 

130 140 150 160 170 180 

....|....|....|.-..|.-..|....|-...|....|....|....|....|....| 
1 1 

1 i 

1 x 

1 1 

121 LFLSGHVIHSASLLINICHPISKILIPVMTFLYFTGLSFLSAMSTERCLCVLWPICLVLL 18 0 

! i 

! i 

X ! 

1 ! 

190 200 210 220 230 240 

....|....|...-|....|...-|....|....|....t....|....|....|....| 

! ! 

1 x 

1 i 

! 1 

181 IRILCGSWKMPLTGLYVTILLTVLVFLLRSLPFGIRWALSTGIHLDLEVIFCHVHLVSIF 24 0 

! i 

1 i 

1 i 

X ! 

250 260 270 280 290 300 

...-|...-|-...|....|....|...-|...-|..-.|-...|....|....|....| 
1 i 

X 1 

1 i 

1 ! 

241 LSPLNGSANPVIYFFVGSFRQRQNRQNLKLVLQRALQDMPEVKVEGGFLREPWSCREADS 30 0 

X i 

1 i 

1 ! 

1 i 

310 320 330 340 350 360 

..-.|....|...-|....|....|.---|....|...-|..-.|....|....|....| 

1 i 

X x 

1 x 

X i 

3 01 GTVTAEYRRRNLNAHAHHIGPNFSETLQDENTRKESRDRADLIPECASPELSNTEESSEV 360 

X l 

1 ! 

1 ! 

1 ! 

370 380 390 400 410 420 

I I I I I I I I I I I I 

1 x 

X ! 

1 ! 

403 




NOV4 6d 
gi | 17472340 | 
gi | 15546062 | 
gi|l6876453 j 
gi | 17461239 | 
gi | 16876455 j 



3 61 TESQKGPPPLAQGKSSPLDKREKDKGKYQELSSSIVSYGFCYKIVSVFPPRDTRLGFLSV 42 0 

X 1 

! 1 

! x 

! 



10 



15 



20 



25 



30 



NOV46a 
NOV4 6b 
NOV4 6c 
NOV46d 
gi|l747 
gi | 1554 
gi|l687 
gi | 1746 
gi | 1687 



2340 | 
6062 j 
6453 I 
1239 | 
6455 



NOV46a 

NOV4 6b 

NOV4 6c 

NOV4 6d 

gi | 17472340 | 

gi|l5546062| 

gi|l6876453| 

gi|l7461239| 

gi|l6876455| 



1 
1 
1 
1 

42 

1 

1 

1 

1 



430 440 450 460 470 480 

...|..-.|....|-...|....|...-|....|....|....|....|....|....| 

1 

± 

. x 

1 DPTI PALGTELTPINGREETPCYKQTLSLTGLTCI VSLVGMTGNAWLWLLGFRMRRNAF 4 8 0 



490 500 510 520' 530 540 

|.---|--..|....|...-|-.-.|....|....|....|....|....|....| 



4 81 SIYIFNLSMADFLFLRSHIIRFPLSLINILHPIFKILSPVMMFSYLASLSFLSAMSTERC 54 0 



35 



40 



NOV4 6a 
NOV4 6b 
NO V4 6c 
NOV4 6d 
gi | 17472340 | 
gi | 15546062 j 
gi | 16876453 | 
gi|l7461239| 
gi j 16876455 j 



550 



560 



570 



|--..|...-|....|....|....| 



580 



590 



600 



5 41 LLVLLVRILCGSRKMPLTRLQCQNRQNLKLVLQRALQDTTEVNEGGRWLPEETLELSGSR 6 0 0 
! ± 



45 



50 



55 



60 



65 



610 



620 



630 



NOV4 6a 
NOV4 6b 
NOV4 6c 
NOV46d 
gi | 17472340 | 
gi|l5546062 j 
gi | 16876453 | 
gi j 17461239 | 
gi | 16876455 j 



NOV46a 
NOV4 6b 
NOV4 6c 
NOV4 6d 
gi | 1747 
gi | 1554 
gi|l687 
gi|l746 
gi | 1687 



2340| 
6062 j 
6453 | 
1239 j 
6455 | 



|....|....| 



640 



650 



660 



|....|....| 



6 01 FGQSVHCLWEEEWRGCGSLLMSCLLSKVNDITSSWFHIKEEHAGHPGVFSRKVTRLGFLS 660 
1 ! 




70 



NOV46a 



58 



i 



730 
. . I . . 



740 
. . I . . 



SIYILNLAAADFLFLSGffll 



750 
jYgL---j|s 

404 



760 



E3 



770 

I. 



780 



|f£ 113 




10 



15 



20 



NOV4 6b 
NO V4 6c 
NOV4 6d 
gi|l7472340| 
gi j 15546062 | 
gi 1 16876453 j 
gi | 17461239 | 
gi j 16876455 j 



NOV46a 

NOV46b 

NOV4 6c 

NOV46d 

gi | 17472340 | 

gi|l5546062| 

gi | 16876453 j 

gi j 17461239 | 

gi j 16876455 | 





STERCLSVLWPIWYRC 
STERCLSVLWPIWYRC 
STEALS VLWP I WYgc 

sterclsvlwpiwyrc 
sterclsvlwpiwyrc 
sterclsvlwpiwyrc 
sterclsRlwpiwyTic 

STERCLS§LWPIWYgC 
STERCLSVLWPIWYRC 



850 



25 



30 



NOV4 6a 

NOV46b 

NOV46c 

NOV4 6d 

gi | 17472340 | 

gi 1 15546062 | 

gi|l6876453 | 

gi | 17461239 j 

gi|l6876455| 



173 
170 
181 
173 
833 
173 
173 
173 
173 



860 
. . I . . 



I 



870 
. . I . . 



880 
. . I - . 



890 
. . I . - 



I 



900 
• - I 



CQTSDFITVAWLIFLCWLCGS 

cqts dfi tvawl i flcwlcgs 
cqEJ2df|1t3Bwl 1 fl@wlcg|| 

cqtsdfitvawliflcwlcgs 
cqts dfi tvawl i flcwlcgs 
cqtsdfitvawliflcwlcgs 
c||tsdfitRawlSflcwlcgs, 
cqtsdfitvawliflcwlcgs, 



slvllmrilcgsrk0 
slvllnrilcgsrk 
0lvllBri|JcgsRkS|: 
slvllHrilcgsSkI: 
slvllnr ilcgs rk h \ 
slvllHrilcgsrkH: 
slvllSri lcgs rkH: 
s lvllsr i l cgsrkI: 



awl i flcwlcBsslvllSrilcgsrkI 



[PLTRLYVTILLTVLVFLLCGLPF 
PLTRLYVTILLTVLVFLLCGLPF 
PLTRL YVTI LLTgLVFLgcgL P 

pltBlyvtilltvlvfllISBlpf 
pltrlyvtilltvlvfllcglpf 
pltrlyvtilltvlvfllcglpf 
pltrlyvtilltvlvfllcglpf 
pltrlyvtilltvlvfllcglpf 
pltrlyvtilltvlvfllcglpf 



35 



40 



45 



50 



55 



910 



920 



930 



940 



NOV46a 
NOV4 6b 
NOV4 6C 
NOV46d 
gi | 17472340 | 
gi|l5546062| 
gi | 16876453 j 
gi j 17461239 | 
gi j 16876455 j 



NOV46a 

NOV46b 

NOV4 6c 

NOV46d 

gi 1 17472340 | 

gijl5546062 j 

gi|l6876453| 

gi|l7461239| 

gi j 16876455 j 




1020 
I 

322 
319 
330 
323 

TRLGFLSMDPTI PVLGTELTPINGTEET 1011 

322 
322 
322 
322 



1040 



1050 



1060 



60 



65 



70 



NOV46a 

NOV46b 

NOV46c 

NOV4 6d 

gi 1 17472340 | 

gi j 15546062 | 

gijl6876453| 

gi|l7461239| 

gi | 16876455 j 



322 
322 
322 
322 



1070 
..|... 



322 

319 

330 

323 

1012 PCYNQTLSFTVLTCIVSLVALTGNAWLWLLGFRMCRNAVSIYILNLVAANFLLLSSHII 10 71 



1080 
-I 

--- 322 
--- 319 
■-- 330 
323 



322 
322 
322 
322 



1090 



1100 



1120 



1130 



1140 



405 



NOV4 6d 


323 


323 


gi | 17472340 


1072 


HPCYTSSITPEYSECHEHQALPVNPVAHLPGPSGQDPLWIPEDAADQAVHDHLLTVLVFL 1131 


gi | 15546062 


322 


322 


gi j 16876453 


322 


322 


gi|l7461239 


322 


322 


gi|l6876455 


322 


322 



1150 1160 1170 1180 1190 1200 

....|....|....|....|....|....|....|....|....|....|....|....| 

NOV46a 322 322 

NOV46b 319 319 

NOV46c 330 , 330 

NOV46d 323 323 

gi | 17472340 | 1132 LCGIiPIGIQWALFSRIHMDWEVLYSHVHLPSIFLSSLNSSANPIIYFFMGFVRQHQNWQN 1191 

gi|l5546062| 322 322 

gi|l6876453| 322 322 

gi|l7461239| 322 322 

gi|l6876455| 322 322 



1210 1220 1230 1240 1250 1260 

....|....|....|....|....|....|....|...-|....|....|....|....| 

NOV46a 322 322 

NOV46b 319 319 

NOV4 6 C . 330 33 0 

NOV4 6d 3 23 32 3 

gi | 17472340 | 1192 LKLVLQRDLQDTPEVDEEQNNVLRYI ILYIMYTLDTMSLHKNSLGNWFYGKSSAVICLQN 1251 

gi|l5546062| 322 322 

gi|l6876453| 322 322 

gi|l7461239| 322 322 

gi | 16876455 | 322 322 



1270 1280 1290 1300 1310 1320 

....|....|....|....|....|....|....|....|....|....|....|....| 

NOV4 6 a 3 22 322 

NOV46b 319 ■> 319 

NOV4 6 C 330 33 0 

NOV46d 323 323 

gi | 17472340 | 1252 GDMLALVSSKDNHGPFPLRTDIGGTGLLFLAVTAEYRRRNLNAHAHHIGPIVLKPCRMQT 1311 

gi|l5546062| 322 322 

gijl6876453| 322 322 

gi|l7461239| 322 322 

gi|l6876455| 322 322 



NOV4 6a 

NOV4 6b 

NOV4 6C 

NOV46d 

gi | 17472340 

gi j 15546062 

gi | 16876453 

gi|l7461239 

gi|l6876455 



13 3 0 134 0 1350 13 60 13 70 13 80 

....|....|....|....|....|....|....|....|....|....|....|-...| 

322 322 

319 319 

330 330 

323 323 

1312 QGRREKTQWISFLSVYVEISPKLHNTKGSSEVTESQKGSTSLARGSTSSTLDRRRRKDAQ 13 71 

322 ' 322 

322 322 

322 322 

322 322 



1390 1400 1410 1420 1430 1440 

....|....|....|....|....|....|....|....|....|....|-...|....| 

NOV46a 322 322 

NOV46b 319 319 

NOV46c 330 33 0 

NOV4 6d 323 323 

gi | 17472340 | 1372 QSHIEPHFKGTLWNLRGTRLGFLSMNPTIPALDTEIAPISDTEETHPHRCGMEVLVLIV 1431 

gi|l5546062| 322 322 

gi | 16876453 | 322 322 

gi|l7461239| 322 322 

gi|l6876455| 322 322 

406 
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1450 1460 1470 1480 1490 1500 

....|....|....|....|....|....|....|....|....|....|....|.-..| 

NOV46a 322 322 

NOV46b 319 319 

NOV4 6c 330 330 

NOV46d 323 323 

gi | 17472340 | 1432 LILIIDLVGL.AGNAVMLWLLGFCMHSNTFSLYILNLARADFLCTCFQIITFINFFSDFVS 14 91 

gi j 15546062 j 322 322 

gi | 16876453 | 322 322 

gi|l7461239| 322 322 

gi | 16876455 | 322 322 



1510 1520 1530 1540 1550 1560 

....|....|....|....|....|....|....|....|....|....|....|....| 

NOV46a 322 322 

NOV4 6b 319 319 

NOV4 6c 330 33 0 

NO V4 6 d 323 323 

gi | 17472340 | 1492 SLSIHFSRFVTTVLFSACITGLSMLSTISTEHRLSVLWPICSANPI IYFFMGSFRQLQNR 1551 

gi j 15546062 j 322 322 

gi j 16876453 | 322 322 

gijl7461239| 322 322 

gi j 16876455 j 322 322 



1570 1580 1590 



NOV4 6a 322 322 

NOV4 6b 319 319 

NOV4 6c 330 330 

NOV4 6d 323 323 

gi | 17472340 | 1552 KTLKLVLQRALQDMLEVDEGGGQLPEETLKLSGSRLGP 1589 

gi j 15546062 | 322 322 

gi | 16876453 | 322 322 

gi | 17461239 | 322 322 

gi j 16876455 | 322 322 



Table 46K lists the domain description from DOMAIN analysis results against 
NOV46. This indicates that the NOV46 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 46K Domain Analysis of NOV46 

gnl | Pf am | pf amOOOOl , 7tm_l / 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO:810) 

CD-Length = 254 residues, 3 7.8% aligned 

Score = 51.2 bits (121), Expect = 9e-08 



NOV53 : 44 GNAWLWLLGCRMR-RNAFSI YILNLAAADFLFLSGRLIYSLLSFIS IPHTISKIL 98 

|| + | + ++ + I +I++MM II II! ++I + + I+ + 

Sbjct : 1 GNLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 

NOV53 : 99 YPVMMFSYFAGLSFLSAVSTERCLSVLWPIWYRCHR 134 

- + + +| + | + | + | + | | +++ | + | | | 
Sbjct: 61 GALFWNGYASILLLTAISIDRYLAIVHPLRYRRIR 96 



The Mas Proto Oncogene belongs to the family of G-Protein Coupled Receptors.G- 
protein-coupled receptors (GPCRs) constitute a vast protein family that encompasses a wide 
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range of functions (including various autocrine, paracrine and endocrine processes). They 
show considerable diversity at the sequence level, on the basis of which they can be separated 
into distinct groups. The currently known clan members include the rhodopsin-like GPCRs, 
the secret in-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the 
5 metabotropic glutamate receptor family. 

The rhodopsin-like GPCRs themselves represent a widespread protein family that 
includes hormone, neurotransmitter and light receptors, all of which transduce extracellular 
signals through interaction with guanine nucleotide-binding (G) proteins. Although their 
activating ligands vary widely in structure and character, the amino acid sequences of the 

10 receptors are very similar and are believed to adopt a common structural framework 

comprising 7 transmembrane (TM) helices The human mas oncogene was originally detected 
by its ability to transform NIH 3T3 cells. We previously showed that the protein encoded by 
this gene is unique among cellular oncogene products in that it has seven hydrophobic 
potential transmembrane domains and shares strong sequence similarity with a family of 

15 hormone-receptor proteins. We have now cloned the rat homolog of the mas oncogene, 
determined its DNA sequence, and examined its expression in various rat tissues. A 
comparison of the predicted sequences of the rat and human mas proteins shows that they are 
highly conserved, except in their hydrophilic amino-terminal domains. Our examination of the 
expression of mas, determined by RNA-protection studies, indicates that high levels of mas 

20 RNA transcripts are present in the hippocampus and cerebral cortex of the brain, but not in 
other neural regions or in other tissues. This pattern of expression and the similarity of mas 
protein to known receptor proteins suggest that mas encodes a receptor that is involved in the 
normal neurophysiology and/or development of specific neural tissues. 



25 like protein includes the nucleic acid whose sequence is provided in Table 46A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 46A while still encoding a protein 
that maintains its Mas Proto-Oncogene -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 

30 are complementary to those just described, including nucleic acid fragments that are 

complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 



The disclosed NOV46 nucleic acid of the invention encoding a Mas Proto-Oncogene- 
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derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 40 percent of the bases may be so changed. 
5 The disclosed NOV46 protein of the invention includes the Mas Proto-Oncogene-like 

protein whose sequence is provided in Table 46B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 46B while still encoding a protein that maintains its Mas Proto-Oncogene-like 
activities and physiological functions, or a functional fragment thereof In the mutant or 

1 0 variant protein, up to about 2 1 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fat>)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Mas Proto-Oncogene -like protein 
(NOV46) is a member of a "Mas Proto-Oncogene family". Therefore, the NOV46 nucleic 

15 acids and proteins identified here may be useful in potential therapeutic applications 

implicated in (but not limited to) various pathologies and disorders as indicated below. The 
potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
target ing/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

20 delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV46 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Von Hippel-Lindau (VHL) syndrome , Alzheimer's 
disease, Stroke, Tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, 

25 Cerebral palsy, Epilepsy ,Lesch-Nyhan syndrome, Multiple sclerosis,Ataxia- 

telangiectasia,Leukodystrophies,Behavioral disorders, Addiction, Anxiety, Pain, 
Neuroprotection, and/or other diseases and pathologies. 

NOV46 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV46 substances for use in 

30 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV46 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
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understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV47 

A disclosed NOV47 nucleic acid of 523 nucleotides (also referred to as AF1 52363) 
encoding a Peptidyl-Prolyl Cis-Trans Isomerase -like protein is shown in Table 47A. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 17-19 and 
ending with a TAA codon at nucleotides 509-5 1 1 . The start and stop codons are shown in 
bold in Table 47A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 47 A. NOV47 nucleotide sequence (SEQ ID NO: 179). 



CCCCGTATTACCAGCT ATQGTCAACCCCACTGTTTTCTTCGACATTGCTGTCAATAGCGAGCCCTTGGGCTG 
CGTCTCCTTCGAGCTGTTTGCAGACAAGCTTCCAAAGACAGCAGAAAATTTTCATGCTCTGAGCACTGGAGA 
AAAAGGATTTGATTATGAGGGTTACTGCTTTCACAGAATTATTCCAGGGTTTGTATGTCAGGGTGGTGACTT 
CACATGCCATAATGGCACTGGTAGCAAGTCCATCTACAGGGAGAAATTTGATGACGAGAACTTCATCCTGAA 
G C AT AC AGGT C CTGGC AT C CTGTCC ATGGC AAATGC TGG AC C C AACGC AAATGGTT C C C AGTTTT T C ATGTG 
CCCTGCCAAGACCAAGTGGTTGGATGGCAAGCAAGTGGTCTTTGGCAGGGTGAAAGAAGGCATGGATATTGT 
GGAGGCCATGGAGCGCTTTGTGTTCAGGAATGGCAAGACTAGCAAGAAGGTCACTATTGCTGACTGTGGACA 
GCTCTAATAAGTTTGACTT 



10 In a search of public sequence databases, the NOV47 nucleic acid sequence, located on 

chromosome 3, has 523 of 523 bases (100%) identical to a gb:GENBANK- 
ID:AF152363|acc:AF152363.1 mRNA from Homo sapiens (constitutive fragile region FRA3B 
sequence) (E = 5.7e _1 1 ] ). 

The disclosed NOV47 polypeptide (SEQ ID NO: 180) encoded by SEQ ID NO: 179 has 

15 164 amino acid residues and is presented in Table 47B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV47 has no signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.6400. Alternatively, 

8 

NOV47 may also localize to the plasma membrane with a certainty of 0.6000, to the 
cytoplasm with a certainty of 0.4500, or to the mitochondrial matrix space with a certainty of 
20 0.1000. 



Table 47B. Encoded NOV47 protein sequence (SEQ ID NO:180). 



MVNPTVFFDI AVNSEPLGCVS FELFADKLPKTAENFHALSTGEKGFDYEGYCFHRII PGFVCQGGDFTCHNG 
TGSKSIYREKFDDENFILKHTGPGILSMANAGPNANGSQFFMCPAKTKWLDGKQVVFGRVKEGMDIVEAMER 
FVFRNGKTSKKVTIADCGQL 



A search of sequence databases reveals that the NOV47 amino acid sequence has 141 
of 164 amino acid residues (85%) identical to, and 151 of 164 amino acid residues (92%) 
25 similar to, the 165 amino acid residue ptnr:pir-id:CSHUA protein from human (peptidylprolyl 
isomerase (EC 5.2.1.8) A) (E = 5.6e -75 ). 
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NOV47 is predicted to be expressed in small intestine because of the expression 
pattern of (GENBANK-ID: gb:GENBANK-ID:E02765|acc: E02765.1) a closely related Sus 
scrofa peptidyl-prolyl cis-trans isomerase A sequence homolog. 

NOV47 also has homology to the amino acid sequences shown in the BLASTP data 
5 listed in Table 47C. 



Table 47C. BLAST results for NOV47 


Gene Index/ 
I dent i f i eir 


Protein/ Organism 


Length 
laa; 


Identity 


Positives 


Expect 


gi | 17440554 | ref |XP_ 
067503. 1| 
(XM_067503) 


similar to 
PEPTIDYL-PROLYL 
CIS -TRANS 

A (PPIASE) 
(ROTAMASE) 
(CYCLOPHILIN A) 
(rYfLO^PORTW 1 A- 

V V- X ^-f ±J\~/LD XT WIN. _L X>l 

BINDING PROTEIN) 
(H. sapiens) 
[Homo sapiens] 


164 


164/164 
(100%) 


164/164 
(100%) 


3e-91 


gi | 12 80433 5 | gb | AAHO 
3026 . 1 | AAH03026 
(BC003026) 


Unknown (protein 
for 

IMAGE:2823490) 
[Homo sapiens] 


174 


141/164 
(85%) 


151/164 
(91%) 


3e-76 


gi|4033689|sp| P043 7 
4 | CYPH_BOVIN 


PEPTIDYL-PROLYL 
CIS-TRANS 
ISOMERASE A 
(PPIASE) 

(ROTAMASE) (CYCLOP 
HILIN A) 

(CYCLOSPORIN A- 
BINDING PROTEIN) 


164 


141/164 
(85%) 


151/164 
(91%) 


le-75 ■ 


gi | 10863927 | ref | NP_ 
066953 .1 | 
(NM_021130) 


peptidylprolyl 
isomerase A 
(cyclophilin A) 
[Homo sapiens] 


165 


141/164 
(85%) 


151/164 
(91%) 


2e-75- 


gi | 68401 |pir| | CSBOA 
B 


peptidylprolyl 
isomerase (EC 
5.2.1.8) A - 
bovine 


163 


140/163 
(85%) 


150/163 
(91%) 


5e-75 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 47D. In the ClustalW alignment of the NOV47 protein, as 
10 well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 47D. ClustalW Analysis of NOV47 

1) Novel NOV47 (SEQ ID NO: 180) 

2) gi | 17440554 | ref |XP_067503 . 1 | (XM_067503) similar to PEPTIDYL -PROLYL CIS-TRANS 
ISOMERASE A(PPIASE) (ROTAMASE) (CYCLOPHILIN A) (CYCLOSPORIN A-BINDING PROTEIN) (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 53 7) 

3) gi | 1280433 5 | gb | AAH03 026 . 1 | AAH03 02 6 (BC003026) Unknown (protein for 
IMAGE: 2823490) [Homo sapiens] (SEQ ID NO: 538) 

4) gi|4033689|sp|P04374|CYPH_BOVIN PEPTIDYL -PROLYL CIS-TRANS ISOMERASE A .(PPIASE) 
(ROTAMASE) (CYCLOPHILIN A) (CYCLOSPORIN A-BINDING PROTEIN) (SEQ ID NO: 539) 

5) gi | 10863927 | ref | NP_066 953 . 1 | (NM_021130) pept idylprolyl isomerase A (cyclophilin 
A) [Homo sapiens] (SEQ ID NO: 540) 

6) gi | 684 01 | pir j | CSBOAB pept idylprolyl isomerase (EC 5.2.1-8) A - bovine (SEQ ID 
NO:541) 



20 



25 



30 



NOV47 

gi | 17440554 | 
gi|l2804335 j 
gi|4033689 | 
gi|l0863927 | 
gi|6840l| 



NOV47 

gi | 17440554 | 
gi |12804335 | 
gi|4033689 | 
gi|10863927 | 
gi|6840l| 



52 
52 
61 
52 
52 
51 




70 



80 



90 



100 



110 



120 
• - I 



CFHRI I PGFfflCQGGDFTBHNGT 
CFHRI I PGFQCQGGDFTgHNGT 



KSIYgEKFDDENFILKHTGPGILSMANAGPNgNGSQ 
KS I YraEKFDDENF I LKHTGPGI LSMANAGPNTNGSQ 



CFHRI IPGFMCQGGDFTRHNGTGGKSI YGEKFjgDENFILKHTGPGILSMANAGPNTNGSQ 
CFHRI I PGFMCQGGDFTRHNGTGGKS I YGE KFDDENF I LKHTGPGI LSMANAGPNTNGSQ 
CFHRI I PGFMCQGGDFTRHNGTGGKS I YGEKFj|]DENFI LKHTGPGI LSMANAGPNTNGSQ 
CFHRI I PGFMCQGGDFTRHNGTGGKS I YGEKFDDENFI LKHTGPGI LSMANAGPNTNGSQ 



111 
111 
120 
111 
111 
110 



35 



40 



130 



NOV47 


112 


gi| 17440554 | 


112 


gi j 12804335| 


121 


gi|4033689 | 


112 


gi [10863927 | 


112 


gi |68401| 


111 




Table 47E lists the domain descriptions from DOMAIN analysis results against 
45 NOV47. This indicates that the NOV47 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 47E Domain Analysis of NOV47 

gnl | Pf am | pf am00160 , pro_isomerase, Cyclophilin type peptidyl -prolyl 
cis-trans isomerase (SEQ ID NO: 839) 
CD-Length = 162 residues, 100.0% aligned 
Score = 219 bits (558) , Expect = le-58 



NOV47 : 5 TVFFDIAVNSEPLGCVSFELFADKLPKTAENFHALSTGEKGFD- YEGYCFHRI IPGFVCQ 63 

50 MM + +||| + MM | +||| HI || MUM 1 + 1 MMI 1+ I 

Sbjct : 1 KVYFDITIGGKPLGRIVFELFGDWPKTVENFRALCTGEKGFGFYKGSTFHRVIPNFMIQ 60 
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NOV47: S4 GGDFTCHNGTGSKSIYREKFDDENFILKHTGPGILSMANAGPNANGSQFFMCPAKTKWLD 123 

Mill 1 1 1 I MM III MM III II IMIIIMI IIIIII+ I III 

Sbjct: 61 GGDFTRGNGTGGKSI YGEKFKDENFNLKHDRPGTLSMANAGPNTNGSQFFITTVATPWLD 12 0 

5 NOV47: 124 GKQWFGRVKEGMDI VEAMERFVFRNG- KTSKKVTIADCGQL 164 

II IIIM IIIM+ -I +1 II I lllllll 

Sbjct: 121 GKHWFGKWEGMDWDKIENVGTDSGDVPSKDVKIADCGQL 162 

Cyclophilin A (peptidyl-prolyl cis-trans isomerase A) is the major high-affinity 

10 binding protein in vertebrates for the immunosuppressive drug cyclosporin A (CSA). Because 
of its dramatic effects on decreasing morbidity and increasing survival rates in human 
transplants, the molecular mechanism of immunosuppression by cyclosporin A has been a 
matter of much interest. Cyclophilin A is a member of the immunophilin class of proteins that 
all possess peptidyl-prolyl cis-trans isomerase (PPIase) activity and, therefore, are believed to 

15 be involved in protein folding and/or intracellular protein transport. PPIase accelerates protein 
folding by catalyzing the cis-trans isomerization of proline imidic peptide bonds in 
oligopeptides. It is probable that CSA mediates some of its effects via an inhibitory action on 
PPIase. Cyclophilin is a cytosolic protein that belongs to a family of isozymes, including 
cyclophilins B and C, PPIase, and natural killer cell cyclophilin-related protein. The sequences 

20 of the different forms of cyclophilin-type PPIases are well conserved. Additional interest in 
cyclophilin A stems from studies performed by Luban et al. (1993), who showed that 
cyclophilin A binds to the gag protein of human immunodeficiency virus type 1 (HIV-1). This 
interaction can be inhibited by the immunosuppressant cyclosporin A and also by 
nonimmunosuppressive, cyclophilin A-binding cyclosporin A derivatives, which were also 

25 shown to exhibit potent anti-HIV-1 activity. Thus, cyclophilin A may have ah essential 
function in HIV-1 replication. 

The disclosed NOV47 nucleic acid of the invention encoding a Peptidyl-Prolyl Cis- 
Trans Isomerase-like protein includes the nucleic acid whose sequence is provided in Table 
47A or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 

30 whose bases may be changed from the corresponding base shown in Table 47A while still 

encoding a protein that maintains its Peptidyl-Prolyl Cis-Trans Isomerase -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
„ nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 

35 invention additionally includes nucleic acids or nucleic acid fragments, or complements 

thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
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chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 1 percent of the bases may be so 
changed. 

5 The disclosed NOV47 protein of the invention includes the Peptidyl-Prolyl Cis-Trans 

Isomerase-like protein whose sequence is provided in Table 47B. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 47B while still encoding a protein that-maintains its Peptidyl-Prolyl 
Cis-Trans Isomerase-like activities and physiological functions, or a functional fragment 
10 thereof. In the mutant or variant protein, up to about 1 5 percent of the residues may be so 
changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Peptidyl-Prolyl Cis-Trans 
15 Isomerase -like protein (NOV47) is a member of a "Peptidyl-Prolyl Cis-Trans Isomerase 
family". Therefore, the NOV47 nucleic acids and proteins identified here may be useful in 
potential therapeutic applications implicated in (but not limited to) various pathologies and 
disorders as indicated below. The potential therapeutic applications for this invention include, 
but are not limited to: protein therapeutic, small molecule drug target, antibody target 
20 (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic 

marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo 
and in vitro of all tissues and cell types composing (but not limited to) those defined here. 

The NOV47 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in inflammatory bowel disease, diverticular disease, and/or 
25 other diseases and pathologies. 

NOV47 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV47 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
30 NOVX Antibodies" section below. The disclosed NOV47 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 
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NOV48 

NOV48 includes three novel Phospholipase C Delta-4-like proteins disclosed below. 
The disclosed sequences have been named NOV48a and NOV48b. 
NOV48a 

5 A disclosed NOV48a nucleic acid of 3238 nucleotides (also referred to as CG56743- 

01) encoding a Phospholipase C Delta-4-like protein is shown in Table 48A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 370-372 and 
ending with a TGA codon at nucleotides 2626-2628. The start and stop codons are shown in 
bold in Table 48 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 48A. NOV48a nucleotide sequence (SEQ ID NO:181). 

TTCTGATCTTTATGTGATTTGAGGCAGGTTTCTAAACCTATCTAAAGTGTCAGAGTCACTAAACTCAAAATT 
AGAAGCAAAAATCAGCTACAGACTATCTTCAAGATTCACCCAGAGCCCTTTGCTCTTCCTTGCTCCTTTAGG 
TGATCTGGTGCCAGCTGGTGGAACAGTGGGTGATGGCGTCCCTGCTGCAAGACCGTGAGTGCCGGGGCCCCT 
GCAGGGGAAGAGGCCTTAGTGTACAGCTCAGGGAAGGGAAGGAGGTTGGACCCCTGTTCCAGAGCTCTCCCT 
GGGCCTGCTACCCTCTCTGCTGGCTACCTAACCCCTGCTTTTCCTGACCTAGAGCTGACCACTGATCAGGAC 
TTGCTGCTG ATGCAGGAAGGCATGCCGATGCGCAAGGTGAGGTCCAAAAGCTGGAAGAAGCTAAGATACTTC 
AGACTTCAGAATGACGGCATGACAGTCTGGCATGCACGGCAGGCCAGGGGCAGTGCCAAGCCCAGCGTCTCA 
ATCTCTGATGTGGAGACAATACGTAATGGCCATGATTCCGAGTTGCTGCGTAGCCTGGCAGAGGAGCTCCCC 
CTGGAGCAGGGCTTCACCATTGTCTTCCATGGCCGCCGCTCCAACCTGGACCTGATGGCCAACAGTGTTGAG 
GAGGCCCAGATATGGATGCGAGGGCTCCAGCTGTTGGTGGATCTTGTCACCAGCATGGACCATCAGGAGCGC 
CTGGACCAGTATCTGAGCGATTGGTTTCAACGTGGAGACAAAAATCAGGATGGTAAGATGAGTTTCCAAGAA 
GTTCAGCGGTTATTGCACCTAATGAATGTGGAAATGGACCAAGAATATGCCTTCAGTCTCTTGCAGGCAGCA 
GACACGTCCCAGTCTGGAACCCTGGAAGGAGAAGAATTCGTACAGTTCTATAAGGCATTGACTAAACGTGCT 
GAGGTGCAGGAACTGTTTGAAAGTTTTTCAGCTGATGGGCAGAAGCTGACTCTGCTGGAATTTTTGGATTTC 
CTCCAAGAGGAGCAGAAGGAGAGAGACTGCACCTCTGAGCTTGCTCTGGAACTCATTGACCGCTATGAACCT 
TCAGACAGTGGTAAGCTGCGGCATGTGCTGAGTATGGATGGCTTCCTCAGCTACCTCTGCTCTAAGGATGGA 
GACATCTTCAACCCAGCCTGCCTCCCCATCTATCAGGATATGACTCAACCCCTGAACCACTACTTCATCTGC 
TCTTCTCATAACACCTACCTAGTGGGGGACCAGCTTTGTGGCCAGAGCAGCGTCGAGGGATATATACGGGCC 
CTGAAGCGGGGGTGCCGCTGCGTGGAGGTGGATGTATGGGATGGACCTAGCGGGGAACCTGTCGTTTACCAC 
GGACACACCCTGACCTCCCGCATCCTGTTCAAAGATGTCGTGGCCACAGTAGCACAGTATGCCTTCCAGACA 
TCAGACTACCCAGTCATCTTGTCCCTGGAGACCCACTGCAGCTGGGAGCAGCAGCAGACCATGGCCCGTCAT 
CTGACTGAGATCCTGGGGGAGCAGCTGCTGAGCACCACCTTGGATGGGGTGCTGCCCACTCAGCTGCCCTCG 
C CTG AGGAG CTTCGGAGGAAGATCCTGGTGAAGGGGAAGAAGT TAAC AC TTG AGGAAGAC CTGGAAT ATG AG 
GAAGAGGAAGCAGAACCTGAGTTGGAAGAGTCAGAATTGGCGCTGGAGTCCCAGTTTGAGACTGAGCCTGAG 
C CCC AGGAG C AG AACCT T C AG AAT AAGGAC AAAAAG AAGGT AAGC C AGC TT CT C C AG AAAT C C AAGC CCATC 
TTGTGTCCAGCCCTCTCTTCCCTGGTTATCTACTTGAAGTCTGTCTCATTCCGCAGCTTCACACATTCAAAG 
GAGCACTACCACTTCTACGAGATATCATCTTTCTCTGAAACCAAGGCCAAGCGCCTCATCAAGGAGGCTGGC 
AATGAGTTTGTGCAGCACAATACTTGGCAGTTAAGCCGTGTGTATCCCAGCGGCCTGAGGACAGACTCTTCC 
AACTACTACAACCCCCAGGAACTCTGGAATGCAGGCTGCCAGATGGTGGCCATGAATATGCAGACTGCAGGG 
CTTGAAATGGACATCTGTGATGGGCATTTCCGCCAGAATGGCGGCTGTGGCTATGTGCTGAAGCCAGACTTC 
CTGCGTGATATCCAGAGTTCTTTCCACCCTGAGAAGCCCATCAGCCCTTTCAAAGCCCAGACTCTCTTAAAC 
CAGGTGATCAGCGGTCAGCAACTCCCCAAAGTGGACAAGACCAAAGAGGGGTCCATTGTGGATCCACTGGTG 
AAAGTGCAGATCTTTGGCGTTCGTCTAGACACAGCACGGCAGGAGACCAACTATGTGGAGAACAATGGTTTT 
AATCCATACTGGGGGCAGACACTATGTTTCCGGGTGCTGGTGCCTGAACTTGCCATGCTGCGTTTTGTGGTA 
ATGGATTATGACTGGAAATCCCGAAATGACTTTATTGGTCAGTACACCCTGCCTTGGACCTGCATGCAACAA 
GGTTACCGCCACATTCACCTGCTGTCCAAAGATGGCATCAGCCTCCGCCCAGCTTCCATCTTTGTGTATATC 
TGCATCCAGGAAGGCCTGGAGGGGGATGAGTCCTGA GGTGGGCATTTCACGGGAAGGGTTGGTGTGCTGGCT 
TTAGACGGGGAGAAACATCTGGAAGGATGCTCGAGAGAACAAATGGAGGTGGTGAAAATCAAGCTTTGGATT 
GTGCATTCCTAGGCACAAAATTACCTCATTCTTCCTAACAAGCAATCTGGGACCTGATTTTCCACCTTTTTT 
CTCTTTTCTTCCCTTCCTTTGTTTTCATAAGCCTTTGGTATCTTTCCTGCCCTTTTCCTTTGTGTACTCTAT 
ACTGGAGTTCCCTTCTTCCTCTTGCTGTAGGCTCAATCCCATACCGACATCTACAACTAATCTTTCCCATCA 
ACTCTGTGTGAAGGCAGGTTGCAACTAGAAATTCAGAGGGGCTTGGAATAGAGAAACCTAAAGAAGCATCAT 
CCCCTCCATCCCCAACTTCCTCAAAGCCCAAAGCCAAGGGAAGGATAAATCAAGGCTCAAGGCTTCCCCAGC 
AAAGATTAGGGAAAGAGACTTGACCCCAGGACTGTACTACGACTCTTAAGAGAACACTGCACAGCACTCAAA 
GTCCCCCACTGGACTGCTTCCTCCTTAGCCCCACTGGTATAAATACATCTCTCTCCAATTTGGCTTCAAA 
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In a search of public sequence databases, the NOV48a nucleic acid sequence, located 
on chromosome 3, has 1279 of 1285 bases (99%) identical to a gb:GENBANK- 
ID:AK023083|acc:AK023083.1 mRNA from Homo sapiens (cDNA FLJ13021 fis, clone 
NT2RP3000742, weakly similar to l-Phosphatidylinositol-4,5-Bisphosphate 
5 Phosphodiesterase Delta 1 (EC 3.1.4.1 1)) (E = 0.0). 

The disclosed NOV48a polypeptide (SEQ ID NO: 182) encoded by SEQ ID NO: 181 
has 752 amino acid residues and is presented in Table 48B using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV48a has no signal peptide 
and is likely to be localized to the cytoplasm with a certainty of 0.4500. Alternatively, 
10 NOV48a may also localize to the microbody (peroxisome) with a certainty of 0.1265, to the 
mitochondrial matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a 
certainty of 0.1000. 



Table 48B. Encoded NOV48a protein sequence (SEQ ID NO: 182). 



MQEGMPMRKVRSKSWKKLRYFRLQNDGMTVWHARQARGSAKPSVSISDVETIRNGHDSELLRSLAEELPLEQ 
GFTIVFHGRRSNLDLMANSVEEAQIWMRGLQLLVDLVTSMDHQERLDQYLSDWFQRGDKNQDGKMSFQEVQR 
IJLHLMNVEMDQEYAFSLLQAADTSQSGTLEGEEFVQF 

EQKERDCTSELALELIDRYEPSDSGKLRHVLSMDGFLSYLCSKDGDIFNPACLPIYQDMTQPLNHYFICSSH 
NTYLVGDQLCGQSSVEGYIRALKRGCRCVEVDVWDGPSGEPWYHGHTLTSRILFKDWATVAQYAFQTSDY 
PVILSLETHCSWEQQQTMARHLTEILGEQLLSTTLDGVLPTQLPSPEELRRKILVKGKKLTLEEDLEYEEEE 
AEPELEESELALESQFETEPEPQEQNLQNKDKKKVSQLLQKSKPILCPALSSLVIYLKSVSFRSFTHSKEHY 
HFYEISSFSETKAKRLIKEAGNEFVQHNTWQLSRVYPSGLRT 

DICDGHFRQNGGCGYVLKPDFLRDIQSSFHPEKPISPFKAQTLLNQVISGQQLPKVDKTKEGSIVDPLVKVQ 

IFGVRLDTARQETNYVENNGFNPYWGQTLCFRVLVPELAML 

H I HLL S KDGI SLR PAS I FVY I C I QEGLEGDE S 



15 A search of sequence databases reveals that the NOV48a amino acid sequence has 619 

of 752 amino acid residues (82%) identical to, and 675 of 752 amino acid residues (89%) 
similar to, the 764 amino acid residue ptnr:pir-id:S141 13 protein from bovine (1- 
phosphatidylinositol-4,5-bisphosphate phosphodiesterase (EC 3.1.4.1 1) delta-2) (E = 0.0). 
NOV48a is predicted to be expressed in at least Amygdala, Bone Marrow, Brain, 

20 Epidermis, Heart, Hypothalamus, Lung, Mammary gland/Breast, Pituitary Gland, Placenta, 
Retina, Skeletal Muscle, Small Intestine, Stomach. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
sources. 

25 NOV48b 

In the present invention, the target sequence identified previously, NOV48a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
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downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
5 silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on' the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 

10 brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 

15 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV48b. This differs from the previously 

20 identified sequence (NOV48a) in having a deletion of 6 amino acids in one region and one 
amino acid at another region. 

A disclosed NOV48b nucleic acid of 2341 nucleotides (also referred to as CG56743- 
02) encoding a Phospholipase C Delta-4-like protein is shown in Table 48C. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 55-57 and ending 

25 with a TGA codon at nucleotides 2278-2280. The start and stop codons are shown in bold in 
Table 48C, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 48C. NOV48b nucleotide sequence (SEQ ID NO: 183). 

TTTCCTGACCTAGAGCTGACCACTGATCAGGGCTTGCTGCTGGTGCAGGAAGGC ATGCCGATGCGCAAGGTG 
AGGTCC AAAAGCTGGAAG AAGC T AAGAT AC TT C AGATTTC AGAATGACGGC ATGACAGT CTGGC ATG C ACGG 
CAGGCCAGGGGCAGTGCCAAGCCCAGCTTCTCAATCTCTGATGTGGAGACAATACGTAATGGCCATGATTCC 
GAGTTGCTGCGTAGCCTGGCAGAGGAGCTCCCCCTGGAGCAGGGCTTCACCATTGTCTTCCATGGCCGCCGC 
TCCAACCTGGACCTGATGGCCAACAGTGTTGAGGGGGCCCAGATATGGATGCGAGGGCTCCAGCTGTTGGTG 
GATCTTGTCACCAGCATGGACCATCAGGAGCGCCTGGACCAATGGCTGAGCGATTGGTTTCAACGTGGAGAC 
AAAAATCAGGATGGTAAGATGAGTTTCCAAGAAGTTCAGCGGTTATTGCACCTAATGAATGTGGAAATGGAC 
CAAGAATATGCCTTCAGTCTTTTTCAGGCAGCAGACACGTCCCAGTCTGGAACCCTGGAAGGAGAAGAATTC 
GTACAGTTCTATAAGGCATTGACTAAACGTGCTGAGGTGCAGGAACTGTTTGAAAGTTTTTCAGCTGATGGG 
CAGAAGCTGACTCTGCTGGAATTTTTGGATTTCCTCCAAGAGGAGCAGAAGGAGAGAGACTGCACCTCTGAG 
CTTGCTCTGGAACTCATTGACCGCTATGAACCTTCAGACAGTGGCAAACTGCGGCATGTGCTGAGTATGGAT 
GGCTTCCTCAGCTACCTCTGCTCTAAGGATGGAGACATCTTCAACCCAGCCTGCCTCCCCATCTATCAGGAT 
ATGACTCAACCCCTGAACCACTACTTCATCTGCTCTTCTCATAACACCTACCTAGTGGGGGACCAGCTTTGT 
GGCCAGAGCAGCGTCGAGGGATATATACGGGCCCTGAAGCGGGGGTGCCGCTGCGTGGAGGTGGATGTATGG 
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GATGGACCTAGCGGGGAACCTGTCGTTTACCACGGACACACCCCGACCTCCCGCATCCTGTTCAAAGATGTC 
GTGGCCACAGTAGCACAGTATGCCTTCCAGACATCAGACTACCCAGTCATCTTGTCCCTGGAGACCCACTGC 
AGCTGGGAGCAGCAGCAGACCATGGCCCGTCATCTGACTGAGATCCTGGGGGAGCAGCTGCTGAGCACCACC 
TTGGATGGGGTGCTGCCCACTCAGCTGCCCTCGCCTGAGGAGCTTCGGAGGAAGATCCTGGTGAAGGGGAAG 
AAGTT AACACT TGAGGAAGACC TGGAAT ATG AGGAAGAGGAAG C AG AAC CTG AGT TGG AAG AGTC AG AATTG 
GCGCTGGAGTCCCAGTTTGAGACTGAGCCTGAGCCCCAGGAGCAGAACCTTCAGAATAAGGACAAAAAGAAG 
AAATCCAAGCCCATCTTGTGTCCAGCCCTCTCTTCCCTGGTTATCTACTTGAAGTCTGTCTCATTCCGCAGC 
TTCACACATTCAAAGGAGCACTACCACTTCTACGAGATATCATCTTTCTCTGAAACCAAGGCCAAGCGCCTC 
ATCAAGGAGGCTGGCAATGAGTTTGTGCAGCACAATACTTGGCAGTTAAGCCGTGTGTATCCCAGCGGCCTG 
AGGACAGACTCTTCCAACTACAACCCCCAGGAACTCTGGAATGCAGGCTGCCGGATGGTGGCCATGAATATG 
CAGACTGCAGGGCTTGAAATGGACATCTGTGATGGGCATTTCCGCCAGAATGGCGGCTGTGGCTATGTGCTG 
AAGCCAGACTTCCTGCGTGATATCCAGAGTTCTTTCCACCCTGAGAAGCCCATCAGCCCTTTCAAAGCCCAG 
ACTCTCTTAATCCAGGTGATCAGCGGTCAGCAACTCCCCAAAGTGGACAAGACCAAAGAGGGGTCCATTGTG 
GATCCACTGGTGAAAGTGCAGATCTTTGGCGTTCGTCTAGACACAGCACGGCAGGAGACCAACTATGTGGAG 
AACAATGGTTTTAATCCATACTGGGGGCAGACACTATGTTTCCGGGTGCTGGTGCCTGAACTTGCCATGCTG 
CGTTTTGTGGTAATGGATTATGACTGGAAATCCCGAAATGACTTTATTGGTCAGTACACCCTGCCTTGGACC 
TGCATGCAACAAGGTTACCGCCACATTCACCTGCTGTCCAAAGATGGCATCAGCCTCCGCCCAGCTTCCATC 
TTTGTGTATATCTGCATCCAGGAAGGCCTGGAGGGGGGTGAGTCCTGA GGTGGGCATTTCACGGGAAGGGTT 
GGTGTGCTGGCTTTAGACGGGGAGAAACATCTGGAAG 



In a search of public sequence databases, the NOV48b nucleic acid sequence, located 
on chromosome 2, has 1069 of 1075 bases (99%) identical to a gb:GENBANK- 
ID:AK023083|acc:AK023083.1 mRNA from Homo sapiens (cDNA FLJ13021 fis, clone 
NT2RP3000742, weakly similar to l-Phosphatidylinositol-4,5-Bisphosphate 
Phosphodiesterase Delta 1 (EC 3.1.4.1 1)) (E = 0.0). 

The disclosed NOV48b polypeptide (SEQ ID NO: 1 84) encoded by SEQ ID NO: 1 83 
has 741 amino acid residues and is presented in Table 48D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV48b has no signal peptide 
and is likely to be localized to the mitochondrial matrix space with a certainty of 0.6523. 
Alternatively, NOV48b may also localize to the mitochondrial inner membrane with a 
certainty of 0.3462, to the mitochondrial intermembrane space with a certainty of 0.3462, or to 
the mitochondrial outer membrane with a certainty of 0.3462. 



Table 48D. Encoded NOV48b protein sequence (SEQ ID NO:184). 

MPMRKVRSKSWKKLRYFRFQNDGMTVWHARQARGSAKPSFSISDVETIRNGHDSELLRSLAEELPLEQGFTI 
VFHGRRSNLDLMANSVEGAQIWMRGLQLLVDLWSMDHQERLDQWLSDWFQRGDKNQDGKMSFQEVQRLLHL 
MNVEMDQEYAFSLFQAADTSQSGTLEGEEFVQFYKALTKRAEVQELFESFSADGQKLTLLEFLDFLQEEQKE 
RDCTSELALELIDRYEPSDSGKLRHVLSMDGFLSYLCSKDGDIFNPACLPIYQDMTQPLNHYFICSSHNTYL 
VGDQLCGQSSVEGYIRALKRGCRCVEVDVWDGPSGEPWYHGHTPTSRILFKDWATVAQYAFQTSDYPVIL 
SLETHCSWEQQQTMARHLTEILGEQLLSTTLDGVLPTQLPSPEELRRKILVKGKKLTLEEDLEYEEEEAEPE 
LEESELALESQFETEPEPQEQNLQNKDKKKKSKPILCPALSSLVIYLKSVSFRSFTHSKEHYHFYEISSFSE 
TKAKRLIKEAGNEFVQHNTWQLSRVYPSGLRTDSSNYNPQELWNAGCRMVAMNMQTAGLEMDICDGHFRQNG 
GCGYVLKPDFLRDIQSSFHPEKPISPFKAQTLLIQVISGQQLPKVDKTKEGSI VDPLVKVQIFGVRLDTARQ 
ETNYVENNGFNPYWGQTLCFRVLVPELAMLRFVVMDYDWKSRNDFIGQYTLPWTCMQQGYRHIHLLSKDGIS 
LRPAS I FVYI C IQEGLEGGES 

15 

A search of sequence databases reveals that the NOV48b amino acid sequence has 736 
of 741 amino acid residues (99%) identical to, and 737 of 741 amino acid residues (99%) 
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5 



.10 




similar to, the 762 amino acid residue ptnr:TREMBLNEW-ACC:AAH06355 protein from 
Homo sapiens (Human) (Unknown (Protein For MGC: 12837)) (E = 0.0). 

NOV48b is predicted to be expressed in at least Heart, Stomach, Small Intestine, Bone 
Marrow, Skeletal Muscle, Brain, Hypothalamus, Pituitary Gland. .The sequence is predicted to 
5 have the expression pattern of (GENBANK-ID: gb:GENBANK- 

ID:AK023083|acc:AK023083.1) a closely related Homo sapiens cDNA FLJ13021 fis, clone 
NT2RP3000742, weakly similar to l-Phosphatidylinositol-4,5-Bisphosphate 
Phosphodiesterase Delta 1 (EC 3.1.4.1 1) homolog. 

NOV48a also has homology to the amino acid sequences shown in the BLASTP data 
10 listed in Table 48E. 



Table 48E. BLAST results for NOV48a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 1424 93 40 | ref | NP_ 
116115. l| 
(NM_032726) 


hypothetical 
protein MGC12837 
[Homo sapiens] 


762 


736/752 
(97%) 


738/752 
(97%) 


0 . 0 


gi | 108854 |pir | |S141 
13 


1- 

phosphatidylinosi 
tol-4,5- 
bisphosphate 
phosphodiesterase 
(EC3 .1.4. 11) 
delta-2 - bovine 


764 


613/757 
(80%) 


671/757 
(87%) 


0 . 0 


gi|l8093100|ref | NP 
542419. l| 
(NM 080688), 


phospholipase C, 
delta 4 [Rattus 
norvegicus] 


772 


550/755 
(72%) 


631/755 
(82%) 


0 . 0 


gi | 13 0418 9 | dbj | BAA0 
9046. l| (D50455) 


phodpho lipase C 
delta4 [Rattus 
norvegicus] 


771 


548/756 
(72%) 


629/756 
(82%) 


0.0 


gi | 12855950 | db j | BAB 
30513. l| (AK016945) 


data source :MGD, 
source 

key :MGI : 107469, ev 
idence : ISS-phosph 
olipase C, delta 
4 -putative [Mus 
musculus] 


447 


335/430 
(77%) 


375/430 
(86%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 48F. In the ClustalW alignment of the NOV48 protein, as 
15 well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 48F. ClustalW Analysis of NOV48 



1) Novel NOV4 8a (SEQ ID NO: 18 2) 

2) Novel NOV4 8b (SEQ ID NO: 184) 

3) gi | 1424 934 0 | ref | NP_116 115 . 1 | (NM_032726) hypothetical protein MGC12837 [Homo 
sapiens] (SEQ ID NO: 542) 

4) gi | 108854 |pir | | S14113 1 -phosphatidylinositol-4 , 5-bisphosphate phosphodiesterase 
(EC3.1.4.11) delta-2 - bovine (SEQ ID NO:543) 

5) gi | 18093100 |ref | NP_5424 19 . 1 | (NM_080688) phospholipase C, delta 4 [Rattus 
norvegicus] (SEQ ID NO: 544) 

6) gi | 13 0418 9 | dbj | BAA0 9 04 6 . 1 | (D50455) phodpholipase C delta4 [Rattus norvegicus] 
(SEQ ID NO: 545) 

7) gi j 12855950 |dbj | BAB30513 . 1 | (AK016945) data source :MGD, source 

key :MGI : 107469, evidence : ISS^phospholipase C, delta 4~putative [Mus musculus] (SEQ ID 
NO: 54 6) 



NOV48a 

NOV48b 

gi | 14249340 | 

gi | 108854 | 

gi j 18093100 | 

gi | 1304189 | 

gi | 12855950 | 



1 
1 
1 
1 
1 
1 
1 




30 



40 

J.. 



50 
■ I - 



60 



mrkvr|kswkklryfrlqgdgmtvwhgrq 
mrkvr|kswkklryfr|3qSdgmtvwhBrq 
rkvrIkswkklryfrlqSdgmtvwhHrq 
mrkvr|kswkklr|frlq|dgmtvwh8rq 
mrkvr«kswkklryfrlqSdgmtvwhSrJ 
mrkvr|kswkklryfrlq|dgmtvwhSr8 

MRKVRjlKS W KKL R Y FRLqSsDGMT VWhSSq 



argsa 


1 


2<; 


i 43 


args^ 






I 39 


ARGS^ 






I 60 


AGGRA 






1 60 


LESljS 






1 60 


LESl| 






1 60 


PESMP 






1 60 



100 



NOV48a 

NOV48b 

gi | 14249340 | 

gi | 108854 | 

gi|l8093100| 

gi | 1304189 | 

gi j 12855950 | 



NOV48a 

NOV48b 

gi | 14249340 | 

gi | 108854 | 

gi | 18093100 | 

gi | 1304189 | 

gi | 12855950 | 




190 



200 



210 



220 



230 



240 



NOV48a 

NOV48b 

gi | 14249340 | 

gi | 108854 | 

gi | 18093100 | 

gi j 1304189 | 

gi | 12855950 | 



NOV48a 

NOV48b 

gi | 14249340 | 

gi j 108854 | 

gi j 18093100 | 

gi j 1304189 | 

gi j 12855950 | 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



NOV48a 

NOV48b 

gi | 14249340 

gi j 108854 | 

gi j 18093100 

gi | 1304189 | 

gi | 12855950 



NOV4 8a 

NOV48b 

gi | 14249340 

gi | 108854 | 

gi | 18093100 

gi j 1304189 | 

gi j 12855950 



NOV4 8a 

NOV48b 

gi | 14249340 

gi j 108854 | 

gi j 18093100 

gi j 1304189 | 

gi j 12855950 



NOV48a 

NOV4 8b 

gi | 14249340 

gi j 108854 | 

gi j 18093100 

gi|l304189| 

gi|l2855950 



NOV48a 

NOV48b 

gi | 14249340 

gi j 108854 | 

gi|l8093100 

gi j 1304189 | 

gi j 12855950 



NOV48a 

NOV48b 

gi | 14249340 

gi | 108854 | 

gi 1 18093100 

gi | 1304189 | 

gi j 12855950 



NOV48a 

NOV48b 

gi | 14249340 

gi j 108854 | 

gi j 18093100 

gi | 1304189 | 

gi | 12855950 
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HLTEILGEQLLSTTLgG^. 
HLTEILGEQLLSTTLgGffiL 

hlteilgeqllsttlBgSl 



HLTE I LGSgQLLSTgLlG 
jHLffglLGEQLLSTTLl 
AjgHLyg ILGE Q LLSTTL i 

mthlteilgeollBStlisgi 



470 



420 

TflPTQ 4 02 

PTQ 3 98 

PTQ 419 

QPPTQ 419 

ijJlDI 419 

f-IDI 419 

,|JVDS 419 



480 



PE 


QEQ 




458 


PE 


QEQ 


M 


454 


PE 


QEQ 


1ST 


475 




4 76 LQij 

4 73 PRSED 

4 80 SG»Ws 

479 SGiffls 



560 



570 



580 



590 



600 




YNPQE 
YNPQE 
YNPQE 
YNPQE 
YNPQE 
YNPQE 




610 




GYVLKPgFLRD 

gyvlkpSflrd 
gyvlkpSflrd 
gyvlkpSflrd 
gyvlkp|flrd 
gyvlkpSflrd 




660 



qvisgqSlpkvd 
qvisgqBlpkvd 
qvisgqBlpkvd 
qvisgqBlpkvd 
qvisgqBlpkvd 
qvisgqIlpkvd 



633 
622 
643 
645 
653 
652 
447 




680 
J 



690 



700 



710 



720 



GVglDT^QETgYVENNG 

gv^dt^qetSyvenng 
gv^dt^qetSyvenng 
gv^dt^qetByvenng 
gv^dt^qetIyvenng 
gvBSdtBIqetIIyvenng 
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730 



740 



NOV48a 


694 


M 


DY 


DWK 


NOV48b 


683 


M 


DY 


DWK 


gi|l4249340| 


704 


M 


DY 


DWK 


gij 108854 | 


706 


K 


DY 


DWK 


gi|l8093100| 


714 


K 


DY 


SRT 


gi|l304189| 


713 




DY 


SRT 


gi 12855950 1 


447 









SRNgFIGQYTLPWTCMgfflGYRH 

srnSfigqytlpwtcm^gyrh 
srnSfigqytlpwtcmkKgyrh 
srnBfigqytlpwBcmBHgyrh 
srnSfigqytlpwtcm|Sgyrh 
srn|figqytlpwtcm|Kgyrh 




10 
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Tables 48G-N lists the domain descriptions from DOMAIN analysis results against 
NOV48. This indicates that the NOV48 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 48G Domain Analysis of NOV48 

gnl | Smart | smart 0 0 14 8 , PLCXc, Phospholipase C, catalytic domain (part) ; ■ 
domain X; Phosphoinositide-specif ic phospholipases C. These enzymes 
contain 2 regions (X and Y) which together form a TIM barrel- like 
structure containing the active site residues. Phospholipase C enzymes 
(PI -PLC) act as signal transducers that generate two second 
messengers, inositol- 1 , 4 , 5 - trisphosphate and diacylglycerol . The 
bacterial enzyme appears to be a homologue of the mammalian PLCs. (SEQ 
ID NO: 840) 

CD-Length = 145 residues, 100.0% aligned 
Score = 202 bits (514) , Expect = 6e-53 
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NOV48: 273 QDMTQPLNHYFICSSHNTYLVGDQLCGQSSVEGYIRALKRGCRCVEVDVWDGPSGEPWY 332 

III++II+IIII 1 1 I 1 1 I I I II l + IIMMI + IM IIIIM + I I I 1 1 1 1 1 1 — I 

Sbjct: 1 QDMSKPLSHYFINSSHNTYLTGKQLWGESSVEGYIQALKHGCRCVELDCWDGPDGEPVIY 6 0 



25 



NOV4 8 : 3 3 3 



Sbjct: 61 



NOV4 8 : 



393 



Sbjct: 121 



HGHTLTSRILFKDWATVAQYAFQTSDYPVILSLETHCSWEQQQTMARHLTEILGEQLLS 3 92 

llll I I +1+ + ++II II llllllll III +11 II- II 1+ I + 

HGHTFTLPIKLSEVLEAIKKFAFVTSPYPVILSLENHCSPDQQAKMAQMFKEIFGDLLYT 120 

TTLDGVLPTQLPSPEELRRKILVKGK 418 

I 11111+1+ MI + IM 

PPTTSSL - EYLPSPEQLKGKI LLKGK 145 



30 



Table 48H Domain Analysis of NOV48 

gnl | Pfam|pfam003 88, PI-PLC-X, Phosphatidylinositol-specif ic 
phospholipase C # X domain. This associates with pfam003 87 to form a 
single structural unit. (SEQ ID NO: 841) 
CD-Length = 145 residues, 100.0% aligned 
Score = 192 bits (489) , Expect = 4e-50 
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NOV4 8 : 


274 


Sbjct : 


1 


NOV4 8 : 


334 


Sb j Ct : 


61 


NOV4 8 : 


394 



DMTQPLNHYFICSSHNTYLVGDQLCGQSSVEGYIRALKRGCRCVEVDVWDGPSGEPWYH 

11+ IMIII II MM I II M M | + I NIMH MM II++II 

DMSIPLSHYFISSSHNTYLTGKQLWGKSQVESYRQQLDHGCRCVELDCWDGPDDEPIIYH 



333 



60 



393 



GHTLTSRILFKDWATVAQYAFQTSDYPVILSLETHCSWEQQQTMARHLTEILGEQLLST 

III I 111+ + + l+ll ll+IIMI 11+ +11+ II++ 111+11+ 
GGTFTLE I KLKDVLEAI KDFLFKTS P YP I ILSLENHCNSDQQRKMAKYFEE I FGDYLLTK 12 0 



3 94 TLDGVLPTQLPSPEELRRKILVKGKK 419 



422 



Sbjct: 121 



II I MM ++I+ lll + l II 

PLDS-LTTKLPSLKDLKGKILLKNKK 



145 



Table 481 Domain Analysis of NOV48 

gnl | Smart | smart 0 0 14 9, PLCYc, Phospholipase C, catalytic domain (part) ; 
domain Y; Phosphoinos i tide -specif ic phospho lipases C. These enzymes 
contain 2 regions (X and Y) which together form a TIM barrel -like 
structure containing the active site residues. Phospholipase C enzymes 

(PI -PLC) act as signal transducers that generate two second 
messengers, inositol- 1 , 4 , 5-trisphosphate and diacylglycerol . The 
bacterial enzyme appears to be a homologue of the mammalian PLCs . 

(SEQ ID NO: 842) 

CD-Length = 117 residues, 100.0% aligned 
Score = 182 bits (462) , Expect = 6e-47 



NOV4 8 : 4 82 LSSLVI YLKSVSFRSFTHSKEHYHFYEISSFSETKAKRLIKEAGNEFVQHNTWQLSRVYP 

II II I I Mil ++I I I l + l I I I I I I I I + I+++I +II++I MMIM 

Sbjct: 1 LSELVSYCAPVKFRSFELAEEKNPFYEMSSFSETKAKKLLEKAPTDFVRYNQRQLSRVYP 



541 



60 



10 NOV4 8: 542 SGLRTDSSNYYNPQELWNAGCQMVAMNMQTAGLEMDICDGHFRQNGGCGYVLKPDFLR 5 99 

I I Mill III II llllll+l II I ♦ I II II ill I 

Sbjct: 61 KGTRVDSSNY-NPQVFWNHGCQMVALNFQTPDKAMQLNQGMFRANGGCGYVLKPDFLR 117 



Table 48 J Domain Analysis of NOV48 i 

gnl | Pf am |pfam00387 , PI-PLC-Y, Phosphatidylinositol - specif ic 
phospholipase C, Y domain. This associates with pfam0 03 8 8 to form a 
single structural unit, (SEQ ID NO: 843) 
CD-Length = 118 residues, 99.2% aligned ^ 
Score = 163 bits (412) , Expect = 4e-41 



15 NOV48: 482 LSSLVIYLKSVSFRSFTHSKEHYHFYEISSFSETKAKRLIKEAGNEFVQHNTWQLSRVYP 541 

IMI 1-1 + I I I l + I ll+lllll III + I + II+ MMI lllllll 

Sbjct: 2 LSNLVNYIQSIKFRSFSLPTEKNTSYEMSSFSERKAKQLLKESPIEFVKHNKRQLSRVYP 61 

20 NOV4 8: 542 SGLRTDSSNYYNPQELWNAGCQMVAMNMQTAGLEMDICDGHFRQNGGCGYVLKPDFLR 599 

I I I I I I ' I I MINIUM I 1+ I I I [ I III ll + IM III 

Sbjct: 62 KGTRFDSSN- FMPQPFWNAGCQMVALNFQTSDLPMQINLGMFEYNGGSGYLLKPPFLR 118 



Table 48K Domain Analysis of NOV48 



gnl|Pfam|pfam00168, C2 , C2 domain. (SEQ ID NO: 844) 
CD-Length = 88 residues, 95.5% aligned 
Score = 88.2 bits (217), Expect = 2e-18 



25 NOV48: 623 VI SGQQLPKVDKTKEGS I VDPLVKVQIFGVRLDTARQETNYVENNGFNPYWGQTLCFR - V 681 

III + MM Ml III + 1 IM +1 1+ II Ml I I 

Sbjct: 5 VISARNLPKMDMN GLSDPYVKVDLDGDPKDTKKFKTKTVKKT-LNPVWNETFVFEKV 60 

NOV48: 682 LVPELAMLRFWMDYDWKSRNDFIGQYT 709 
30 + | + || HI | | | IMIIII I 

Sbjct: 61 PLPDLASLRFAVYDEDRFSRDDFIGQVT 88 
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Table 48L Domain Analysis of NOV48 

gnl | Smart | smart 0023 9 , C2 , Protein kinase C conserved region 2 (CalB) ; 
Ca2+-binding motif present in phospholipases , protein kinases C, and 
synaptotamins (among others) . Some do not appear to contain Ca2 + - 
binding sites. Particular C2s appear to bind phospholipids, inositol 
polyphosphates, and intracellular proteins. Unusual occurrence in 
perforin. Synaptotagmin and PLC C2s are permuted in sequence with 
respect to N- and C-terminal beta strands. SMART detects C2 domains 
using one or both of two profiles. (SEQ ID NO: 84 5) 
CD-Length = 101 residues, 100.0% aligned 
Score = 83.6 bits (205), Expect = 4e-17 



NOV48: 618 TLLNQ V I S GQ QL P K VD KT KEG S I VD PL VK VQ I FG VRLDT ARQ E TN Y VENNG FNP Y WGQTL 677 

II ++|| + II II II III + I + + I l+l II I +1 

Sbjct: 1 TLTVKIISARNLPPKDKG GKS DP YVKVSLDGDPREKKK — TKWKNTL - NPVWNETF 54 



NOV4 8: 67 8 CFRVLVPELAMLRFWMDYDWKSRNDFIGQYTLPWTCMQQGYRHIHL 7 24 

II 111+ I III 11+1111+ M + + III I 

Sbjct : 55 EFEVPPPELSELEIEVYDKDRFSRDDFIGRVTIPLSDLLLGGRHEKL 101 



Table 48M Domain Analysis of NOV48 

gnl | Smart | smart00233 , PH, Pleckstrin homology domain.,- Domain commonly 
found in eukaryotic signalling proteins. The domain family possesses 
multiple functions including the abilities to bind inositol 
phosphates, and various proteins. PH domains have been found to 
possess inserted domains (such as in PLC gamma, syntrophins) and to be 
inserted within other domains. Mutations in Brutons tyrosine kinase 
(Btk) within its PH domain cause X- linked agammaglobulinaemia (XLA) in 
patients. Point mutations cluster into the positively charged end of 
the molecule around the predicted binding site for 
phosphatidylinositol lipids. (SEQ ID NO: 84 6) 
CD-Length = 104 residues, 87.5% aligned 
Score = 47.8 bits (112), Expect = 2e-06 
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NOV48 : 


9 




Sbjct : 


12 


15 


NOV4 8 : 


68 




Sbjct : 


64 



KVRSKS WKKLRYFRLQNDGMTVWHARQARGS AKPSVS IS- DVET I RNGHDSELLRSLAEE 
| | | | | | | | | | + + + + + + | + | | || | + | || + 

SGGKKSWKK- RYFVLFNGVLLYYKSKKKKSSSKPKGSIPLSGCTVREAPDSD S 



67 



63 



L P LE QG FT I V FHGRR S NL DLM ANS VE E AQ I WM RGL QL L VD 

+ III I++ I I I I I I + 1+ 1+ + 
DKKKNCFEIVTPDRKT-LLLQAESEEERKEWVEALRKAIA 



107 



102 



Table 48N Domain Analysis of NOV48 

gnl | Pf am | pf am00169 , PH, PH domain. PH stands for pleckstrin homology 
(SEQ ID NO: 847) 

CD-Length = 100 residues, 88.0% aligned 
Score = 45.1 bits (105), Expect = le-05 



20 NOV48: 9 KVRSKSWKKLRYFRLQNDGMTVWHARQARGSAKPSVSISDVETIRNGHDSELLRSLAEEL 

I- I III III I II + + I I- -I - I 

Sbjct: 12 TVKKKRWKK- RYFFLFNDVLIYYKDKKKSYEPKGSIPLSGCS VEDVPDSEF 



68 



61 



NOV48: 69 PLEQG FT I VFHGRRSNLDLMANS VEE AQ I WMRGLQLL V 106 
25 | + + | | | | | | |++ +1 + 

Sbjct: 62 KRPNCFQLRSRDGKETFILQAESEEERQDWIKAIQSAI 99 
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Phosphatidylinositol-specific phospholipase C (EC 3.1 .4.1 1), an eukaryotic 
intracellular enzyme, plays an important role in signal transduction processes. It catalyzes the 
hydrolysis of l-phosphatidyl-D-myo-inositol-3,4,5-triphosphate into the second messenger 
molecules diacylglycerol and inositoM, 4, 5 -triphosphate. This catalytic process is tightly 
5 regulated by reversible phosphorylation and binding of regulatory proteins. In mammals, there 
are at least 6 different isoforms of PI-PLC, they differ in their domain structure, their 
regulation, and their tissue distribution. Lower eukaryotes also possess multiple isoforms of 
PI-PLC. All eukaryotic PI-PLCs contain two regions of homology, sometimes referred to as 
'X-box' and 'Y-box'. The order of these two regions is always the same (NH2-X-Y-COOH), 

10 but the spacing is variable. In most isoforms, the distance between these two regions is only 
50-100 residues but in the gamma isoforms one PH domain, two SH2 domains, and one SH3 
domain are inserted between the two PLC-specific domains. The two conserved regions have 
been shown to be important for the catalytic activity. At the C-terminal of the Y-box, there is a 
C2 domain possibly involved in Ca-dependent membrane attachment. Phosphoinositide- 

1 5 specific phospholipase C (PLC) mediates the cellular actions of a variety of hormones, 

neurotransmitters and growth factors. Agonist-dependent activation of PLC causes hydrolysis 
of membrane phosphatidylinositol 4,5-bisphosphate (PIP2), generating the second messengers 
inositol 1,4,5-trisphosphate (IP3) and diacylglycerol (DAG). IP3 binds specific intracellular 
receptors to trigger Ca2+ mobilisation, while DAG mediates activation of a family of protein 

20 kinase C isozymes. Based on molecular size, immunoreactivity and amino acid sequence, 

several subtypes have been classified. In PLC-beta subtypes, X and Y domains are separated 
by a stretch of 70-120 amino acids rich in Ser, Thr and acidic residues. Their C -terminus is 
rich in basic residues. In PLC-gammas, there is an insert of more than 400 residues containing 
an SH3 and two SH2 domains. PLCs show little similarity in the 300-residue N-terminal 

25 region preceding the X-domain. 

The disclosed NOV48 nucleic acid of the invention encoding a Phospholipase C Delta- 
4-Iike protein includes the nucleic acid whose sequence is provided in Table 48A, 48C or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 48A or 48C while still 

30 encoding a protein that maintains its Phospholipase C Delta-4 -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
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include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
5 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 1 percent of the bases may be so changed. 

The disclosed NOV48 protein of the invention includes the Phospholipase C Deltas- 
like protein whose sequence is provided in Table 48B or 48D. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
10 residue shown in Table 48B or 48D while still encoding a protein that maintains its 

Phospholipase C Delta-4-like activities and physiological functions, or a functional fragment 
thereof. In the mutant or variant protein, up to about 28 percent of the residues may be so 
changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 

1 5 (F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Phospholipase C Delta-4 -like 
protein (NOV48) is a member of a "Phospholipase C Delta-4 family". Therefore, the NOV48 
nucleic acids and proteins identified here may be useful in potential therapeutic applications 
implicated in (but not limited to) various pathologies and disorders as indicated below. The 

20 potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

25 The NOV48 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in Cardiomyopathy, Atherosclerosis,Hypertension, 
Congenital heart defects, Aortic stenosis Atrial septal defect (ASD) Atrioventricular (A-V) 
canal defect, Ductus arteriosus , Pulmonary stenosis , Subaortic stenosis, Ventricular septal 
defect (VSD), valve diseases,Tuberous sclerosis, Scleroderma, Obesity,Transplantation, 

30 Osteoporosis,Hypercalceimia, Arthritis, Ankylosing spondylitis, Scoliosis, Von Hippel-Lindau 
(VHL) syndrome , Alzheimer's disease, Stroke, Tuberous sclerosis, hypercalceimia, 
Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, 
Multiple sclerosis, Ataxia-telangiectasia, Leukodystrophies, Behavioral disorders, Addiction, 
Anxiety, Pain, Neuroprotection, and/or other diseases and pathologies. 
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NOV48 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV48 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV48 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV49 

A disclosed NOV49 nucleic acid of 1588 nucleotides (also referred to as CG56739-01) 
encoding a Leukotriene-B4 Omega- Hydroxylase -like protein is shown in Table 49A. An 
open reading frame was identified beginning with a ATG initiation codon at nucleotides 2-4 
and ending with a TGA codon at nucleotides 1577-1 579. The start and stop codons are shown 
in bold in Table 49 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 49 A. NOV49 nucleotide sequence (SEQ ID NO: 185). 

GATGTCGCTGCTGAGCCTGTCCTGGCTGGGCCTCGGGCCGGTGGCAGCATCCCCATGGCTGCTCCTGCTGCT 
GGTCGGGGCCTCCTGGCTCCTGGCCCGTGTCCTGGCCTGGACGTACGCCTTCTATGACAACTGCCACCGCCT 
CCAGTGTTTCCAGCAGCCTCCAAAACGGAACTGCTTTTTAGGTCACCTGAGCCTGGTGCGGGGCAATGAGGA 
GGACATGAGGCTGATGGAGGATCTGGGCCACTACTTCCGTGATGTCCAACTCTGGTGGCTTGGGTCTTTCTA 
CCCTGTCCTGCATCTCGTCCACCCTACGTTCACTGCCCCTGTGCTCCAGGCTTCAGCTGCTGTTGCACTCAA 
GGATATGAGTTTCTATGGCTTCCTGAAGCCCTGGCTGGGTCCTGATGGGCTCCTGATTAGTGCCGGTGACAA 
GTGGAGATGGCACCGCCACCTGCTCACACCTGCCTTCCACTTCAAAATCCTGAAGCCCTATGTGAAGATTTT 
CAATGAGAGCACGAACATCATGCACGCCAAATGGCAACGCCTGGCCTTGGAGGGCAGTGTCCGTCTGGAAAT 
GTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTGCAGAAATGCATCTTCAGCTTTGACAGCAATTGTCA 
GGAGAAGCCCAGCGAATATATTGATGCCATCTTGGAGCTCAGTGCCCTCAGTCTGAAACGGCACCAGCACAT 
CTTCCTGCTCACGGACTTCTTGTACTTCCTCACTCCCAATGGGCGACGCTTCTGCAGGGCCTGTGACATAGT 
GC ACAACTT C AC AG ATG CTGT C AT C C AGG AGCGGCGT C GCAC C CT C AC T AG C C AGGGTGTCG ATG AC TT C CT 
GCAGGCCAAGGCCAAGTCCAAGACTTTGGACTTCATTGACGTGCTCTTGCTGGCCAAGGATGAAAATGGAAA 
GAAGTTGTCAGATGAGAACATAAGAGCGGAGGCTGACACCTTCATGTCTGGGGGCCATGACACCTCGGCCAG 
TGGTCTCTCCTGGGTCCTGTACAACCTCGCGAGGTACCCAGAATACCAGGAGCACTGCCGACAGGAGGTGCA 
AGAGCTCCTGAAGAACGGTGATCCTAAAGAGATTGAATGGGATGACCTGGCCCAGTTGCCCTTCCTGACCAT 
GTGCCTGAAGGAGAGCCTGCGGCTGCATTCCCCAGTCTCCAGGATCCACCGCTGCTGCCCCCAGGACGGGGT 
GCTCCCGGATGGCCGGGTCATCCCCAAAGGTAACACTTGCACCATCAGCATCTTTGGGATCCATCACAACCC 
TTCAGTCTGGCCGGACCCGGAGGTGTATGACCCCTTTCGCTTCGACCCAGAAAATCTCCAGAAGACATCACC 
TCTGGCTTTTATTCCCTTCTCAGCAGTGCCCAGGAACTGCATCGGCCAGACGTTCGCCATGGCTGAGATGAA 
GGTGGTCCTGGCGCTCACGCTGCTGCGCTTCCGCGTCCTGCCGGACCACGCGGAGCCCCGCAGGAAGCTGGA 
GCTGATCGTGCGCGCGGAGGATGGACTTTGGCTACGGGTGGAGCCCCTGAGCGCGGATCTGCAGTGACCCAC 
CACT . 



In a search of public sequence databases, the NOV49 nucleic acid sequence, located on 
chromosome 19, has 1320 of 1 584 bases (83%) identical to a gb:GENBANK- 
ID:HUMLB4OH|acc:D26480.1 mRNA from Homo sapiens (Human mRNA for leukotriene 
B4 omega-hydroxylase, complete cds) (E = 9.7e ~ 237 ). 

The disclosed NOV49 polypeptide (SEQ ID NO: 1 86) encoded by SEQ ID NO: 1 85 has 
525 amino acid residues and is presented in Table 49B using the one-letter amino acid code. 
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Signal P, Psort and/or Hydropathy results predict that NOV49 has a signal peptide and is 
likely to be localized extracellularly with a certainty of 0.8200. Alternatively, NOV49 may 
also localize to the lysosome (lumen) with a certainty of 0.4520, to the microbody 
(peroxisome) with a certainty of 0.161 1, or to the endoplasmic reticulum (membrane) with a 
5 certainty of 0.1000. The most likely cleavage sit for NOV49 is between positions 36 and 37: 
VLA-WT. 



Table 49B. Encoded NOV49 protein sequence (SEQ ID NO:186). 

MSLLSLSWLGLGPVAASPWLLLLLVGASWLLAR^ 

DMRLMEDLGHYFRDVQLWWLGSFYPVLHLVHPTFTAPVLQASAAVALKDMSFYGFLKPWLGPDGLLISAGDK 
WRWHRHLLTPAFHFKILKPYVKIFNESTNIMHAKWQRLALEGSVRLEMFEHISLMTLDSLQKCIFSFDSNCQ 
EKPSEYIDAILELSALSLKRHQHIFLLTDFLYFLTPNGRRFCRACDIVHNFTDAVIQERRRTLTSQGVDDFL 
QAKAKSKTLDFIDVLLLAKDENGKKLSDENIRAEADTFMSGGHDTSASGLSWVLYNLARYPEYQEHCRQEVQ 
ELLKNGDPKEIEWDDLAQLPFLTMCLKESLRLHSPVSRIHRCCPQDGVLPDGRVIPKGNTCTISIFGIHHNP 
SWPDPEVYDPFRFDPENLQKTSPLAFIPFSAVPRNCIGQTFAMAEMKWIoALTLLRFRVLPDHAEPRRKLE 
L I VRAEDGLWLRVE PLSADLQ 

A search of sequence databases reveals that the NOV49 amino acid sequence has 397 
10 of 521 amino acid residues (76%) identical to, and 444 of 521 amino acid residues (85%) 

similar to, the 520 amino acid residue ptnr:SWISSPROT-ACC:Q08477 protein from Homo 
sapiens (Human) (CYTOCHROME P450 4F3 (EC 1.14.13.30) (CYPIVF3) (Leukotriene-B4 
Omega- Hydroxylase) (Leukotriene-B4 20-Monooxygenase) (Cytochrome P450- LTB- 
Omega))(E = 4.3e" 219 ). 

15 NOV49 is predicted to be expressed in at least Prostate. This information was derived 

by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. 

In addition, the sequence is predicted to be expressed in Bone Marrow, Peripheral 
20 Blood, Brain, Colon, Coronary Artery, Hippocampus, Kidney, Kidney Cortex, Liver, Lyniph 
node, Pituitary Gland, and Prostate because of the expression pattern of (GENBANK-ID: 
gb:GENBANK-ID:HUMLB4OH|acc:D26480.1) a closely related Human mRNA for 
leukotriene B4 omega-hydroxylase, complete cds homolog. 

NOV49 also has homology to the amino acid sequences shown in the BLASTP data 
25 listed in Table 49C. 
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Table 49C. BLAST results for NOV49 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 17456512 | ref |XP_ 
065069. 1| 
(XM_065069) 


similar to 
CYTOCHROME P4 50 
4F6 (CYPIVF6 ) (H. 
sapiens) [Homo 
sapiens] 


491 


452/495 
(91%) 


455/495 
(91%) 


0.0 


gi | 14 767705 | ref | XP_ 
029072. 1| 
(XM_029072) 


cytochrome P450 , 
subfamily IVF, 
polypeptide 3 

r T-T/~\m/~i c* ^ t~\ V ^ n c? 1 
L xiCJi/IC tzcipj. Crlii? J 


520 


399/521 
(76%) 


446/521 
(85%) 


0 . 0 


gi | 4503241 | ref | NP_0 
00887. 1| 
(NM_000896) 


cy t ochrome P4 5 0 , 

subfamily IVF, 

polypeptide 

3 ; leukotriene B4 

omega 

hydroxylase ; 
leukotriene -B4 
2 0 -monooxygenase ; 
cytochrome P4 5 0 - 
LTB - omega [ Homo 
sapiens] 


520 


397/521 - 
(76%) 


444/521 
(85%) 


0.0 


gi | 2 99773 7 |gb | AAC08 
589. 1| (AF054821) 


cytochrome P-4 50 
[Homo sapiens] 


520 


395/521 
(75%) 


440/521 
(83%) 


0.0 


gi | 1706095 | sp | P5187 
. 1 | CPF6_RAT 


CYTOCHROME P4 50 
4F6 (CYPIVF6) 


537 


392/521 
. (75%) , 


443/521 
(84%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 49D. In the ClustalW alignment of the NOV49 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



1 0 Table 49D. ClustalW Analysis of NOV49 

1) Novel NOV4 9 (SEQ ID NO: 186) 

2) gi | 17456512 | ref |XP_065069 . 1 | (XM_065069) similar to CYTOCHROME P450 4F6 
(CYPIVF6) (H. sapiens) [Homo sapiens] (SEQ ID NO: 547) 

3) gi | 14767705 | ref |XP_029072 .1 | (XM_029072) cytochrome P450, subfamily IVF, 
15 polypeptide 3 [Homo sapiens] (SEQ ID NO: 54 8) 

4) gi | 4503241 | ref | NP_000887 . 1 | (NM_000896) cytochrome P450, subfamily IVF, 
polypeptide 3 ; leukotriene B4 omega hydroxylase; leukotriene -B4 2 0 -monooxygenase ; 
cytochrome P450- LTB -omega [Homo sapiens] (SEQ ID NO: 54 9) 

5) gi|2997737|gb|AAC08589.l| (AF054821) cytochrome P-450 [Homo sapiens] (SEQ ID 
20 NO:550) 

6) gi|l706095|sp|P5187l|CPF6_RAT CYTOCHROME P450 4F6 (CYPIVF6) (SEQ ID NO:551) 
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NOV49 

gi | 17456512 | 
gi | 14767705 j 



10 



20 



....|....|....|. 



30 



40 



50 



60 



SPWLLLLLVGASWLLARraLAWTYgFYDNCjSjRLffiCFSQPPKRN 



eg 



QLSLSHLGL^P-iAASPWLLLLLVGASWLLARILAWTYTFYDNCCRLRCFPQPPKRNWF 



60 
1 

60 
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10 



15 



20 



gi 
gi 
gi 



4503241 | 
2997737 | 
1706095 | 



N0V4 9 

gi | 17456512 | 
gi j 14767705 | 
gi|450324l| 
gi | 2997737 j 
gi | 1706095 | 



NOV49 

gi | 17456512 | 
gi | 14767705 | 
gi|450324l| 
gi | 2997737 j 
gi | 1706095 | 




121 emt: 
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30 



NOV4 9 

gi 1 17456512 | 
gi | 14767705 j 
gi j 4503241 | 
gi|2997737| 
gi j 1706095 | 



181 
114 
180 
180 
180 
180 



I 



190 
..[.. 



I 



200 
. . I . . 



I 



210 
. . I . . 



220 



230 
. . I . . 



LAfflEGSfflRL^FEHISLMTLDSLQKCMFSFDSlCQEKPSEYlgAILELSAL 

laBegs3ri^mfehislmtldslqkcHfsfds|^ 

laHegsarldmfehislmtldslqkcBfsfdsBcqekpseyiaailelsal 

la Be gsarldmfehislmtldslqkcBfsfdsBcqekpseyiaailelsal 

la^gBarldmfehislmtldslqkcBfsfdsBcqekpseyiaailelsal 

l^BBgsarijDmfehislmtldslqkcBfsfdsBcqe^BseyiaailelsBl 




250 



260 



270 



35 



40 



45 



50 



55 



60 



65 



70 



NOV4 9 

gi I 17456512 | 
gi|l4767705| 
gi j 4503241 | 
gi (2997737 j 
gi|l706095 j 



NOV49 

gi 
gi 
gi 
gi 
gi 



17456512 | 
14767705 | 
4503241 | 
2997737 | 
1706095 j 



NOV4 9 

gi | 17456512 | 
gi | 14767705 j 
gi | 4503241 | 
gi | 2997737 j 
gi j 1706095 j 



NOV4 9 

gi | 17456512 | 
gi | 14767705 | 
gi [4503241 | 
gi|2997737 | 
gi | 1706095 | 



NOV4 9 




467 



490 
..|.. 



500 
..|.. 



510 



520 



530 



540 



504 



430 



gi | 17456512 | 
gi | 14767705 j 
gi | 4503241 | 
gi | 2997737 j 
gi | 1706095 j 



411 GSRRDGAGMGVLGTVRVPTPS POj 
466 
466 
466 
466 



NOV49 


505 


gi 


17456512 | 


471 


gi 


14767705 | 


504 


gi 


4503241 | 


504 


gi 


2997737 | 


504 


gi 


1706095] 


504 




525 
491 
520 
520 
520 

7TSQLPWDLLAHPPTS 537 



Table 49F lists the domain description from DOMAIN analysis results against NOV49. 
This indicates that the NOV49 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 49F Domain Analysis of NOV49 

gnl | Pf am | pf am00067 , p4 50, Cytochrome P450. Cytochrome P450s are 
involved in the oxidative degradation of various compounds. 
Particularly well known for their role in the degradation of 
environmental toxins and mutagens. Structure is mostly alpha, and 
binds a heme cof actor. (SEQ ID NO: 84 8) 
CD-Length = 445 residues, 98.9% aligned 
Score = 308 bits (790) , Expect = 4e-85 



NOV4 9 : 
Sbjct : 
NOV4 9: 
Sbjct : 
NOV4 9 : 
Sbjct : 
NOV4 9 : 
Sbjct : 
NOV4 9: 
Sbjct : 
NOV4 9: 
Sbjct: 
NOV4 9: 
Sbjct: 
NOV4 9 : 
Sbjct: 



5 5 PKRNCFLGHLSLVRGNEEDMRLMEDLGHY FRDVQLWWLGSFYPVLHLVHPTFTAPVLQAS 114 

I + 1 + 1 + + + +1 + I +11 I 1+ + I II 

4 PPPLPL I GNLLQLGRG - - P I HS LTELRKKYGPVFTLYLGP - RPWWTGPE AVKE VLI DK 60 

115 AAVALKDMSFYGFLKPWLGPDGLLISAGDKWRWHRHLLTPAFHFKILKPYVKIFNESTNI 174 

I I I I I I l + l I I +11 I III I I + I 1 + 

61 GEEFAGRGDFPVF- - - PWLG- YGILFSNGPRWRQLRRLLTLRF- FGMGKRS-KLEERIQEE 115 

175 MHAKWQRLALEGSVRLEMFEHI SLMTLDSLQKCI FSFDSNCQEKPSE - YIDAILELSALS 233 



412 



337 



I I I 



+++ | ++ 



I 



I I 



116 T^DLVERLRKEQGSPIDITELLAPAPLNVICSLLFGVRFDYEDPEFLKLIDKLNELFFLV 175 

234 LKRHQHIFLLTDFLYFLTPNGRRFCRACDIVHNFTDAVIQERRRTLTSQGVDDFLQAKAK 2 93 

I I I I +1 + 1+ +1 + + + I +1 + 1 I I I I 

176 S PWGQ LLDFFRYLPGSHRKAFKAAKDLKDYLDKLI EERRETLE P 219 

2 94 SKTLDFIDVLLL-AKDENGKKLSDENIRAEADTFMSGGHDTSASGLSWVLYNIiARYPEYQ 352 

ll+l 11+ II I I + 1 + 11 ++I + I II++I III II II++II I 

220 GDPRDFLDSLLI EAKREGGSELTDEELKATVLDLLFAGTDTTSSTLSWALYLLAKHPEVQ 279 

353 EHCRQEVQELLKNGDP KE I E WDDLAQL P FLTMCLKES LRLHS PV - SR I HRCCPQDGVLPD 411 

1+1+ I++ I + +111 +1+1 +11+1111 I +1 +1+1 

2 80 AKLREEIDEVI--GRDRSPTYDDRANMPYLDAVIKETLRLHPWPLLLPRVATEDTEI-D 336 



GRVIPKGNTCTISIFGIHHNPSVWPDPEVYDPFRFDPENLQKTSPLAFIPFSAVPRNCIG 

I +1111 + + + + +1 +1 l + l + ll +11 II II + I l + l I I I I I l + l 

GYLIPKGTLVIVNLYSLHRDPKVFPNPEEFDPERFLDENGKFKKSYAFLPFGAGPRNCLG 



4 72 QTFAMAEMKWLALTLLRFRV - LPDHAE PRRKLELI VRAEDGLWLRV 517 

+ I 1+ + I I I I I + I 
3 97 ERLARMELFLFLATLLQRFELELVPPGDIPLTPKPLGLPSKPPLYQL 443 



471 



396 
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Leukotrienes are a group of bioactive compounds that play important roles in such 
processes as inflammation. Kikuta et al. (1993) (J. Biol. Chem. 268: 9376-9380) isolated a 
cDNA for the human leukotriene B4 omega-hydroxylase (LTB4H), an enzyme which 
catalyzes the omega-hydroxylation of leukotriene B4. Their cDNA encoded a 520-amino acid 
5 protein with a predicted molecular weight of 59,805 Da. The deduced amino acid sequence 
contains a cysteine in the conserved heme-binding domain near the C-terminus, which is a 
characteristic feature of the cytochrome P450 superfamily; the protein shares 31 to 44% 
similarity with CYP4A, CYP4B, and CYP4C. Kikuta et al. (1993) (J. Biol. Chem. 268: 9376- 
9380) detected transcript from the LTB4H gene in polymorphonuclear leukocytes and 

10 leukocytes. Kikuta et al. (1998) (DNA Cell Biol. 17: 221-230) determined that the CYP4F3 
gene contains 13 exons and spans approximately 22.2 kb. By fluorescence in situ 
hybridization, they mapped the CYP4F3 gene to 19pl3.2. The cytochrome P450 enzymes 
usually act as terminal oxidases in multicomponent electron transfer chains, called P450- 
containing monooxygenase systems. P450-containing monooxygenase systems primarily fall 

15 into two major classes: bacterial/mitochondrial (type I), and microsomal (type II). All P450 
enzymes can be categorised into two main groups, the so-called B- and E-classes: P450 
proteins of prokaryotic 3-component systems and fungal P450nor (CYP55) belong to the B- 
class; all other known P450 proteins from distinct systems are of the E-class. This family 
contains a number of subtypes of both B and E classes. 

20 The disclosed NOV49 nucleic acid of the invention encoding a Leukotriene-B4 

Omega- Hydroxylase-like protein includes the nucleic acid whose sequence is provided in 
Table 49A or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 49A while 
still encoding a protein that maintains its Leukotriene-B4 Omega- Hydroxylase -like activities 

25 and physiological functions, or a fragment of such a nucleic acid. The invention further 

includes nucleic acids whose sequences are complementary to those just described, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 

30 of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
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variant nucleic acids, and their complements, up to about 17 percent of the bases may be so 
changed. 

The disclosed NOV49 protein of the invention includes the Leukotriene-B4 Omega- 
Hydroxyiase-like protein whose sequence is provided in Table 49B. The invention also 
5 includes a mutant or variant protein any of whose residues may be changed from the 

corresponding residue shown in Table 49B while still encoding a protein that maintains its 
Leukotriene-B4 Omega- Hydroxy lase-1 ike activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 25 percent of the 
residues may be so changed. 

10 The invention further encompasses antibodies and antibody fragments, such as F a b or 

(Fab)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Leukotriene-B4 Omega- 
Hydroxylase -like protein (NOV49) is a member of a "Leukotriene-B4 Omega- Hydroxylase 
family". Therefore, the NOV49 nucleic acids and proteins identified here may be useful in 

15 potential therapeutic applications implicated in (but not limited to) various pathologies and 

disorders as indicated below. The potential therapeutic applications for this invention include, 
but are not limited to: protein therapeutic, small molecule drug target, antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic 
marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo 

20 and in vitro of all tissues and cell types composing (but not limited to) those defined here. 

The NOV49 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Atherosclerosis, Aneurysm, Hypertension, 
Fibromuscular dysplasia, Stroke, Scleroderma, Obesity, Transplantation, Myocardial 
infarction, Embolism, Cardiovascular disorders, Bypass surgery, Osteoporosis, 

25 Hypercalceimia, Arthritis, Ankylosing spondylitis, Scoliosis, Von Hippel-Lindau (VHL) 
syndrome , Alzheimer's disease, Stroke, Tuberous sclerosis, hypercalceimia, Parkinson's 
disease, Huntington's disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple 
sclerosis, Ataxia-telangiectasia, Leukodystrophies, Behavioral disorders, Addiction, Anxiety, 
Pain, Neuroprotection, Diabetes, Autoimmune disease, Renal artery stenosis, Interstitial 

30 nephritis, Glomerulonephritis, Polycystic kidney disease, Systemic lupus erythematosus, 
Renal tubular acidosis, IgA nephropathy, Hypercalceimia, Lesch-Nyhan syndrome, and/or 
other diseases and pathologies. 

NOV49 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV49 substances for use in 
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therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV49 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV50 

NOV50 includes three novel Protein Arginine N-Methyltransferase 2-like proteins 
disclosed below. The disclosed sequences have been named NOV50a and NOV50b. 
NOV50a 

A disclosed NOVSOa nucleic acid of 1 196 nucleotides (also referred to as CG56771- 
01) encoding a Protein Arginine N-Methyltransferase 2-like protein is shown in Table 50A. 
An open reading frame was identified beginning with a ATG initiation codon at nucleotides 
13-15 and ending with a TGA codon at nucleotides 1068-1070. The start and stop codons are 
shown in bold in Table 50 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 50 A. NOVSOa nucleotide sequence (SEQ ID NO: 187). 

AGAGCGGCCAAGATGTCGCAGCCCAAGAAAAGAAAGCTTGAGTCGGGGGGCGGCGCCGAAGGAGGGGAGGGA 
ACTGAAGAGGAAGATGGCGCGGAGCGGGAGGCGGCCCTGGAGCGACCCCGGAGGACTAAGCGGGAACGGGAC 
CAGCTGTACTACGAGTGCTACTCGGACGTTTCGGTCCACGAGGAGATGATCGCGGACCGCGTCCGCACCGAT 
GCCTACCGCCTGGGTATCCTTCGGAACTGGGCAGCACTGCGAGGCAAGACGGTACTGGACGTGGGCGCGGGC 
ACCGGCATTCTGAGCATCTTCTGTGCCCAGGCCGGGGCCCGGCGCGTGTACGCGGTAGAGGCCAGCGCCATC 
T GG C AAC AGG C C CGGG AGGTGGTGCGGTT C AAC GGGCTGGAGG AC CGGGTG C ACGT CCTGC CGGG AC C AGTG 
GAGACTGTAGAGTTGCCGGAACAGGTGGATGCCATCGTGAGCGAGTGGATGGGCTACGGACTCCTGCACGAG 
TCCATGCTGAGCTCCGTCCTCCACGCGCGAACCAAGTGGCTGAAGGAGGGCGGTCTTCTCCTGCCGGCCTCC 
GCCGAGCTCTTCATAGCCCCCATCAGCGACCAGATGCTGGAATGGCGCCTGGGCTTCTGGAGCCAGGTGAAG 
CAGCACTATGGTGTGGACATGAGCTGCCTGGAGGGCTTCGCCACGCGCTGTCTCATGGGCTAGAGCTCTCCC 
GCGCCGGCTTGGAGCAGGAGCTGGAGGCCGGAGTGGGCGGGCGCTTCCGCTGCAGCTGCTATGGCTCGGCGC 
CCATGCATGGCTTTGCCATCTGGTTCCAGGTGACCTTCCCTGGAGGGGAGTCGGAGAAACCCCTGGTGCTGT 
CCACCTCGCCTTTTCACCCGGCCACTCACTGGAAACAGGCGCTCCTCTACCTGAACGAGCCGGTGCAAGTGG 
AGCAAGACACGGACGTTTCAGGAGAGATCACGCTGCTGCCCTCCCGGGACAACCCCCGTCGCCTGCGCGTGC 
TGCTGCGCTACAAAGTGGGAGACCAGGAGGAGAAGACCAAAGACTTTGCCATGGAGGACTGA GCGTOGCC1T 
TTCTCCCAGCTACCTCCCAAAGCAGCCTGACCTGCGTGGGAGAGGCGCCACTCGGAGATCGTTGTGCAGGGA 
TTGTCCGGCGAGGACGTGCTGGCCCGGCCGCAGCGCTTTGCTCA 



In a search of public sequence databases, the NOV50a nucleic acid sequence, located 
on chromosome 19, has 681 of 719 bases (94%) identical to a gb:GENBANK- 
ID:AK001421|acc:AK001421.1 mRNA from Homo sapiens (cDNA FLJ10559 fis, clone 
NT2RP2002618, weakly similar to Protein Arginine N-Methyltransferase 2 (EC 2.1 .1 .-)) (E = 



The disclosed NOV50a polypeptide (SEQ ID NO: 188) encoded by SEQ ID NO: 187 
has 375 amino acid residues and is presented in Table 50B using the one-letter amino acid 
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code. Signal P, Psort and/or Hydropathy results predict that NOV50a has no signal peptide 
and is likely to be localized to the nucleus with a certainty of 0.7000. Alternatively, NOV50a 
may also localize to the microbody (peroxisome) with a certainty of 0.2641, to the 
mitochondrial matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a 
certainty of 0.1000. 



Table 50B. Encoded NOVSOa protein sequence (SEQ ID NO:188). 

MSQPKKRKLESGGGAEGGEGTEEEDGAEREAALERPRRTKRERDQLYYECYSDVSVHEEMIADRVRTDAYRL 

GILRNWAALRGKTVLDVGAGTGILSIFCAQAGARRVYAVEASAIWQQAREWRFNGLEDRVHVLPGPVETVE 

LPEQVDAIVSEWMGYGLLHESMLSSVLHARTKWLKEGGLLLPASAELFIAPISDQMLEWRLGFWSQVKQHYG 

VDMSCLEGFATRCLMGHSEIWQGLSGEDVLARPQRFAQLELSRAGLEQELEAGVGGRFRCSCYGSAPMHGF 

AIWFQVTFPGGESEKPLVLSTSPFHPATHWKQALLYLNEPVQVEQDTDVSGEITLLPSRDNPRRLRVLLRYK 
VGDQEEKTKD FAMED 

A search of sequence databases reveals that the NOV50a amino acid sequence has 316 
of 316 amino acid residues (100%) identical to, and 316 of 316 amino acid residues (100%) 
similar to, the 316 amino acid residue p tnr : S PTREMB L- ACC : Q9N VR8 protein from Homo 
sapiens (Human) (CDNA FLJ10559 FIS, Clone NT2RP2002618, Weakly Similar To Protein 
Arginine N-Methyl transferase 2 (EC 2.1.1.-)) (E = 1.7e~ 169 ). 

NOVSOa is predicted to be expressed in at least lung, bronchus, kidney. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

NOVSOb 

In the present invention, the target sequence identified previously, NOV50a, was 
subjected to the exon linking process to confirm the sequence. PCR primers were designed by 
starting at the most upstream sequence available, for the forward primer, and at the most 
downstream sequence available for the reverse primer. In each case, the sequence was 
examined, walking inward from the respective termini toward the coding sequence, until a 
suitable sequence that is either unique or highly selective was encountered, or, in the case of 
the reverse primer, until the stop codon was reached. Such primers were designed based on in 
silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 
sequence of the target sequence, or by translated homology of the predicted exons to closely 
related human sequences sequences from other species. These primers were then employed in 
PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 
marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 
brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 
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lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV50b. This differs from the previously 
identified sequence (NOV50a) at aminoacid position 1 5 A->G. 

A disclosed NOV50b nucleic acid of 1 165 nucleotides (also referred to as CG56771- 
02) encoding a Protein Arginine N-Methyltransferase 2-like protein is shown in Table 50C. 
An open reading frame was identified beginning with a ATG initiation codon at nucleotides 4- 
6 and ending with a TGA codon at nucleotides 1 129-1 131 . The start and stop codons are 
shown in bold in Table 50C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 50C. NOVSOb nucleotide sequence (SEQ ID NO:189). 

AAGATGTCGCAGCCCAAGAAAAGAAAGCTTGAGTCGGGGGGCGGCGGCGAAGGAGGGGAGGGAACTGAAGAG 
GAAGATGGCGCGGAGCGGGAGGCGGCCCTGGAGCGACCCCGGAGGACTAAGCGGGAACGGGACCAGCTGTAC 
TACGAGTGCTACTCGGACGTTTCGGTCCACGAGGAGATGATCGCGGACCGCGTCCGCACCGATGCCTACCGC 
CTGGGTATCCTTCGGAACTGGGCAGCACTGCGAGGCAAGACGGTACTGGACGTGGGCGCGGGCACCGGCATT 
CTGAGCATCTTCTGTGCCCAGGCCGGGGCCCGGCGCGTGTACGCGGTAGAGGCCAGCGCCATCTGGCAACAG 
GCCCGGGAGGTGGTGCGGTTCAACGGGCTGGAGGACCGGGTGCACGTCCTGCCGGGACCAGTGGAGACTGTA 
GAGTTGCCGGAACAGGTGGATGCCATCGTGAGCGAGTGGATGGGCTACGGACTCCTGCACGAGTCCATGCTG 
AGCTCCGTCCTCCACGCGCGAACCAAGTGGCTGAAGGAGGGCGGTCTTCTCCTGCCGGCCTCCGCCGAGCTC 
TTCATAGCCCCCATCAGCGACCAGATGCTGGAATGGCGCCTGGGCTTCTGGAGCCAGGTGAAGCAGCACTAT 
GGTGTGGACATGAGCTGCCTGGAGGGCTTCGCCACGCGCTGTCTCATGGGCCACTCGGAGATCGTTGTGCAG 
GGATTGTCCGGCGAGGACGTGCTGGCCCGGCCGCAGCGCTTTGCTCAGCTAGAGCTCTCCCGCGCCGGCTTG 
GAGCAGGAGCTGGAGGCCGGAGTGGGCGGGCGCTTCCGCTGCAGCTGCTATGGCTCGGCGCCCATGCATGGC 
TTTGCCATCTGGTTCCAGGTGACCTTCCCTGGAGGGGAGTCGGAGAAACCCCTGGTGCTGTCCACCTCGCCT 
TTTCACCCGGCCACTCACTGGAAACAGGCGCTCCTCTACCTGAACGAGCCGGTGCAAGTGGAGCAAGACACG 
GACGTTTCAGGAGAGATCACGCTGCTGCCCTCCCGGGACAACCCCCGTCGCCTGCGCGTGCTGCTGCGCTAC 
AAAGTGGGAG AC C AGGAGG AGAAGAC C AAAGAC TT TGC CATGG AGG AC TG AGCGTTG CC TTT TCC C CCAG CT 
ACCTCCCAAAGCA ~ ; 



In a search of public sequence databases, the NOV50b nucleic acid sequence, located 
on chromosome 19, has 1090 of 1091 bases (99%) identical to a gb:GENBANK- 
ID:AK001421|acc:AK001421.1 mRNA from Homo sapiens (cDNA FLJ10559 fis, clone 
NT2RP2002618, weakly similar to Protein Arginine N-Methyltransferase 2 (EC 2.1.1.-)) (E = 

2.2e" 240 ). 

The disclosed NOV50b polypeptide (SEQ ID NO: 190) encoded by SEQ ID NO: 189 
has 375 amino acid residues and is presented in Table 50D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV50b has no signal peptide 

436 



.IS ii'Sf li 7" ir" !i ft 1 IP - fa 1! "5 t Or 

• ' • " 



and is likely to be localized to the nucleus with a certainty of 0.7000. Alternatively, NOV50b 
may also localize to the microbody (peroxisome) with a certainty of 0.2766, to the 
mitochondrial matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a 
certainty of 0.1000. 

5 



Table SOD. Encoded NOVSOb protein sequence (SEQ ID NO:190). 

MSQPKKRKLESGGGGEGGEGTEEEDGAEREAALERPRRTKRERDQLYYECYSDVSVHEEMIADRVRTDAYRL 
GILRNWAALRGKTVLDVGAGTGILSIFCAQAGARRVYAVEASAIWQQAREVVRFNGLEDRVHVLPGPVETVE 
LPEQVDAIVSEWMGYGLLHESMLSSVLHARTKWLKEGGLLLPASAELFIAPISDQMLEWRLGFWSQVKQHYG 
VDMSCLEGFATRCLMGHSEIWQGLSGEDVLARPQRFAQLELSRAGLEQELEAGVGGRFRCSCYGSAPMHGF 
AIWFQVTFPGGESEKPLVLSTSPFHPATHWKQALLYLNEPVQVEQDTDVSGEITLLPSRDNPRRLRVLLRYK 
VGDQEEKTKD FAMED 



A search of sequence databases reveals that the NOVSOb amino acid sequence has 316 
of 316 amino acid residues (100%) identical to, and 316 of 316 amino acid residues (100%) 
similar to, the 316 amino acid residue ptnr:TREMBLNEW-ACC:AAH02729 protein from 
1 0 Homo sapiens (Human) (Hypothetical 35.2 Kda Protein) (E = 1.8e" 169 ). 

NOV50b is predicted to be expressed in at least lung, bronchus, kidney. . 

NOV50a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 50E 



Table 50E. BLAST results for NOV50a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 15822 6 52 | gb | AAK8 
5733. l| (AY043278) 


arginine 

methyl trans f erase 
6 [Homo sapiens] 


375 


374/375 
(99%) 


374/375 
(99%) 


0.0 


gi | 8922515 | ref | NP_0 
60607 . 1 | 
<NM_018137) 


hypothetical 
protein FLJ10559 
[Homo sapiens] 


316 


316/316 
(100%) 


316/316 
(100%) 


0 . 0 


gi| 9293956 |dbj | BABO 
1859. l| (AP000383) 


protein arginine 
N- 

methyl transferase 
-like 

protein [Arabidops 
is thaliana] 


399 


148/317 
(46%) 


193/317 
(60%) 


5e-66 


gi | 15231011 | ref | NP_ 
188637. 1| 
(NC_003074) 


arginine 

methyl transferase 
, putative 
[Arabidopsis 
thaliana] 


409 


143/310 
(46%) 


185/310 
(59%) 


le-65 


gi | 1523 3 6 06 | ref |NP_ 
194680 . 1 | 
(NC_003075) 


arginine 

methyl transferase 
(paml) 

[Arabidopsis 


390 


135/365 
(36%) 


201/365 
(54%) 


4e-58 



15 

The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 50F. In the ClustalW alignment of the NOV50 protein, as 
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well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
5 function. 
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Table 50F. ClustalW Analysis of NOV50 



10 



15 



20 



1) Novel NOV5 0a (SEQ ID NO: 18 8) 

2) Novel NOV5 0b (SEQ ID NO: 190) 

3) gi | 15822652 |gb | AAK85733 . 1 | (AY043278) arginine methyltransf erase 6 [Homo 
sapiens] (SEQ ID NO: 552) 

4) gi | 8922515 | ref |NP_0G0607 . 1 | (NM_018137) hypothetical protein FLJ10559 [Homo 
sapiens] (SEQ ID NO: 553) 

5) gi | 9293956 | dbj |BAB01859 . 1 | (AP000383) protein arginine N-methyl trans f erase- like 
protein [Arabidopsis thaliana] (SEQ ID NO: 554) 

6) gi | 15231011 | ref |NP_188637 . 1 | (NC_003074) arginine methyltransf erase , putative 
[Arabidopsis thaliana] (SEQ ID NO: 555) 

7) gi | 15233606 | ref |NP_194680 . 1 | (NC_003075) arginine methyltransf erase (paml) 
[Arabidopsis (SEQ ID NO: 556) 



25 



30 



35 



40 



45 



50 



55 



60 



NOV50a 

NOV50b 

gi | 15822652 | 

gi|8922515| 

gij 9293956 | 

gi j 15231011 | 

gi|l5233606| 



NOVSOa 
NOV5 0b 
gi | 15822652 | 
gi j 8922515 | 
gi | 9293956 j 
gi | 15231011 | 
gi I 15233606 j 



NOV50a 

NOVSOb 

gi | 15822652 | 

gi | 8922515 | 

gi j 9293956 j 

gi | 15231011] 

gi|l5233606| 



10 



20 
- I-- 



30 
•I 



-msqp 

-Ms|p 




40 

|....|.. 
ESGGGAE 
ESGGGGE 
ESGGGGE 





50 






60 


GG 


EG- 


..|.. 
-TEEE 




-1 

GA 2 7 


GG 


EG- 


-TEEE 


D 


GA 2 7 


GG 


EG- 


-TEEE 


D 


GA 2 7 



mqsggdfsngfhgdhhreleledkqgpslssfgra^gshagardp: 
mqsggdfsngfhgdhhreleledkqgpslssfgJa^^shagardp: 
mtknsnhdenefi sfepnqntkir- fedadgdevaegsgv, 




LRVSgQL 6 0 
[LANVLRVSgQL 6 0 
ET--PQDESMF 51 



70 

■ • - J - r- - I - - - - I - ■ 

2 8 EREAAggR PRRTK RER 

2 8 EREAA^RPRR|jK R&] 

2 8 EREAA^BrPRrSIk R^R 



100 



110 



61 GEHKSTOTSES^PPPCTgFgVAgHHSi 

6 1 gehks{3|tses^pppct&f3vaB^hs| 

5 2 DAGESADTAE VtllDD - - TTS AD' 




120 



^Ag 
fAAj| 

{^aEr 

iQHQj|L^E 
,Qi|QSLlE 
yjjKFL^|D 




130 
• . | ■ • 



140 
I | • . 



150 

._J_. 



160 



170 



180 



NOVSOa 


85 


T 


NOV50b 


85 


T 


gi | 15822652 | 


85 


T 


gi | 8922515 | 


26 


T 


gi | 9293956 | 


121 


V 


gi j 15231011| 


121 


V 


gi|l5233606| 


110 


I 



LDVGAGTGI LS I FCAQAGAgRVYAVEASgl 
VLDVGAGTGILS I FCAQAGAgRVYAVEAsHl 
VLDVGAGTGI LS I FCAQAGAjgRVYAVEAsHl 
VLDVGAGTGILS I FCAQAGA|RVYAVEASgI 

aDVGfflGTG I LS I FCAQAGaHrVYAVIAsJi 
. m dvgBgtgilsifcaqagaBrvyavSasBi 
vldvgagtgils3fcaSagaSSvyaveBs" 



ELS I FCAQAGAgRVYAVEASgl 



[LSI FCAQAGAjgRVYAVgASgl 

llsi fcaqagaHrvyav§asBi 
n ls3fcaSagaS5vyaveSsh 




190 
I 



200 



SEWMGYgLL 
SEWMGYgLL 
SEWMGYgLL 
SEWMGYgLL 

sewmgySll 
sewmgyBll 
sewmgySll 
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250 



260 



270 



280 



290 



300 



NOV5 0a 

NOV50b 

gi | 15822652 | 

gi j 8922515 | 

gij 9293956 | 

gi | 15231011 | 

gi|l5233606 j 



NOVSOa 
NOV5 Ob 
gi|l5822652 | 
gij 8922515 | 
gi | 9293956 | 
gi|l523101l| 
gi|l5233606| 



2 03 WRXiG 
2 03 w||!g 





RFAQpELg- 

'RFAQLrEI 
RFAQgEI 
•RFAQggb 

_ ^ _ KHf^CKTIK 2 94 
iT||DQNQ^TDSyLLKTMpig- - - 2 77 

360 



!c^CYGgAPj 
!CSCYG S API 
.C^CYGSAP] 
FSGPASSPAjM|SET j IASGSS 
FSGPASSPA|lSiySET@IASGSS 



2 95 IQELDSVTARYKFNSMM^PMHGFAFWFg 1 _ ^ ^ 

2 77 KMSSGt^SFTAP^LVAQRtoYfgjALVA: 





NOV50a 

NOV50b 

gi | 15822652 | 

gi | 8922515 | 

gi j 9293956 | 

gi|l523101l| 

gi | 15233606 j 



PGGE;SEK 
PGGESE: 

Ipggesek 

|PGGpSE: 
VNQKpR' 
3 55 VNQiSr' 

311 [Jtmchkllg 



296 
296 
296 
237 
319 




400 

^tdjtsSeJ 
^qvJe@s|; 

. -NWSYI 

|Tic|GETilST^ls| 



420 




NOVSOa 

NOVSOb 

gi | 15822652 | 

gi | 8922515 | 

gi | 9293956 j 

gij 15231011 | 

gijl5233606| 




430 440 

iKV.GDgEEgTKDFAMED 
K^GD 1EE0TKDFAMED 

(K&GD 
l GD 

JSSAGgSFVK^SVMR 

ISNSASAYIliKN 4 09 

iK|3sgsgNG@HC[2lSRTQHYKMR 3 90 



SEE JTKDFAMED- - 
IE E|gTiD FAMED - - 



375 
375 
375 
316 
399 



Table 49G lists the domain description from DOMAIN analysis results against 
NOV50. This indicates that the NOV50 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 50G Domain Analysis of NOV50 

gnl | Pf am | pfam0l2 09 , Ubie_methyltran, ubiE/COQ5 methyl transferase 
family. (SEQ ID NO: 84 9) 

CD-Length = 237 residues, 63.3% aligned 
Score = 35.0 bits (79), Expect - 0.008 



NOV5 0 : 


40 


Sbjct : 


5 


NOV5 0 : 


91 


Sbj Ct : 


60 


NOV50 : 


148 



KRERDQLYYECYSDVSVHEEMIADRVRTDAYRLGILRNW AALRGKTVLDVG 9 0 

I | ++ | + |+ +++ I II I I III III 

KEEKEQKVHHVFASVAKKYDLM ND VM S FG I HRL WKDH FTM KLMG P KRG KK FLD VA 5 9 

AGTGILSI - FCAQAGAR- RVYAVEASA- I WQQAREWRFNGLEDRVHVLPGPVETVELPE 14 7 

| | | ++ || +| + + + ++ +1+1 I I I I 

GGTGDIAFRLLRHAGTSGKWVLDINENMLKVGKKRAE - EEGKIRIEWLCANAE - - ELPF 116 

QVDAIVSEWMGYGLLHESMLSSVLHARTKWLKEGGLLL 18 5 
+ +1+ + + II + I I I I 1 + 
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Sbjct: 117 EDNTFDLVTISFGIRNFTDYLKVLREAFRVLKPGGQLV 154 

Methyl transfer from S-Adenosyl-L-methionine (SAM) to either nitrogen, oxygen or 
5 carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and 
mammals. The reaction is catalyzed by methyltransferases (MTases) and modifies DNA, 
RNA, proteins and small molecules like catechol. The catalytic domain of SAM-MTases is of 
the alpha/beta type with a central mixed beta-sheet around which several alpha-helices are 
arranged. Topologically it can be divided into two halves. The first half, formed by betal- 

10 alphaA-beta2-alphaB-beta3-alphaC, is mainly responsible for SAM binding. The second half, 
beta4-alphaD-beta5-alphaE-beta6-beta7, is primarily responsible for catalysis. According to 
the sequential order of these two sites, the SAM-MTases can be divided into three families 
Protein arginine methylation has been implicated in signal transduction, nuclear transport and 
transcription regulation. Protein arginine methyltransferases (PRMTs) mediate the AdoMet- 

1 5 dependent methylation of many proteins, including many RNA binding proteins involved in 
various aspects of RNA processing and/or transport. 

The bulk of methylated arginine residues in eukaryotic cells are found in 
heterogeneous nuclear ribonucleoproteins (hnRNPs), RNA-binding proteins that play essential 
roles in the metabolism of nuclear pre-mRNA. Lin et al. (1 996) identified a rat cDNA 

20 encoding PRMT1 (protein-arginine N-methy transferase 1; EC 2.1.1.23). Recombinant 
PRMT1 methylated histones and hnRNPAl (164017) in vitro. By using a yeast 2-hybrid 
screen to identify proteins that interact with the intracytoplasmic domain of the interferon- 
alpha/beta receptor-1 (IFNAR1; 107450), Abramovich et al. (1997) identified a human cDNA 
encoding a protein that was nearly identical to PRMT1 . The deduced 361 -amino acid protein 

25 was designated IR1B4 for 'interferon receptor-1 -bound protein 4.' Epitope-tagged IR1B4 
bound the IFNAR1 intracytoplasmic domain in vitro. Antibodies against IFNAR1 
coimmunoprecipitated a methyltransferase activity from human cell extracts. An antisense 
oligonucleotide strongly reduced methyltransferase activity in human cells, and caused them 
to become more resistant to growth inhibition by interferon. Abramovich et al. (1997) 

30 concluded that protein methylation, like phosphorylation, may be an important signaling 

mechanism for certain cytokine receptors. Scott et al. (1998) identified HRMT1L2 transcripts 
with variable 5-prime ends that encode 3 protein variants with different N-terminal regions. 
Rat PRMT1 and HRMT1L2 variant 2 (v.2) share 95% sequence identity, but diverge at their N 
termini. The amino acid sequences of HRMT1L2 and HRMT1L1 (601961) are 27% identical. 

35 Recombinant protein methylated human hnRNPAl and a yeast hnRNP in vitro. The 

HRMT1L2 gene complemented mutations in the yeast hnRNP methyltransferase gene HMT1 . 
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Northern blot analysis revealed that HRMT1L2 is expressed as a predominant 1 .4-kb mRNA 
in various adult and fetal tissues. Additional larger and smaller bands were observed in some 
tissues. 

The disclosed NOV50 nucleic acid of the invention encoding a Protein Arginine N- 
5 Methyltransferase 2-like protein includes the nucleic acid whose sequence is provided in Table 
50A, 50C, or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 50A or 50C 
while still encoding a protein that maintains its Protein Arginine N-Methyltransferase 2 -like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 

10 further includes nucleic acids whose sequences are complementary to those just described, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 

15 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 6 percent of the 
bases may be so changed. 

20 The disclosed NOV50 protein of the invention includes the Protein Arginine N- 

Methyltransferase 2-like protein whose sequence is provided in Table 50B or SOD. The 
invention also includes a mutant or variant protein any of whose residues may be changed 
from the corresponding residue shown in Table SOB or SOD while still encoding a protein that 
maintains its Protein Arginine N-Methyltransferase 2-like activities and physiological 

25 functions, or a functional fragment thereof. In the mutant or variant protein, up to about 63 
percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Protein Arginine N- 

30 Methyltransferase 2 -like protein (NOV50) is a member of a "Protein Arginine N- 

Methyltransferase 2 family". Therefore, the NOV50 nucleic acids and proteins identified here 
may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
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antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

5 The NOV50 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in systemic lupus erythematosus , autoimmune disease, 
asthma, emphysema, scleroderma, allergy, ards, diabetes, autoimmune disease, renal artery 
stenosis, interstitial nephritis, glomerulonephritis, polycystic kidney disease, renal tubular 
acidosis, 1GA nephropathy, hypercalceimia, Lesch-Nyhan syndrome, and/or other diseases and 
10 pathologies. 

NOV50 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV50 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

15 NOVX Antibodies" section below. The disclosed NOV50 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

20 NOV51 

A disclosed NOV51 nucleic acid of 984 nucleotides (also referred to as CG56759-01) 
encoding a Olfactory Receptor -like protein is shown in Table 51 A. An open reading frame 
was identified beginning with a ATG initiation codon at nucleotides 9-1 1 and ending with a 
TAA codon at nucleotides 954-956. The start and stop codons are shown in bold in Table 

25 5 1 A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 51 A. NOV51 nucleotide sequence (SEQ ID NO: 191). 

~CTGGCCTAA TGAA.TGTCTCTGAGCCAAATTCCAGCTTTGCTTTTGTAAATGAATTTATACTCCAAGGTTTCT 
CTTGTGAGTGGACAATTCAGATCTTCCTCTTCTCACTCTTTACTACAATATATGCACTGACTATAACAGGGA 
ATGGAGCCATTGCTTTTGCCCTGTGGTGTGACCGGCGACTTCACACTCCCATGTACATGTTCCTGGGAGATT 
TCTCCTTTTTAGAGATATGGTATGTCTTTTCTACAGTTCCCAAGATGTTGGTCAACTTCCTTTCAGAGAAAA 
CAAACATCTCCTTTGCTGGATGTTTTCTCCAGTTTTATTTCTTCTTCTCTTTGGGTACATCAGAATGCTTGC 
TTTTGACTGTGATGGCCTTTGATCAGTACCTTGCTATCTGCCGTCCCTTGCACTATCCTAATATCATGACTG 
GGCATCTCTGTGCCAAACTGGTCATACTGTGCTGGGTTTGTGGATTTCTGTGGTTCCTGATCCCCATTGTTC 
TCATCTCTCAGATGCCCTTCTGTGGCCCAAACATTATTGACCATGTTGTGTGTGACCCAGGGCCACTATTTG 
C ATTGG AT TGTGTTT C TGC C C C AAG AAT CC AAC TGTTT TGCT AC ACTCT AAAC TC ATT AGTT ATT TTTGGT A 
ACTTCCTCTTTATTATTGGATCCTATACTATTGTCCTGAAAGCTGTGTTGGGTACACCTTCAAGCACTGGGA 
GACATAAGGCCTTCTCTACCTGTGGGTCTCATTTGGCTGTGGTATCACCGTGCTATGGCTCTCTTATGGTCA 
TGTATGTGAGCCCAGGACTCGGACATTCTACGGGGATGCAGAAAATTGTAACTTTGTTCTATGCTATGGTGA 
CCCCACTCTTCAATCCCCTTATCTATAGCCTCCAGAATAAGGAGATAAAGGCAGCCCTGAGGAAAGTTCTGG 
GGAGTT C C AAC AT AAT CT AA GG CAT AT TAG ATT AT T CC T C C ATG AT C A 
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In a search of public sequence databases, the NOV5 1 nucleic acid sequence, located on 
chromosome 22, has 967 of 984 bases (98%) identical to a gb:GENBANK- 
ID:AP000534|acc:AP000534.1 mRNA from Homo sapiens (genomic DNA, chromosome 
22ql 1 .2, Cat Eye Syndrome region, clone:c23H5) (E - 2.9e~ 208 ). 
5 The disclosed NOV5 1 polypeptide (SEQ ID NO: 1 92) encoded by SEQ ID NO: 1 9 1 has 

315 amino acid residues and is presented in Table 5 1 B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV51 has a signal peptide and is 
likely to be localized to the plasma mambrane with a certainty of 0.6000. Alternatively, 
NOV51 may also localize to the Golgi body with a certainty of 0.4000, to the endoplasmic 
1 0 reticulum (membrane) with a certainty of 0.3000, or to the microbody (peroxisome) with a 

certainty of 0.3000. The most likely cleavage site for NOV51 is between positions 40 and 41: 
IYA-LT. 



Table 51B. Encoded NOV51 protein sequence (SEQ ID NO:192). 

MNVSEPNSSFAFVNEFILQGFSCEWTIQIFLFSLFTTIYALTITGNGAIAFALWCDRRLHTPMYMFLGDFSF 
LEIWYVFSTVPKMLVNFLSEKTNISFAGCFLQFYFFFSLGTSECLLLTVMAFDQYLAICRPLHYPNIMTGHL 
CAKLVILCWVCGFIiWFLIPIVLISQMPFCGPNIIDHWCDPGPLFALDCVSAPRIQLFCYTLNSLVIFGNFL 
FIIGSYTIVLKAVLGTPSSTGRHKAFSTCGSHLAWSPCYGSLMVMYVSPGLGHSTGMQKIVTLFYAMVTPL 
FNPLIYSLQNKEIKAALRKVLGSSNI I 

15 A search of sequence databases reveals that the NOV51 amino acid sequence has 191 

of 314 amino acid residues (60%) identical to, and 226 of 314 amino acid residues (71%) 
similar to, the 324 amino acid residue ptnr:SPTREMBL-ACC:Q9WU86 protein from Mus 
musculus (Mouse) (Odorant Receptor SI) (E - 7.0e~ 100 ). 

NOV51 is predicted to be expressed in at least Apical microvilli of the retinal pigment 

20 epithelium, arterial (aortic), basal forebrain, brain, Burkitt lymphoma cell lines, corpus 
callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 
umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 

25 cell lines, fetal lymphoid tissue, adult lymphoid tissue, Those that express MHC II and III 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 
spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus, and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 

30 were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 
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NOV5 1 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 5 1C 



Table 51 C. BLAST results for NOV51 


rTf^Ti p» TtiHpy / 

Identifier 


Protein/ Oraanism 


Length 
(aa) 


I dent i ty 
(%) 


Positives 
(%) 


Expect 


gi | 15293 807 | gb | AAK9 
5096. 1| (AF399611) 


olfactory 
receptor [Homo 
sapiens] 


217 


205/217 
(94%) 


208/217 
(95%) 


4e-99 


gi | 9938010 | ref | NP_0 
64684 . 1 | 
(NM_020288) 


odorant receptor 
SI gene [Mus 
musculus] 


324 


191/314 
(60%) 


226/314 
(71%) 


5e-90 


gi | 17476501 | ref |XP_ 
063251 . 1 | 
(XM_063251) 


similar to 
OLFACTORY 
RECEPTOR- LIKE 
PROTEIN F6 (H. 
sapiens) [Homo 
sapiens] 


1056 


134/293 
(45%) 


181/293 
(61%) 


2e-66 


gi | 15293805 | gb | AAK9 
5095 .1 | (AF399610) 


olfactory 
receptor [Homo 
sapiens] 


217 


142/217 
(65%) 


163/217 
(74%) 


2e-65 


gi | 17476700 | ref | XP_ 
063315 . 1 | 
(XM_063315) 


similar to 
odorant receptor 
SI gene (H. 
sapiens) [Homo 
sapiens] 


195 


131/189 
(69%) 


156/189 
(82%) 


2e-64 



5 The homology between these and other sequences is shown graphically in the 

ClustalW analysis shown in Table 5 ID. In the ClustalW alignment of the NOV51 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
10 can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 51D. ClustalW Analysis of NOV51 

1) Novel NOV51 (SEQ ID NO: 192) 
15 2) gi | 15293807 |gb|AAK95096 .1 | (AF399611) olfactory receptor [Homo sapiens] (SEQ ID 
NO: 557) 

3) gi | 9938010 | ref |NP_064684 . 1 | (NM_020288) odorant receptor SI gene [Mus musculus] 
(SEQ ID NO:558) 

4) gi | 17476501 | ref | XP_063251 . 1 | (XM_063251) similar to OLFACTORY RECEPTOR-LIKE 
20 PROTEIN F6 (H. sapiens) [Homo sapiens] (SEQ ID NO: 559) 

5) gi | 152 93 805 | gb | AAK95095 . 1 | (AF399610) olfactory receptor [Hojtio sapiens] (SEQ ID 
NO:560) 

6) gi | 17476700 | ref |XP_063315 . 1 | (XM_063315) similar to odorant receptor SI gene (H. 
sapiens) [Homo sapiens] (SEQ ID NO: 5 61) 

25 

10 20 30 40 50 60 
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gi 
gi 
gi 



9938010 | 
17476501 | 
15293805 j 
17476700 j 



NOV51 

gi | 15293807 | 
gi j 9938010 | 
gi j 17476501 | 
gi | 15293805 j 
gi j 17476700 j 



NOV51 

gi | 15293807 | 
gi | 9938010 | 
gi j 17476501 | 
gi | 15293805 | 
gi | 17476700 | 



NOV51 

gi | 15293807 | 
gi j 9938010 | 
gi | 17476501 | 
gi j 15293805 j 
gi j 17476700 j 



NOV51 

gi | 15293807 | 
gi j 9938010 | 
gi | 17476501 | 
gi j 15293805 | 
gi j 17476700 j 



NOV51 

gi | 15293807 | 
gi | 9938010 | 
gi | 17476501 | 
gi|l5293805| 
gi | 17476700 j 



NOV51 

gi|l5293807 | 
gi | 9938010 | 
gi | 17476501 | 
gi|l5293805| 
gi | 17476700 | 



NOV51 

gi | 15293807 | 
gi j 9938010 | 
gi j 17476501 1 
gi j 15293805 j 
gi j 17476700 j 



! 1 

1 MPVLLPVHFSAKCPLLLLCDPANPPSEPLPSQGCFIFIHRVLLDLSTAGESGNTAGFICD 6 0 

! 1 

! ! 

70 80 90 100 110 120 

....|....|....|....|....|....|....|....|....|....|....|....| 

! 1 

1 1 

! 1 

61 QALLTSPVREDGAENGLGFHQPVELHICGDAVGFVGMGQRRKPMSVPWSHPKISEKCASD 12 0 

1 1 

! 1 

130 140 150 160 170 180 

....|....|....|....|....|....|....|....|....|....|....|....| 

1 1 

1 1 

1 1 

121 TWCTDATYHREHSKPSGPWEHGPLKPFEDWVPALPYPLWPQELLHCGSQSGDCMCLLLLE 18 0 

1 1 

1 1 

190 200 210 220 230 240 

....|....|....|....|....|....|....|....|....|....|....|....| 

! 1 

! 1 

x 1 

181 SSRRSPPTLPIPLTFPRLCQSFPLLTASGKEPSCGFTSALRRLYGCGAAERPQSPVTPKT 24 0 

! 1 

! 1 

250 260 270 280 290 300 

....|....|....|....|....|....|....|....|....|....|....|....| 

1 1 

1 1 

! 1 

241 ETSEQGPKDPPIHLAHPSDRALSPSCFLSLRAVILTCKNRDAQVEEGHRREPPVLDCGYQ 300 

1 1 

1 1 

310 320 330 340 350 360 

I I I I I I I I I I I I 

! 1 

x 1 

! 1 

3 01 RSGTRGNHTRRICSTLRGSRIEAWVAAATLQRGPYFRKQQPLGKDSWSVAEDWIEAFMLA 3 6 0 

x 1 

1 1 

370 380 390 400 410 420 

....|....|....|....|....|....|....|....|....|....|.--.|....| 

! 1 

! 1 

! 1 

361 FGVRVLWDASMALEAQRDPSSNDTKGKDQLTKRDQRNPQNFALLQKSAASDWNSQPVCRR 42 0 

! 1 

1 1 

430 440 450 460 470 480 

....|....|....|....|....|....|....|....|....|....|....|....| 

! 1 

! 1 

! 1 

421 GYLTCASASLGEISSPHFPVHLNAPKCHWGLSSSPVERWMLRERKAVTDESSSSWMVAIR 4 80 

1 1 

x X 



490 500 510 520 530 540 
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NOV51 


1 


gi 


15293807 | 


1 


g± 


9938010 | 


1 


gi 


17476501| 


481 


gi 


15293805 | 


1 


gi 


17476700) 


1 



1 

1 

1 

ARETPGILAQRICSALKGVWCQAAQGSLPRLLSSLSISTGCDKTAVLTFDRALLTREHSK 54 0 

1 

1 



10 



15 



NOV51 

gi | 15293807 | 
gi | 9938010 | 
gi|l747650l| 
gi|15293805| 
gi|l7476700| 



550 



560 



570 



580 
..|.. 



590 
..|.. 



600 



1 

- - 1 
-M 1 



541 PNGPWERGPLKPSGDWDTCLHYLLWPQELFHCRSQTEDYTVTWFDWDRQMQKYSQSPFL 60 0 



20 



25 



NOV51 

gi | 15293807 | 
gi|9938010| 
gi j 17476501 | 
gi | 15293805 j 
gi j 17476700 j 



610 620 630 640 650 660 

|....|....|....|....|....|...-|....|....|...-|..--|.-..| 
-MNVSEPNSSFAFVNEFILQGFSCEWTIQIFLFSLFTTIYALTITGNGAIAFALWC 55 



1 

1 1 

2 S L FPQRNLDAMNRS AAHVTE FVLLGF PG S WK I Q I FL FVL FLVF Y VLTLLGNGAI I CAVRC 61 
601 E QR VKKTM S P DGNH S S D PT E FVL AGL PNLNS ARVE L F S VFLL VY LLNLTGNVL I VGWRA 66 0 
1 1 

1 1 



720 



30 



35 



40 



45 



50 



55 



60 



NOV51 

gi 1 15293807 | 
gi j 9938010 | 
gi | 17476501 | 
gi|l5293805| 
gi 1 17476700 | 



NOV51 

gi|l5293807| 
gi | 9938010 | 
gi | 17476501| 
gi|l5293805| 
gi | 17476700 j 



NOV51 

gi|l5293807 | 
gi|9938010| 
gi|l747650l| 
gi|15293805| 
gi j 17476700 j 



56 
1 

62 
661 
1 
1 




DSRLHTPMYFLLGNFS 
DTRLQTPMYFFLGNLS 



NOV51 


235 


ST 


GR 


gi 


15293807 | 


164 


ST 


GR 


gi 


9938010 | 


241 


A^ 


GR 


gi 


17476501| 


841 


M 


GR 


gi 


15293805 j 


164 


AA 


GR 


gi 


17476700 | 


114 




GR 




65 



70 



NOV51 

gi 1 15293807 | 
gi | 9938010 | 
gi|l747650l| 
gi 1 15293805 | 
gi j 17476700 j 



295 
217 
301 
901 
217 
174 



910 



920 930 940 
,|....|....|....|....|....|.. 
JLG SSNII-- 



950 



960 
••I 



315 

217 

L.L GMRIVKNM 3 24 

3FRKGCDFAFERCNSACNCRKGSLTTTTKSATLRCGAGAKARAGARL 96 0 

217 

195 



-GLTVSQN- 
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10 



15 



NOV51 

gi|l5293807 | 
gij 9938010 | 
gi | 17476501 | 
gi|15293805 | 
gi 1 17476700 j 



NOV51 

gi | 15293807 | 
gi | 9938010 | 
gi j 17476501 | 
gi|l5293805 | 
gi 1 17476700 j 



970 980 990 1000 1010 1020 

....|....|....|....|....|....|....|....|....|....|....|....| 

315 315 

217 217 

324 324 

961 HPAAGSPRDSRKVNVRVQKDPRRSVPKVETFISGSGPSCVGQCTGRVCILKGTRTISGGL 1020 

217 217 

195 195 

1030 1040 1050 

....|....|....|....|....|....|...-|. 

315 315 

2 1 7 217 

324 324 

1021 WL.ED PRKTRTTDFTHRKI KVTAGLAGE KVE PTLPRC 1056 

217 217 

1 95 195 



20 



Table 5 IE lists the domain descriptions from DOMAIN analysis results against 
NOV51. This indicates that the NOV51 sequence has properties similar to those of other 
proteins known to contain this domain. 



25 



30 



35 



40 



45 
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Table 51E Domain Analysis of NOV51 

gnl | Pf am|pfam00001, 7tm_l, 7 transmembrane receptor (rhodopsin 

family). (SEQ ID NO:810) 

CD-Length = 254 residues, 100.0% aligned 

Score = 105 bits (262) , Expect = 4e-24 



NOV51 : 


45 


Sbjct : 


1 


NOV51 : 


105 


Sbjct : 


61 


NOV51: 


165 


SbjCt : 


121 


NOV51 : 


225 


SbjCt: 


181 


NOV51 : 


281 


SbjCt : 


241 



+ I I 



11+ I 



1 1 + 1 



I I 



+ l + + 



GNGAIAFALWCDRRLHTPMYMFLGDFSFLEIWYVFSTVPKMLVNFLSEKTNISFAGCFLQ 104 

||+ + ++| I I +1 I + + ++ ++ + I I + III 
GNLLVI LVI LRTKKLRTPTNI FLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 6 0 



I I + I I I I ++ 1 + 1 I I I I I I I I I++I II I 11 + 

GALFWNGYAS I LLLTAI S I DRYLAI VHPLRYRRI RTPRRAKVL I LLVWVLALLLSLPPL 12 0 



G-Protein Coupled Receptor (GPCRs) have been identified as an extremely large 
family of protein receptors in a number of species. At the phylogenetic level they can be 
classified into four major subfamilies. These receptors share a seven transmembrane domain 
structure with many neurotransmitter and hormone receptors. They are likely to be involved in 
the recognition and transduction of various signals mediated by G-Proteins, hence their name 
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G-Protein Coupled Receptors. The human GPCR genes are generally intron-less and belong to 
four gene subfamilies, displaying great sequence variability. These genes are dominantly 
expressed in olfactory epithelium. 

Olfactory receptors (ORs) have been identified as extremely large family of GPCRs in 
5 a number of species. As members of the GPCR family, these receptors share a seven 

transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. Like 
GPCRs, the ORs they can be expressed in a variety of tissues where they are thought to be 
involved in recognition and transmission of a variety of signals. The human OR genes are 

10 typically intron-less and belong to four different gene subfamilies, displaying great sequence 
variability. These genes are dominantly expressed in olfactory epithelium. 

The disclosed NOV51 nucleic acid of the invention encoding a Olfactory Receptor-like 
protein includes the nucleic acid whose sequence is provided in Table 51 A or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 

15 be changed from the corresponding base shown in Table 5 1 A while still encoding a protein 

that maintains its Olfactory Receptor -like activities and physiological functions, or a fragment 
of such a nucleic acid. The invention further includes nucleic acids whose sequences are 
complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 

20 nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 

25 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 2 percent of the bases may be so changed. 

The disclosed NOV51 protein of the invention includes the Olfactory Receptor-like 
protein whose sequence is provided in Table 5 IB. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

30 in Table 5 IB while still encoding a protein that maintains its Olfactory Receptor-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 54 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab)2, that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this Olfactory Receptor -like protein 
(NOV51) is a member of a "Olfactory Receptor family". Therefore, the NOV51 nucleic acids 
and proteins identified here may be useful in potential therapeutic applications implicated in 
(but not limited to) various pathologies and disorders as indicated below. The potential 
5 therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 
research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing 
(but not limited to) those defined here. 

10 The NOV51 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in developmental diseases, MHCII and III diseases 
(immune diseases), Taste and scent detectability Disorders, Burkitt's lymphoma, 
Corticoneurogenic disease, Signal Transduction pathway disorders, Retinal diseases including 
those involving photoreception, Cell Growth rate disorders; Cell Shape disorders, Feeding 

1 5 disorders;control of feeding; potential obesity due to over-eating; potential disorders due to 
starvation (lack of apetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, 
fungal, protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, 
cancer (including but not limited to Neoplasm; adenocarcinoma; lymphoma; prostate cancer; 
uterus cancer), anorexia, bulimia, asthma, Parkinson's disease, acute heart failure, hypotension, 

20 hypertension, urinary retention, osteoporosis, Crohn's disease; multiple sclerosis; and 

Treatment of Albright Hereditary Ostoeodystrophy, angina pectoris, myocardial infarction, 
ulcers, asthma, allergies, benign prostatic hypertrophy, and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, severe 
mental retardation. Dentatorubro-pallidoluysian atrophy(DRPLA) Hypophosphatemic rickets, 

25 autosomal dominant (2) Acrocallosal syndrome and dyskinesias, such as Huntington's disease 
or Gilles de la Tourette syndrome and/or other pathologies and disorders of the like.. The 
polypeptides can be used as immunogens to produce antibodies specific for the invention, and 
as vaccines. They can also be used to screen for potential agonist and antagonist compounds. 
For example, a cDNA encoding the OR -like protein may be useful in gene therapy, and the 

30 OR-like protein may be useful when administered to a subject in need thereof. By way of 
nonlimiting example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from bacterial, fungal, protozoal and viral infections 
(particularly infections caused by HIV-1 or HIV-2), pain, cancer (including but not limited to 
Neoplasm; adenocarcinoma; lymphoma; prostate cancer; uterus cancer), anorexia, bulimia, 
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asthma, Parkinson's disease, acute heart failure, hypotension, hypertension, urinary retention, 
osteoporosis, Crohn's disease; multiple sclerosis; and Treatment of Albright Hereditary 
Ostoeodystrophy, angina pectoris, myocardial infarction, ulcers, asthma, allergies, benign 
prostatic hypertrophy, and psychotic and neurological disorders, including anxiety, 
5 schizophrenia, manic depression, delirium, dementia, severe mental retardation and 

dyskinesias, such as Huntington's disease or Gilles de la Tourette syndrome, and/or other 
diseases and pathologies. 

NOV5 1 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV51 substances for use in 

10 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV51 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 

15 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV52 

A disclosed NOV52 nucleic acid of 3828 nucleotides (also referred to as CG56731-01) 
encoding a H326-like protein is shown in Table 52A. An open reading frame was identified 
20 beginning with a ATG initiation codon at nucleotides 177-179 and ending with a TAA codon 
at nucleotides 1968-1970. The start and stop codons are shown in bold in Table "52A, and the 
5' and 3' untranslated regions, if any, are underlined. 



Table 52 A. NOV52 nucleotide sequence (SEQ ID NO: 193). 

CTCTTAGCGCTCAGGTCTTTTCCTTCCGCCGACCCGAAGTCATCGCTGGGAGTACTGGTTGCCCTTTCCTCA 
GTCCTTCAGTGAATCTACAGAGCCTATTTCCTCAGGAGCCTCAGCCTGGTCCTTACTTCAGTGATAAAAGGA 
GGAAAGGCTGGCTACAGCAAACATCATTCAAG ATGTCCAGCAAAGGGAGCAGCACAGATGGCAGAACAGACT 
TAGCTAATGGAAGCCTGTCTAGCAGTCCAGAGGAGATGTCTGGAGCTGAAGAGGGGAGGGAGACATCCTCAG 
GCATTGAAGTGGAGGCCTCAGACCTGAGTTTGAGCTTGACTGGGGATGATGGTGGCCCCAACCGCACCAGCA 
CAGAAAGTCGAGGCACAGACACAGAGAGCTCAGGTGAAGATAAGGACTCTGACAGCATGGAGGACACTGGTC 
ATT ACT CC ATT AAT GATGAAAAT CG AGTC C ATG AC CGC T CAG AGGAAG AGGAAG AGG AGGAAG AAGAGGAGG 
AAGAAGAGCAGCCTCGGCGCCGTGTACAGCGCAAGCGGGCTAACCGTGACCAGGACTCATCAGATGATGAGC 
GGGCCCTAGAGGACTGGGTGTCCTCAGAAACATCAGCTCTACCCCGACCTCGCTGGCAAGCCCTCCCTGCCC 
TTCGGGAGCGGGAGCTGGGTTCAAGTGCCCGCTTTGTCTATGAGGCCTGTGGGGCAAGAGTCTTTGTGCAGC 
GTTTCCGCCTGCAGCATGGGCTTGAGGGCCATACTGGTTGTGTCAATACCCTGCACTTTAACCAGCGCGGCA 
CCTGGCTGGCCAGTGGCAGCGATGACCTGAAGGTGGTGGTGTGGGATTGGGTACGGCGGCAGCCAGTACTGG 
ACTTTGAGAGTGGCCACAAAAGTAATGTGTTCCAGGCCAAGTTTCTTCCTAACAGTGGTGATTCTACTCTGG 
CCATGTGTGCCCGTGACGGGCAGGTTCGAGTAGCAGAACTGTCTGCCACACAGTGTTGCAAGAATACAAAAC 
GTGTGGCCCAGCACAAGGGAGCGTCCCACAAGTTGGCACTGGAACCAGACTCTCCCTGTACGTTCTTATCTG 
CAGGTGAAGATGCAGTTGTTTTCACCATTGACCTGAGACAAGACCGCCCAGCGTCGAAACTGGTGGTGACAA 
AAGAGAAAGAGAAGAAAGTGGGGCTGTATACGATCTATGTGAATCCTGCCAATACCCACCAGTTTGCAGTGG 
GTGGACGAGATCAGTTTGTAAGGATTTATGACCAGAGGAAAATTGATGAGAATGAGAACAATGGAGTACTCA 
AGAAGTTCTGTCCTCATCACCTGGTGAACAGTGAGTCCAAAGCAAACATCACCTGTCTTGTGTACAGCCACG 
ACGGC AC AG AG CTCCTGGC C AGTT AC AATG ATG AAG AC ATT TACCTCTT C AACT C CT CT C AC AGTG ATGGGG 
CCCAGTATGTTAAGAGATACAAGGGCCACAGAAATAATGCCACAGTAAAAGGCGTCAATTTCTATGGCCCCA 
AGAGTGAGTTTGTGGTGAGCGGTAGTGACTGTGGGCACATCTTCCTCTGGGAGAAATCATCCTGCCAGATTA 
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TTCAGTTCATGGAGGGGGACAAGGGAGGCGTGGTAAACTGTCTTGAGCCCCACCCTCACCTGCCTGTGCTGG 
CAACCAGTGGCCTAGACCATGATGTGAAGATCTGGGCACCCACAGCTGAAGCTTCCACTGAGCTGACAGGGT 
TAAAAGATGTGATTAAGAAGAACAAGCGGGAGCGGGATGAAGATAGCTTGCACCAAACTGACCTGTTTGATA 
GTCACATGCTGTGGTTCCTTATGCATCACCTGAGACAGAGACGCCATCACCGGCGCTGGCGAGAACCTGGGG 
TTGGGGCCACAGACGCGGACTCTGATGAGTCTCCCAGCTCCTCAGACACATCGGACGAGGAGGAGGGCCCTG 
ACCGGGTGCAGTGCATGCCATCTTG AGGCCTCATACCTAGGTGGGGCAGGCTGGGGCTGCCAACCTGATCCT 
GCCTGGGCAACCCTTTCCTGTCCCAGGCCCTACATTCAGCAGAAACGCACTTTGGACTTTTTGCTTTAGATA 
AAAGAAAGAC ATC C C AGG AG AAGGAC AAAC CAGAGGAGTG AAC C AAC AAAG AGTACC TAGGAATGGGAGTTG 
AGCCTGGAATGGGCTCCATGGAGAGGTGCATAGGACTCGGCAGAAATGGCCTCTCCCCAAAGCCTCTTTTTG 
AGAGGAGAGGGAAGCCTATTTGTTAACTGGTTTGGGATAGGGAATGGGGTTTCTTTTTCTTTAATCTCCCTT 
GTTTCTTGGGCTGGGGGAGGGGTGGGGGGAACAACTGGCTATTCAGTACCAAGGGGCCAGAGTGGAGGGTAG 
GAGTGCCACTCTCTCTTTGGTTTAGGTTTTTGACCTTTTCTTCCTTTGTTTTTTAAAAGTTTATGACAGTTN 
C TC C CNNNACC CC AC AAC C C CAT C C C AGAAT C C T ATTT T CC TGGG AAGT CC T T AAAG CC C C T AAC CAT C C C A 
CACTCTTCACTTTCCTTTCCACCTTATTCATTCTCTGTACTTACCACAGTATTTTGCACTTGATTACATATC 
CTTCATCTCTTCTCTTCATCCCATCACCCCCTAAATAGGTCAGGTGAGGGAGGCTGGGAAGAGGTGGGAGGA 
GGGGAGAAGTGAAGGAAGATAGGAAGGATATTACCTCTTCTGTTATTTTTTTAAGAAACATTGTTTGGTGGC 
AGCAATCTCCCTGTCCCTATCACTGTTAGAGGCCTAATTTTATATCTATAAATATATTAAAAAGCAAGTCAA 
ACTTGGATGTATCAAGGTAAAATTATTGTCAAAGTTTAAATACCTATATATTCTCTGAATGCAATAAAGGGA 
CTTAAGAGTGAACAAGAGTAATGGTGTGGAAGTGACACCTGGGGTCAGTTTACCTCTGTGTATGGTCACTAG 
AGATTGGGACTTACCCTTTAGGTTTTAGGAGGCTTGAGAATGGAAGGATCCTCATTTCTGCCCTTCCTGGTT 
CCCTGCTTTGGTGTAGGGGTTGGGAAAAACAGGAAATTCCTCTCAGCTCTGCCTCAGATCTCCTACCTCTCC 
TTAAGTCTTGTAGGGGGTTCCAAGGATGGCTCTTCTAACCAGAGGCTGGCCTGTCTTTAAAACTTAACTACT 
TTAGGGTGGTGCCACCACTGCAGACTATTGTGGTACTTTGTGACAGAAGACATGTACACACACACCACACAC 
ATACATACACACTCTCTCACTCTGTCTCTCTTACCTTTAGCTGCTTGATCATTAAGCCATCCAACTTCATGC 
CAGTTCCCTTCTTTATAGAAGAGTGAAGGGAAAGACTTCCTGGGTTTGACTTAAACCTTGTCCACCTTCTTG 
ATATTTTAGGATTGAGGAATAAAGTCATTAATCTAAGGAACTGATTACAGTGGCTGGAGCTTGGGCACTTGT 
CTTATCACTGGTCACTGAGTCTGAAAGTCCCAGNTGAATTCTTGCCCTTAAGTGCTTTTGCTGCTATTTTTT 
TGCCCCCAGTTCCACAAGATCCAACCAAGAATTCTGTATCCTGGCAACAGTCAGATTCTTCTAAATCAGCCA 
GCAAGAGGGNAAAGAGTGAGAGATGGTATTCCCAGATCATTCTTCCTCCTGCCCCTTTCCCAGCAGCTCTAG 
ACCAGATGTTGGCTGCTGTACTTACTCCCTGAGGTAGGGAATGTGTGGTGATCGAGTGGTCTGTGTTCCTAT 
TGCTGGTGGGGTGATAGGGTGGGCTAAAAACCATGCACTCTGGAATTTGTTGTATTTTCTCCCAGTAAAGCT 
TTTCTTCTCCCG 



In a search of public sequence databases, the NOV52 nucleic acid sequence, located on 
chromosome 22, has 3818 of 3828 bases (99%) identical to a gb:GENBANK- 
ID:HSU06631|acc:U0663Ll mRNA from Homo sapiens (Human (H326) mRNA, complete 
5 cds) (E = 0.0). 

The disclosed NOV52 polypeptide (SEQ ID NO: 194) encoded by SEQ ID NO: 193 has 
597 amino acid residues and is presented in Table 52B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV52 has no signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.3000. Alternatively, 
10 NOV52 may also localize to the nucleus with a certainty of 0.3000, to the mitochondrial 

matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a certainty of 0.1000. 



Table 52B. Encoded NOV52 protein sequence (SEQ ID NO: 194). 

MSSKGSSTDGRTDLANGSLSSSPEEMSGAEEGRETSSGIEVEASDLSLSLTGDDGGPNRTSTESRGTDTESS 
GEDKDSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSET 
SALPRPRWQALPALRERELGSSARFVYEACGARVFVQRFRLQHGLEGHTGCVNTLHFNQRGTWLASGSDDLK 
VVWDWVRRQPVLDFESGHKSNVFQAKFLPNSGDSTLAMCARDGQVRVAELSATQCCKNTKRVAQHKGASHK 
LALEPDSPCTFLSAGEDAWFTIDLRQDRPASKLWTKEKEKKVGLYTIYVNPANTHQFAVGGRDQFVRIYD 
QR K I D ENENNG VL KK FC P HHL VN S E S KAN I T CLV Y S HDGT E L LAS YNDED I YL FNS S H S DG AQY VKR Y KGHR 
NNATVKGVNFYGPKSEFWSGSDCGHIFLWEKSSCQIIQFMEGDKGGWNCLEPHPHLPVLATSGLDHDVKI 
WAPTAEASTELTGLKDVI KKNKRERDEDSLHQTDLFDSHMLWFLMHHLRQRRHHRRWREPGVGATDADSDES 
PSSSDTSDEEEGPDRVQCMPS 
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A search of sequence databases reveals that the NOV52 amino acid sequence has 588 
of 597 amino acid residues (98%) identical to, and 589 of 597 amino acid residues (98%) 
similar to, the 597 amino acid residue ptnr:SPTREMBL-ACC:Q 12839 protein from Homo 
5 sapiens (Human) (H326) (E - 0.0). 

NOV52 is predicted to be expressed in at least adrenal gland, bone marrow, brain - 
amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 

10 small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus, Amnion, 

Appendix, Bone, Bronchus, Brown adipose, Cervix, Chorionic Villus, Colon, Coronary 
Artery, Dermis, Epidermis, Foreskin, Hair Follicles, Hypothalamus, Kidney Cortex, Liver, 
Lung, Lung Pleura, Lymph node, Lymphoid tissue, Muscle, Ovary, Oviduct/Uterine 
Tube/Fallopian tube, Parathyroid Gland, Parotid Salivary glands, Peripheral Blood, 

15 Respiratory Bronchiole, Retina, Right Cerebellum, Skin, Synovium/Synovial membrane, 
Temporal Lobe, Thymus, Tonsils, Umbilical Vein, Urinary Bladder, Vein, Vulva, Whole 
Organism. 

This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
20 sources, Literature sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in plasma cells (myeloma) 
because of the expression pattern of (GENBANK-ID: gb:GENBANK-ID:HSU0663 1 |acc: 
U06631.1) a closely related Human (H326) mRNA, complete cds homolog. 

NOV52 also has homology to the amino acid sequences shown in the BLASTP data 
25 listed in Table 52C 



Table 52C. BLAST results for NOV52 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 13 63 6 6 82 | ref |XP_ 
010501 .2 | 
(XM_010501) 


similar to H326 
(H. sapiens) 
[Homo sapiens] 


597 


597/597 
(100%) 


597/597 
(100%) 


0.0 


gi | 7657148 | ref |NP_0 
56541 . 1 | 
(NM_015726) 


H3 2 6 [Homo 
sapiens] 


597 


588/597 
(98%) 


589/597 
(98%) 


0 . 0 


gi | 17485807 | ref |XP_ 
066683 . 1 | 
(XM 066683) 


similar to H326 
(H. sapiens) 
[Homo sapiens] 


779 


401/603 
(66%) 


463/603 
(76%) 


0.0 
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gi | 17485821 | ref | XP_ 
066690 .1 | 
(XM 066690) 


similar to H326 
(H. sapiens) 
[Homo sapiens) 


577 


387/601 
(64%) 


449/601 
(74%) 


0.0 


gi | 6679281 | ref | NP_0 
32847. 1 | 
(NM 008821) 


plasmacytoma 
expressed 
transcript 2 [Mus 
musculus] 


747 


308/469 
(65%) 


383/469 
(80%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 52D. In the ClustalW alignment of the NOV52 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence (i.e. f regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



1 0 Table 52D. ClustalW Analysis of NOV52 

1) Novel NOV52 (SEQ ID NO: 194) 

2) gi | 13636682 | ref | XP_010501 . 2 | (XM_010501) similar to H326 (H. sapiens) [Homo 
sapiens] (SEQ ID NO: 562) 

3) gi | 7657148 | ref |NP_056541 . 1 | (NM_015726) H326 [Homo sapiens] (SEQ ID NO: 563) 
15 4) gi j 17485807 | ref | XP_066683 . 1 | (XM_066683) similar to H326 (H. sapiens) [Homo 

sapiens] (SEQ ID NO: 564) 

5) gi | 17485821 | ref | XP_066690 . 1 | (XM_066690) similar to H326 (H. sapiens) [Homo 
sapiens] (SEQ ID NO: 565) 

6) gi | 6679281 | ref |NP_032847 . 1 | (NM_008821) plasmacytoma expressed transcript 2 [Mus 
20 musculus] (SEQ ID NO: 566) 
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NOV52 

gi | 13636682 | 
gi|7657148| 
gi j 17485807 | 
gi | 17485821 j 
gi | 6679281 | 



NOV52 

gi|l3636682| 
gi|7657148| 
gi j 17485807 | 
gi|l748582l| 
gi|667928l| 



NOV52 


1 


gi 


13636682 | 


1 


gi 


7657148 | 


1 


gi 


17485807 | 


121 


gi 


17485821| 


1 


gi 


6679281| 


103 
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1 1 

1 MANS KNPVE I E IMVFSSTLNMPDVGDNKPLS I KQVNDI SRQSTMNS VHLLNL LLC FATAL 6 0 



- MSSHES YTNAAETPENI S I LSCLGETSGALVDTKTI SDI KTMDPRVSLTP - 
70 80 90 100 110 
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6 1 KHHGLGDLQAACTLRQRVDVSLIGEQHPTRI PVI I G Y KS E E Q L P VLD KT KFL VLANQ AF F 120 

1 1 

51 SSDVTGTEDSSVLTPQSTDVN SVDSYQGYEGDDDDEEDDEDDKDGDSNLPSL 10 2 

130 140 150 160 170 180 
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NOV52 

gi|l3636682 | 
gi|7657148| 
gi j 17485807 | 
gij 17485821 j 
gi|667928l| 



NOV52 

gi|l3636682 | 
gi|7657148| 
gi|l7485807 | 
gi | 17485821 j 
gi|667928l| 



NOV52 

gi|l3636682| 
gi j 7657148 | 
gi | 17485807 | 
gi | 17485821 | 
gi|667928l| 



NOV52 

gi|l3636682 | 
gi|7657148| 
gi j 17485807 | 
gi | 17485821 | 
gi|667928l| 



NOV52 

gi|l3636682| 
gi |7657148 | 
gi|l7485807| 
gi|l748582lj 
gi|667928l[ 



NOV52 

gi|l3636682| 
gi|7657148| 
gi j 17485807 | 
gi j 17485821 | 
gi|667928l| 




FLgGEDAWFT I DLRQDRPAS K 
FL|||gEDAWFTIDLRODRPASK 
Fl |1| gEDAVVFTIDLRQDRPASK 
FLggGE DAWFT I DLRQDRPAS K 
FlSgEDAWFT I DLRQDRPAS K 

fl™gedawfSji dlrqSSpas k 



321 
321 
321 
504 
301 
481 



NOV52 

gi 1 13636682 | 
gi|7657148| 
gi 1 17485807 | 
gi j 17485821 j 
gi|667928l| 
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I 



I 



560 



I 



570 
.. I . . 



580 



590 



600 



I 



lwtke^kkvglytimvnpantjqfavgggdqfvriydqrkidenenngvlkkfcphhl 
lwtkeISkkvglytiSvnpantSqfavggBdqfvriydqrkidenenngvlkkfcphhl 
lwtke^kkvglytiSvnpantSqfavggSdqfvriydqrkidenenngvlkkfcphhl 

WTBeIBIkKVGL YT lBvijPANT5QFAVGG|DQFVRI YDQRKI DggENNGVLKKFjPHHL 

wT ! E ii KKvGLYTi i^ 

L^VgKi^KKVGLYT^VNPAN^QFAVGGSDQFlRIYDQRKIDENffiNNGVLKKFCPHHL 



660 



NOV52 

gi | 13636682 | 
gi|7657148| 
gi | 17485807 | 
gi | 17485821] 
gi|667928l| 
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680 



690 



700 



710 



NOV52 


442 


gi|l3636682| 


442 


gi|7657148| 


442 


gi|l7485807| 


625 


gi |l748582l| 


422 


gi | 6679281 | 


602 


NOV52 


502 


gi|l3636682 | 


502 


gi|7657148| 


502 


gi|l7485807 | 


685 


gi|l748582l| 


482 


gi|667928l| 


662 


NOV52 


562 


gi|l3636682 | 


562 


gi|7657148 | 


562 


gij 17485807 | 


745 


gi|l748582l| 


- 542 


gi|667928l| 


722 



I 



I 



720 
• ■ I 



FYGPjgSEFVVSGSpCGHI FgWEKSSCQI IjQFMEGDjaGGcJVNCLE PHPffiLPVLATSGLDHD 



fygpSsefwsgsdcghifb 

F YG Pras E FWS iGSpffGH I FB 
FYGpSsEFWSGSbCGKI FU 

fygpSsefwsgsd'&ghHfB 
fygpIsefvIsgsdcghi FB 



MEKSSCQI IQFMEGDgGG 



CLE PHPfflLPVLATSGLDHD 



WEKSSCQIIQFMEGDjSG^^NCLEPHPgLPVLATSGLDHD 

WEKSSjgQl^ 

|WE KS S§Q I 1 Q FMEGD^S^^NC LE P H PwLPVLAT S GLPj BS 
|WEKS'sCQlBQFgEBb>gGG^jNCg|^HP§LPVLAgSGLDHi5 



501 
501 
501 
684 
481 
661 



760 




790 



800 



810 




Tables 52E-F list the domain descriptions from DOMAIN analysis results against 
NOV52. This indicates that the NOV52 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 52E Domain Analysis of NOV52 

gnl | Smart | smart00320, WD40, WD40 repeats; Note that these repeats are 
permuted with respect to the structural repeats (blades) of the be.ta 
propeller domain. (SEQ ID NO: 85 0). 
CD-Length = 40 residues, 82.5% aligned 
Score = 43.1 bits (100), Expect = 4e-05 



NOV5 2 : 189 LE GHTGCVNT LH FNQRGTW LAS G S DDLKWVWD 221 

1*1111 I ++ 1+ I lllllll + +11 
Sbjct: 8 LKGHTGPVTSVAFSPDGKXiLASGSDDGTIKLWD 4 0 



Table 52F Domain Analysis of NOV52 

(SEQ ID NO:851) 



gnl | Pf am|pf am00400 , WD40, WD domain, G-beta repeat 
CD-Length = 39 residues, 97.4% aligned 
Score = 43.1 bits (100), Expect = 4e-05 



NOV52 : 184 RLQHGLEGHTGCVNTLHFNQRGTWLASGSDDLKWVWD 

+ 1 i M I I I ++ 1+ I lllllll I +11 
Sbjct : 2 KLLRTLSGHTGSVTSVAFSPDGNLLASGSDDGTVKIWD 



221 



39 
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Beta-transducin (G-beta) is one of the three subunits (alpha, beta, and gamma) of the 
guanine nucleotide-binding proteins (G proteins) which act as intermediaries in the 
transduction of signals generated by transmembrane receptors. The alpha subunit binds to and 
hydrolyzes GTP; the functions of the beta and gamma subunits are less clear but they seem to 
5 be required for the replacement of GDP by GTP as well as for membrane anchoring and 
receptor recognition. 

In higher eukaryotes G-beta exists as a small multigene family of highly conserved 
proteins of about 340 amino acid residues. Structurally G-beta consists of eight tandem repeats 
of about 40 residues, each containing a central Trp-Asp motif (this type of repeat is also called 

10 a WD-40 repeat). Such a repetitive segment has been shown to exist in a number of other 

proteins, including G-beta-like peptides, yeast STE4, MSI1, CDC4, CDC20, MAK1 1, PRP4, 
PWP1 and TUP1, slime-mould AAC3 and coronin, and Drosophila Groucho protein. The 
number of repeats within these proteins varies between 5 (PRP4, TUP1, and Groucho) and 8 
(G-beta, STE4, MSI1, AAC3, CDC4, PWP1, etc.). In G-beta and G-beta like proteins, the 

15 repeats span the entire length of the sequence, while in other proteins, they make up the N- 
terminal, the central or the C-terminal section. 

The protein of this invention contains 7 WD-40 repeats. Although the function of this 
H326-like protein is not precisely known, it has potential importance in the intracellular 
transduction of signals, similarly to other WD-40 repeat-containing proteins. 

20 The disclosed NOV52 nucleic acid of the invention encoding a H326-Iike protein 

includes the nucleic acid whose sequence is provided in Table 52A or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 52A while still encoding a protein that maintains 
its H326 -like activities and physiological functions, or a fragment of such a nucleic acid. The 

25 invention further includes nucleic acids whose sequences are complementary to those just 

described, including nucleic acid fragments that are complementary to any of the nucleic acids 
just described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 

30 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1 percent of the 
bases may be so changed. 
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The disclosed NOV52 protein of the invention includes the H326-like protein whose 
sequence is provided in Table 52B. The invention also includes a mutant or variant protein 
any of whose residues may be changed from the corresponding residue shown in Table 52B 
while still encoding a protein that maintains its H326-like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 36 
percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fabk that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this H326 -like protein (NOV52) is a 
member of a "H326 family". Therefore, the NOV52 nucleic acids and proteins identified here 
may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug target ing/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

The NOV52 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in intracellular transduction of signals and any relevant 
diseases that may result from dysregulation of signal transduction, and/of other diseases and 
pathologies. 

NOV52 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV52 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV52 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV53 

A disclosed NOV53 nucleic acid of 1233 nucleotides (also referred to as CG56745-01) 
encoding a uracil phosphoribosyltransferase-like protein is shown in Table 53A. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 142-144 
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and ending with a TAA codon at nucleotides 1069-1071. The start and stop codons are shown 
in bold in Table 53A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 53A. NOV53 nucleotide sequence (SEQ ID NO:195). 



CTAGGGGT GAAAGGACAGCCAGGGTTAGATGTTCTGAGGAGGCGGGAGCAACCGAGAGAGCACGTGAGCAT C 
TGTCCTTTCTACCCGTTCCTCTTTATCTTTAGTGTTCAGTAGCAGCGGGGATAGCCCGGGGCCCGGTGTA TG 
GCCACGGAGTTACAGTGTCCGGACTCCATGCCCTGTCACAACCAGCAAGTAAACTCTGCCTCAACCCCAAGT 
CCCGAGCAGCTGCGACCTGGCGATCTGATCCTGGACCACGCAGGGGGAAACAGAGCCTCCAGGGCCAAGGTG 
ATTCTCCTCACGGGGTACGCCCATTCTAGCCTGCCGGCCGAGCTGGACTCTGGGGCCTGCGGCGGCTCCAGC 
CTCAACTCAGAGGGCAACAGTGGTAGTGGTGACAGTAGCAGCTATGACGCACCAGCTGGCAACTCCTTCCTA 
GAGGACTGCGAACTCTCCCGGCAGATCGGGGCGCAGCTTAAGCTGCTGCCTATGAATGATCAGATACGGGAG 
CTACAGACCATCATCCGGGACAAGACAGCCAGTAGAGGTGACTTCATGTTTTCTGCGGATCGTTTGATCAGA 
CTTGTTGTGGAAGAGGGATTGAATCAGCTGCCATATAAAGAATGCATGGTGACCACTCCAACAGGGTACAAG 
T ATGAAGG AGTG AAAT TTG AGAAGGGAAATTGTGGGGT C AG C AT AATGAGAAGCGGTGAGGC AAT GG AAC AA 
GGTTTACGAGACTGCTGTCGATCCATACGAATTGGAAAGATCCTGATTCAGAGTGATGAGGAGACACAAAGA 
GCCAAAGTATATTATGCCAAATTCCCCCCAGACATTTACCGGAGAAAAGTCCTTCTGATGTATCCAATTCTC 
AGCACTGGAAATACTGTAATTGAAGCTGTAAAGGTTCTTATAGAACATGGAGTTCAACCCAGTGTTATCATC 
CTACTCAGTCTGTTCTCCACTCCTCATGGTGCCAAATCAATCATTCAGGAGTTTCCAGAGATCACAATTTTA 
ACTACTGAAGTTCATCCTGTTGCACCTACACATTTTGGACAGAAATACTTTGGAACAGACTA AGTTATTTAA 
GTAAAATAA TTGTCTTATGTAATATTACAATCATGTTTTGATTTTCTATTTGTTTT ACTGATTCACTTGAGG 
GTGGCAGAGACAAATGTGTTACAATGCTTTTTAGTTTTGGAAGTGGGTATATTTGAGGTTATATCTCACTTA 

GTTATTTGT 



In a search of public sequence databases, the NOV53 nucleic acid sequence, located on 
5 the X chromosome, has 312 of 544 bases (57%) identical to a gb:GENBANK-ID: 

YSCFURlA|acc: M36485.1 mRNA from Saccharomyces cerevisiae {S.cerevisiae uracil 
phosphoribosy transferase (FUR1) gene, complete cds). 

The disclosed NOV53 polypeptide (SEQ ID NO: 196) encoded by SEQ ID NO: 195 has 
309 amino acid residues and is presented in Table 53B using the one-letter amino acid code. 
10 Signal P, Psort and/or Hydropathy results predict that NOV53 has no signal peptide and is 

likely to be localized to the nucleus with a certainty of 0.3000. Alternatively, NOV53 may also 
localize to the mitochondrial matrix space with a certainty of 0.1000, or to the lysosome 
(lumen) with a certainty of 0.1000. 



Table 53B. Encoded NOV53 protein sequence (SEQ ID NO:196). 

^4ATELQCPDSMPCHNQQVNSASTPSPEQLRPGDLILDHAGGNRASRAKVILLTGYAHSSLPAELDSGACGGS 
SLNSEGNSGSGDSSSYDAPAGNSFLEDCELSRQIGAQLKLLPMNDQIRELQTIIRDKTASRGDFMFSADRLI 
RLWEEGLNQLPYKECMVTTPTGYKYEGVKFEKGNCGVSIMRSGEAMEQGLRDCCRSIRIGKILIQSDEETQ 
RAKVYYAKFPPDIYRRKVLLMYPILSTGNTVIEAVKVLIEHGVQPSVIILLSLFSTPHGAKSIIQEFPEITI 

LTTEVHPVAPTHFGQKYFGTD „ . 

15 

A search of sequence databases reveals that the NOV53 amino acid sequence has 588 
of 597 amino acid residues (98%) identical to, and 138 of 209 amino acid residues (66%) 
identical to, and 165 of 209 amino acid residues (78%) similar to, the 261 amino acid residue 
ptnr : SPTREMBL- ACC :Q9 V RQ 1 protein from Drosophila melanogaster (Fruit fly) (CG5537 
20 Protein) (E = 8.5e -72 ). 
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NOV53 is predicted to be expressed in at least Bone Marrow, Brain, Bladder, Eye, 
Cervix, Kidney, Liver, Lymph node, Prostate, Small Intestine, Umbilical Vein. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, and/or 
5 RACE sources. 

NOV53 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 53C 



Table 53C. BLAST results for NOV53 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 174 8617 9 | ref |XP_ 
060041. l| 
(XM_060041) 


similar to 
Unknown (protein 
for MGC:23937) 
(H. sapiens) 
[Homo sapiens] 


309 


309/309 
(100%) 


309/309 
(100%) 


e-172 


gi | 143 8 84 54 | db j | BAB 
60766. l| (AB063019) 


hypothetical 
protein [Macaca 
fascicularis] 


309 


300/309 
(97%) 


302/309 
(97%) 


e-167 


gi | 138744 65 | db j | BAB 
46861. l| (AB060829) 


hypothetical 
protein [Macaca 
f ascicularis] 


309 


299/309 
(96%) 


302/309 
(96%) 


e-166 


gi | 14388519 | db j | BAB 
60785. l| (AB063065) 


hypothetical 
protein [Macaca 
f ascicularis] 


309 


298/309 
(96%) 


301/309 
(96%) 


e-166 


gi | 8217490 | emb | CAB 9 
2761. l| (AL137013) 


bA311P8 .3 
(probable uracil 
phosphor ibosyltra 
nf erase) [Homo 
sapiens] 


166 


166/166 
(100%) 


166/166 
(100%) 


3e-94 



10 The homology between these and other sequences is shown graphically in the 

Clustal W analysis shown in Table 53D. In the ClustalW alignment of the NOV53 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence {i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 

15 can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 53D. ClustalW Analysis of NOV53 

1) Novel NOV53 (SEQ ID NO: 196) 
20 2) gi | 17486179 | ref |XP_060041 . 1 | (XM_060041) similar to Unknown (protein for 
MGC: 23937) (H . sapiens) [Homo sapiens] (SEQ ID NO: 567) 

3) gi | 14388454 |dbj |BAB60766 . 1 | (AB063019) hypothetical protein [Macaca 
fascicularis] (SEQ ID NO:568) 

4) gi | 13874465 | dbj |BAB46861 . 1 | (AB060829) hypothetical protein [Macaca 
25 fascicularis] (SEQ ID NO: 569) 
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5) gi | 14388519 |dbj | BAB60785. l| (AB063065) hypothetical protein [Macaca 
fascicularis] (SEQ ID NO: 570) 

6) gi| 8217490 |emb|CAB92761.l| (AL137013) DA311P8.3 (probable uracil 
phosphoribosyltranf erase) [Homo sapiens] (SEQ ID NO: 571) 



10 



15 



20 



25 



45 



50 



NOV53 


1 


gi 


17486179 | 


1 


g± 


14388454 j 


1 


gi 


13874465 j 


1 


gi 


14388519| 


1 


gi 


8217490 | 


1 



NOV53 


61 


gi 


17486179 | 


61 


gi 


14388454 j 


61 


gi 


13874465 j 


61 


gi 


14388519 j 


61 


gi 


8217490 | 


1 





NOV53 


121 




gi 1 17486179 | 


121 




gi |14388454 j 


121 




gij 13874465 j 


121 


30 


gi j 14388519 j 


121 




gi| 8217490 | 


1 


35 


NOV53 


181 




gi| 17486179 | 


181 




gi 1 14388454 j 


181 




gi | 13874465 | 


181 




gi j 14388519 j 


181 


40 


gi|8217490| 


38 



NOV53 


241 


gi 


17486179| 


241 


gi 


14388454 j 


241 


gi 


13874465 j 


241 


gi 


14388519] 


241 


gi 


8217490 | 


98 




I 



70 
• I - - 



I 



80 
. I . . 



90 



I 



100 



110 



120 



J_ 



PAELDBGACgGSSLNSES 
PAELDg GACgGSSLNSES 
PAE LdHgAcSg S S LNS EB 



PAE LDgGACffiG S S LNS EB 
P AELDgGACMGS S LNS Eg 
PAE LDSGACfflG S S LNS E 



tfsgsgdsssydapagnsflgdcelsrqigaqlkllpmndqi 
stsgsgdsssydapagnsflSdcelsrqigaqlkllpmndqir 
^sgsgdsssydapagnsflSdcelsrqigaqlkllpmndqir 
stsgsgdsssydapagnsflSdcelsrqigaqlkllpmndqir 

S GS GD S S S YDAP AGNS FlSdC^L S RQ I GffiQLKLLPMNDQIR 



120 
120 
120 
120 
120 
1 



I 



130 
. . I . . 



I 



140 
. . I . . 



150 
. . I . . 



160 
. . I - . 



170 
. . I . . 



180 



I 



ELQTIIRDKTASRGDFMFSADRLIRLVVEEGLNQLPYKECMVTTPTGYKYEGVKFEKGNC[ 
ELQTIIRDKTASRGDFMFSADRLIRLVVEEGLNQLPYKECMVTTPTGYKYEGVKFEKGNC ! 
ELQTI IRDKTASRGDFMFSADRLIRLVVEEGLNQLPYKECMVTTPTGYKYEGVKFEKGNC 
ELQTglRDKTASRGDFMFSADRLIRLVVEEGLNQLPYKECMVTTPTGYKYEGVKFEKGNC 
ELQTIIRDKTASRGDFMFSADRLIRLVVEEGLNQLPYKECMVTTPTGYKYEGVKFEKGNC 

IRLVVEEGLNQLPYKECMVTTPTGYKYEGVKFEKGNC 



180 
180 
180 
180 
180 
37 



I 



190 
. . I . . 



I 



200 
. . I . . 



210 



I 



220 
. . I . . 



230 
. . I . . 



240 
_i 



GVSIMRSGEAMEQGLRDCCRSIRIGKILIQSDEETQRAKVYYAKFPPDIYRRKVLLMYPI 
GVS IMRSGEAMEQGLRDCCRS I RIGKI LI QSDEETQRAKVYYAKFPPDI YRRKVLLMYP I 
GVSIMRSGEAMEQGLRDCCRSIRIGKILIQSDEETQRAKVYYAKFPPDIYRRKVLLMYPI 
GVS IMRSGEAMEQGLRDCCRS I RIGKI LI QSDEETQRAKVYYAKFPPDI YRRKVLLMYP I 
GVS IMRSGEAMEQGLRDCCRS IRIGKI LI QSDEETQRAKVYYAKFPPDI YRRKVLLMYP I 
GVS IMRSGEAMEQGLRDCCRS I RIGKI LI QSDEETQRAKVYYAKFPPDI YRRKVLLMYP I 



240 
240 
240 
240 
240 
97 



I 



250 
. . I . . 



260 
. . I . . 



I 



270 
. . I . . 



I 



280 
. . I . . 



290 
. . I I 



300 
• - I 



LSTGNTVIEAVKVLIEHGVQPSVI 
L S TGNTV I E AVK VL I E HG VQ P S V I 
LSTGNTVIEAVKVLIEHGVQPSVI 
LSTGNTVIEAVKVLIEHGVQPSVI 
LSTGNTVIEAVKVLIEHGVQPSVI 
LSTGNTVIEAVKVLIEHGVQPSVI 



ILLSLFSTPHGAKS 
ILLSLFSTPHGAKS 
ILLSLFSTPHGAKS 
ILLSLFSTPHGAKS 
ILLSLFSTPHGAKS 
ILLSLFSTPHGAKS 



IIQEFPEITILTTEVHPVAPTH 
I I QEFPE ITI LTTEVHPVAPTH 
IIQEFPEITI LTTEVHPVAPTH 
IIQEFPEITILTTEVHPVAPTH 
IIQEFPEITILTTEVHPVAPTH 
IIQEFPEITILTTEVHPVAPTH 



300 
300 
300 
300 
300 
157 



55 



NOV5 3 


301 


FGQKYFGTD 


309 


gi 


17486179 | 


301 


FGQKYFGTD 


309 


gi 


14388454 j 


301 


FGQKYFGTD 


309 


gi 


13874465 j 


301 


FGQKYFGTD 


309 


gi 


14388519 j 


301 


FGQKYFGTD 


309 


gi 


8217490 1 


158 


FGQKYFGTD 


166 



60 



Table 53E lists the domain descriptions from DOMAIN analysis results against 
NOV53. This indicates that the NOV53 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 53E Domain Analysis of NOV53 

gnl | Pfam |pf am00156 , Pribosyltran, Phosphoribosyl transferase domain. 
This family includes a range of diverse phosphoribosyl transferase 
enzymes. This family includes: Adenine phosphoribosyltransf erase 
EC : 2 . 4 . 2 . 7 , Hypoxanthine -guanine -xanthine phosphoribosyltransf erase , 
Hypoxanthine phosphoribosyltransf erase EC : 2 . 4 . 2 . 8 , Ribose-phosphate 
pyrophosphokinase i EC : 2 . 7 . 6 . 1 , Amidophosphoribosyltransf erase 
EC: 2. 4. 2. 14, Orotate phosphoribosyltransf erase EC: 2. 4. 2. 10, Uracil 
phosphoribosyltransf erase EC: 2. 4. 2. 9, Xanthine-guanine 
phosphoribosyltransf erase EC: 2. 4. 2. 22. (SEQ ID NO: 8 52) 
CD-Length - 153 residues, 43.1% aligned 
Score = 35.0 bits (79), Expect = 0.006 



NOV53 : 226 PPDIYRRKVLLMYPILSTGNTVIEAVKVLIEHGVQPSVI ILLSLFSTPHGAKSIIQEFPE 285 

|+ ++| |++ ++ | | |+ | ++| | | + + +| + + + | | 

Sbjct : 87 VGDVGGKRVLIVDDVIDTGGTIRAAAELLKEAGAKWGVAVLVDRPEGGARERLDKGFPI 14 6 



NOV53 : 2 86 ITILTT 291 
+ + + 

Sbjct: 147 PSLIVL 152 

10 The gene of invention is a novel uracil phosphoribosyltransferase (UPRT)-like gene. 

UPRT catalyzes the formation of uridine 5 , -monophosphate in the pyrimidine salvage pathway 
from uracil and 5-phospho-alpha-D-ribose 1 -diphosphate. The Saccharomyces cerevisiae 
FUR1 gene encodes UPRT (Kern et al., Gene 88:149-157(1990)). Mutations in the FUR1 gene 
have been correlated to resistance to 5-fluorouraciI, a common chemotherapeutic agent (Kern 

15 et al., Curr Genet 1991 May;19(5):333-7). 

The novel gene belongs to a family of phosphoribosyl transferases, as evidenced by the 
presence of a characteristic domain. It is anticipated that this gene plays a role in the 
pyrimidine salvage pathway and that it influences the growth or growth restriction of various 
tissues, and especially of tumor cells. 

20 The disclosed NOV53 nucleic acid of the invention encoding a Uracil 

Phosphoribosyltransferase-like protein includes the nucleic acid whose sequence is provided 
in Table 53A or a fragment thereof. The invention also includes a mutant or variant nucleic 
acid any of whose bases may be changed from the corresponding base shown in Table 53 A 
while still encoding a protein that maintains its Uracil Phosphoribosyltransferase -like 

25 activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 

30 include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
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in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 43 percent of the 
bases may be so changed. 
5 The disclosed NOV53 protein of the invention includes the Uracil 

Phosphoribosyltransferase-like protein whose sequence is provided in Table 53B. The 
invention also includes a mutant or variant protein any of whose residues may be changed 
from the corresponding residue shown in Table 53B while still encoding a protein that 
maintains its Uracil Phosphoribosyltransferase-like activities and physiological functions, or a 

10 functional fragment thereof. In the mutant or variant protein, up to about 4 percent of the 
residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 
(Fab>2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Uracil Phosphoribosyltransferase- 

15 like protein (NOV53) is a member of a "Uracil Phosphoribosyltransferase family". Therefore, 
the NOV53 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 

20 targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 

delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 

The NOV53 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, 

25 stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, cerebral 
palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 
leukodystrophies, behavioral disorders, addiction, anxiety, pain, neurodegeneration, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, autoimmune disease, 
allergies, immunodeficiencies, transplantation, graft versus host disease, fertility disorders, 

30 anemia, bleeding disorders, scleroderma, cystitis, incontinence, diabetes, renal artery stenosis, 
interstitial nephritis, glomerulonephritis, polycystic kidney disease, systemic lupus 
erythematosus, renal tubular acidosis, IgA nephropathy, hypercalceimia, cirrhosis, 
inflammatory bowel disease, diverticular disease, lymphedema, cancer, trauma, tissue 
degeneration, bacterial/viral/parasitic infections, and/or other diseases and pathologies. 
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NOV53 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV53 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV53 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV54 

NOV54 includes two novel protein phosphatase 2C -like proteins disclosed below. 
The disclosed sequences have been named NOV54a and NOV54b. 
NOV54a 

A disclosed NOV54a nucleic acid of 2185 nucleotides (also referred to as CG56773- 
01) encoding a protein phosphatase 2C-like protein is shown in Table 54A. An open reading 
frame was identified beginning with a ATG initiation codon at nucleotides 1-3 and ending 
with a TGA codon at nucleotides 1402-1404. The start and stop codons are shown in bold in 
Table 54A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 54A. NOV54a nucleotide sequence (SEQ ID NO: 197). 

ATGTCCGCCGGCTGGTTCCGGCGCCGCTTCCTGCCTGGGGAGCCGCTCCCCGCGCCGCGGCCGCCTGGGCCG 
CATGCCAGCCCCGTGCCCTACCGACGGCCCCGCTTCCTTCGCGGCTCCAGCTCCAGCCCCGGGGCGGCCGAC 
GCCTCGCGCCGCCCAGACTCCCGGCCCGTGCGCAGCCCCGCACGAGGACGCACGCTACCCTGGAATGCAGGC 
T ACGCCGAG AT TAT C AATG C AG AGAAATCTGAATT CAATGAGG ATC AAG C CGC CTGTGGGAAGCTGT GCATC 
CGGAGATGTGAGTTTGGGGCTGAAGAAGAGTGGCTGACCCTGTGCCCAGAGGAGTTCCTGACAGGCCATTAC 
TGGGCACTGTTCGATGGGCACGGCGGTCCTGCAGCAGCCATCTTGGCTGCCAACACCCTGCACTCCTGCTTG 
CGCCGGCAGCTGGAGGCCGTGGTGGAAGGCTTGGTGGCCACTCAGCCCCCCATGCACCTCAATGGCCGCTGC 
ATCTGCCCCAGTGACCCTCAGTTTGTGGAGGAAAAGGGCATCAGGGCAGAAGACTTGGTGATCGGGGCATTG 
GAGAGTGCCTTTCAGGAATGTGATGAGGTGATCGGGCGGGAGCTGGAGGCCTCAGGCCAGATGGGCGGCTGC 
ACAGCCCTGGTGGCTGTGTCCCTGCAGGGAAAGCTGTACATGGCCAATGCTGGGGATAGCAGGGCCATCTTG 
GTGCGGAGAGATGAGATACGGCCACTGAGCTTCGAGTTCACCCCAGAGACTGAGCGGCAGCGGATCCAGCAG 
CTGGCCTTTGTCTATCCTGAGCTTCTGGCTGGTGAGTTCACCCGACTGGAGTTCCCTCGGCGGCTGAAGGGG 

TCGGATCTCAAGTACCCACTGATCCATGGACAGGGTAGGCAGGCTCGGTTACTAGGAACACTGGCTGTCTCC 
CGGGGCCTGGGAGACCATCAGCTCAGAGTCCTGGACACAAACATCCAGCTCAAGCCCTTCTTGCTCTCTGTG 
CCACAGGTGACTGTGCTGGATGTGGACCAGCTGGAGCTACAGGAGGATGATGTGGTTGTCATGGCAACTGAT 
GGACTCTGGGATGTACTGTCCAACGAGCAGGTGGCATGGCTGGTGCGGAGCTTCCTCCCTGGGAACCAAGAG 
GACCCACACAGCTATCTGCAGGATGGTCTTCACAGGTTCTCAAAGCTGGCCCAGATGCTGATACACAGCACA 
CAGGGAAAGGAAGACAGTCTCACAGAGGAAGGGCAGGTGTCCTACGATGACGTCTCTGTGTTCGTGATTCCC 
TTGCACAGTCAGGGCCAAGAGAGCAGTGACCACTQ AGGATTCAGACACTGTATCCCAGAACTGCTCTAGTGC 
CCGGGTGTGGTCTGGGCATCCCTCCAGTGTGACCAAGAGCAAATCCTGCCTGCCCTATCCCTAGCCACCGCC 
CAGTGCTCTCACTATCCACCTCAACACACATCCATCTCAAGAGGAACATTTATACCAGGCAGTCAGAGCTGG 
AAGTGTATGGAGAGCCCAGCCCACCAGGTCCTGCCTTTTGCGGTGATAACCTTCTCTGGCAGAGTGACTTTA 
CAACTTAACTAGGAAACCCATGTGAGGCTCCTCAGACAGGATCTTGAACAGCCCAAAGTATCATTCTCAGAT 
AGGGGCACCCAAGCTAAGGGTATTAGCCAAAGATGCCAGGATGGGTAGCTAGCCCATGTTTAGATCCAGGTC 
T C C AAT T C ATGGTT ATC AGGG C ATGTGTTC AAC AACC C C C AAAGT C C ACGC AGGTGGCT TGT AGAAAC CT TT 
GGGCAGCCTCATGTCTGCTAAAACAGCCATCTTCAAGACAGCCCCTGAAAAGAGACCAGTTCAGGTCCTGCC 
CTGCTGTTCTTTGCTGGAGATGAGGAACAGGTGCTGGGGCTAAAGTTTGGGGTAGAGCACAAGGGACAAGAG 
GAACTCTTGGAGTTGGCTGGGTGAGAGGGCTCTCCATTTGCTACCTGTAGTAGCCTGCCTCTTAACTGGTTG 
CTTCTCCCTAGTTCCAGCCCTGCCCTGGTCTGATGCCCCAACACTGCCCTTGCTTTGTTTTCCCTGTCACCT 
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C C CT ATTATT AAATGTT TT CT ACAG 



In a search of public sequence databases, the NOV54a nucleic acid sequence, located 
on the p21.1 region of chromosome 3, has 592 of 928 bases (63%) identical to a 
gb:GENBANK-ID:AK023315|acc:AK023315.1 mRNA from Homo sapiens (cDNA 
5 FLJ13253 fis, clone OVARC1 000751) (E = 4.5e~ 41 ). 

A disclosed NOV54a polypeptide (SEQ ID NO: 198) encoded by SEQ ID NO: 197 has 
467 amino acid residues and is presented in Table 54B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV54a has no signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.3941 . Alternatively, 
10 NOV54a may also localize to the nucleus with a certainty of 0.3000, to the lysosome (lumen) 
with a certainty of 0.1 558, or to the mitochondrial matrix space with a certainty of 0.1000. 



Table 54B. Encoded NOV54a protein sequence (SEQ ID NO: 198). 

MSAGWFRRRFLPGEPLPAPRPPGPHASPVPYRRPRFLRGSSSSPGAADASRRPDSRPVRSPARGRTLPWNAG 
YAEIINAEKSEFNEDQAACGKLCIRRCEFGAEEEWLTLCPEEFLTGHYWALFDGHGGPAAAILAANTLHSCL 
RRQLEAWEGLVATQPPMHLNGRCICPSDPQFVEEKGIRAEDLVIGALESAFQECDEVIGRELEASGQMGGC 
TALVAVSLQGKLYMANAGDSRAILVRRDEIRPLSFEFTPETERQRIQQLAFVYPELLAGEFTRLEFPRRLKG 
DDLGQKVLFRDHHMSGWSYKRVEKSDLKYPLIHGQGRQARLLGTLAVSRGLGDHQLRVLDTNIQLKPFLLSV 
PQWVLDVDQLELQEDDVVVMATDGLWDVLSNEQVAWLVRSFLPGNQEDPHSYLQDGLHRFSKLAQMLIHST 
QGKEDSLTEEGQVSYDDVSVFVIPLHSQGQESSDH 

A search of sequence databases reveals that the NOV54a amino acid sequence has 32 
15 of 77 amino acid residues (41%) identical to, and 48 of 77 amino acid residues (62%) similar 
to, the 413 amino acid residue ptnr:SPTREMBL-ACC:Q9M3Vl protein from Fagus sylvatica 
(Beechnut) (Protein Phpsphatase 2C (PP2C) (EC 3.1.3.16)) (E = 9.2e~ 16 ). 

NOV54a is predicted to be expressed in at least bone marrow, lymphoid tissue, tonsils, 
brain, colon, uterus, endometrium, placenta, mammary gland/breast, prostate, testis, foreskin, 
20 heart, kidney, lung, spleen, peripheral blood, pituitary gland, retina, and pooled germ cell 

tumors. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
sources, Literature sources, and/or RACE sources. 

In addition, the sequence is predicted to be expressed in ovarian carcinoma because of 
25 the expression pattern of (GENBANK-ID: gb:GENBANK-ID:AK023315|acc:AK023315.1) a 
closely related Homo sapiens cDNA FLJ 13253 fis, clone OVARC 1000751 homolog. 
NOV54b 

A disclosed NOV54b nucleic acid of 1930 nucleotides (also referred to as CG56773- ' 
02) encoding a protein phosphatase 2C-like protein is shown in Table 54C. An open reading 
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frame was identified beginning with a ATG initiation codon at nucleotides 1-3 and ending 
with a TGA codon at nucleotides 1 147-1 149. The start and stop codons are shown in bold in 
Table 54C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 54C. NOV54b nucleotide sequence (SEQ ID NO: 199). 

ATGTCCGCCGGCTGGTTCCGGCGCCGCTTCCTGCCTGGGGAGCCGCTCCCCGCGCCGCGGCCGCCTGGGCCG 
CATGCCAGCCCCGTGCCCTACCGACGGCCCCGCTTCCTTCGCGGCTCCAGCTCCAGCCCCGGGGCGGCCGAC 
GCCTCGCGCCGCCCAGACTCCCGGCCCGTGCGCAGCCCCGCACGAGGACGCACGCTACCCTGGAATGCAGGC 
TACGCCGAGATTATCAATGCAGAGAAATCTGAATTCAATGAGGATCAAGCCGCCTGTGGGAAGCTGTGCATC 
CGGAGATGTGAGTTTGGGGCTGAAGAAGAGTGGCTGACCCTGTGCCCAGAGGAGGATGAGGTGATCGGGCGG 
GAGCTGGAGGCCTCAGGCCAGATGGGCGGCTGCACAGCCCTGGTGGCTGTGTCCCTGCAGGGAAAGCTGTAC 
ATGGCCAATGCTGGGGATAGCAGGGCCATCTTGGTGCGGAGAGATGAGATACGGCCACTGAGCTTCGAGTTC 
ACCCCAGAGACTGAGCGGCAGCGGATCCAGCAGCTGGCCTTTGTCTATCCTGAGCTTCTGGCTGGTGAGTTC 
ACCCGACTGGAGTTCCCTCGGCGGCTGAAGGGGGATGACTTGGGACAGAAGGTTTTGTTCAGGGATCACCAC 
ATGAGTGGCTGGAGCTACAAACGTGTGGAGAAATCGGATCTCAAGTACCCACTGATCCATGGACAGGGTAGG 
CAGGCTCGGTTACTAGGAACACTGGCTGTCTCCCGGGGCCTGGGAGACCATCAGCTCAGAGTCCTGGACACA 
AACATCCAGCTCAAGCCCTTCTTGCTCTCTGTGCCACAGGTGACTGTGCTGGATGTGGACCAGCTGGAGCTA 
CAGGAGGATGATGTGGTTGTCATGGCAACTGATGGACTCTGGGATGTACTGTCCAACGAGCAGGTGGCATGG 
CTGGTGCGGAGCTTCCTCCCTGGGAACCAAGAGGACCCACACAGCTATCTGCAGGATGGTCTTCACAGGTTC 
TCAAAGCTGGCCCAGATGCTGATACACAGCACACAGGGAAAGGAAGACAGTCTCACAGAGGAAGGGCAGGTG 
TCCTACGATGACGTCTCTGTGTTCGTGATTCCCTTGCACAGTCAGGGCCAAGAGAGCAGTGACCACTGAGGA 
TTCAGACACTGTATCCCAGAACTGCTCTAGTGCCCGGGTGTGGTCTGGGCATCCCTCCAGTGTGACCAAGAG 
CAAATCCTGCCTGCCCTATCCCTAGCCACCGCCCAGTGCTCTCACTATCCACCTCAACACACATCCATCTCA 
AGAGGAACATTTATACCAGGCAGTCAGAGCTGGAAGTGTATGGAGAGCCCAGCCCACCAGGTCCTGCCTTTT 
GCGGTGATAACCTTCTCTGGCAGAGTGACTTTACAACTTAACTAGGAAACCCATGTGAGGCTCCTCAGACAG 
GAT C T TGAAC AGCC C AAAGT AT C ATTCT CAGAT AGGGG C AC C C AAG CT AAGGGT ATT AG C CAAAGATG CC AG 
GATGGGTAGCTAGCCCATGTTTAGATCCAGGTCTCCAATTCATGGTTATCAGGGCATGTGTTCAACAACCCC 
CAAAGTCCACGCAGGTGGCTTGTAGAAACCTTTGGGCAGCCTCATGTCTGCTAAAACAGCCATCTTCAAGAC 
AGCCCCTGAAAAGAGACCAGTTCAGGTCCTGCCCTGCTGTTCTTTGCTGGAGATGAGGAACAGGTGCTGGGG 
CTAAAGTTTGGGGTAGAGCACAAGGGACAAGAGGAACTCTTGGAGTTGGCTGGGTGAGAGGGCTCTCCATTT 
GCTACCTGTAGTAGCCTGCCTCTTAACTGGTTGCTTCTCCCTAGTTCCAGCCCTGCCCTGGTCTGATGCCCC 
AACACTGCCCTTGCTTTGTTTTCCCTGTCACCTCCCTATTATTAAATGTTTTCTACAG 



In a search of public sequence databases, the NOV54b nucleic acid sequence, located 
on chromosome 3, has 446 of 660 bases (67%) identical to a gb:GENBANK- 
ID:BC01 1803|acc:BC01 1803.1 mRNA from Homo sapiens {Homo sapiens, Similar to R1KEN 
cDNA 2310008J22 gene, clone MGC: 19531 IMAGE:4336762, mRNA, complete cds) (E = 
2.0e 56 ). 

The disclosed NOV54b polypeptide (SEQ ID NO:200) encoded by SEQ ID NO:199 
has 382 amino acid residues and is presented in Table 54D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV54b has no signal peptide 
and is likely to be localized to the microbody (peroxisome) with a certainty of 0.4037. 
Alternatively, NOV54b may also localize to the nucleus with a certainty of 0.3000, to the 
lysosome (lumen) with a certainty of 0.1 000, or to the mitochondrial matrix space with a 
certainty of 0.1000. 



Table 54D. Encoded NOV54b protein sequence (SEQ ID NO:200). 

MSAGWFRRRFLPGEPLPAPRPPGPHASPVPYRRPRFLRGSSSSPGAADASRRPDSRPVRSPARGRTLPWNAG 



465 



YAEIINAEKSEFNEDQAACGKLCIRRCEFGAEEEWLTLCPEEDEVIGRELEASGQMGGCTALVAVSLQGKLY 
MANAGDSRAI LVRRDE IRPLS FE FTPETERQR I QQLAFVYPE LLAGE FT R L E F P RRL KGDDLGQ KVL FRDHH 
MSGWSYKRVEKSDLKYPLIHGQGRQARLLGTLAVSRGLGDHQLRVLDTNIQLKPFLLSVPQVTVLDVDQLEL 
QEDDWVMATDGLWDVLSNEQVAWLVRSFLPGNQEDPHSYLQDGLHRFSKLAQMLIHSTQGKEDSLTEEGQV 
SYDDVSVFVI PLHSQGQESSDH 

A search of sequence databases reveals that the NOV54b amino acid sequence has 231 
of 270 amino acid residues (85%) identical to, and 244 of 270 amino acid residues (90%) 
similar to, the 453 amino acid residue ptnr:SPTREMBL-ACC:Q9CSD6 protein from Mus 
5 musculus (Mouse) (2810423O19RIK Protein) (E = 1 .6e' 167 ). 

NOV54b is predicted to be expressed in at least the following tissues: bone marrow, 
lymphoid tissue, tonsils, brain, colon, uterus, endometrium, placenta, mammary gland/breast, 
prostate, testis, foreskin, heart, kidney, lung, spleen, peripheral blood, pituitary gland, retina, 
and pooled germ cell tumors. 
10 NOV54 also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 54E 



Table 54E. BLAST results for NOV54 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 128 503 32 | db j | BAB 
28679. l| (AK013149) 


Protein 

phosphatase 2C 
containing 
protein-datasourc 
e : Pf am, source 
key :PF004 81 / 
evidence : ISS~puta 
tive [Mus 
musculus] 


453 


400/459 
(87%) 


423/459 
(92%) 


0.0 


gi | 165 52416 | dbj | BAB 
71302. 1| (AK056894) 


unnamed protein 
product [Homo 
sapiens] 


270 


251/252 
(99%) 


251/252 
(99%) 


e-143 


gi | 174 623 96 | ref | XP_ 
059571. l| 
(XM_059571) , 


similar to 
putative (H. 
sapiens) [Homo 
sapiens] 


247 


247/255 
(96%) 


247/255 
(96%) 


e-137 


gi | 128 563 86 | dbj | BAB 
30649. 1 | (AK017245) 


Protein 

phosphatase 2C 
containing 
protein-datasourc 
erPfam, source 
key:PF00481, 
evidence : ISS-puta 
tive [Mus 
musculus] 


254 


222/261 
(85%) 


234/261 
(89%) 


e-121 


gi | 17455719 | ref |XP_ 
051093 .2 | 
(XM 051093) 


KIAA1157 protein 
[ Homo sapi en s ] 


514 


213/480 
(44%) 


288/480 
(59%) 


6e-98 



The homology between these and other sequences is shown graphically in the 
15 ClustalW analysis shown in Table 54F. In the ClustalW alignment of the NOV54 protein, as 
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well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 54F. ClustalW Analysis of NOV54 

1) Novel NOV54a (SEQ ID NO: 19 8) 

2) Novel NOV54b (SEQ ID NO: 20 0) 

3) gi | 12850332 |dbj | BAB28679 . 1 | (AK013149) Protein phosphatase 2C containing 
protein-datasource :Pf am, source key:PF00481, evidence : ISS-putative [Mus muscuius] 

(SEQ ID NO: 572) 

4) gi | 16 552416 | dbj | BAB713 02 . 1 | (AK056894) unnamed protein product [Homo sapiens] 
(SEQ ID NO: 57 3) 

5) gi | 17462396 | ref |XP_059571 . 1 | <XM_05 957 1 ) similar to putative (H. sapiens) [Homo 
sapiens] (SEQ ID NO: 5 74) 

6) gi | 12856386 |dbj | BAB30649 . 1 | (AK017245) Protein phosphatase 2C containing 
protein-datasource : Pf am, source key:PF00481, evidence : ISS -putative [Mus muscul us] 

(SEQ ID NO: 575) 

7) gi | 17455719 | ref |XP_051093 .2 | (XM_051093) KIAA1157 protein [Homo sapiens] (SEQ ID 
NO: 57 6) 



NOV54a 
NOV54b 
gi|l2850332| 
gi 1 16552416 | 
gi 1 17462396 | 
gi j 12856386 | 
gi | 17455719 j 



10 20 30 40 50 60 

MSAGWFRRRFLPGEPLPAPRPPGPHASPV PYRRPRFLRGSSSSPGAADAS 5 0 

MSAGWFRRRFLPGEPLPAPRPPGPHASPV PYRRPRFLRGSSSSPGAADAS 50 

VLPGGPLPEPRPAGPRSSPV P YHRPRFLRGSGS S PGATDAS 41 



MLTRVKSAVANFMGGIMAGSSGSEHGGGSCGGSDLPLRFPYGRPEFLG- 



-LSQDEVECS 57 



NOV54a 

NOV54b 

gi|l2850332 | 

gi|l6552416| 

gi|l7462396| 

gi|l2856386| 

gi 1 17455719 | 



70 80 90 100 110 120 

....|....|....|....|....|....|....|....|....|....|....|....| 

51 RRPDSRPVRSPARGRTLPWNAGYAEIINAEKSEFNEDQAACGKLCIR 97 

51 RRPDSRPVRSPARGRTLPWNAGYAEIINAEKSEFNEDQAACGKLCIR 97 

42 RRPDARPVRSPARGRTLPWNAGYAEVINAEKSEFNEDQAACGKLCIR 88 

1 ! 

! ± 

1 x 

58 ADH I AR PILILKET RRL P W ATGYAE V I NAGKS THNEDQ AS CE VLT VK K KAG AVT S T PNRN 117 



NOV54a 
NOV54b 

gi 
gi 
gi 
gi 
gi 



12850332 | 
16552416 | 
17462396 j 
12856386 j 
17455719 



130 140 150 160 170 180 

....|....|....|....|....|....|....|....|....|....|....|....| 

97 RCEFG- - AEEEWLTLCPEEFLTGHYWALFDGHGGPAAAILAANTLHSCLRRQLEAV 151 

97 RCEFG --AEEEWLTLCPEE 114 

88 RCEFGIEEHQEWLTVCPEEFLTGHYWALFDGHGGPAAAILAANTLHSCLRRQLEAV 144 

x x 

! x 

1 ± 

118 SSKRRSSLPNGEGLQLKENSESEGVSCHYWSLFDGHAGSGAAWASRLLQHHITEQLQDI 177 



NOV54a 
NOV54b 
gi|l2850332| 
gi | 16552416 j 
gi|l7462396| 
gi j 12856386 | 



I 



190 



152 VEGLV- -ATQPPMHL- 
114 



200 



210 
.|....|.. 
-NGRCIC-- 



220 



230 240 

•I I I I 

-PSDP- -QFVEEKGIRA 184 



14 5 VEGMI - -APQPPMHL- 

! MHL- 

1 

1 



-SGRCVC- 
-NGRCIC- 



H4 

-PSDP- -QFVEEKGIQA 177 

-PSDP- -QFVEEKGIRA 2 3 

! 

1 
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gi | 17455719 | 178 VDILKNSAVLPPTCLGEEPENTPANSRTLTRAASLRGGVGAPGSPSTPPTRFFTEKKIPH 237 



NOV54a 
NOV54b 
gi|l2850332| 
gi | 16552416 | 
gi | 17462396 | 
gi|l2856386 | 
gi 1 17455719 j 



250 

... | .... | . 
185 EDLVI GALES AFQEC 
114 

17 8 EDLVI GALENAFQEC 
24 EDLVI GALES AFQE< 
1 
1 



238 ECL VI GALES AFKEMj3LQ0E| 




NOV54a 


245 


NOV54b 


160 


gi | 12850332 | 


238 


gi j 16552416 j 


84 


gi|l7462396 j 


33 


gi j 12856386 j 


39 


gi j 17455719 j 


298 



NOV54a 


305 


NOV54b 


220 


gi | 12850332 | 


298 


gi| 16552416 j 


144 


gi | 17462396 | 


93 


gi|l2856386 j 


99 


gij 17455719 | 


358 



NOV54a 


365 


NOV54b 


280 


gi|!2850332 


358 


gij 16552416 


204 


gi j 17462396 


153 


gi j 12856386 


159 


gi | 17455719 


418 


NOV54a 


425 


NOV54b 


340 


gi 


12850332 


411 


gi 


16552416 


252 


gi 


17462396 


205 


gi 


12856386 


212 


gi 


17455719 


471 



310 



320 



330 



340 



350 



360 





490 500 

....|....|_ 
EDSL 
EDSL 

dJga 

PAWAWVG 

LKDgGWRlgNgRLGSG® 



520 



T a EGQVSYDDVSVFVI PLHSQgQE |SgH 
T a EGQVSYDDVSVFVI PLHSQgQE |SgH 
TS EGQVSYDDVSVFVI PLHSQSQeBsBh 




|IHGNJCLS 



4 67 
382 
453 
270 
247 
254 
514 



Tables 54G-H list the domain descriptions from DOMAIN analysis results against 
NOV54. This indicates that the NOV54 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 54G Domain Analysis of NOV54 

gnl | Smart | smart 00332 , PP2Cc, Serine/threonine phosphatases, family 2C, 
catalytic domain; The protein architecture and deduced catalytic 
mechanism of PP2C phosphatases are similar to the PPl, PP2A, PP2B 
family of protein Ser/Thr phosphatases, with which PP2C shares no 
sequence similarity. (SEQ ID NO: 853) 
CD-Length = 260 residues, 73.5% aligned 
Score = 114 bits (286) , Expect = 9e-27 



10 



15 



20 



N0V54 : 
Sbjct : 
NOV54 : 
Sbjct : 
N0V54 : 
Sbjct : 
NOV54 : 
Sbjct: 
NOV54 : 
Sbj Ct : 



118 



40 



178 



76 



237 



13 0 



297 



155 



GHYWALFDGHGGPAAAILAANTLHSCLRRQLEAWEGLVATQPPMHLNGRCICPSDPQFV 
I ++ +1 I I I I I II +1 I +1 

GG F FG VFDGHGG S E AAKFL S KNL P E I L AE ELI KD KD 

EEKGIRAEDLVIGALESAFQECDEVIGRELEASG-QMGGCTALVAVSLQGKLYMANAGDS 

+ + I II II II I |||+ I I 11+11+ lll+ll III 
EDEDVEDALRKAFLRTDEEILEELESLEDQRSGTTAWALIRGNKLYVANVGDS 

RAILVRRDEIRPLSFEFTPETERQRIQQLAFVYPELLAGEFTRLEFPRRLKGDDLGQKVL 
11+11+ 1+ + I I +1 

RAVLCRNGKAVQLTEDHKPSNEDER 



75 



236 



129 



296 



154 



356 



FRDHHMSGWSYKRVEKSDLKYPLIHGQGRQARLLGTLAVSRGLGDHQLRVLDTNIQLKPF 

+ 1+ ++ I 1+ I I l + l I I I I I 11 + 

ERI RE A GG F VSNGR VNG VL AL S RALGD F FL KPY 187 

403 



3 57 LLSVPQVT VLDVDQLELQEDDVVVMATDGIMDVIj SNEQVAWL VRS FL 

+++ I III+++ + + +I+IIIIIIIII++I +11 I 

18 8 VIAEPDVTWELTEKDDFLI LAS DG L WD VL SNQ E WD I VRKHL 230 



Table 54H Domain Analysis of NOV54a 

gnl | Pf am | pf am004 81 , PP2C, Protein phosphatase 2C. Protein phosphatase 
2C is a Mn++ or Mg++ dependent protein S/ threonine phosphatase. (SEQ 
ID NO: 854) 

CD-Length = 252 residues, 77.0% aligned 
Score = 93.2 bits [230) , Expect = 3e-20 



25 



30 



35 



40 



N0V54 : 


119 


Sbjct: 


35 


N0V54 : 


179 


Sbjct : 


70 


N0V54 : 


238 


Sbjct : 


123 


NOV54 : 


295 


Sbjct : 


155 


NOV54 : 


355 


Sbjct : 


184 



HYWALFDGHGGPAAAILAANTLHSCLRRQLEAWEGLVATQPPMHLNGRCICPSDPQFVE 17 8 

++I+IIIIII II I I + I + 11+ 

GFFAVFDGHGGSQAAKYAGKHLETKLALR KSFPEL- - 69 

EKGIRAEDLVIGALESAFQEC-DEVIGRELEASGQMGGCTALVAVSLQGKLYMANAGDSR 23 7 

I + 11+ +1 I II + + 111 + 11+ lll + ll I I II 

DDL ENALKE SFLESTDEELRS S AANTD LD S G S T AWAL I RGNKL Y VANVGD S R 122 

AILVRRDE-IRPLSFEFTP- - ETERQRIQQLAFVYPELLAGEFTRLEFPRRLKGDDLGQK 2 94 
l+l I 1+ 1+ + I I 11+11+ 

AVLCRNGNAI KQLTEDHKPSNEDERRRI EGAG 154 

VLFRDHHMSGWSYKRVEKSDLKYPLIHGQGRQARLLGTLAVSRGLGDHQLRVLDTNIQLK 3 54 

+ II + I Mill II +1+ 
GFVSRNGR VNGVLAVSRAFGDFELK PGVL 183 



PFLLSVPQVTVLDVDQLELOEDDlOa'MATDGLWDVLSNEQVAWLVRSFL 

+ + I II 1+ +++I+IIIIIIII+++I +111 I 

QPVTAEPDVT SHKITPSDEFLILASDGLWDVLSDQEWDIVRSEL 



403 



228 



Protein phosphorylation plays a key role in the regulation of cellular functions through 
the activation or inhibition of enzymes involved in various biochemical pathways. Kinases and 



469 




phosphatases that determine the phosphorylation state of an enzyme (and its activity) are 
frequently regulated through the action of hormones and growth factors (1). Four distinct 
subfamilies of serine/threonine protein phosphatases have been identified in mammals: PP1, 
PP2A, PP2B and PP2C (2). The PP2C subfamily contains structurally diverse protein 
5 phosphatases with a wide range of functions in cellular signal transduction; however, the exact 
physiological role of most PP2C enzymes is still unclear. 

The protein described in this invention contains protein phosphatase 2C domains and is 
therefore likely to play a role in signal transduction and cellular proliferation. The protein is 
also homologous, but not identical, to the rat petrin protein, a PP2C subfamily member that 

10 has been shown to modulate neurite growth inhibition and may therefore be useful in the 
treatment of nerve damage resulting from traumatic injury, stroke or CNS degenerative 
disorders (3). The PP2C-like gene described in this invention is also expressed in the brain and 
may therefore have similar functions in the CNS. However, it is also expressed in a number of 
other tissues and based on its expression pattern may contribute to additional human diseases, 

15 such as cancer, inflammation/autoimmune diseases, and metabolic disorders. The PP2C-like 
gene maps to human chromosome 3p21 .1. 

Protein phosphatase 2C domain is found in protein phosphatase 2C, as well as other 
proteins e.g. adeylate cyclase. Protein phosphatase 2C (PP2C) is one of the four major classes 
of mammalian serine/threonine specific protein phosphatases. PP2C is a monomeric enzyme 

20 of about 42 Kd that shows broad substrate specificity and is dependent on divalent cations 

(mainly manganese and magnesium) for its activity. Its exact physiological role is still unclear. 
Three isozymes are currently known in mammals: PP2C-alpha, -beta and -gamma. In yeast, 
there are at least four PP2C homologs: phosphatase PTC1 that has weak tyrosine phosphatase 
activity in addition to its activity on serines, phosphatases PTC2 and PTC3, and hypothetical 

25 protein YBR125c. Isozymes of PP2C are also known from Arabidopsis thaliana (ABI1, 
PPH1), Caenorhabditis elegans (FEM-2, F42G9.1, T23F1 1.1), Leishmania chagasi and 
Paramecium tetraurelia. In Arabidopsis thaliana, the kinase associated protein phosphatase 
(KAPP) is an enzyme that dephosphorylates the Ser/Thr receptor-like kinase RLK5 and which 
contains a C-terminal PP2C domain. 

30 PP2C does not seem to be evolutionary related to the main family of serine/ threonine 

phosphatases: PP1, PP2A and PP2B. However, it is significantly similar to the catalytic 
subunit of pyruvate dehydrogenase phosphatase (PDPC), which catalyzes dephosphorylation 
and concomitant reactivation of the alpha subunit of the El component of the pyruvate 
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dehydrogenase complex. PDPC is a mitochondrial enzyme and, like PP2C, is magnesium- 
dependent. 

The disclosed NOV54 nucleic acid of the invention encoding a Protein phosphatase 
2C-like protein includes the nucleic acid whose sequence is provided in Table 54A, 54C or a 
5 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 54A or 54C while still 
encoding a protein that maintains its Protein phosphatase 2C -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 

10 that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 

15 stability of the modified nucleic acid, such that they may be used, for example, as antisense 

binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 37 percent of the bases may be so changed. 

The disclosed NOV54 protein of the invention includes the Protein phosphatase 2C- 
like protein whose sequence is provided in Table 54B or 54D. The invention also includes a 

20 mutant or variant protein any of whose residues may be changed from the corresponding 

residue shown in Table 54B or 54D while still encoding a protein that maintains its Protein 
phosphatase 2C-like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 56 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F a b or 

25 (F a b)2, that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Protein phosphatase 2C-like protein 
(NOV54) is a member of a "Protein phosphatase 2C family". Therefore, the NOV54 nucleic 
acids and proteins identified here may be useful in potential therapeutic applications 
implicated in (but not limited to) various pathologies and disorders as indicated below. The 

30 potential therapeutic applications for this invention include, but are not limited to: protein 
therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 
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The NOV54 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated cancer, trauma, bacterial and viral infections, in vitro and 
in vivo regeneration, fertility, endometriosis, hypogonadism, cardiomyopathy, atherosclerosis, 
hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 
5 atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, 
transplantation, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 
autoimmune disease, allergies, immunodeficiencies, transplantation, graft versus host disease 
(GVHD), lymphaedema, anemia, Alzheimer's disease, stroke, hypercalceimia, Parkinson's 

10 disease, Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple 
sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, 
pain, neurodegeneration, systemic lupus erythematosus, asthma, emphysema, allergy, ARDS, 
diabetes, renal artery stenosis, interstitial nephritis, glomerulonephritis, polycystic kidney 
disease, renal tubular acidosis, IgA nephropathy, hypercalceimia, Von Hippel-Lindau (VHL) 

15 syndrome, endocrine dysfunctions, growth and reproductive disorders, tonsillitis, 

Hirschsprung's disease, Crohn's Disease, appendicitis, and/or other diseases and pathologies. 

NOV54 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV54 substances for use in 
therapeutic or diagnostic methods. These antibodies may be generated according to methods 

20 known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 

NOVX Antibodies" section below. The disclosed NOV54 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
understanding of pathology of the disease and development of new drug targets for various 

25 disorders. 

NOV55 

A disclosed NOV55 nucleic acid of 1500 nucleotides (also referred to as CG56806-01) 
encoding a Heparan Sulfate 6-Sulfotransferase 3-like protein is shown in Table 55A. An open 
reading frame was identified beginning with a ATG initiation codon at nucleotides 74-76 and 
30 ending with a TGA codon at nucleotides 1490-1492. The start and stop codons are shown in 
bold in Table 55 A, and the 5' and 3' untranslated regions, if any, are underlined. 

Table 55A. NOV55 nucleotide sequence (SEQ ID NO:201). 

~CTTCCGAGCGGGCGCCCGTCCGCCCTGCCGCCGCCGCCGCCGCCGCTTCGCCTGCCGGCCTGAGAGCGGGAC~ 
CATGGATGAAAGGTTCAACAAGTGGCTGCTGACGCCGGTGCTCACTCTCCTCTTCGTGGTCATCATGTACCA 
GTACGTGTCCCCCTCCTGCACCAGCTCCTGCACCAACTTCGGGGAGCAGCCCCGCGCGGGGGAGGCCGGCCC 
G C C CGC CGT C C CGGGTC CCG CC CGC CGGG CTC AGG CGC CGC CGG AGG AGTGGGAGC AG AGG AGG C C C C AGTT 
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GCCCCCGCCGCCCCGGGGGCCCCCCGAGGGACCTCGGGGGGCCGCGGCGCCGGAGGAGGAGGACGAGGAGCC 
CGGAGACCCCCGGGAGGGGGAGGAAGAGGAGGAGGAAGACGAGCCGGACCCCGAGGCCCCGGAAAACGGCTC 
CCTGCCCCGATTCGTGCCGCGCTTCAACTTCAGCCTGAAGGACCTGACCCGCTTCGTGGATTTCAACATCAA 
AGGGCG CGACGTGAT CGTGT T C C T C C AC ATC C AG AAG ACGGGGGGC AC C ACTT T CGGC CGGC AC CTGGTG AA 
GAACATCCGGCTGGAGCAGCCTTGTAGCTGCAAAGCGGGTCAGAAGAAGTGCACCTGCCACCGGCCTGGCAA 
GAAGGAGACGTGGCTCTTCTCCCGCTTCTCCACCGGCTGGAGCTGCGGGCTGCACGCCGACTGGACGGAGCT 
CACCAACTGCGTGCCGGCCATCATGGAGAAGAAGGACTGTCCCCGCAACCACAGCCACACCAGGAATTTCTA 
TTACATCACAATGTTACGGGATCCAGTGTCACGTTACCTGAGCGAGTGGAAACATGTCCAGAGAGGGGCCAC 
TTGGAAAACCTCTCTTCATATGTGTGATGGAAGAAGCCCCACCCCAGATGAGCTGCCTACCTGCTACCCTGG 
GGATGACTGGTCTGGGGTCAGCTTGCGGGAGTTTATGGATTGCACCTACAACCTGGCTAACAATCGCCAGGT 
G CG C AT GCT GGCTG AC CTC AGC CTGGTGGG C TGCT AT AACTTG ACTT T CAT G AACG AGAGTG AAAGAAAC AC 
CATCCTGTTGCAGAGTGCAAAGAACAACCTGAAGAACATGGCCTTCTTTGGGCTCACTGAGTTCCAGAGGAA 
GACACAGTTTCTCTTTGAGAGAACATTCAACCTCAAGTTCATCTCCCCCTTCACACAGTTCAACATCACGCG 
GGC TTC T AACGTGG AGAT C AACG AGGGTG CC CGC C AACG C ATTG AGGAT CT AAACT T C CTGG AC ATG C AGCT 
TTACGAGTATGCAAAAGATCTCTTCCAGCAGCGCTACCACCACACCAAGCAGCTAGAGCACCAGAGGGACCG 
CCAGAAGCGGCGGGAGGAGCGGAGGCTGCAGCGAGAGCACAGGGACCACCAGTGGCCCAAAGAAGATGGGGC 
TGCAGAAGGGACTGTCACCGAGGACTACAACAGCCAGGTGGTGAGATGGTGACCTCCTGC 



In a search of public sequence databases, the NOV55 nucleic acid sequence, located on 



chromosome 13, has 1329 of 1492 bases (89%) identical to a gb :GENB ANK- 
ID:AB024567|acc:AB024567.1 mRNA from Mus musculus (mRNA for heparan sulfate 6- 
5 sulfotransferase 3, complete cds) (E = 7. 1 e 263 ). 

A disclosed NOV55 polypeptide (SEQ ID NO:202) encoded by SEQ ID NO:201 has 
944 amino acid residues and is presented in Table 55B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV55 has a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.6850. Alternatively, 
1 0 NOV55 may also localize to the endoplasmic reticulum (membrane) with a certainty of 

0.6400, to the Golgi body with a certainty of 0.3700, or to the microbody (peroxisome) with a 
certainty of 0.1269. The most likely cleavage site for NOV55 is between positions 28 and 29: 



MDERFNKWLLTPVLTLLFWIMYQYVSPSCTSSCTNFGEQPRAGEAGPPAVPGPARRAQAPPEEWEQRRPQL 
PPPPRGPPEGPRGAAAPEEEDEEPGDPREGEEEEEEDEPDPEAPENGSLPRFVPRFNFSLKDLTRFVDFNIK 
GRDVIVFLHIQKTGGTTFGRHLVKNIRLEQPCSCKAGQKKCTCHRPGKKETWLFSRFSTGWSCGLHADWTEL 
TNCVPAIMEKKDCPRNHSHTRNFYYITMLRDPVSRYLSEWKHVQRGATWKTSLHMCDGRSPTPDELPTCYPG 
DDWSGVS LREFMDCT YNLANNRQVRMLADL SLVGC YNLT FMNE S E RNT I LLQS AKNNL KNMAFFGLTEFQRK 
TQFLFERTFNLKFISPFTQFNITRASNVEINEGARQRIEDLNFLDMQLYEYAKDLFQQRYHHTKQLEHQRDR 
QKRREERRLQREHRDHQWPKEDGAAEGTVTEDYNSQWRWMDERFNKWLLTPVLTLLFWIMYQYVSPSCTS 
SCTNFGEQPRAGEAGPPAVPGPARRAQAPPEEWEQRRPQLPPPPRGPPEGPRGAAAPEEEDEEPGDPREGEE 
EEEEDEPDPEAPENGSLPRFVPRFNFSLKDLTRFVDFNIKGRDVIVFLHIQKTGGTTFGRHLVKNIRLEQPC 
SCKAGQKKCTCHRPGKKETWLFSRFSTGWSCGLHADWTELTNCVPAIMEKKDCPRNHSHTRNFYYITMLRDP 
VSRYLSEWKHVQRGATWKTSLHMCDGRSPTPDELPTCYPGDDWSGVSLREFMDCTYNLANNRQVRMLADLSL 
VGCYNLTFMNESERNTILLQSAKNNLKNMAFFGLTEFQRKTQFLFERTFNLKFISPFTQFNITRASNVEINE 
GARQRIEDLNFLDMQLYEYAKDLFQQRYHHTKQLEHQRDRQKRREERRLQREHRDHQWPKEDGAAEGTVTED 
YNSQWRW 



A search of sequence databases reveals that the NOV55 amino acid sequence has 447 
of 472 amino acid residues (94%) identical to, and 458 of 472 amino acid residues (97%) 



VSP-SC. 



Table 55B. Encoded NOV55 protein sequence (SEQ ID NO:202). 



15 
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similar to, the 470 amino acid residue ptnr:SPTREMBL-ACC:Q9QYK4 protein from Mus 
musculus (Mouse) (Heparan Sulfate 6-Sulfotransferase 3) (E = 7.5e" 254 ). 

NOV55 is predicted to be expressed in at least Right Cerebellum, Oviduct/Uterine 
Tube/Fallopian tube, Amygdala, and Kidney. This information was derived by determining the 
5 tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, Public EST sources, Literature sources, and/or RACE sources. 

NOV55 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 55C 



Table 55C. BLAST results for NOV55 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 7657192 | ref |NP_0 
56635. l| 
(NM_015820) 


heparan sulfate 
6-0- 

sulf otransf erase 
3 [Mus musculus] 


470 


440/472 
(93%) 


452/472 
(95%) 


0.0 


gi | 16 5 52186 | dbj | BAB 
71260. l| (AK056706) 


unnamed protein 
product [Homo 
sapiens] 


605 


241/330 
(73%) 


284/330 
(86%) 


e-144 


gi | 14 042 611 | dbj | BAB 
55322 . 1 | (AK027720) 


unnamed protein 
product [Homo 
sapiens] 


459 


242/330 
(73%) 


285/330 
(86%) 


e-143 


gi | 7657190 | ref |NP_0 
56634. 1| 
(NM_015819) 


heparan sulfate 
6-0- 

sulf otransf erase 
2 [Mus musculus] 


506 


246/369 
(66%) 


290/369 
(77%) 


e-140 


gi | 12545389 | ref |NP_ 
004798 . 2 | 
(NM_004807) 


heparan sulfate 

6-0- 

sulf otransf erase 
[Homo sapiens] 


401 


238/353 
(67%) 


286/353 
(80%) 


e-138 



10 

The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 55D. In the ClustalW alignment of the NOV55 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence {i.e., regions that may be required to preserve structural or 
15 functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 



Table 55D. ClustalW Analysis of NOV55 

20 1) Novel NOV55a (SEQ ID NO: 202) 

2) gi | 7657192 | ref |NP_056635 . 1 | (NM_015820) heparan sulfate 6 -O- sulf otransf erase 3 
[Mus musculus] (SEQ ID NO: 577) 

3) gi | 16552186 | dbj | BAB71260 . 1 1 (AK05 67 06) unnamed protein product [Homo sapiens] 
(SEQ ID NO: 578) 
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4) gi | 14042611 | dbj | BAB55322 . 1 | (AK027720) unnamed protein product [Homo sapiens] 
(SEQ ID NO: 579) 

5) gi | 7657190 | ref |NP_056634 . 1 | (NM_015819) heparan sulfate 6-O-sulf otransf erase 2 
[Mus musculus] (SEQ ID NO: 5 80) 

6) gi | 125453 89 | ref | NP_004 798 . 2 | (NM_004807) heparan sulfate 6-O-sulf otransf erase 
[Homo sapiens] (SEQ ID NO: 5 81) 



NOV55 

gi|7657192 | 
gijl6552186| 
gi 1 14042611 j 
gi |7657190 | 
gi 1 12545389 | 



NOV55 

gi|7657192 | 
gi j 16552186 | 
gi | 14042611 j 
gi |7657190 | 
gij 12545389 | 



NOV55 

gi|7657192 | 
gi | 16552186 | 
gij 14042611 | 
gi j 7657190 | 
gi|l2545389| 



NOV55 

gi|7657192 | 
gi|l6552186| 
gij 14042611 | 
gi j7657190 | 
gi jl2545389 | 



NOV55 

gi | 7657192 | 
gij 16552186 | 
gi j 14042611 j 
gi j7657190 | 
gi j 12545389 | 
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20 



30 
■|.- 



40 



50 



60 
•I 



1 1 

1 MALPACAVREFEPPRQPERGAPVRTTCPRRHSRVEAELAASRPGSVAASVRAGPPRGVSH 6 0 



70 



80 
•I-- 



90 



100 
..|.. 



110 



120 
•-I 



1 1 

6 1 GFHTRPLLDKPRKAS S SLAGAACAPLFALLSRGRRRRMHVLRRRWDLGSLCRALLTRGLA 12 0 



130 



140 150 
|....|....| 



1 

1 

121 alghslkhvlgaifskifgpmasvg: 

! 

1 

1 



190 200 

tnfgeqprageagppavpgp^ 
tnfgeqlrsgearppavpspjj 

LL | 

LL I 

LLg 

LG APGG& 




180 



israscTssc 34 



SCTSSC 34 

ECQ- 179 
STECQ- 
TECQ- 
«LS 



33 
33 
31 



35 

35 

179 

33 

33 

31 




210 220 
.|....|. 
!PPEEWEQRRPQLP[|P 
IPLDEWE - RRPQLPgP 

|fss 

FSS 

FSS 

PPDDLY L 




230 240 

GPPEGPRGAAAPEEEDE 94 

GPPEGSRGVAAPEDEDE 93 

191 

45 

45 

50 



250 




260 

..|....|....| 
EEDEPDPEAPENC 
EEEEPDPEAPEN 




310 
■ . I ■ . 



320 



330 



340 



I 



350 



360 



NOV55 


155 


gi |7657192 | 


153 


gi|l6552186 | 


235 


gij 14042611 j 


89 


gij7657190| 


89 


gi jl2545389| 


85 


NOV55 


215 


gi|7657192 | 


213 


gi j 16552186 | 


295 


gi j 14042611 1 


149 


gi j 7657190 | 


149 


gijl2545389| 


145 



QKTGGTTFGRHLVg] 
QKTGGTTFGRHLvi] 
QKTGGTTFGRHL 
QKTGGTTFGRHL 
QKTGGTTFGRHLVjg] 
QKTGGTTFGRHL Vgy 



ji[gleqpcgc|ggqkkctchrpgkgetwlfsrfstgwscglhadwt 
jiSleqpc|c™gqkkctchrpgk|etwlfsrfstgwscglhadwt 
ijgleqpc^crqgqkkctchrpgkretwlfsrfstgwscglhadwt 
i!leqpc|crSgqkkctchrpgkretwlfsrfstgwscglhadwt 
jigleqpc|cragqkkctchrpgkretwlfsrfstgwscglhadwt 

Tg§LESPcScR§GQKKCTCgRPj^RETWLFSRFSTGWSCGLHADWT 



214 
212 
294 
148 
14 8 
144 




390 
..|.. 



400 



410 



420 



236 

234 

315 

169 

|p||RWR I FQ I LDGT S KDRWGS S NFNS G ANS PSSTKPRSTSKS 2 08 

-|p 165 
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440 



450 



NOV55 

gi | 7657192 | 
gi|l6552186| 
gi | 14042611 | 
gi|7657190| 
gi j 12545389 | 



NOV55 

gi |7657192 | 
gi | 16552186 | 
gi|l404261l| 
gi|7657190 | 
gi j 12545389 | 



NOV55 

gi [7657192 | 
gi j 16552186 | 
gi j 14042611 | 
gi | 7657190 | 
gi|l2545389| 



NOV55 

gi | 7657192 | 
gi | 16552186 | 
gi | 14042611 | 
gi|7657190 | 
gi | 12545389 | 




460 



480 



s 


SLH 


| 


Haa i 

CDGR 


a 


SLH 


I 


CDGR 


s 


SLH 


! 


CDGR 


9 


SLH 




CDGR 


9 


SLH 




CDGR 


S 


SLH 


1 


CDGR 




NOV55 


296 


gi|7657192| 


294 


gij 16552186| 


375 


gi j 14042611 1 


229 


gi|7657190| 


269 


gijl2545389| 


225 



356 
354 
435 
289 
329 
285 



490 



500 



510 



520 



530 



540 




610 



620 



630 



640 



650 



rSkrI 




I 



LQH 

LQQ 

KflBgrllqthfqsqgqgqsqn- 
kflkgrllqthfqsqgqgqsqn- 



660 

.|....|....| 

443 

441 

-PNQNQSQNPNPN 552 
-PNQNQSQNPNPN 4 06 



j^FL&GRFLQTHFQSQSQGQSQSQSPGQNLSQNPNPN 

LLH$i 



448 
372 



670 



680 



700 



710 



— I — | — | — -I — I — 

443 E J RD HQW 

441 feMRA HRW 

5 53 ANQNLTQNLMQNLT QNLSQKENRES 

4 07 ANQNLTQNLMQNLT i^SLSQKENRES 

44 9 PNQNLTQNLSHNLTPSSNP||STQRENRGS 
372 AK EAL 



I 




3TV 

- TV 

2NDN-TSN< 
3NDN-TSN( 
jdG^GdGQGDSGTSNi 

Ipgrvp--- 




472 
470 
605 



SQ 
SQI 

[GSfflEK 

fGsRE^-|R 45 9 

;gsqet-Br 50 6 

ISHglE 



401 



Heparan-sulfate 6-sulfotransferase (HS6ST) catalyzes the transfer of sulfate from 3'- 
phosphoadenosine 5'-phosphosulfate to position 6 of the N-sulfoglucosamine residue of 
heparan sulfate. 

The disclosed NOV55 nucleic acid of the invention encoding a Heparan Sulfate 6- 
Sulfotransferase 3-like protein includes the nucleic acid whose sequence is provided in Table 
55A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any 
of whose bases may be changed from the corresponding base shown in Table 55 A while still 
encoding a protein that maintains its Heparan Sulfate 6-Sulfotransferase 3 -like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to those just described, including nucleic 
acid fragments that are complementary to any of the nucleic acids just described. The 
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invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of nonlimiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
5 chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 1 1 percent of the bases may be so 
changed. 



10 Sulfotransferase 3-like protein whose sequence is provided in Table 55B. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 55B while still encoding a protein that maintains its 
Heparan Sulfate 6-Sulfotransferase 3-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 34 percent of the 

15 residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(Fab>2,that bind immunospecifically to any of the proteins of the invention. 

The above disclosed information suggests that this Heparan Sulfate 6-Sulfotransferase 
3-like protein (NOV55) is a member of a "Heparan Sulfate 6-Sulfotransferase 3 family". 

20 Therefore, the NOV55 nucleic acids and proteins identified here may be useful in potential 

therapeutic applications implicated in (but not limited to) various pathologies and disorders as 
indicated below. The potential therapeutic applications for this invention include, but are not 
limited to: protein therapeutic, small molecule drug target, antibody target (therapeutic, 
diagnostic, drug target ing/cytotoxic antibody), diagnostic and/or prognostic marker, gene 

25 therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of 
all tissues and cell types composing (but not limited to) those defined here. 

The NOV55 nucleic acids and proteins of the invention are useful in potential 
therapeutic applications implicated in Diabetes, Autoimmune disease, Renal artery stenosis, 
Interstitial nephritis, Glomerulonephritis, Polycystic kidney disease, Systemic lupus 

30 erythematosus, Renal tubular acidosis, IgA nephropathy, Hypercalceimia, Lesch-Nyhan 

syndrome, Von Hippel-Lindau (VHL) syndrome , Alzheimer's disease, Stroke, Tuberous 
sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, 
Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia,Leukodystrophies, 



The disclosed NOV55 protein of the invention includes the Heparan Sulfate 6- 
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Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection, and/or other diseases and 
pathologies. 

NOV55 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV55 substances for use in 
5 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV55 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 
1 0 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV56 

NOV56 includes two novel N-Hydroxyarylamine Sulfotransferase-like proteins 
disclosed below. The disclosed sequences have been named NOV56a and NOV56b. 
15 NOV56a 

A disclosed NOV56a nucleic acid of 1223 nucleotides (also referred to as CG56816- 
01) encoding a N-Hydroxyarylamine Sulfotransferase-like protein is shown in Table 56A. An 
open reading frame was identified beginning with a ATG initiation codon at nucleotides 2-4 
and ending with a TGA codon at nucleotides 974-976. The start and stop codons are shown in 
20 bold in Table 5 6 A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 56A. NOV56a nucleotide sequence (SEQ ID NO:203). 

TATGGCGAAGATTGAGAAAAACGCTCCCACGATGGAAAAAAAGCCAGAACTGTTTAACATCATGGAAGTAGA 
TGGAGTCCCTACGTTGATATTATCAAAAGAATGGTGGGAAAAAGTATGTAATTTCCAAGCCAAGCCTGATGA 
TCTTATTCATGCATCCATGTTGTACCTGACTTTGGGTAAGTTGCCAGAAGAAGATCATCAGGCTTGGCTTGG 
AAATTACCCAAAGTCAGGTACAACATGGATGCATGAAATTTTAGACATGATTCTAAATGATGGTGATGTGGA 
GAAATGCAAAAGAGCCCAGACTCTAGATAGACACGCTTTCCTTGAACTGAAATTTCCCCATAAAGAAAAACC 
AGATTTGGAGTTCGTTCTTGAAATGTCCTCACCACAACTGATAAAAACACATCTCCCTTCACATCTGATTCC 
ACCATCTATCTGGAAAGAAAACTGCAAGATTGTCTATGTGACCAGAAATCCCAAGGATTGCCTGGTGTCCTA 
CTACCACTTTCACAGGATGGCTTCCTTTATGCCTGATCCTCAGAACTTAGAGGAATTTTATGAGAAATTCAT 
GTCCGGAAAAGTTGTTGGCGGGTCCTGGTTTGACCATATGAAAGGATGGTGGGCTGCAAAAGACATGCACCG 
GATCCTCTACCTCTTCTACGAGGATATTAAAAAAAATCCAAAACATGAGATCCACAAGGTGTTGGAATTCTT 
GGAGAAAACTTGGTCAGGTGATGTTATAAACAAGATTGTCCACCATACCTCATTTGATGTAATGAAGGATAA 
TCCCATGGCCAACCATACTGCGGTACCTGCTCACATATTCAATCACTCCATCTCAAAATTTATGAGGAAAGG 
GATGCCTGGAGACTGGAAGAACCACTTTACTGTGGCTATGAATGAGAACTTTGATAAGCATTATGAAAAGAA 
riATnnrAGGGTGCACACTGAACTTCTGCCTGGAGATCTG AGAGGAACAACAACAAACTAGGTGACAGAGACT 
ATGCCAACTATTTCGCCTTTTATTCTGTTGAGCAAGGAACTGTGACTGAATGTGGAGCTTATGAGCTTCAGT 
CCATCTCCTATAGTGTGGCTAGTTTGCTATAATATTAAAACATGATTTAAAATATCAACAAACCAGTTACTC 
CAGCAAATAAAATAAGAGAATTAGAGACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGG 



In a search of public sequence databases, the NOV56a nucleic acid sequence, located 
on chromosome 2, has 633 of 921 bases (68%) identical to a gb:GENBANK- 
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ID:AF033653|acc:AF033653.1 mRNA from Mus musculus (phenol sulfotransferase mRNA, 
complete cds) (E - 7.3e' 270 ). 

A disclosed NOV56a polypeptide (SEQ ID NO:204) encoded by SEQ ID NO:203 has 
324 amino acid residues and is presented in Table 56B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV56a has no signal peptide and is 
likely to be localized to the microbody (peroxisome) with a certainty of 0.7480. Alternatively, 
NOV56a may also localize to the mitochondrial membrane space with a certainty of 0. 1000, or 
to the iysosome (lumen) with a certainty of 0.1000. 



Table 56B. Encoded NOV56a protein sequence (SEQ ID NO:204). 

MAKIEKNAPTMEKKPELFNIMEVDGVPTLILSKEWWEKVCNFQAKPDDLIHASMLYLTLGKLPEEDHQAWLG 
NYPKSGTTWMHEILDMILNDGDVEKCKRAQTLDRHAFLELKFPHKEKPDLEFVLEMSSPQLIKTHLPSHLIP 
PSIWKENCKIVYVTRNPKDCLVSYYHFHRMASFMPDPQNLEEFYEKFMSGKWGGSWFDHMKGWWAAKDMHR 
ILYLFYEDIKKNPKHEIHKVLEFLEKTWSGDVINKIVHHTSFDVMKDNPMANHTAVPAHIFNHSISKFMRKG 
MPGDWKNHFTVAMNE NFDKHYEKKMAGSTLNFCLEI 

A search of sequence databases reveals that the NOV56a amino acid sequence has 155 
of 254 amino acid residues (61%) identical to, and 196 of 254 amino acid residues (77%) 
similar to, the 304 amino acid residue ptnr:SWISSPROT-ACC:P50237 protein from Rattus 
norvegicus (Rat) (N-Hydroxyaryl amine Sulfotransferase (EC 2.8.2.-) (HAST-I)) (E - 6.7e" 96 ). 

NOV56b 

In the present invention, the target sequence identified previously, NOV56a, was 

subjected to the exon linking process to confirm the sequence. PCR primers were designed by 

starting at the most upstream sequence available, for the forward primer, and at the most 

downstream sequence available for the reverse primer. In each case, the sequence was 

examined, walking inward from the respective termini toward the coding sequence, until a 

suitable sequence that is either unique or highly selective was encountered, or, in the case of 

the reverse primer, until the stop codon was reached. Such primers were designed based on in 

silico predictions for the full length cDNA, part (one or more exons) of the DNA or protein 

sequence of the target sequence, or by translated homology of the predicted exons to closely 

related human sequences sequences from other species. These primers were then employed in 

PCR amplification based on the following pool of human cDNAs: adrenal gland, bone 

marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, 

brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 

lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 

uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
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redundancy. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence traces 
were evaluated manually and edited for corrections if appropriate. These procedures provide 
the sequence reported below, which is designated NOV56b. This is identical to the previously 
identified sequence (NOV56a). 

A disclosed NOV56b nucleic acid of 1 167 nucleotides (also referred to as CG56816- 
02) encoding a N-Hydroxyarylamine Sulfotransferase-like protein is shown in Table 56C. An 
open reading frame was identified beginning with a ATG initiation codon at nucleotides 33-35 
and ending with a TGA codon at nucleotides 918-920. The start and stop codons are shown in 
bold in Table 56C, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 56C. NOV56b nucleotide sequence (SEQ ID NO:205). 

TATGGCGAAGATTGAGAAAAACGCTCCCACGC ATGGGAAAAGAAAGCCAGGAACTOTTTAAraTraTfyzanr: 
TAGATGGAGTCCCTACGTTGATATTATCAAAAGAATGGTGGGAAAAAGTATGTAATTTCCAAGCCAAGCCTG 
ATGATCTTATTCTGGCAACTTACCCAAAGTCAGGTACAACATGGATGCATGAAATTTTAGACATGATTCTAA 
ATGATGGTGATGTGGAGAAATGCAAAAGAGCCCAGACTCTAGATAGACACGCTTTCCTTGAACTGAAATTTC 
CCCATAAAGAAAAACCAGATTTGGAGTTCGTTCTTGAAATGTCCTCACCACAACTGATAAAAACACATCTCC 
CTTC ACAT CTGATT C C ACC ATC T AT CTGG AAAGAAAACTGC AAGATTGT C T ATG TGGCC AGAAATC C CAAGG 
ATTGCCTGGTGTCCTACTACCACTTTCACAGGATGGCTTCCTTTATGCCTGATCCTCAGAACTTAGAGGAAT 
TTTATGAGAAATTCATGTCCGGAAAAGTTGTTGGCGGGTCCTGGTTTGACCATATGAAAGGATGGTGGGCTG 
C AAAAGACATG CAC CGG AT CCT CT ACCT CTT CT ACGAGGAT ATT AAAAAAAAT C CAAAAC AT GAG AT C CAC A 
AGGTGTTGGAATTCTTGGAGAAAACTTGGTCAGGTGATGTTATAAACAAGATTGTCCACCATACCTCATTTG 
ATGTAATGAAGGATAATCCCATGGCCAACCATACTGCGGTACCTGCTCACATATTCAATCACTCCATCTCAA 
AAT TTATGAGGAAAGGGATGC CTGG AG ACTGGAAG AAC C AC TT TAC TGTGGCTATGAATGAGAACTT TGATA 
AGCATTATGAAAAGAAGATGGCAGGGTCCACACTGAACTTCTGCCTGGAGATCTG AGAGGAACAACAACAAA 
CTAGGTGACAGAGACTATGCCAACTATTTCGCCTTTTATTCTGTTGAGCAAGGAACTGTGACTGAATGTGGA 
GCTTATGAGCTTCAGTCCATCTCCTATAGTGTGGCTAGTTTGCTATAATATTAAAACATGATTTAAAATATC 
AAC AAACCAGTT AC TCC AG CAAAT AAAAT AAGAG AATT AG AGAC CAAAAAAAAAAAAAAAAAAAAAAA AAAA 
AAAAAAAAAAAAGGG " 



In a search of public sequence databases, the NOV56b nucleic acid sequence, located 
on chromosome 2, has 649 of 919 bases (70%) identical to a gbrGENBANK- 
lD:AF033653|acc:AF033653.1 mRNA from Mus musculus (phenol sulfotransferase mRNA, 
complete cds) (E = 3.6e" 89 ). 

The disclosed NOV56b polypeptide (SEQ ID NO:206) encoded by SEQ ID NO:205 
has 295 amino acid residues and is presented in Table 56D using the one-letter amino acid 
code. Signal P, Psort and/or Hydropathy results predict that NOV56b has no signal peptide 
and is likely to be localized to the microbody (peroxisome) with a certainty of 0.7480. 
Alternatively, NOV56b may also localize to the mitochondrial membrane space with a 
certainty of 0. 1 000, or to the lysosome (lumen) with a certainty of 0. 1 000. 
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Table 56D. Encoded NOV56b protein sequence (SEQ ID NO:206). 

MGKESQELFNIMEVDGVPTLILSKEWWEKVCNFQAKPDDLILATYPKSGTTWMHEILDMILNDGDVEKCKRA 
QTLDRHAFLELKFPHKEKPDLEFVLEMSSPQLIKTHLPSHLIPPSIWKENCKIVYVARNPKDCLVSYYHFHR 
MASFMPDPQNLEEFYEKFMSGKVVGGSWFDHMKGWWAAKDMHRILYLFYEDIKKNPKHEIHKVLEFLEKTWS 
GD V I NK I VHHTS FD VMKDNPM ANHT AVP AH I FNH SIS K FMR KGM PGD WKNH FT VAMNEN FD KH YE K KMAGS T 
LNFCLEI 



A search of sequence databases reveals that the NOV56b amino acid sequence has 173 
of 283 amino acid residues (61%) identical to, and 220 of 283 amino acid residues (77%) 
similar to, the 304 amino acid residue ptnr:SWISSPROT-ACC:P50237 protein from Rattus 
5 norvegicus (Rat) (N-Hydroxyaryl amine Sulfotransferase (EC 2.8.2.-) (HAST-I)) (E = 4.0e"). 

NOV56b is predicted to be expressed in at least brain. . 

NOV56a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 56E 



Table 56E. BLAST results for NOV56a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|l744634l|ref |XP 
065757 . 1 | 
(XM_065757) 


similar to 
sulfotransferase, 
phenol preferring 
2 ; Phenol 
sulfotransferase 
lcl (H. sapiens) 
[Homo sapiens] 


304 


299/324 
(92%) 


301/324 
(92%) 


e-166 


gi | 13929030 | ref |NP 
113920. l| 
(NM_031732) 


sulfotransferase, 
phenol preferring 
2; 

Phenol sul f o t rans f 
erase lcl [Rattus 
norvegicus] 


304 


171/303 
(56%) 


218/303 
(71%) 


le-94 


gi | 9055354 | ref |NP 0 
61221. l| 
(NM 018751) 


sulfotransferase , 
phenol preferring 
2 [Mus musculus] 


304 


172/303 
(56%) 


217/303 
(70%) 


4e-94 


gi | 163 0483 6 | emb | CAC 
95180.1] (AJ416889) 


sulfotransferase 
1C [Gallus 
gallus] 


307 


161/306 
(52%) 


214/306 
(69%) 


4e-88 


gi | 14731177 | ref | XP 

010849.3] 

(XM 010849) 


STJLT1C 

sulfotransferase 
[Homo sapiens] 


302 


161/285 
(56%) 


201/285 
(70%) 


2e-85 



10 ~ ~ ~ " ' 

The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 56F. In the Clustal W alignment of the NOV56 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 

15 functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 
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Table 56F. ClustalW Analysis of NOV56 

1) Novel NOV5 6a (SEQ ID NO: 2 04) 

2) Novel NOV56b (SEQ ID NO: 206) 

3) gi | 17446341 | ref |XP_065757 . 1 | <XM_065757) similar to sulfotransf erase , phenol 
preferring 2; Phenol sulfotransf erase lcl (h/ sapiens) [Homo sapiens] (SEQ ID 
NO: 582) 

4) gi | 13929030 | ref |NP_113920 . 1 | (NM_031732) sulfotransf erase , phenol preferring 2; 
Phenolsulf otransf erase lcl [Rattus norvegicus] (SEQ ID NO: 583) 

5) gi | 9055354 | ref |NP_061221 . 1 | (NM_0 18 75 1 ) sul f otransf erase , phenol preferring 2 
[Mus musculus] (SEQ ID NO: 584) 

6) gi|l6304836|emb|CAC95180.l| (AJ416889) sul f otransf erase 1C [Gallus gallus] (SEQ 
ID NO: 585) 

7) gi|l4731177|ref |XP_010849.3| (XM_010849) SULT1C sulfotransf erase [Homo sapiens] 
(SEQ ID NO: 586) 



NOV56a 
NOV56b 

gi | 17446341 | ref 
gi|13929030|ref 
gi j 9055354 | ref | 
gij 16304836 | emb 
gi j 14731177 j ref 



10 20 

I- ---I--- 1 

KI gKNAPTgpjKPELFNjM 

KESQ — - - ELFN§5 

^Kl|KNAPTpllKPELFNiTVI 
|LEp^KDLl€SiQDLQPETR 
|PLEyLKDLH^P^QNMQPETR 
OAliDgMENLSr ~ 

|alhdmedft: 
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NOV5 6a 
NOV5 6b 

gi|l744634l|ref 
gi 1 13929030 j ref 
gi | 9055354 | ref | 
gi | 16304836 | emb 
gi 1 14731177 | ref 



NOV5 6a 
NOV56b 

gi | 17446341 | ref 
gi | 13929030 | ref 
gi | 9055354 | ref | 
gi j 16304836 | emb 
gi j 14731177 | ref 



NOV5 6a 
NOV5 6b 

gi | 17446341 | ref 
gi 1 13929030 j ref 
gi j 9055354 | ref | 
gi j 16304836 | emb 
gij 14731177 | ref 



|LKFg HK 117 

-HK 8 8 
-HK 97 
SPL 9 8 
PPL 9 8 
SSPL 100 
SL 96 




GSW|DHgKGWWgAKDgHRILYLFYEDyKgNPKi 
GSWgDH^KGWwHAKDfflHRILYLFYEDMKgNPKl 

gsw§dhvkgwwsakdBhrilylfyedEkSnpki 
gswSdhvkgwwSBkdBhrilylfyedSkmp: 
gswqdhvkgww^kdBhrilylfyed|k^p: 
gsw^hvkgww^kdJBhrilylfyedIkSwpki 
gswSShvkgww§akdBhrilylfyed1kSnpkp 




250 



260 



270 



NOV5 6a 
NOV56b 

gi | 17446341 | ref 
gij 13929030 j ref 
gi I 9055354 | ref | 
gi 1 16304836 | emb 
gi 1 14731177 j ref 



290 



300 
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ji %~ 



310 320 



324 
295 
304 
304 
304 
307 
302 



Tables 56G lists the domain descriptions from DOMAIN analysis results against 
NOV56. This indicates that the NOV56 sequence has properties similar to those of other 
proteins known to contain this domain. 

15 



Table 56G Domain Analysis of NOV56a 

gnl | Pf am|pf am00685 , Sulfotransfer, Sulfotransf erase protein (SEQ ID 
NO: 855) 

CD-Length = 269 residues, 99.6% aligned 
Score = 260 bits (665) , Expect = 7e-71 



NOV5 6a 
NOV5 6b 

gi | 17446341 | ref 
gi | 13929030 | ref 
gi I 9055354 | ref | 
gi | 16304836 | emb 
gi j 14731177 j ref 




20 
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NOV56: 25 GVPTLILSKEWWEKVCN- FQAKPDDLIHASMLYLTLGKLPEEDHQAWLGNYPKSGTTWMH 83 

I II 1+ I Ill-Ill- + IIMIIII + 

.Sbjct: 2 GFWVDKFHLEGWRKIKNCFQARPDDV LIAGYPKSGTTWLQ 41 

NOV56 : 84 E I LDM I LNDGDVE KCKRAQTLDRHAFLEL KFPHKE KPDLE FVLEM S S PQL I KTHLP SHL I 143 

III + I II I I 1+ +11 1+ lll+lllllll 1+ 

Sbjct: 42 EILSLHPNVGDFEPSPSDPLLFRNPWLEYPKGEDWYETLKP- -MPSSPRLIKTHLPLELL 99 

NOV56: 144 PPSIWKENCKIVYVTRNPKDCLVSYYHFHRMASFMPDPQN-LEEFYEKFMSGKVVGGSWF ' 202 

I I I l + l I Mill llllll I -I Ml I I++III+ ll + l 

Sbjct: 100 P K S FL S S KAK 1 1 Y VLRNP KD VAVS Y YH F S R S H KDL P AD PGT FE E FLE A FLNGK VL YGS Y F 159 



NOV56 : 203 DHMKGWWAAKDMHRILYLFYEDIKKNPKHEIHKVLEFLEKTWSGDVINKIVHHTSFDVMK 

30 11+ III + ++I+I III+I++I II I- III I + ++I++ l+ll +|| 

Sbjct : 16 0 DHVLGWWELRPEPQVLFLDYEDLKEDPAGEIKKIAEFLGLPLSEEELDKLLDHSSFFLMK 
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NOV56: 263 DNPMANHTAVPAHI FNHS ISKFMRKGMPGDWKNHFTVAMNENFDKHYEKK 312 

II++I+ + I I I I I 1+ I I I I l + l I II III ++I 

35 Sbjct: 220 LNPLSNYETLCLGKSKGRKSPFMRKGLVGDWKNYFTPEQNEKFDKVIKEK 269 



This protein carries out sulfation of phenols and bioactivation of n-hydroxyarylamines. 
It is responsible for the formation of N-hydroxy-2-acetylaminofluorene, a reactive metabolite 
which exhibits toxicity by binding to DNA, RNA and protein. 

40 Hepatic sulfation of heterocyclic and non-heterocyclic arylamines was studied to 

assess enzymes responsible for their metabolisms. Both 2-amino-3-methylimidazo[4,5- 
fjquinoline (IQ)- and non-IQ-type (beta-carboline) heterocyclic amines were N-sulfated to 
form their sulfamates in cytosols of rat livers in the presence of S'-phosphoadenosine-S- 
phosphosulfate (PAPS). An arylsulfo-transferase, ST1 Al, whose cDNA was isolated from a 

45 rat cDNA library, was expressed in COS-1 cells. The expressed enzyme catalyzed N-sulfation 
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of IQ, but not appreciably those of 2-amino-l-methyl-6-phenylimidazo[4,5-b]pyridine (PhlP), 
2-amino-6-methyldipyrido[l,2-a:3 T ,2'-d] imidazole (GIu-P-I), and 3-amino-l-methyI-5H- 
pyrido[4,3-b]indole (Trp-P-2). N-Sulfation of heterocyclic amines except IQ was higher in 
hepatic cytosols of female rats than of male rats. These results suggest the involvement of at 
5 least plural forms of sulfotransferase on the N-sulfation. In addition, N-sulfation of IQ was 
also observed in cytosol of a human liver, suggesting that N-sulfation is one of the metabolic 
pathways of heterocyclic amines in humans as well as rats. Hepatic sulfotransferase also 
catalyzes metabolic activation of N-hydroxy derivatives of carcinogenic arylamines. Using 
anti-HAST (hydroxylarylamine sulfotransferase) antibodies and ST1A1 cDNA as screening 

10 probes, several cDNA clones were isolated from the cDNA library. A new member of 

arylsulfotransferase, ST1C1, whose cDNA shows considerable sequence similarity to ST1A1 
cDNA, was found to catalyze O-sulfation of N-hydroxy-2-acetylaminofluorence by the cDNA 
expression in COS-I cells. From the close similarity of ontogenic profile and sex-specific 
expression of STIC 1 and HAST, ST1C1 cDNA was shown to encode a major sulfotransferase 

15 (HAST) mediating the metabolic activation of N-hydroxyaryl amines in rat livers. In addition, 
properties of PAPS-dependent N-hydroxyarylamine activation and sulfotransferase in human 
livers are also discussed. 

The disclosed NOV56 nucleic acid of the invention encoding a N-Hydroxyarylamine 
Sulfotransferase-like protein includes the nucleic acid whose sequence is provided in Table 

20 56A, 56C or a fragment thereof. The invention also includes a mutant or variant nucleic acid 
any of whose bases may be changed from the corresponding base shown in Table 56A or 56C 
while still encoding a protein that maintains its N-Hydroxyarylamine Sulfotransferase -like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to those just described, 

25 including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of nonlimiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

30 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 48 percent of the 
bases may be so changed. 
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The disclosed NOV56 protein of the invention includes the N-Hydroxyarylamine 
Sulfotransferase-like protein whose sequence is provided in Table 56B or 56D. The invention 
also includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 56B or 56D while still encoding a protein that 
5 maintains its N-Hydroxyarylamine Sulfotransferase-like activities and physiological functions, 
or a functional fragment thereof. In the mutant or variant protein, up to about 32 percent of 
the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 

10 The above disclosed information suggests that this N-Hydroxyarylamine 

Sulfotransferase-like protein (NOV56) is a member of a "N-Hydroxyarylamine 
Sulfotransferase family". Therefore, the NOV56 nucleic acids and proteins identified here may 
be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 

15 invention include, but are not limited to: protein therapeutic, small molecule drug target, 

antibody target (therapeutic, diagnostic, drug target ing/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue 
regeneration in vivo and in vitro of all tissues and cell types composing (but not limited to) 
those defined here. 

20 The NOV56 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in metabolic diseases and disorders, and/or other diseases 
and pathologies. 

NOV56 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV56 substances for use in 

25 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobicity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV56 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 

30 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV57 

A disclosed NOV57 nucleic acid of 953 nucleotides (also referred to as CG56829-01) 
encoding a Testis Specific Serine Kinase-3-like protein is shown in Table 57A. An open 
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reading frame was identified beginning with a ATG initiation codon at nucleotides 50-52 and 
ending with a TGA codon at nucleotides 854-856. The start and stop codons are shown in 
bold in Table 57A, and the 5' and 3' untranslated regions, if any, are underlined. 



Table 57 A. NOV57 nucleotide sequence (SEQ ID NO:207). 

CAGAGGCAGCATGAGCTGAGAGGGTGATAGGAAGGCGGCGCTAGACAGC ATGGAGGACTTTCTGCTCTCCAA 
TGGGTACCAGCTGGGCAAGACCATTGGGGAAGGGACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAACA 
C C AAAGAAAAGTGG C AATT AAAGTT AT AGAC AAGATGGG AACTT C C T C AG AGTT T AT C C AG AGATT C CT C CC 
TCGGGAGCTCCAAATCGTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGAGTCTGC 
CGACGGGAAAATCTGCCTGGTGATGGAGCTCGCTGAGGGAGGGGATGTCTTTGACTGCGTGCTGAATGGGGG 
GCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTTGAGGCCATCCGCTACTGCCATGGCTG 
TGGTGTGGCCCACCGGGACCTCAAATGTGAGAACGCCTTGTTGCAGGGCTTCAACCTGAAGCTGACTGACTT 
TGGCTTTGCCAAGGTGTTGCCCAAGTCACACCGGGAGCTGAGCCAGACCTTCTGCGGCAGTACAGCCTATGC 
TGCCCCCGAGGTGCTGCAGGGCATTCCCCACGATAGCAAAAAAGGTGATGTCTGGAGCATGGGTGTGGTCCT 
GTATGTCATGCTCTGTGCCAGCCTACCTTTTGACGACACAGACATCCCCAAGATGCTGTGGCAGCAGCAGAA 
GGGGGTGTCCTTCCCCACTCATCTGAGCATCTCGGCCGATTGCCAGGACCTGCTCAAGAGGCTCCTGGAACC 
CGAT ATGATCCTCCGGCCTTC AATT GAAGAAGTT AGTT GGC AT CCATGGCTAGCAAGCACTTGAXAAAAGCA 
ATGGCAAGTGCTCTCCAATAAAGTAGGGGGAGAAAGCAAACCCAAAAACCCGCTTCTAAAATGGTGATATAT 
ATTTTACGCTTTAAGTT 



5 In a search of public sequence databases, the NOV57 nucleic acid sequence, located on 

chromosome 1, has 831 of 912 bases (91%) identical to a gb:GENBANK- 
ID:AF201734|acc:AF20 1734.1 mRNA from Mus musculus (testis specific serine kinase-3 
(Tssk-3) mRNA, complete cds) (E - 2.0e 165 ). 

A disclosed NOV57 polypeptide (SEQ ID NO:208) encoded by SEQ ID NO:207 has 
10 268 amino acid residues and is presented in Table 57B using the one-letter amino acid code. 
Signal P, Psort and/or Hydropathy results predict that NOV57 has no signal peptide and is 
likely to be localized to the cytoplasm with a certainty of 0.4500. Alternatively, NOV57 may 
also localize to the microbody (peroxisome) with a certainty of 0.1821, to the mitochondrial 
matrix space with a certainty of 0.1000, or to the lysosome (lumen) with a certainty of 0.1000. 

15 



Table 57B. Encoded NOV57 protein sequence (SEQ ID NO:208). 

MEDFLLSNGYQLGKTIGEGTYSKVKEAFSKKHQRKVAIKVIDKMGTSSEFIQRFLPRELQIVRTLDHKNIIQ 
VYEMLESADGKICLVMELAEGGDVFDCVLNGGPLPESRAKALFRQMVEAIRYCHGCGVAHRDLKCENALLQG 
FNLKLTDFGFAKVLPKSHRELSQTFCGSTAYAAPEVLQGIPHDSKKGDVWSMGWLYVMLCASLPFDDTDIP 
KMLWQQQKGVSFPTHLSISADCQDLLKRLLEPDMILRPSIEEVSWHPWLAST 

A search of sequence databases reveals that the NOV57 amino acid sequence has 240 
of 268 amino acid residues (89%) identical to, and 245 of 268 amino acid residues (91%) 
similar to, the 266 amino acid residue ptnr:SPTREMBL-ACC:Q9JL98 protein from Mus 
20 musculus (Mouse) (Testis Specific Serine Kinase-3) (E = 6.2e" 124 ). 

NOV57 is predicted to be expressed in at least lung, testis, B-cell, brain, head and 
neck. Expression information was derived from the tissue sources of the sequences that were 
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included in the derivation of the sequence of CuraGen Acc. No. CG56829-01 . The sequence is 
predicted to be expressed in testis because of the expression pattern of (GENBANK-ID: 
gb:GENBANK-ID:AF201734|acc:AF20 1734.1) a closely related Mus musculus testis specific 
serine kinase-3 (Tssk-3) mRNA, complete cds homolog. 
5 NOV57 also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 57C 



Table 57C. BLAST results for NOV57 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
<%) 


Positives 
(%) 


Expect 


gi | 16418343 |ref |NP_ 

4.4.-3 07^ -i | 

(NM_052841) 


testis -specific 
serine / threonine 
kinase 22 C [Homo 
sapiens] 


268 


265/268 
(98%) 


265/268 
(98%) 


e-157 


gi|l286020l|dbj | BAB 
31876. l| (AK019840) 


data source : MGD , 
source 

key:MGI : 1929914, 
evidence : ISS~puta 
tive- serine/ threo 
nine kinase 22C 
(spermiogenesis 
associated) [Mus 
musculus] 


268 


259/268 
(96%) 


261/268 
(96%) 


e-153 


gi | 10946880 | ref |NP_ 
067454 . 1 | 
(NM_021479) 


serine /threonine 
kinase 22C 

( spermiogenesisas 
sociated) ; testis 
specific serine 
kinase-3 (Tssk-3) 

[Mus musculus] 


266 


240/273 
(87%) 


245/273 
(88%) 


e-136 


gi | 165 0 7245 | ref | NP_ 
443732 . 1 | 
(NM_053006) 


serine/ threonine 
kinase 22B 
( spermiogenesisas 
sociated) ; testis 
specific serine 
threonine kinase 
2 [Homo sapiens] 


358 


127/266 
(47%) 


192/266 
(71%) 


ie-70 


gi | 147 7 6 972 | ref | XP_ 
033051 . 1 | 
(XM 033051) 


serine/ threonine 
kinase 22B 
(spermiogenesis 
associated) [Homo 
sapiens] 


358 


127/266 
(47%) 


192/266 
(71%) 


le-70 



The homology between these and other sequences is shown graphically in the 
10 ClustalW analysis shown in Table 57D. In the ClustalW alignment of the NOV57 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
regions of conserved sequence (i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
15 function. 
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Table 57D. ClustalW Analysis of NOV57 



1) Novel NOV57 {SEQ ID NO: 2 08) 

2) gi | 16418343 | ref | NP__443073 . 1 | (NM_0 52 84 1 ) test is-specif ic serine/threonine kinase 
22C [Homo sapiens] (SEQ ID NO: 587) 

3) gi | 12860201 | dbj | BAB31876 . 1 | (AK019840) data source : MGD , source key : MGI : 192 9 914 , 
evidence : ISS-putative-serine/threonine kinase 22C ( spermiogenesis associated) [Mus 
musculus] (SEQ ID NO: 588) 

4) gi | 10946880 | ref |NP_067454 . 1 | (NM_021479) serine/ threonine kinase 22C 
(spermiogenesisassociated) ; testis specific serine kinase-3 (Tssk-3) [Mus musculus] 
(SEQ ID NO: 58 9) 

5) gi | 165 07245 | ref | NP_44 3 732 . 1 | (NM_053006) serine/ threonine kinase 22B 
(spermiogenesisassociated) ; testis specific serine threonine kinase 2 [Homo sapiens] 
(SEQ ID NO: 59 0) 

6) gi | 14776972 | ref |XP_033051 . 1 | (XM_033051) serine/threonine kinase 22B 
(spermiogenesis associated) [Homo sapiens] (SEQ ID NO: 591) 



NOV57 

gi | 16418343 | ref 
gi|l286020ljdbj 
gij 10946880jref 
gi|l6507245|ref 
gi | 14776972 j ref 



NOV57 

gi|16418343|ref 
gi | 12860201 j db j 
gij 10946880 j ref 
gij 16507245 j ref 
gij 14776972 j ref 



NOV57 

gi | 16418343 | ref 
gi j 12860201 j db j 
gi j 10946880 j ref 
gij 16507245 j ref 
gi j 14776972 j ref 




LRI^FAIAMAV 



210 



220 



230 



240 



NOV57 

gi | 16418343 | ref 
gi j 12860201 j dbj 
gij 10946880 j ref 
gij 16507245 j ref 
gij 14776972 j ref 



NOV57 

gi|l6418343|ref 
gij 12860201 j dbj 
gij 10946880 j ref 
gi|l6507245jref 
gij 14776972 j ref 




(QPPKPKPTSSASFKREGEGKYRAECKLD 3 00 
PPKPKATS S AS FKREGEGKYRAECKLD 3 00 



NOV57 

gi|l6418343|ref 



310 



320 



330 



340 
..|.. 



350 
|....|.. 



268 
268 
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gi 1 12860201 I dbj 268 

gi j 10946880 j ref 266 

gi | 16507245 j ref TKTGLRPDHRPDHKLGAKTQHRLLWPENENRMEDRLAETSRAKDHHISGAEVGKAST 3 58 

gi j 1477 6972 j ref TKTGLRPDHRPDHKLGAKTQHRLLWPENENRMEDRLAETSRAKDHHISGAEVGKAST 3 58 



10 



15 



20 



25 



30 



Tables 57E-G lists the domain descriptions from DOMAIN analysis results against 
NOV57. This indicates that the NOV57 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 57E Domain Analysis of NOV57 

gnl | Smart | smart 0 0220 , STKc, Serine/Threonine protein kinases, 
catalytic domain; Phosphotransferases. Serine or threonine-specif ic 
kinase subfamily. (SEQ ID NO: 856) 
CD-Length = 256 residues, 100.0% aligned 
Score - 258 bits (659) , Expect = 3e-70 



NOV57 : 


10 


Sbjct : 


1 


NOV5 7 : 


70 


Sbjct : 


60 


NOV57 : 


130 


Sbjct : 


119 


NOV57 : 


189 


Sbjct : 


177 


NOV57 : 


246 


Sbjct : 


237 



V 

YQLGKTIGEGTYSKVKEAFSKKHQRKVAIKVIDKMGTSSEFIQRFLPRELQIVRTLDHKN 69 

l+l + +|+| + II I II + llllll I + +1 I II++I++ III I 

YE LL E VLGKG AFG K VYLARDKKTG KL VA I KVI KKEKLKKKKRERI L - REI KI LKKLDHPN 5 9 

IIQVYEMLESADGKICLVMELAEGGDVFDCVLNGGPLPESRAKALFRQMVEAIRYCHGCG 12 9 

I+ + +I++ I I 1+ I I I I llll + ll + I II 1+ II++ 1+ I I I 

IVKLYDVFED-DDKLYLVMEYCEGGDLFDLLKKRGRLSEDEARFYARQILSALEYLHSQG 118 

VAHRDLKC ENALL - QGFNL KLTD FGF AK VL PK S HRE L S QT F CGST AY AAP E VL QG I PHDS 18 8 

+ Mill II II ++II III II I I I II 1+ I Mill I + 

1 1 HRDLKP EN I LLDS DGHVKL AD FGL AKQL - D S GGTL LTT F VGT P E YM AP E VL LGKG YG - 17 6 

KKGDVWSMGWLYVMLCASLPFDDTDI PKMLWQQ QKGVS FPTHLS I S ADCQDLLKRL 24 5 

I l+ll+ll+ll +1 III I+++ I II + +II+I+I 

KAVDIWSLGVILYELLTGKPPFPGDDQLLALFKKIGKPPPPFPPPEWKISPEAKDLIKKL 23 6 



I I 1 + 



Table 57F Domain Analysis of NOV57 

gnl | Pfam |pfam0006 9 , pkinase, Protein kinase domain (SEQ ID NO: 8 57) 
CD-Length = 256 residues, 100.0% aligned 
Score = 234 bits (596) , Expect = 6e-63 



NOV57: 


10 


YQLGKTIGEGTYSKVKEAFSKKHQRKVAIKVIDKMGTSSEFIQRFLPRELQIVRTLDHKN 


69 






1+11+ +11+11+ 1 IIII++ 1 1 + ++ ll+ll+l 1 1 1 




Sbjct : 


1 


YELGEKLGSGAFGKVYKGKHKDTGEI VAIKILKKRSLSEK- - KKRFLREIQILRRLSHPN 


58 


NOV57: 


70 


I I Q VYEML E S ADG K I C L VME L AEGGD VFD C VL - NGG P L PE S RAKAL FRQM VE A I R Y CHGC 


128 






I+++ +11+ MM llll + M + II 1 1 II + I++ + 1 1 




Sbjct: 


59 


IVRLLGVFEE-DDHLYLVMEYMEGGDLFDYLRRNGLLLSEKEAKKIALQILRGLEYLHSR 


117 


NOV57 : 


129 


GVAHRDLKCENALL-QGFNLKLTDFGFAKVLPKSHRELSQTFCGSTAYAAPEVLQGIPHD 


187 






1+ Mill II II + +1+ III l+l II II 1+ 1 IMM + I 




Sbjct: 


118 


GIVHRDLKPENILLDENGTVKIADFGLARKLESSSYEKLTTFVGTPEYMAPEVLEG-RGY 


176 


NOV57 : 


188 


SKKGDVWSMGWLYVMLCASLPFDDTDIPKMLWQQQKGVSFPTHL- - SIS ADCQDLLKRL 


245 



I I 1 1 1 l + l l + l I +1 III I + I++ ++ 
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Sbjct: 177 SSKVDWSLGVILYELLTGKLPFPGIDPLEELFRIKERPRLRLPLPPNCSEELKDLIKKC 236 

NOV57 : 246 LEPDM I LRPS IEE VSWHPWL 265 

I I 11+ +1+ III 

Sbjct: 237 LNKDPEKRPTAKE I LNHPWF 256 



Table 57G Domain Analysis of NOV57 

gnl | Smart | smart0 0219 , TyrKc, Tyrosine kinase, catalytic domain; 
Phosphotransferases. Tyrosine -specific kinase subfamily (SEQ ID 
NO : 858) 

CD-Length = 258 residues, 97.7% aligned 
Score = 115 bits (289) , Expect = 2e-27 



10 



15 



20 



25 



NOV57 : 
Sbjct : 
NOV57 : 
Sbjct : 
NOV57 : 
Sbjct : 
NOV57 : 
Sbj Ct : 
NOV57 : 
Sbjct : 



12 LGKTIGEGTYSKVKEAF SKKHQRKVAIKVIDKMGTSSEFIQRFLPRELQIVRTLDHK 68 

III +111 + +! + + +1 l+l + I I + 1+ II II +++I III 

3 LGKKLGEGAFGEVYKGTLKGKGGVEVEVAVKTL- KEDASEQQIEEFL- REARLMRKLDHP 6 0 

69 NI IQVYEMLESADGKICLVMELAEGGDVFDCVLNGGP- -LPESRAKALFRQMVEAIRYCH 126 

II+ + + + + + + +||| I I I |+ I + I I I + 1+ + I 

61 NIVKLLGVC-TEEEPLMIVMEYMEGGDLLDYLRKNRPKELSLSDLLSFALQIARGMEYLE 119 

12 7 GCGVAHRDLKCENALL-QGFNLKLTDFGFAK- VLPKSHRELSQTFCGSTAYAAPEVLQGI 184 

I I I I I 1+ + +1+ III 1+ + + ++ + III 1 + 

12 0 SKNFVHRDLAARNCLVGENKTVKIADFGLARDLYDDDYYRKKKSPRLPIRWMAPESLKDG 179 

185 PHDSKKGDVWSMGVVLYVML- CASLPFDDTDI PKMLWQQQKGVSFPTHLS I SADCQDLLK 24 3 

I I I I II 11 + 1+ + 1+ ++I +11 I + + 11 + 

KFTS-KSDVWSFGVLLWEIFTLGESPYPGMSNEEVLEYLKKGYRLPQPPNCPDEIYDLML 23 8 



180 



244 



239 



RLLEPDMILRPSIEEV 259 

i 11+ 1+ 

QCWAEDPEDRPTFSEL 2 54 



Eukaryotic protein kinases (1) are enzymes that belong to a very extensive family of 

30 proteins which share a conserved catalytic core common with both serine/threonine and 

tyrosine protein kinases. Protein phosphorylation is a fundamental process for the regulation of 

cellular functions. The coordinated action of both protein kinases and phosphatases controls 

the levels of phosphorylation and, hence, the activity of specific target proteins. One of the 

predominant roles of protein phosphorylation is in signal transduction, where extracellular 

35 signals are amplified and propagated by a cascade of protein phosphorylation and 

dephosphorylation events. Two of the best characterized signal transduction pathways involve 

the cAMP-dependent protein kinase and protein kinase C (PKC). Each pathway uses a 

different second-messenger molecule to activate the protein kinase, which, in turn, 

phosphorylates specific target molecules. Extensive comparisons of kinase sequences defined 

40 a common catalytic domain, ranging from 250 to 300 amino acids. This domain contains key 

amino acids conserved between kinases and are thought to play an essential role in catalysis. 

In the N-terminal extremity of the catalytic domain there is a glycine-rich stretch of residues in 

the vicinity of a lysine residue, which has been shown to be involved in ATP binding. In the 
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central part of the catalytic domain there is a conserved aspartic acid residue which is 
important for the catalytic activity of the enzyme (2). 

Protein kinases and phosphatases regulate cell-cycle progression, transcription, 
translation, protein sorting and cell adhesion events that are critical to the inflammatory 
5 process. Two of the best-characterized immunosuppressants, cyclosporin and rapamycin, are 
also effective anti-inflammatory drugs. They act directly on protein phosphorylation and, as 
such, validate the concept that small-molecule modulators of phosphorylation cascades 
possess anti-inflammatory properties (3). 

Some examples of the role of serine/threonine protein kinases that are important in cell 

10 proliferation and disease include AKT, RAF1 and PIM1. Dudek et al. (4) demonstrated that 
AKT is important for the survival of cerebellar neurons. Thus, the 'orphan' kinase moved 
center stage as a crucial regulator of life and death decisions emanating from the cell 
membrane. Holland et al. (5) transferred, in a tissue-specific manner, genes encoding 
activated forms of Ras and Akt to astrocytes and neural progenitors in mice. These authors 

1 5 found that although neither activated Ras nor Akt alone was sufficient to induce glioblastoma 
multiforme (GBM) formation, the combination of activated Ras and Akt induced high-grade 
gliomas with the histologic features of human GBMs. These tumors appeared to arise after 
gene transfer to neural progenitors, but not after transfer to differentiated astrocytes. Increased 
activity of Ras is found in many human GBMs and Akt activity is increased in most of these 

20 tumors, implying that combined activation of these 2 pathways accurately models the biology 
of this disease (5). 

Another disease that involves yet another serine/threonine kinase is Peutz-Jeghers 
syndrome (PJS) , an autosomal dominant disorder characterized by melanocyte macules of the 
lips, buccal mucosa, and digits, multiple gastrointestinal hamartomatous polyps, and an 

25 increased risk of various neoplasms. Jenne et al. (6) identified and characterized the 

serine/threonine kinase STK1 1 and identified mutations in PJS patients. All 5 germline 
mutations were predicted to disrupt the function of the kinase domain. They concluded that 
germline mutations in STK1 1, probably in conjunction with acquired genetic defects of the 
second allele in somatic cells according to the Knudson model, caused the manifestations of 

30 PJS. These authors commented that PJS was the first cancer susceptibility syndrome identified 
that is due to inactivating mutations in a protein kinase and found mutations in the STK1 1 
gene in 1 1 of 12 unrelated families with PJS. Ten of the 1 1 were truncating mutations. All 
were heterozygous in the germline. Su et al. (7) found that of 53 PJS patients with cancer 
reported to that time, 6(11%) were diagnosed with pancreatic adenocarcinoma. Su et al. (7) 
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presented evidence that the STK1 1 gene plays a role in the development of both sporadic and 
familial (PJS) pancreatic and biliary cancers. They found that in sporadic cancers, the STK1 1 
gene was somatically mutated in 5% of pancreatic cancers and in at least 6% of biliary cancers 
examined. In the patient with pancreatic cancer associated with PJS, there was inheritance of a 
5 mutated copy of the STK1 1 gene and somatic loss of the remaining wildtype allele. 

The novel human serine/threonine protein kinase of the invention contains a protein 
kinase domain. Therefore it is anticipated that this novel protein has a role in the regulation of 
essentially all cellular functions and could be a potentially important target for drugs. Such 
drugs may have important therapeutic applications, such as treating numerous inflammatory 
1 0 diseases. 

The disclosed NOV57 nucleic acid of the invention encoding a Testis Specific Serine 
Kinase-3-like protein includes the nucleic acid whose sequence is provided in Table 57A or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 57A while still encoding a 

15 protein that maintains its Testis Specific Serine Kinase-3 -like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 
includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 

20 include chemical modifications. Such modifications include, by way of nonlimiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 

25 acids, and their complements, up to about 9 percent of the bases may be so changed. 

The disclosed NOV57 protein of the invention includes the Testis Specific Serine 
Kinase-3-like protein whose sequence is provided in Table 57B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 57B while still encoding a protein that maintains its Testis Specific 

30 Serine Kinase-3-like activities and physiological functions, or a functional fragment thereof. 
In the mutant or variant protein, up to about 53 percent of the residues may be so changed. 

The invention further encompasses antibodies and antibody fragments, such as F ab or 
(F a b)2, that bind immunospecifically to any of the proteins of the invention. 
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The above disclosed information suggests that this Testis Specific Serine Kinase-3-like 
protein (NOV57) is a member of a "Testis Specific Serine Kinase-3 family". Therefore, the 
NOV57 nucleic acids and proteins identified here may be useful in potential therapeutic 
applications implicated in (but not limited to) various pathologies and disorders as indicated 
5 below. The potential therapeutic applications for this invention include, but are not limited to: 
protein therapeutic, small molecule drug target, antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene therapy (gene 
delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of all tissues 
and cell types composing (but not limited to) those defined here. 
10 The NOV57 nucleic acids and proteins of the invention are useful in potential 

therapeutic applications implicated in metabolic diseases and disorders, and/or other diseases 
and pathologies. 

NOV57 nucleic acids and polypeptides are further useful in the generation of 
antibodies that bind immuno-specifically to the novel NOV57 substances for use in 

15 therapeutic or diagnostic methods. These antibodies may be generated according to methods 
known in the art, using prediction from hydrophobic ity charts, as described in the "Anti- 
NOVX Antibodies" section below. The disclosed NOV57 protein has multiple hydrophilic 
regions, each of which can be used as an immunogen. These novel proteins can be used in 
assay systems for functional analysis of various human disorders, which will help in 

20 understanding of pathology of the disease and development of new drug targets for various 
disorders. 

NOV58 

NOV58 includes two Gap Junction Beta-5-like proteins, designated herein as NOV58a 
and NOV58b. Gap junctions are conduits that allow the direct cell-to-cell passage of small 
25 cytoplasmic molecules, including ions, metabolic intermediates, and second messengers, and 
that thereby mediate intercellular metabolic and electrical communication. Gap junction 
channels consist of connexin protein subunits, which are encoded by a multigene family. 

NOV58a 

30 The disclosed NOV58a (alternatively referred to herein as CG563 1 5-01) includes the 

728 nucleotide sequence (SEQ ID NO:209) shown in Table 58A. A NOV58a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 28-30 and ends with a stop codon 
at nucleotides 697-699. The disclosed NOV58a maps to human chromosome 6. 
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Table 58A. NOV58a Nucleotide Sequence (SEQ ID 
NO:209) 

AGTCTTGCCTTCTTTTGAGCCTAAGTCATGAGTTGGATGTTCCTCAGAGATCTCCTGAGT 
GGAGTAAATAAATACTCCACTGGGACTGGATGGATTTGGCTGGCTGTCGTGTTTGTCTTC 
CGTTTGCTGGTCTACATGGTGGCAGCAGAGCACGTGTGGAAAGATGAGCAGAAAGAGTTT 
GAGTGCAACAGTAGACAGCCCGGTTGCAAAAATGTGTGTTTTGATGACTTCTTCCCCATT 
TCCCAAGTCAGACTTTGGGCCTTACAACTGATAATGGTCTCCACACCTTCACTTCTGGTG 
GTTTTACATGTAGCCTATCATGAGGGTAGAGAGAAAAGGCACAGAAAGAAACTCTATGTC 
AGCCCAGGTACAATGGATGGGGGCCTATGGTACGCTTATCTTATCAGCCTCATTGTTAAA 
ACTGGTTTTGAAATTGGCTTCCTTGTTTTATTTTATAAGCTATATGATGGCTTTAGTGTT 
CCCTACCTTATAAAGTGTGATTTGAAGCCTTGTCCCAACACTGTGGACTGCTTCATCTCC 
AAACCCACTGAGAAGACGATCTTCATCCTCTTCTTGGTCATCACCTCATGCTTGTGTATT 
GTGTTGAATTTCATTGAACTGAGTTTTTTGGTTCTCAAGTGCTTTATTAAGTGCTGTCTC 
CAAAAATATTTAAAAAAACCTCAAGTCCTCAGTGTGTGAGTGCCACAGCCTCAGATATGT 
TGAATGTG 



A NOV58a polypeptide (SEQ ID NO:210) encoded by SEQ ID NO:209 is 223 amino 
acids in length and is presented using the one-letter amino acid code in Table 58B. The Psort 
profile for NOV58a predicts that this sequence has a signal peptide and is likely to be 
localized at the plasma membrane with a certainty of 0.6000. In alternative embodiments, a 
NOV58a polypeptide is located to the mitochondrial inner membrane with a certainty of 
0.4358, to the endoplasmic reticulum (membrane) with a certainty of 0.3000, or to the Golgi 
with a certainty of 0.4000. The Signal P predicts a likely cleavage site for a NOV58a peptide 
is between positions 40 and 41 , i.e., at the dash in the sequence VAA-EH. 



Table 58B. NOV58a Polypeptide Sequence (SEQ ID 
NO:210) 

MSWMFLRDLLSGVNKYSTGTGWIWLAVVFVFRLLVYMVAAEHVWKDEQKEFECNSRQPGC 
KNVCFDDFFPISQVRLWALQLIMVSTPSLLWLHVAYHEGREKRHRKKLYVSPGTMDGGL 
WYAYLISLIVKTGFEIGFLVLFYKLYDGFSVPYLIKCDLKPCPNTVDCFISKPTEKTIFI 
LFLVITSCLCIVLNFIELSFLVLKCFIKCCLQKYLKKPQVLSV 

NOV58b 

The disclosed NOV58b (alternatively referred to herein as CG563 15-02) includes the 
727 nucleotide sequence (SEQ ID NO:21 1) shown in Table 58C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 27-29 and ends with a stop codon at 
nucleotides 696-698. The disclosed NOV58b maps to human chromosome 10. 



Table 58C. NOV58b Nucleotide Sequence (SEQ ID 
NO:211) 

AGTCTTGCTTCTTTTGAGCCTAAGTCATGAGTTGGATGTTCCTCAGAGATCTCCTGAGTG 
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GAGTAAATAAATACTCCACTGGGATTGGATGGATTTGGCTGGCTGTCGTGTTTGTCTTCC 
GTTTGCTGGTCTACATGGTGGCAGCAGAGCACGTGTGGAAAGATGAGCAGAAAGAGTTTG 
AGTGCAACAGTAGACAGCCCGGTTGCAAAAATGTGTGTTTTGATGACTTCTTCCCCATTT 
CCCAAGTCAGACTTTGGGCCTTACAACTGATAATGGTCTCCACACCTTCACTTCTGGTGG 
TTTTACATGTAGCCTATCATGAGGGTAGAGAGAAAAGGCACAGAAAGAAACTCTATGTCA 
GCCCAGGTACAATGGATGGGGGCCTATGGTACGCTTATCTTATCAGCCTCATTGTTAAAA 
CTGGTTTTGAAATTGGCTTCCTTGTTTTATTTTATAAGCTATATGATGGCTTTAGTGTTC 
CCTACCTTATAAAGTGTGATTTGAAGCCTTGTCCCAACACTGTGGACTGCTTCATCTCCA 
AACCCACTGAGAAGACGATCTTCATCCTCTTCTTGGTCATCACCTCATGCTTGTGTATTG 
TGTTGAATTTCATTGAACTGAGTTTTTTGGTTCTCAAGTGCTTTATTAAGTGCTGTCTCC 
AAAAATATTTAAAAAAACCTCAAGTCCTCAGTGTGTGAGTGCCACAGCCTCAGATATGTT 
GAATGTG 



A NOV58b polypeptide (SEQ ID NO:212) encoded by SEQ ID NO:21 1 is 628 amino 
acids in length and is presented using the one-letter amino acid code in Table 58D. The Psort 
profile for NOV58b predicts that this sequence has a signal peptide and is likely to be 
5 localized at the plasma membrane with a certainty of 0.6000. The Signal P predicts a likely 
cleavage site for a NOV58b peptide is between positions 40 and 41, i.e., at the dash in the 
sequence VAA-EH. 



Table 58D. NOV58b Polypeptide Sequence (SEQ ID 
NO:212) 

MSWMFLRDLLSGVNKYSTGIGWIWLAWFVFRLL.VYMVAAEHVWKDEQKEFECNSRQPGC 
KNVC FDD F F P I S Q VRLWAL.QL I M V S T P S L LWLHVAY HE GRE KRHRK KL Y VS PGTMDGGL 
WYAYLISLIVKTGFEIGFLVLFYKLYDGFSVPYLIKCDLKPCPNTVDCFISKPTEKTIFI 
LFLVITSCLCIVLNFIELSFLVLKCFIKCCLQKYLKKPQVLSV 



10 A BLAST analysis of NOV58 was run against the proprietary PatP GENE SEQ Protein 

Patent database. It was found, for example, that the amino acid sequence of NOV58 had high 
homology to other proteins as shown in Table 58E. 



Table 58E. BLASTX results from PatP database for NOV58 










Smallest 










Sum 








High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp 


AAY32 07 9 


Human gap junction protein beta-4 


689 


1 .2e-67 


patp 


AAY36145 


Human secreted protein #17 - Homo sapiens 


689 


1.2e T 67 


patp 


AAY36192 


Human secreted protein #64 - Homo sapiens 


689 


1 .2e-67 


patp 


AAY70457 


Human membrane channel protein- 7 (MECHP-7) 


666 


3 .3e-65 


patp 


AAG74 001 


Human colon cancer antigen protein 


657 


3.0e-64 



15 In a search of public sequence databases, it was found, for example, that the nucleic 

acid sequence of this invention has 240 of 336 bases (71%) identical to a gb:GENBANK- 
ID:RATCXN31 1 A|acc:M76533.1 mRNA from Rattus norvegicus (Rattus norvegicus 
connexin (CXN-31 1) gene). The full amino acid sequence of the protein of the invention was 
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found to have 125 of 220 amino acid residues (56%) identical to, and 170 of 220 amino acid 
residues (77%) similar to, the 271 amino acid residue ptnr:SWISSNEW-ACC:Q02739 protein 
from Mus musculus (Mouse) (GAP JUNCTION BETA-5 PROTEIN (CONNEXIN 31.1) 
(CX31 .1)). NOV58 also has homology to the other proteins shown in the BLASTP data in 
5 Table 58F. 



Table 58F. NOV58 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi | 15990851 | e 
mb j CAC93845 . 1 
| (AJ414563) 


connexin25 
[Homo sapiens] 


223 


222/223 
(99) 


222/223 
(99) 


e-116 


gi | 10835079 | r 
ef |NP_005259 . 
1 | (NM 005268) 


gap junction protein, 
beta 5 (connexin 31.1) 
[Homo sapiens] 


273 


123/216 
(56) 


161/216 
(73) 


le-67 


gi | 6753996 | re 
f |NP_034421. 1 
| (NM_010291) 


gap junction membrane 
channel protein beta 5; 
connexin 31.1 
[Mus musculus] 


271 


123/214 
(57) 


166/214 
(77) 


le-67 


gi | 15029850 |g 
b|AAH11148 . 1 | 
AAH11148 
(BC011148) 


Similar to gap junction 
membrane channel protein 
beta 5 

[Mus musculus] 


271 


123/214 
(57) 


166/214 
(77) 


7e-67 


gi |4009522 | gb 
| AAC95472 . 1 | 
(AF099731) 


connexin 31.1 
[Homo sapiens] 


273 


122/216 
(56) 


160/216 
(73) 


7e-67 



This BLASTP data is displayed graphically in the ClustalW in Table 58G. A multiple 
sequence alignment is given, with the NOV58a and b protein being shown on line 1 and 2 in a 
10 ClustalW analysis comparing the protein of the invention with the related protein sequences 
shown in Table 58F. 



Table 58G. ClustalW Alignment of NOV58 



NOV5 8a 
NOV5 8b 
gi|l599085l| 
gi | 10835079 | 
gi|6753996| 
gi|l5029850| 
gi j 4009522 | 



NOV5 8a 

NOV58b 

gi|!599085l| 

gi | 10835079 j 

gi|6753996| 

gi|l5029850| 

gi|4009522 | 



(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 



NO:210) 
NO:212) 
NO:592) 
NO: 593) 
NO:594) 
NO:595) 
NO: 5 96) 

10 




LLSGVNKYST 
LLSGVNKYST 
LLSGVNKYST 
LLSGVNKYST 
LLSGVNKYST 
LLSGVNKYST 
LLSGVNKYST 




496 



,J! *"T »* !. >'-*U >-~i 



70 



NOV5 8a 

NOV58b 

gi|l599085l| 

gi | 10835079 j 

gi|6753996| 

gi|l5029850 | 

gi|4009522| 



NOV5 8a 

NOV58b 

gi | 15990851 | 

gi|l0835079| 

gi|6753996| 

gi|l5029850 | 

gi|4009522| 



61 
61 
61 
61 
61 
61 
61 



kS 

Kg 



NOV5 8a 


114 


NOV5 8b 


114 


gi | 15990851 | 


114 


gi | 10835079 | 


121 


gi j 6753996 | 


119 


gi|l5029850 | 


119 


gi|4009522| 


121 


NOV5 8a 


174 


NOV5 8b 


174 


gi | 15990851 | 


174 


gi j 10835079 | 


181 


gi j 6753996 | 


179 


gi j 15029850 | 


179 


gi |4009522 | 


181 



I 



I 



80 
. I . . 



100 



110 



VCFDgFFPg 

vcfdSffpI 
vcfdSffp| 
vcfd|ffp| 
vcSSdIffp! 



JVRLWALQLIIm, 
VRLWALQLHj 

T .TXT R T .PlT . TRW 



iSfflVRLWALQLI 
|sjgVRLWALQLigV| 
iSgVRLWALQLlfflvi 
SvRLWALQLlSvj* 
SvRLWALQLlSv^ 



VCgjDj3FFP|S 
VCFDiaFFpS.c: 



130 




120 



150 



160 



AHGENSGR 
YQSglG--EGY 
r G — EGY 
[GENSGR 

170 




180 




190 



220 




SfCL 


CI 




LKT 


FI 


EL 


s| 


SjCL 


CI 




LN 


FI 


EL 


1 


§CL 


CI 




LN 


FI 


EL 




m$. 


CI 




LN 


LV 


EL 


I Y 




CI 




LN 


LV 


EL 


IY 




CI 




LN 




EL 


IY 


P4 


CI 




LN 




EL 


I Y 




230 
..| 



I 



CLQ&Y 

CLcfo 

jCLQKY 

KAQ AM CTGHH PHGTT S S 
RP PT AHAKNDPNWANS PS 
RP PT AHAKND PNWANS PS 
.QAMCTGHHPHGTTSS' 




I 



250 



260 



270 



218 PQVfgV 22 3 

218 PQV^V 223 

218 PQ t vffiv 22 3 

241 DDEBgGDLIFLGSDSHPPLLPDRPRDHVKKTIL 273 

23 9 KDfMcDLIFLGSDAHPPLLPDRPRAHVKKTIL 271 

23 9 KDF^HcDL I FLGSDAHP PLL PDRPRAHVKKT I L 271 

241 DDlEgDLIFLGSDSHPPLLPDRPRDHVKKTIL 27 3 



10 
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The presence of identifiable domains in NOV58 was determined by searches using 
software algorithms such as PROSITE, DOMAIN, Blocks, Pfam, ProDomain, and Prints, and 
then determining the Interpro number by crossing the domain match (or numbers) using the 
Interpro website (http:www.ebi.ac.uk/ interpro). DOMAIN results for NOV58 as disclosed in 
Table 58H, were collected from the Conserved Domain Database (CDD) with Reverse 
Position Specific BLAST analyses. This BLAST analysis software samples domains found in 
the Smart and Pfam collections. For Table 58H fully conserved single residues are indicated 
by the sign (|) and "strong" semi-conserved residues are indicated by the sign (+). The 
"strong" group of conserved amino acid residues may be any one of the following groups of 
amino acids: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW. 

Table 58H lists the domain description from DOMAIN analysis results against 
NOV58. This indicates that the NOV58 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 58H. Domain Analysis of NO V58 




gnl | 


Pf am | pf am00029 , connexin, Connexin. SEQ ID NO: 


859 




Score = 


CD-Length = 218 residues, 100.0% aligned 
- 265 bits (678), Expect = 2e-72 




NOV5 8 : 
Sbjct : 


1 
l 


MSWMFLRDLLSGWKYSTGTGWIWLAWFVFRLLVY1WAAEHVWKDEQKEFECNSRQPGC 
M W FL LL GVNK+ST G IWL+V+F+FR+LV VAAE VW DEQ +F CN++QPGC 
MDWSFLGRLLEGVNKHSTAIGKIWLSVLFIFRILVLGVAAESVWGDEQSDFVCNTQQPGC 


60 
60 


NOV58 : 
Sbjct: 


61 
61 


KNVCFDDFFPISQVRLWALQLIMVSTPSLLWLHVAYHEGREKRHRKK LYVSP 

+NVC+D FFPIS VRLW LQLI VSTPSLL + HVAY RE+ + R+K LY 
ENVCYDQFFPISHVRLWVLQLIFVSTPSLLYLGHVAYRVRREEKLREKEEEHSKGLYSEE 


113 
120 


NOV58 : 
Sbjct : 


114 
121 


G TMDGGL W YAY L I S L I VKTG FE I G FL VL F Y KL YDG F S VP YL IKCDLKPC 

+ GGLW+ Y+ S+I K+ FE+GFL Y LY GF++ L+ C PC 
AKKRCGSEDGKVRI RGGLWWTYVFS 1 1 FKS I FEVGFLYGQYLLY - GFTMS PLWCSRAPC 


162 
179 


NOV5 8 : 
Sbj Ct : 


163 
180 


PNTVDCFI SKPTEKTIFILFLVITSCLCIVLNFIELSFL 2 01 
P+TVDCF+S+PTEKTIFI+F+++ S +C++LN EL +L 
PHTVDCFVSRPTEKTI FIVFMLWSAICLLLNLAELFYL 218 





Connexins are a family of integral membrane proteins that oligomerise to form 
intercellular channels that are clustered at gap junctions. These channels are specialized sites 
of cell-cell contact that allow the passage of ions, intracellular metabolites and messenger 
5 molecules (with molecular weight <l-2 kD) from the cytoplasm of one cell to its apposing 

neighbours. They are found in almost all vertebrate cell types, and somewhat similar proteins 
have been cloned from plant species. Invertebrates utilise a different family of molecules, 
innexins, that share a similar predicted secondary structure to the vertebrate connexins, but 
have no sequence identity to them. 

10 Vertebrate gap junction channels are thought to participate in diverse biological 

functions. For instance, in the heart they permit the rapid cell-cell transfer of action potentials, 
ensuring coordinated contraction of the cardiomyocytes. They are also responsible for 
neurotransmission at specialised 'electrical' synapses. In non-excitable tissues, such as the 
liver, they may allow metabolic cooperation between cells. In the brain, glial cells are 

15 extensively-coupled by gap junctions; this allows waves of intracellular Ca2+ to propagate 
through nervous tissue, and contribute to their ability to spatial ly-buffer local changes in 
extracellular K+ concentration. 

The connexin protein family is encoded by at least 13 genes in rodents, with many 
homologues cloned from other species. They show overlapping tissue expression patterns, 

20 most tissues expressing more than one connexin type. Their conductances, permeability to 

different molecules, phosphorylation and voltage-dependence of their gating, have been found 
to vary. Possible communication diversity is increased further by the fact that gap junctions 
may be formed by the association of different connexin isoforms from apposing cells. 
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However, in vitro studies have shown that not all possible combinations of connexins produce 
active channels. 

Hydropathy analysis predicts that all cloned connexins share a common 
transmembrane (TM) topology. Each connexin is thought to contain 4 TM domains, with two 
5 extracellular and three cytoplasmic regions. This model has been validated for several of the 
family members by in vitro biochemical analysis. Both N- and C-termini are thought to face 
the cytoplasm, and the third TM domain has an amphipathic character, suggesting that it 
contributes to the lining of the formed-channel. Amino acid sequence identity between the 
isoforms is -50-80%, with the TM domains being well conserved. Both extracellular loops 

10 contain characteristically conserved cysteine residues, which likely form intramolecular 

disulphide bonds. By contrast, the single putative intracellular loop (between TM domains 2 
and 3) and the cytoplasmic C-terminus are highly variable among the family members. Six 
connexins are thought to associate to form a hemi-channel, or connexon. Two connexons then 
interact (likely via the extracellular loops of their connexins) to form the complete gap 

15 junction channel. Two sets of nomenclature have been used to identify the connexins. The 
first, and most commonly used, classifies the connexin molecules according to molecular 
weight, such as connexin43 (abbreviated to Cx43), indicating a connexin of molecular weight 
close to 43 kD. However, studies have revealed cases where clear functional homologues 
exist across species that have quite different molecular masses; therefore, an alternative 

20 nomenclature was proposed based on evolutionary considerations, which divides the family 
into two major subclasses, alpha and beta, each with a number of members. 

Due to their ubiquity and overlapping tissue distributions, it has proved difficult to 
elucidate the functions of individual connexin isoforms. To circumvent this problem, 
particular connexin-encoding genes have been subjected to targeted-disruption in mice, and 

25 the phenotype of the resulting animals investigated. Around half the connexin isoforms have 
been investigated in this manner. Further insight into the functional roles of connexins has 
come from the discovery that a number of human diseases are caused by mutations in 
connexin genes. For instance, mutations in Cx32 give rise to a form of inherited peripheral 
neuropathy called X-linked dominant Charcot-Marie-Tooth disease. Similarly, mutations in 

30 Cx26 are responsible for both autosomal recessive and dominant forms of nonsyndromic 
deafness, a disorder characterised by hearing loss, with no apparent effects on other organ 
systems. 

The disclosed NOV58 is a connexin-like protein localized to gap junctions. Gap 
junctions were first characterized by electron microscopy as regionally specialized structures 
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on plasma membranes of contacting adherent cells. These structures were shown to consist of 
cell-to-cell channels. Proteins, called connexins, purified from fractions of enriched gap 
junctions from different tissues differ. The connexins are designated by their molecular mass. 
Another system of nomenclature divides gap junction proteins into 2 categories, alpha and 
5 beta, according to sequence similarities at the nucleotide and amino acid levels. For example, 
CX43 is designated alpha- 1 gap junction protein, whereas CX32 and CX26 are called beta-1 
and beta-2 gap junction proteins, respectively. This nomenclature emphasizes that CX32 and 
CX26 are more homologous to each other than either of them is to CX43. 

Willecke et al. (1990) used rat connexin gene probes in Southern blot analysis of 

10 human-mouse somatic cell hybrids to map the CX26 gene to chromosome 13. By means of 
somatic cell hybrids, Hsieh et al. (1991) assigned the GJB2 gene to chromosome 13 in man 
and chromosome 14 in the mouse. Haefliger et al. (1992) showed that the rat homologs of the 
CX26 and CX46 genes are tightly linked on chromosome 14. By isotopic in situ hybridization, 
Mignon et al. (1996) mapped GJB2 to 13ql l-ql2 and confirmed the assignment to mouse 

15 chromosome 14. Kelsell et al. (1997) studied a pedigree containing individuals with autosomal 
dominant deafness and identified a mutation in the CX26 gene: a 101T-C transition resulting 
in a met34-to-thr amino acid substitution. CX26 mutations resulting in premature stop codons 
were also found in 3 autosomal recessive nonsyndromic sensorineural deafness pedigrees, 
genetically linked to 13ql l-ql2, where the CX26 gene is localized. 

20 Immunohistochemical staining of human cochlear cells for CX26 demonstrated high 

levels of expression. Kelley et al. (1998) presented evidence that the 101T-C missense 
mutation identified by Kelsell et al. (1997) in individuals with autosomal dominant 
nonsyndromic deafness is not sufficient to cause hearing loss. Carrasquillo et al. (1997) 
performed linkage analysis in 2 interrelated inbred kindreds in a single Israeli-Arab village 

25 containing more than 50 individuals with nonsyndromic recessive deafness. Genetic mapping 
demonstrated that a gene located at 13ql 1 (DFNB1) segregated with the deafness in these 2 
kindreds. Haplotype analysis, using 8 microsatellite markers spanning 15 cM in 13ql 1, 
suggested the segregation of 2 different mutations in this extended kindred; affected 
individuals were homozygotes for either haplotype or compound heterozygotes. Carrasquillo 

30 et al. (1997) identified 2 distinct mutations, trp77 to arg and 35delG, in the CX26 gene, both of 
which were predicted to inactivate connexin 26. 

The recombination of marker alleles involving polymorphisms in 13ql 1, at known 
map distances from the mutations, allowed them to estimate the age of the mutations to be 3 to 
5 generations (75 to 125 years). The study demonstrated that in small populations with high 
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rates of consanguinity, as compared with large outbred populations, recessive mutations may 
have very recent origin and show allelic diversity. They pointed to the same phenomenon 
being observed for Hurler syndrome with 3 unique mutations and for metachromatic 
leukodystrophy with 5 distinct mutations, discovered among the Druze and Muslim Arab 
5 villages in Israel. In light of these findings, the authors commented that it is likely that 

homozygosity mapping studies in highly inbred communities may be compromised, as may be 
studies of mapping by linkage disequilibrium, unless the possibility of mutational diversity is 
taken into account. 

Lench et al. (1998) studied the role of CX26 mutations in singleton (sporadic) cases of 

10 nonsyndromal sensorineural deafness. Such mutations were identified in 4 of 43 U.K. and 2 of 
25 Belgian patients. Thus, about 10% of families presenting with a child sporadically affected 
with this disorder can be offered definitive mendelian recurrence risks. This was said to be the 
first genetic test available for screening such children. Kelley et al. (1998) analyzed 58 
multiplex families each having at least 2 affected children diagnosed with autosomal recessive 

15 nonsyndromic deafness. Mutations in both alleles of GJB2 were observed in 20 of the 58 

families. A 30delG allele occurred in 33 of the 1 16 chromosomes, for a frequency of 0.284. 
This mutation was observed in 2 of 192 control chromosomes, for an estimated gene 
frequency of 0.01 +/- 0.007. The homozygous frequency of the 30delG allele was then 
estimated at 0.0001, or 1 in 10,000. Given that the frequency of all childhood hearing 

20 impairment is 1 in 1,000 and that half of that is genetic, the specific mutation 30delG is 
responsible for 10% of all childhood hearing loss and for 20% of all childhood hereditary 
hearing loss. Six novel mutations were also observed in the affected population. 

Murgia et al. (1999) studied 53 unrelated individuals with nonsyndromic sensorineural 
hearing impairment and carried out CX26 mutation analysis. Mutations were found in 53% of 

25 cases, in 35.3% of those in whom autosomal recessive inheritance was thought likely and in 
60% of the presumed sporadic cases. Three novel mutations were found. The hearing deficit 
varied from mild to profound even within the same family. Among patients with profound 
hearing loss, 35.5% were found to have a mutation; among those severely impaired, 20%; and 
among those moderately impaired, 33.3%. 

30 Rabionet et al. (2000) analyzed the GJB2 gene in 576 families/unrelated patients with 

recessive or sporadic deafness from Italy and Spain, 193 of them being referred as autosomal 
recessive and the other 383 as apparently sporadic. Of the 1,152 unrelated GJB2 
chromosomes, 37% had GJB2 mutations. A total of 23 different mutations were detected. 
Mutation 35delG was the most common, accounting for 82% of all GJB2 deafness alleles. It 
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represented 88% of the alleles in Italian patients and only 55% in Spanish cases. Sobe et al. 
(2000) sequenced the entire coding region of the GJB2 gene in 75 hearing-impaired children 
and adults in Israel. Was both prelingual and postlingual, with hearing loss ranging from 
moderate to profound. Almost 39% of all persons tested harbored GJB2 mutations, most of 
5 which were 35delG and 167delT. A novel mutation, involving both a deletion and an insertion, 
51dell2insA, was identified in a family originating from Uzbekistan. All GJB2 mutations 
were associated with prelingual hearing loss, although severity ranged from moderate to 
profound, with variability even among hearing-impaired sibs. No significant difference in 
hearing levels was found between individuals with 35delG and 167delT mutations. 

10 Antoniadi et al. (2000) screened 26 unrelated Greek patients with prelingual 

sensorineural deafness in whom syndromic forms and environmental causes of deafness had 
been excluded. They detected the 35delG mutation in 28 chromosomes (53.8%); another 3 
sequence variations accounted for 7.6% of the alleles. Wilcox et al. (2000) performed 
mutation analysis of the GJB2 gene and audiology on 106 families presenting with at least 1 

15 child with congenital hearing loss. In 74 families (80 children), the etiology was consistent 
with nonsyndromic recessive hearing loss. Six different GJB2 mutations, including 1 novel 
mutation, were identified. They found that GJB2 mutations caused a range of phenotypes from 
mild to profound hearing impairment and that loss of hearing in the high-frequency range 
(4,000 to 8,000 Hz) is a characteristic feature in children with molecularly diagnosed CX26 

20 hearing impairment. They also demonstrated that high frequency hearing loss was found in a 
group of similar size of deaf children in whom a mutation could be found in only one of the 
GJB2 alleles. In their study, the M34T mutation was associated with hearing loss only when 
present in compound heterozygous state, suggesting autosomal recessive inheritance. 
Heathcote et al. (2000) reported a missense mutation in affected members of a family with 

25 autosomal dominant deafness and palmoplanar keratoderma. Rabionet et al. (2000) reviewed 
the molecular genetics of hearing impairment due to mutations in gap junction genes encoding 
beta-connexins. Among these genes, mutations in GJB2 account for about 50% of all 
congenital cases of hearing impairment. Three mutations in GJB2 are particularly common in 
specific populations: 35delG in Caucasoids, 167delT in Ashkenazi Jews, and 235delC in 

30 Orientals. Carrier frequencies in these populations vary between 1 and 30 and 1 in 75. Over 50 
mutations have been identified in the GJB2 gene, of which some missense changes have a 
dominant-negative action in hearing impairment, with partial to full penetrance. Functional 
studies for some missense mutations in connexins 26, 30, and 32 indicate abnormal gap 
junction conductivity. Expression patterns in mouse and rat cochlea indicate that connexin 26 
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and connexin 30 are expressed in the supporting cells of the cochlea, suggesting a potential 
role in endolymph potassium recycling. 

In the Japanese population, Kudo et al. (2000) sequenced the GJB2 gene in 39 patients 
with prelingual deafness, 39 patients with postlingual progressive sensorineural hearing loss, 
5 and 63 individuals with normal hearing. GJB2 mutations were found in 5 of the 39 patients 

(12%) with prelingual deafness. The most common mutation was 235delC observed in 7 of 10 
mutant alleles. There were no cases with the 30delG allele. No GJB2 mutation was found in 
patients in the postlingual hearing loss group. Nance et al. (2000) noted that recessive 
mutations at the connexin-26 gene locus account for nearly half of all cases of genetic 

10 deafness in many populations. They suggested that this high frequency is only seen in 

populations with a long tradition of intermarriage among deaf people. Available data are 
consistent with the hypothesis that such marriages might well have contributed to the high 
frequency of connexin-26 deafness in the U.S., and could represent a novel mechanism for 
maintaining specific genotypes at unexpectedly high frequencies. 

15 The NOV58 disclosed in this invention is predicted to be expressed in at least the 

following tissues: brain, lung, ovary, and colon. This information was derived by determining 
the tissue sources of the sequences that were included in the invention including but not 
limited to SeqCalling sources, public EST sources, literature sources, and/or RACE sources. 
Further expression data for NOV58 is provided in Example 2. 

20 The nucleic acids and proteins of NOV58 are useful in potential therapeutic 

applications implicated in various gap junction-related pathological disorders described further 
herein. The NOV58 nucleic acid encoding the connexin-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. The novel nucleic acid of the 

25 invention encoding a Connexin-like protein includes the nucleic acid whose sequence is 

provided in Table 58A or 58C, or a fragment thereof. The invention also includes a mutant or 
variant nucleic acid any of whose bases may be changed from the corresponding base shown 
in Table 58A or 58C while still encoding a protein that maintains its connexin-like activities 
and physiological functions, or a fragment of such a nucleic acid. The invention further 

30 includes nucleic acids whose sequences are complementary to the sequence shown in Table 
58A or 58C, including nucleic acid fragments that are complementary to any of the nucleic 
acids just described. The invention additionally includes nucleic acids or nucleic acid 
fragments, or complements thereto, whose structures include chemical modifications. Such 
modifications include, by way of non-limiting example, modified bases, and nucleic acids 
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whose sugar phosphate backbones are modified or derivatized. These modifications are carried 
out at least in part to enhance the chemical stability of the modified nucleic acid, such that 
they may be used, for example, as antisense binding nucleic acids in therapeutic applications 
in a subject. In the mutant or variant nucleic acids, and their complements, up to about 29% of 
5 the bases may be so changed. The novel protein of the invention includes the connexin-like 
protein whose sequence is provided in Table 58B or 58D. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 58B or 58D while still encoding a protein that maintains its connexin- 
like activities and physiological functions, or a functional fragment thereof. In the mutant or 

10 variant protein, up to about 44% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

1 5 below. 

NOV59 

A eukaryotic translation initiation factor 5 (EIF5), disclosed herein as NOV59, 
interacts with the 40S initiation complex to promote hydrolysis of bound GTP with 

20 concomitant joining of the 60S ribosomal subunit to the 40S initiation complex. The resulting 
functional 80S ribosomal initiation complex is then active in peptidyl transfer and chain 
elongations. The disclosed NOV59 (alternatively referred to herein as CG56633-01) includes 
the 1328 nucleotide sequence (SEQ ID NO:213) shown in Table 59A. A NOV59 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 34-36 and ends with a TGA 

25 codon at nucleotides 1273-1275. The disclosed NOV59 maps to human chromosome 3. 



Table 59 A. NOV59 Nucleotide Sequence (SEQ ID 
NO:213) 

CTTTCCTCATCACCTTAAATTCGGGTGTCTTTTATGAGTAATCAAAAGCAGCAAAAGCCA 

ACGCTATCAGGCCCAGTATTTAAAACCAGAAAAAGAGATGAAAAAGAGAGGTTTGACCCT 
ACTCAGTTTCAGGACTACGTTATTCAAGGCTTAACTGAAACTGGTACTGATTTGGAAGCA 
GTAGCTAAGTTCCTTGATGCTTCTGGAACAAAACTTGATTACCGTCGATGTGCAGAAACA 
CTCTTTGACATTCTGGTGGGTGGTGGAATGCTGGCCCCAGGTGGTACACTGGCAGATGAC 
ATCATGCGTACAGATGTCTGCGTGTTTGCAGCCCAAGAAGACCTAGAGACCATGCAAGCA 
TTTGCTCAGGTTTTTAACAAGTTAATCAGGCACTACAAATACCTTGAGAAATGTTGTGAA 
GATGAAGTAAAAAGGCTGCTGGTGTTCGGAAAGGGTTTTTCAGACTCGGAGAGGAAAAAA 
CTGGCTATGTTGACTGGTGTTCTTCTGGCTAATGCATCCATTCTTAATAGCCTTTATAAT 
GAGAATTTGGTTAAAGAAGGGGTTTCAACAGCTTTTGCTGGAAAGCTATTTAAATCATGT 
ATAAATGAAAAAGATATCAATGCAGTAACTGCAAGGAAAGTCAGCATGGATAACAGCCTG 
ATGGAACTTTTTCCTGCCAATAAGCAAAGCGTTCAACACTTCACGAAGTATTTTACTGAG 
GCAGGCCTGAAAGAGCTTTCAGAATATGTTCGGAATCAGCAAACCATCAGAGCTTGTAAG 
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GAGCTGCAGAAAGAACTTCAAGAACAGATGTCCCGTGGGGATCCATTTAAGGTTATAATT 
TTATATGTCAAGGAGGAGATGAAAAAAAACAACATCCCAGAACCAGTTGTCATCGAAATA 
GTCTGGTCAAATGTAATGAGCGCTGTGGAATGGAACAAAAGAGAGGAGATTGTAGCAGAG 
CAAGCCATCAAACACTTGAAGCAACACAGCCCTCTACTTGCTGCCTTTACTACTCAAAGT 
CAGTCTGAGCTGACCCTGTTACTGAAGATTCAGGAGTATTGCTATGACAACATTCATTTC 
ATGAAAGCCTTACGGAAAATAGTGGTGCTTTTTTATAAAGCTGTAGTCCTGAGCAAAGAG 
ACCATTTTGAAGTGGTATAAAGGTACACATGTTGCAAAGGGGAAGAGTGTTTTCCTTGAG 
CAAATGAAAAAGTTTGGAGAGTGGCTCAAAAATGCTGAAGAAGAATCTGAATCTGAAGCT 
GAAGAAGGTGACTGAATTTTGAAACTACACCCTCAGTAAAGCAAACAGGAGTTGTAGATA 
AAATGTCC __ 



The NOV59 polypeptide (SEQ IDNO:214) encoded by SEQ ID NO:213 is 413 amino 
acids in length and is presented using the one-letter amino acid code in Table 59B. The Psort 
profile for NOV59 predicts that this sequence has no signal peptide and is likely to be 
localized to the nucleus with a certainty of 0.7600. In alternative embodiments, a NOV59 
polypeptide is located to lysosomes with a certainty of 0.10000. 



Table 59B. NOV59 Polypeptide Sequence (SEQ ID 
NO:214) 

MSNQKQQKPTLSGPVFKTRKRDEKERFDPTQFQDYVIQGLTETGTDLEAVAKFLDASGTK 
LDYRRCAETLFDILVGGGMLAPGGTLADDIMRTDVCVFAAQEDLETMQAFAQVFNKLIRH 
YKYLEKCCEDEVKRLLVFGKGFSDSERKKLAMLTGVLLANASILNSLYNENLVKEGVSTA 
FAGKLFKSCINEKDINAVTARKVSMDNSLMELFPANKQSVQHFTKYFTEAGLKELSEYVR 
NQQTIRACKELQKELQEQMSRGDPFKVIILYVKEEMKKNNIPEPWIEIVWSNVMSAVEW 
NKREEIVAEQAIKHLKQHSPLLAAFTTQSQSELTLLLKIQEYCYDNIHFMKALRKIWLF 
YKAWLS KETILKWYKGTHVAKGKSVFLEQMKKFGEWLKNAEEESESEAEEGD 



A BLAST analysis of NOV59 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV59 had high 
homology to other proteins as shown in Table 59C. 



Table 59C. BLASTX results from PatP database for NOV59 


Smallest 










Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp : AAB43883 Human cancer associated protein sequence 


1834 


5 . 6e-189 


patp : AAW93 950 Human regulatory molecule HRM-6 protein 


1403 


2 . 6e- 143 


patp : AAB92726 Human protein sequence 


1403 


2 . 6e-143 


patp : AAM3 8 7 64 Human polypeptide 


1403 


2 . 6e-143 


patp : AAM4 0550 Human polypeptide 


1403 


2 . 6e-143 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 778 of 824 bases (94%) identical to a gb:GENBANK- 
ID:HUMRSC419|acc:D13630.1 mRNA from Homo sapiens (Human mRNA for KIAA0005 
gene. The full amino acid sequence of the protein of the invention was found to have 372 of 
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419 amino acid residues (88%) identical to, and 385 of 419 amino acid residues (91%) similar 
to, the 419 amino acid residue ptnr:SPTREMBL-ACC:Q 15394 protein from Homo sapiens 
(Human) (KI AA0005 PROTEIN). NOV59 also has homology to the other proteins shown in 
the BLASTP data in Table 59D. 



Table 59D. NOV59 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi | 7661850 | re 
f |NP_0554 85 . 1 
| (NM_014 670) 


basic leucine- zipper 
protein B ZAP 4 5 ; KIAA0 0 05 
gene product [Homo 
sapiens] 


419 


372/419 
(88) 


385/419 
(91) 


0 . 0 


gi | 7661744 | re 
f |NP_054757 . 1 
[ (NM_014038) 


HSPC028 protein 
[Homo sapiens] 


419 


264/406 
(65) 


355/406 
(82) 


e-143 


gi | 15341786 |g 
b | AAH13060 . 1 | 
AAH13060 
(BC013060) 


HSPC028 protein 
[Mus musculus] 


419 


264/406 
(65) 


334/406 
(82) 


e-143 


gi |4426565 |gb 
| AAD20436 . 1 | 
(AF031483 ) 


unknown 

[Rattus norvegicus] 


419 


264/406 
(65) 


334/406 
(82) 


e-143 


gi | 11640562 |g 
b| AAG3 92 7 8 . 1 | 
AF110323_1 
(AF110323) 


MSTP017 
[Homo sapiens] 


419 


263/406 
(64) 


334/406 
(81) 


e-143 



This BLASTP data is displayed graphically in the ClustalW in Table 59E. A multiple 
sequence alignment is given, with the NOV59 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
10 Table 59D. 



Table 59E. ClustalW Alignment of NOV59 



NOV59 

gi|7661850 | 
gi|7661744| 
gi | 15341786 | 
gi|4426565| 
gi|H640562| 



(SEQ ID NO: 214) 

(SEQ ID NO: 597) 

(SEQ ID NO: 5 98) 

(SEQ ID NO: 599) 

(SEQ ID NO: 600) 

(SEQ ID NO:601) 



NOV59 

gi|7661850| 
gi | 7661744 I 
gi|l5341786| 
gi|4426565 | 
gi|H640562] 




50 



60 



EHGnDLEAVAKFLDj||G 

e5g3dleavakfld^g] 
eagddleavakfldstgsr 
eagddleavakfldstgsr 
eagddleavakfldstgsr 
eagddleavakfldstgsr 



Si 60 

31 60 
58 
58 
58 
58 



NOV59 

gi|7661850 | 
gi [7661744 | 



110 



120 
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gi | 15341786 | 
gi|4426565| 
gi j 11640562 | 



NOV59 

gi | 7661850 | 
gi|7661744| 
gi 1 15341786 | 
gi|4426565| 
gi j 11640562 | 



59 
59 
59 



LDYRRYADTLFDILVAGSMLAPGGTRIDDGDKTKMTNHCVFSANEDHETIRNYAQVFNKL 
LDYRRYADTLFDILVAGSMLAPGGTRIDDGDKTKMTNHCVFSANEDHETIRNYAQVFNKL 
LDYRRYADTLFDI LVAGSMLAPGGTRIDDGDKTKNTNHCVF5ANEDHETIRNYAQVFNKL 



118 
118 
118 



NOV59 


118 


gi|7661850| 


118 


gi|7661744| 


119 


gij 15341786 | 


119 


gi | 4426565 | 


119 


gij 11640562 | 


119 


NOV59 


174 


gi|7661850| 


178 


gi|7661744| 


179 


gi j 15341786 | 


179 


gi|4426565| 


179 


gi | 11640562 | 


179 



232 
238 
239 
239 
239 
239 



NOV59 


292 


gi|7661850 | 


298 


gi|7661744| 


299 


gij 15341786] 


299 


gi|4426565| 


299 


gi | 11640562 | 


299 


NOV59 


352 


gi|7661850| 


358 


gi |7661744 j 


359 


gi | 15341786 | 


359 


gi|4426565| 


359 


gi|H640562 | 


359 




NOV59 

gi|7661850| 
gi |7661744 j 
gi | 15341786 | 
gi|4426565| 
gij 11640562 [ 



412 


GD 


413 


418 


GD 


419 


419 




419 


419 




419 


419 




419 


419 




419 



Table 59F lists the domain description from DOMAIN analysis results against NOV59. 
This indicates that the NOV59 sequence has properties similar to those of other proteins 
known to contain this domain. 
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Table 59F. Domain Analysis of NOV59 




gni 1 


Load 1 LOAD W2, W2, conserved protein-protein interaction domain in translation 


factors like eIF2B SEQ ID NO : 86 0 






CD-Length =116 residues, 96.6% aligned 






Score = 83.6 bits (205), Expect = 2e-17 


NOV59: 
Sbjct : 


290 
1 


WSNVMSAVEWNKRE E I VAEQAI KHLKQHS PLLAAF TTQS OS EL TLLLK1 QEYCYDN I HF 34 9 
V + + S + E A+K K+ PliLA + S + L LL + + E + C + 
VALVILSVASIELADNEPKEAAVKVFKKWGPLLAKYLKDEDSQLELLYALEEFCEELEEL 60 


NOV59: 
Sbjct: 


350 
61 


MKALRKIWLFYKAWLSKETILKWY- KGTHVAKGKSVFLEQMKKFGEWLKN 4 00 
+ K L KI+ Y VL + E ILKWY K + +GK L+ K F WL+ 
LKLLAKIIiKYLYDEDVLEEEAILKWYEKKSKAEEGKKKVLKSAKPFVTWLQE 112 



The NOV59 disclosed in this invention is predicted to be expressed in at least the 
following tissues: brain. This information was derived by determining the tissue sources of 
the sequences that were included in the invention including but not limited to SeqCalling 
5 sources, public EST sources, literature sources, and/or RACE sources. Further expression data 
for NOV59 is provided in Example 2. 

The nucleic acids and proteins of NOV59 are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 

10 hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, myasthenia gravis, neuroprotection, endocrine 
dysfunctions, diabetes, obesity, growth and reproductive disorders and other diseases, 
disorders and conditions of the like. The NOV59 nucleic acid encoding the translation 

15 initiation factor 5-like protein of the invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. The novel nucleic acid of the invention encoding a translation initiation factor 
5-like protein includes the nucleic acid whose sequence is provided in Table 59A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 

20 bases may be changed from the corresponding base shown in Table 59A while still encoding a 
protein that maintains its translation initiation factor 5-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to those just described, including nucleic acid fragments 
that are complementary to any of the nucleic acids just described. The invention additionally 

25 includes nucleic acids or nucleic acid fragments, or complements thereto, whose structures 
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include chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
5 binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 6% of the residues may be so changed. 

The novel protein of the invention includes the translation initiation factor 5-like 
protein whose sequence is provided in Table 59B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
10 in Table 59b while still encoding a protein that maintains its translation initiation factor 5-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 12% of the bases may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
15 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV60 

NOV60 includes two Lynxl-like proteins, designated herein as NOV60a and NOV60b, 
20 which differ by three amino acids and the relative length of their untranslated regions (UTR's). 

NOV60a 

The disclosed NOV60a (alternatively referred to herein as CG56894-01) includes the 
715 nucleotide sequence (SEQ ID NO:215) shown in Table 60A. A NOV60a ORF begins 
25 with a Kozak consensus ATG initiation codon at nucleotides 348-350 and ends with a stop 
codon at nucleotides 696-698. 



Table 60A. NOV60a Nucleotide Sequence (SEQ ID NO:215) 

AGCTTTGTTCTTGAGTGGGTCTGCCTCGGGGGCTTTAGAGGAGACCCCAGAGGGTGGCGATGCGGCACGG 
GTGCTGCGGGACACACAGACACGCCTACGATTAGACTCAGGCAGGCACCTACCGGCGAGCGGCCGCGGGT 
GACTCCCAGGCGCGGCGGTACCTCACGGTGGTGAAGGTCACAGGGTTGCAGCACTCCCAGTAGACCAGGA 
GCTCCGGGAGGCAGGGCCGGCCCCACGTCCTCTGCGCACCACCCTGAGTTGGATCCTCTGTGCGCCACCC 
CTGAGTTGGATCCAGGGCTAGCTGCTGTTGACCTCCCCACTCCCACGCTGCCCTCCTGCCTGCAGCCATG 
ACGCCCCTGCTCACCCTGATCCTGGTGGTCCTCATGGGCTTACCTCTGGCCCAGGCCTCGGACTGCCACG 
TGTGTGCCTACAACGGAGACAACTGCTTCAACCCCATGCGCTGCCCGGCTATGGTTGCCTACTGCATGAC 
CACGCGCACCTACTACACCCCCACCAGGATGAAGGTCAGTAAGTCCTGCGTGCCCCGCTGCTTCGAGACT 
GTGTATGATGGCTACTCCAAGCACGCGTCCACCACCTCCTGCTGCCAGTACGACCTCTGCAACGGCACCG 
GCCTTGCCACCCCGGCCACCCCGGCCCTGGCCCCCATCCTCCTGGCCACCCTCTGGGGTCTCCTCTAAAG 
CCCCCGAGGCAGACA 
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The NOV60a polypeptide (SEQ ID NO:216) encoded by SEQ ID NO:215 is 1 16 
amino acids in length and is presented using the one-letter amino acid code in Table 60B. The 
Psort profile for NOV60a predicts that this sequence has a signal peptide and is likely to be 
5 exported from the cell with a certainty of 0.8200. In alternative embodiments, a NOV60a 

polypeptide is located to lysosomes with a certainty of 0.1000, or to the endoplasmic reticulum 
(membrane) with a certainty of 0.1000. The Signal P predicts a likely cleavage site for a 
NOV60a peptide is between positions 34 and 35, i.e., at the dash in the sequence AQA-SD. 



Table 60B. NOV60a Polypeptide Sequence (SEQ ID 
NO:216) 

MTPLLTLILVVLMGLPLAQASDCHVCAYNGDNCFNPMRCPAMVAYCMTTRTYYTPTRMKV 
SKSCVPRCFETVYDGYSKHASTTSCCQYDLCNGTGLATPATPALAPILLATLWGLL 

io " " ' ~ " " ~ ~ ~ i 

NOV 60b 

The disclosed NOV60b (alternatively referred to herein as CG56894-02) includes the 
876 nucleotide sequence (SEQ ID NO: ) shown in Table 60C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 348-350 and ends with a stop codon at 
15 nucleotides 696-698. 



Table 60C. NOV60b Nucleotide Sequence (SEQ ID 
NO:217) 

AGCTTTGTTCTTGAGTGGGTCTGCCTCGGGGGCTTTAGAGGAGACCCCAGAGGGTGGCGA 
TGCGGCACGGGTGCTGCGGGACACACAGACACGCCTACGATTAGACTCAGGCAGGCACCT 
ACCGGCGAGCGGCCGCGGGTGACTCCCAGGCGCGGCGGTACCTCACGGTGGTGAAGGTCA 
CAGGGTTGCAGCACTCCCAGTAGACCAGGAGCTCCGGGAGGCAGGGCCGGCCCCACGTCC 
TCTGCGCACCACCCTGAGTTGGATCCTCTGTGCGCCACCCCTGAGTTGGATCCAGGGCTA 
GCTGCTGTTGACCTCCCCACTCCCACGCTGCCCTCCTGCCTGCAGCCATGACGCCCCTGC 
TCACCCTGATCCTGGTGGTCCTCATGGGCTTACCTCTGGCCCAGGCCTCGGACTGCCACG 
TGTGTGCCTACAACGGAGACAACTGCTTCAACCCCATGCGCTGCCCGGCTATGGTTGCCT 
ACTGCATGACCACGCGCACCTACTACACCCCCACCAGGATGAAGGTCAGTAAGTCCTGCG 
TGCCCCGCTGCTTCGAGACTGTGTATGATGGCTACTCCAAGCACGCGTCCACCACCTCCT 
GCTGCCAGTACGACCTCTGCAACGGCACCGGCCTTGCCACCCCGGCCACCCTGGCCCTGG 
CCCCCATCCTCCTGGCCACCCTCTGGGGTCTCCTCTAAAGCCCCCGAGGCAGACCCACTC 
AAGAACAAAGCTCTCGAGACACACTGCTACACCCTCGCACCCAGCTCACCCTGCCTCACC 
CTCCACACTCCCTGCGACCTCCTCAGCCATGCCCAGGGTCAGGACTGTGGGCAAGAAGAC 
ACCCGACCTCCCCCAACCACCACACGACCTCACTTC 



The NOV60b polypeptide (SEQ ID NO:218) encoded by SEQ ID NO:217 is 1 16 

amino acids in length and is presented using the one-letter amino acid code in Table 60D. The 

20 Psort profile for NOV60b predicts that this sequence is a Type la membrane protein, has a 

signal peptide, and is likely to be localized at the plasma membrane with a certainty of 0.9190. 

In alternative embodiments, a NOV60b polypeptide is located to lysosomes with a certainty of 

0.2000, or to the endoplasmic reticulum (membrane) with a certainty of 0.1000. The Signal P 
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predicts a likely cleavage site for a NOV60b peptide is between positions 20 and 21, i.e., at the 
dash in the sequence AQA-SD. 



Table 60D. NOV60b Polypeptide Sequence (SEQ ID 
NO:218) 

MTPLLTLILVVLMGLPLAQASDCHVCAYNGDNCFNPMRCPAMVAYCMTTRTYYTPTRMKV 
SKSCVPRCFETVYDGYSKHASTTSCCQYDLCNGTGLATPATLALAPILLATLWGIiL 

5 

A BLAST analysis of NOV60 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV60 had high 
homology to other proteins as shown in Table 60E. 



Table 60E. BLASTX results from PatP database for NOV60 








Smallest 








Sum 






High 


Probability 


Sequences producing High-scoring Segment Pairs: Score 


P(N) 


patp 


AAY02738 


Human secreted protein encoded by gene 89 630 


2 .2e-61 


patp 


AAM39828 


Human polypeptide 63 0 


2 . 2e-61 


patp 


AAM41614 


Human polypeptide 630 


2 .2e-61 


patp 


AAB61131 


Human NOV3 protein 5 94 


1.4e-57 


patp 


AAY79325 


Mouse receptor ligand Lynxl 521 


7 . 7e-50 



10 

Tn a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 338 of 424 bases (79%) identical to a gbrGENBANK- 
ID:AF141377|acc:AF141 377.1 mRNA from Mus musculus (Ly-6/neurotoxin homolog 
(Lynxl) mRNA). The full amino acid sequence of the protein of the invention was found to 
1 5 have 92 of 1 16 amino acid residues (79%) identical to, and 96 of 1 16 amino acid residues 

(82%) similar to, the 1 16 amino acid residue ptnr:SPTREMBL-ACC:Q9WVC2 protein from 
Mus musculus (Mouse) (LY-6/NEUROTOXIN HOMOLOG). NOV60 also has homology to 
the other proteins shown in the BLASTP data in Table 60F. 



Table 60F. NOV60 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi | 7106349 | re 
f |NP_035968. 1 
| (NM_011838) 


Ly6/neurotoxin 1 
[Mus musculus] 


116 


92/116 
(79%) 


96/116 
(82%) 


5e-39 
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gi | 12698684 |g 
b|AAK01642.l| 
AF321824_1 
(AF321824) 


Ly-6 neurotoxin- like 
protein Lynxl 
[Homo sapiens] 


80 


79/80 


79/80 


5e-27 


gi | 1519481 |gb 
| AAB07524 . 1 | 
(U66837) 


E4 8 antigen 
[Homo sapiens] 


79 


28/72 
(38%) 


34/72 
(46%) 


0.035 


gi|l072024l|s 
p| P57096 | PSCA 
_MOUSE E4 8 
antigen [Homo 
sapiens} 


E48 antigen 
[Homo sapiens] 


123 


30/104 
(28%) 


40/104 
(37%) 


0.038 


gi | 12845967 | d 
bj | BAB26976 . 1 
| (AK010485) 


PAR/Ly-6 domain 
containing protein 
[Mus musculus] 


154 


36/108 
(33%) 


47/108 
(43%) 


0 . 068 



This BLASTP data is displayed graphically in the ClustalW in Table 60G. A multiple 
sequence alignment is given, with the NOV60a and b protein being shown on lines 1 and 2 in 
a ClustalW analysis comparing the protein of the invention with the related protein sequences 
shown in Table 60F. 



Table 60G. ClustalW Alignment of NOV60 



NOV60a (SEQ ID NO: 216) 

NOV60b (SEQ ID NO: 218) 

gi|7106349| (SEQ ID NO:602) 

gi | 12698684 | (SEQ ID NO: 603) 

gi j 1519481 | (SEQ ID NO: 604) 

gi|l072024l| (SEQ ID NO:605) 

gijl2845967| (SEQ ID NO:606) 



NOV60a 

NOV60b 

gi|7106349| 

gi|l2698684| 

gi|l51948l| 

gi|l072024l| 

gi j 12845967 j 



Btf 

TP 
TH 
^TP 



LL 


Tgl 


LV 


LL 




LV 


LL 




LV 


LL 




LV 




Noveoa 

NOV60b 
gi | 7106349 | 
gi j 12698684 | 
gi | 1519481 | 
gi j 10720241 | 
gi j 12845967 j 




PATP 102 
PATL 102 
FA0PVTL 102 



'QLL 118 



130 



140 



150 



512 



NOV6 0a 
NOV6 0b 



gi | 7106349 | 
gi j 12698684 | 
gi | 1519481 | 
gi j 10720241 | 
gi I 12845967 j 



103 ALAP 

103 ALAP 

103 ALVP 

80 



I 

I 
■A 




116 
116 
116 
80 

79 



79 

104 TTLG 

119 KEQPGKASGRRHRYI 



--MtvlcsJIl: 



jLWGSSRL 12 3 

'ANGLSALCLL 154 



Elapid snake venom neurotoxins exert their effects through high-affinity interactions 
with specific neurotransmitter receptors. The lynxl-like gene disclosed herein as NOV60, is 
highly expressed in the brain and contains the cysteine-rich motif characteristic of this class of 
neurotoxins. Primary sequence and gene structure analyses reveal an evolutionary relationship 
between lynxl and the Ly-6/neurotoxin gene family. Lynxl is expressed in large projection 
neurons in the hippocampus, cortex, and cerebellum. In cerebellar neurons, lynxl protein is 
localized to a specific subdomain including the soma and proximal dendrites. Lynxl binding 
to brain sections correlates with the distribution of nAChRs, and application of lynxl to 
Xenopus oocytes expressing nAChRs results in an increase in acetylcholine-evoked 
macroscopic currents. These results identify NOV60 as a protein modulator for nAChRs in 
vitro, with important implications in the regulation of cholinergic function in vivo. 

The NOV60 disclosed in this invention is predicted to be expressed in at least the 
following tissues: brain. This information was derived by determining the tissue sources of 
the sequences that were included in the invention including but not limited to SeqCalling 
sources, public EST sources, literature sources, and/or RACE sources. Further expression data 
for NOV60 is provided in Example 2. 

The nucleic acids and proteins of NOV60 are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for the treatment of patients 
suffering from: Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous 
sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, 
Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, and neurodegeneration, as well as other diseases, disorders 
and conditions. The NOV60 nucleic acid encoding the lynxl-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. The nucleic acid of the invention 
encoding a lynxl-like protein includes the nucleic acid whose sequence is provided in Table 
60A or 60C, or a fragment thereof. 
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The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 60A or 60C while still encoding a 
protein that maintains its Lynx 1 -like activities and physiological functions, or a fragment of 
such a nucleic acid. The invention further includes nucleic acids whose sequences are 
5 complementary to the sequence of Table 60A or 60C including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 
chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 

10 derivatized. These modifications are carried out at least in part to enhance the chemical 

stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 21% of the bases may be so changed. 

The novel protein of the invention includes the lynxl-like protein whose sequence is 

15 provided in Table 60B and 60D. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 60B and 
60D while still encoding a protein that maintains its lynxl-like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 21% 
of the amino acid residues may be so changed. 

20 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobic ity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

25 

NOV61 

NOV61 is a homolog of the adlican gene and belongs to the superfamily of cell 
adhesion molecules. The disclosed NOV61 (alternatively referred to herein as CG56453-01) 
includes the 5925 nucleotide sequence (SEQ ID NO: ) shown in Table 61 A. A NOV61 ORF 
30 begins with a Kozak consensus ATG initiation codon at nucleotides 16-18 and ends with a 

stop codon at nucleotides 5653-5655. The disclosed NOV 70 maps to human chromosome Y. 



Table 61A. NOV61 Nucleotide Sequence (SEQ ID 
NO:219) 

AGGCACCCCGACAAGATGCCCAAGCGCGCGCACTGGGGGGCCCTCTCTGTGGTGCTGATC 
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CTGCTTTGGGGTCATCCGCGAGTGGCGCTGGCCTGCCCTCATCCTTGTGCCTGCTACGTC 
CCCAGCGAGGTCCACTGCACGTTCCGATCCCTGGCTTCTGTGCCCGCTGGCATTGCTAAA 
CATGTGGAAAGAATCAATTTGGGGTTTGGAATTCTGAAGTGTAAAAAGGACAAAGCTTAT 
GAAGGCGGTCAGTTGTGTGCAATGTGCTTCAGTCCAAAGAAGTTGTACAAACATGAGATT 
CACAAGCTGAAGGACCTGACTTGTCTGAAGCCTTCCATAGAGTCTCCTCTGAGACAGAAC 
AGG AG CAGGAGT ATTG AGG AGGAG C AAAAAC AAGAAG AGAATGGTGAC AGC C AGCTC AT C 
CTGGAGAAAATCCAACTTCCCCAGTGGAGCATCTCTTTGAATATGACTGATGAGCACGGG 
AACCTGGTGAACTTGGTGTGTGACATCAAGAAACCAATGGATGTGTACAAAATTCACTTG 
AACCAAACAGATCCTCCAGATATTGACATAAATGCAATGGTTGCCTTGGACTTTGAGTAT 
C CAATGAC CC AGG AAAACT ATGAAAAT CT ATGG AAAT TGAT AG CAT AC T AC AGTG AAGTT 
CCCATGAAGCTACACAGAGAGCTCATGCTCAGCAAACACCCCAGAGTCAGCTACCAGTAC 
AGGCAAGATGCCGATGAAGAAGCTCTTTACTACACAGGTGTGAGAGCCCAGATTCTTGCA 
G AACC AG AATGGAT C ATGC AG C CAT C CAT AG AT AT CC AGC TGAAC CGACCT CAGAGTACG 
GCCAAGAAGGTGCTACTTTCCTACTACAACCAGTATTCTCAAACAATAGCCACCAAAGAT 
ACAAGGCAGGCTCGGGGCAGAAGCTGGGTAATGATTGAGCCTAGTAGAGCTGTGCAAAAA 
GATCAGACTGTCCTGGAAGGGGGTCGATGCCAGTTGAGCTGCAATGTGAAAGCTTCTGAG 
AGTCCATCTATCTTCTGGGTGCTTCCAGATGGCTCCATCCTGAAAGTGCCTGTGGATGAC 
CCAGACAGCAAGTTCTCCATTCTCAGCAGTGGCTGGCTGAGGATCAAGTCCATGGAGCCA 
TCTGACTCGGGCTTGTACCAGTGCATTGCTCAAGTGAGGGATGAAATGGACCGCATGGTA 
TATAGGGTACTTGTGCAGTCTCCCTCCACTCAGCCAGCCGAGAAAGACACAGTGACAATT 
GGCAAGAACCCAGGGGAGCCAGTGATGTTGCCTTGCAATGCTTTAGCTATACCCGAAGCC 
CACCTTAGCTGGATTCTTCCAAACAGAAGGATAATTAATGATTTGGCTAACACATCACAT 
GTATACATGCTGCCAAATGGAACTCTTTCCATCCCAAAGGTCCAAGTCAGTGACAGTGGT 
TACCACAGATGTGTGGCTGTCAACCAGCATGGGGCAGACCATATCACGGTGGGAATCACA 
GTGACCAAGAAAGGTTCTGGCTCGCCATCCAAAAGAGGCAGATGGCCAGGTCCAAAGGCT 
CTTT C C AGAATGAG AG AAG AC ATCGTGG AGGATGAAGGGGTCT C AGGC ACGGGAGATG AA 
GAGAACACTTCAAGGAGACTTCTACATCCAAAGCACCAAGAGGCGTTCCTCAAAACAAAG 
GATGATG C CAT C AATGG AGAT AAG AAAGC C AAGAAAGGG AGAAG AAAGCTG AAACTCTGG 
AAG C ATT C AGAAAAAG AAC CAGAG AC C AGTGT TGC AG AAGATCTC AGAGTGTTTGAAT C A 
AGACGAAGGATAAACGTGGCAAACAAACAGATTAATCCGGAGCACTGGGCTGATATTTTA 
GCCAAAGTCTTTGGGAAAAATCTCCCTACAGGCACAGAAGTATCCCCAATTATTAAAACC 
ACAAGTTCTCCATTCTTGAGCCTAGTAGTCACACCACCTTTGCCTGCTGTTTCTCCCCCC 
TTGGCATCTCCAATACAGACAGCAACAAGTGCTGAAGAATCCTCAGCAGATGTACCTCTA 
CTC AG CG AAGG AAAG C ACATT TTG AGT AC CAT TTC CT C AG C CAGC ATGGGACT AG AACAC 
C AC AAC AATGG AGTTATTCTTGTTGAACCTGAAGT AAC AAGCACACCTCTGG AAG AAGTT 
GTTGATGAGTATTCCAAGAAGACTGAGGAGATGACTT CCACTGAAGGCGAC CTGAAGGGG 
ACTGCAGCCTCTACACTTATATCTGAGCCTTATGAACAATCTCCTACTCTACACACCTTA 
GACACAGTCTATGAAGAGCCCACCCATGAAGAGACGGAAACAGAGGGTTGGTCTGCAGCA 
GATGTTGGATCCTCACCAGATCCCACATCCAGTGAGTATGAGCTTCCATTGGTTGTTGTC 
TCCTTGGCTGAGTCTAAGCCTGTGCAATACTTTGACCCAGATTTGGAGACTAATTCACAA 
CCACATGAGGATAACATAAAAGAATACAGTTTTGCACACCTTACTCCAACCGCCATCATC 
TGGTTTAATGACTCTAGTACATCACTGTCATTTGAGGATTCTACTGTAGGGGAACAAGGT 
GTCCCAGGCAAATCACATCTACAAGGACCGACAGAGAACATCCAGCTTGTGAAAAGTAGT 
TTTAGCACTCAAGACACCTTATTGATTAAAAAAGGTATGAAAGAGATGTCTCAGACACTA 
CAGGGAGGAAAT ATGCT AG AGGGAGAC CC T AC AC ACT C CAGAAGT T CTGAG AATGAGGG C 
CAAGAGAGCAAATCCATCACTTTACCTGACTCCACACTGGGTATAACGAGCAGTACGTCT 
C C AGTTAAGAAGC CTGCGG AAAC C AC AGT TGT C AC C C TGC T AC AC AAAGAC AC C AC AAC A 
GAAACAACTCCAAGGCAAAAAGTGGCTTCATCATCCACCATGAGCACTCACCCTTCTCGA 
AGG AGAC C CAATGGG AGAAAATT AC AC C CT CAC AAAT TC C ACC AC CGG C AC AAGC AAAC C 
CCACCCACAACTTTTGCTCCATTAGAGACTTTTTCTACTCAACCAACTCAAGCAACTGAC 
ATTAAGATTTCAAATCAAATGGAGAGTTCTCTGGTTCCTACATCTTGGGAGATTAACACA 
GTT AAT AC CCC C AAAC AGC TGG AAATGGAGAAGAATGTAG AGC T C ATATC AAAGGG AAC T 
C CACGGAG AAAAC ACGGGAAG AGGC CAAAC AAACAT CGAT AT AC C C CT T CT AC AGTGAGT 
T CAAG AGC ATC TG CAT CC AAG C CC AGC C C TT CT C C AG AAAAT AAAC AT AG AAAC ATTGTT 
ACTCCCAGTTCAGAAACTACACTTTTGCCTAGAAATGTTTCTCTGAAAACTGAGGGCGTT 
T ATGATTC CTT AG ATT AC ACG AC AAC CAC C AG AAAAAT AC ATT C AT CT C AC C AT AAAGT C 
CAAGACACACTTCCAGTCATGTATAAACCCACATCAGATGGAAAAGAAATTCAGGATGAT 
GTTGCCACAAATGTTGACAAACATAAAAGTGACATTTTAGTCCCTGGTGAGTCAATTACA 
AATGTCACACAAACTTCTCGCTCCTTGGTCTCCACTATGGGAGAATTTAAGGAAGAATCC 
TCTCCTGTGGGCTTTCCAGGAATTCCAACCTGGAATCCCTCAAGGAAAGCTCAGCCTGGG 
AGGCTACAGACAGACATACATGTTACCACTTCTGGGGAAACCCCTACAGACCCTCCCCTT 
GTTAACGAGCTTGAGGATGTGGATTTTACTTCTGAGTTTTTGTCCTCTGTGACAGTCTCC 
ACACCATTTCACCAGGAAGAAGCTGGTTTTTCCACAATTCTCTCAAGCATAAAAGTGGAG 
ATGGCTTCAAGTCAGGTAGAAACTACCACCCTTGGTCAAGATCATCATGAAACCACTGTG 
GCTATTCTCCACTCTGAAACTAGACCACAGAATCACATCCTTACTGCTGCCTGGATGAAG 
GAG CC AGC ATC TT TGT CCC CT C CC ATG AT T CT CCTGTCTT TGGG AC AAACC ACC ACC ACT 
AAG C C AG AACT TC T CAGT C C AAG AACAT C T C AAAT ATGT AAAGATT C C AAGGAAAATGT T 
TTCTTGAATTACATGGGGAATCCAGAAACAGAAGCAACCCCAGTGAAAAATGAAGGAACA 
CAGCGTATGTCAGGGCCAAATGAATTATCAACACCATCTTCTGACCACGATGCATTTAAC 
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TTGTCTACAAAGCTAGAATTGGAAAAGCAAGTATTTGATAGTAGGAGTCTAACACGTGGC 
C C AG AT AGC CAC C AC C AGG ATGG AAG AGT TC ATGCTTCT CATC AACT AACC AG AAT CC C T 
GCCAAACCCATCCTACCAACAGGAACAGTGAGGCTGCCTGAAATGTCCACACAAAGCACT 
TCCAGATACTTTGTAACTTTCCAGCCACCTCATCACGGGACCAACAAACCAGAAATAACT 
ACATATCCTTCTAGGGCTTTGCCAGAGAGCAAACAGTTTACAACTCCAAGAGTAGCAAGT 
ACAACTCCTCTCCTATCACACATGTCCAAACCCAGCATTTCTAGTAAGTTTGCTGACCTA 
AGAACTGACCAATCCAATGGCTCCTACAAAGTGTTTGGAAATAGCAACATCCCTGAGGCA 
AGAAACTCAGTTGGAAAGCCTCTCAGTCCAAGAATTTATCATTATTCCAATGGAAGACTC 
CCTTTCTTTACCAACAGGACTCTTTCTTTTTCACAGTTGGGAGTCACCCGGAGACCCCAG 
ATACCCTCTTCTCCTGTCC CAGTAATGAGAGAGAG AAAAGTTAAT CCAGGTTC CTACAAT 
AGGATATATTCCCATAGCACCTTCCATCTGGACTTTGGCCTTCCAGCACCTCCACTGTTG 
CACACTCCATGGACCATGGTATCACCCCCAACTAACTTACAGAATATCCCTATGGTCTCA 
TCCACCCAGAGTTCTGTCTCCTTTATAACATCTTCTGTCCAGTCCTCAGGAAGCATCCAC 
CAAAGCGGCTCAAAGTTCTTTGCAGGAGGACCGCCTGCATCCAAATTCTGGCCTCTTGGG 
GAAAAGCCCCAAATCCTCACCAAGTCCCCACAGACTGTGTCTGTCACTGCTGAAACGGAC 
GCT GTGTTC CCGTGTG AGGC AAT AGG AAAAC C AAAG CCT TT CGTT ACT TGG AC AAAAGT T 
TCCACAGGAGTTCTTATGACTCCGAATACCAGGATACAACGGTTTGAGGTTCTCAAGAAC 
GGTACCTTAGTGATAAGGAAGTTTCAAGTGCAAGATCGAGGCCAGTATATGTGCACCGCC 
AGCAACCTGTACGGCCTGGACAGGATGGTGGTCTTTCTCTGGGTCACCGTGCAGCAACCT 
CAAAT CCT AGCCT CC C ACT AC C AGGACGT CAC CGT CT AC CTGGGAG AC AC CAT TAC AATG 
GAGTGTCTGGCGAAAGGGACCCCAGCCCCCCAAATTTCCTGGATCTTCCGTGACAGGAGG 
GTGTGGC AAACT CTGT CCT CC GTGGAGGG CCGG AT CAC C CTGC AC C AAAAC CGGAC C CT T 
TCCATCAAGGAGGCGTCCTTCTCAGACAGAGGCGTCTATAAGTGCGTGGCCAGCAACGCA 
ACC CGGGCGGACAGCGTGT CC AT C CGC CT AC ACGT GGCGG CACTG CCC CC C ATTAT CC AC 
CAGGAGAAGCTGTAGAACATCTCGCTGCCCCCGGGGCTCAGCATTCACATTCACTGCACT 
GCCAAAGCTGCGCCCCTGCCCAGCGTGCTCTGGGTGCTCGGGGATGGTACCCAAATCCGC 
CCCTCGCATTTCCTCCACCGGAACTTGTTTGTTTTCCCCAACGGGACGCTCTACATCTGC 
AACCTCGCGCCCAAGGACAGCGGGCGCTATGAGTGCGTGGCCGCCAACCTGATCGGCTCC 
GCG CG C AGT ACGGTGC AGCTGAAC GTGC AGCG CGC AGC AGCG AAC 



The NOV61 polypeptide (SEQ ID NO:220) encoded by SEQ ID NO:219 is 1879 
amino acids in length and is presented using the one-letter amino acid code in Table 6 IB. The 
Psort profile for NOV61 predicts that this sequence has a signal peptide and is likely to be 
localized outside the cell with a certainty of 0.4371 . In alternative embodiments, a NOV61 
polypeptide is located to lysosomes with a certainty of 0.1900, to the endoplasmic reticulum 
(membrane) with a certainty of 0. 1 000, or to the nucleus with a certainty of 0.1 800. The 
Signal P predicts a likely cleavage site for a NOV61 peptide is between positions 26 and 27, 
i.e., at the dash in the sequence ALA-CP. 



Table 61B. NOV61 Polypeptide Sequence (SEQ ID 
NO:220) 

MPKRAHWGALSWLILLWGHPRVALACPHPCACYVPSEVHCTFRSLASVPAGIAKHVERI 
NLGFGILKCKKDKAYEGGQLCAMCFS PKKLYKHE I HKLKDLTCLKPS I ES PLRQNRSRS I 
EEEQKQEENGDSQLILEKIQLPQWSISLNMTDEHGNLVNLVCDIKKPMDVYKIHLNQTDP 
PDIDINAMVALDFEYPMTQENYENLWKLIAYYSEVPMKLHRELMLSKHPRVSYQYRQDAD 
EEALYYTGVRAQILAEPEWIMQPSIDIQLNRPQSTAKKVLLSYYNQYSQTIATKDTRQAR 
GRSWVMIEPSRAVQKDQTVLEGGRCQLSCNVKASESPSIFWVLPDGSILKVPVDDPDSKF 
SILSSGWLRIKSMEPSDSGLYQCIAQVRDEMDRMVYRVLVQSPSTQPAEKDTVTIGKNPG 
EPVMLPCNALAIPEAHLSWILPNRRI INDLANTSHVYMLPNGTLS IPKVQVSDSGYHRCV 
AVNQHGADHITVGITVTKKGSGSPSKRGRWPGPKALSRMREDIVEDEGVSGTGDEENTSR 
RLLHPKHQEAFLKTKDDAINGDKKAKKGRRKLKLWKHSEKEPETSVAEDLRVFESRRRIN 
VANKQINPEHWADILAKVFGKNLPTGTEVSPIIKTTSSPFLSLWTPPLPAVSPPIiASPI 
QTATSAEESSADVPLLSEGKHILSTISSASMGLEHHNNGVILVEPEVTSTPLEEWDEYS 
KKTEEMTSTEGDLKGTAASTLISEPYEQSPTLHTLDTVYEEPTHEETETEGWSAADVGSS 
PDPTSSEYELPLVWSLAESKPVQYFDPDLETNSQPHEDNIKEYSFAHLTPTAIIWFNDS 
STSLSFEDSTVGEQGVPGKSHLQGPTENIQLVKSSFSTQDTLLIKKGMKEMSQTLQGGNM 
LEGDPTHSRSSENEGQESKSITLPDSTLGITSSTSPVKKPAETTWTLLHKDTTTETTPR 
QKVASSSTMSTHPSRRRPNGRKLHPHKFHHRHKQTPPTTFAPLETFSTQPTQATDIKISN 
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QMESSLVPTSWEINTVNTPKQLEMEKNVELISKGTPRRKHGKRPNKHRYTPSTVSSRASA 
SKPSPSPENKHRNIVTPSSETTLLPRNVSLKTEGVYDSLDYTTTTRKIHSSHHKVQDTLP 
VMYKPTSDGKEIQDDVATNVDKHKSDILVPGESITNVTQTSRSLVSTMGEFKEESSPVGF 
PGIPTWNPSRKAQPGRLQTDIHVTTSGETPTDPPLVNELEDVDFTSEFLSSVTVSTPFHQ 
EE AGFSTI LS S I KVEMAS SQVETTTLGQDHHETTVAI LHSETRPQNH I LTAAWMKEPASL 
SPPMILLSLGQTTTTKPELLSPRTSQICKDSKENVFLNYMGNPETEATPVKNEGTQRMSG 
PNELSTPSSDHDAFNLSTKLELEKQVFDSRSLTRGPDSHHQDGRVHASHQLTRIPAKPIL 
PTGTVRLPEMSTQSTSRYFVTFQPPHHGTNKPEITTYPSRALPESKQFTTPRVASTTPLL 
SHMSKPSISSKFADLRTDQSNGSYKVFGNSNIPEARNSVGKPLSPRIYHYSNGRLPFFTN 
RT L S F S Q LG VTRR P Q I P S S P VP VMRE RKVNPG S YNR I Y S HST FHL D FGL P AP P LL HT P WT 
MVSPPTNLQNIPMVSSTQSSVSFITSSVQSSGSIHQSGSKFFAGGPPASKFWPLGEKPQI 
LTKSPQTVSVTAETDAVFPCEAIGKPKPFVTWTKVSTGVLMTPNTRIQRFEVLKNGTLVI 
RKFQVQDRGQYMCTASNLYGLDRMWFLWVTVQQPQILASHYQDVTVYLGDTITMECLAK 
GTPAPQISWIFRDRRVWQTLSSVEGRITLHQNRTLSIKEASFSDRGVYKCVASNATRADS 
VSIRIjHVAAIjPPI IHQEKL 



A BLAST analysis of NOV61 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV61 had high 
homology to other proteins as shown in Table 61C. 
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Table 61 C. BLASTX results from PatP database for NOV61 










Smallest 










Sum 








High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp 


AAM03157 


Peptide #183 9 encoded by probe 


2631 


2 . Oe-273 


patp 


AAMl'53 95 


Peptide #182 9 encoded by probe 


2631 


2 . Oe-273 


patp 


AAM27883 


Peptide #192 0 encoded by probe 


2631 


2 . Oe-273 


patp 


AAM55191 


Human brain expressed single exon probe 


2631 


2 . Oe-273 


patp 


AAM67586 


Human bone marrow expressed probe 


2631 


2 . Oe-273 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 4045 of 4330 bases (93%) identical to a gb:GENBANK- 
ID:AF245505|acc:AF245505.1 mRNA from Homo sapiens (adlican mRNA). The full amino 
10 acid sequence of the protein of the invention was found to have 1 598 of 1 81 8 amino acid 

residues (87%) identical to, and 1661 of 1818 amino acid residues (91%) similar to, the 2828 
amino acid residue ptnr:SPTREMBL-ACC:Q9NR99 protein from Homo sapiens (Human) 
(ADLICAN). NOV61 also has homology to the other proteins shown in the BLASTP data in 
Table 6 ID. 
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Table 61D. NOV61 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%> 


Positive 
(%) 


Expect 


gi | 14766612 | r 
ef |XP_035465 . 
1 | (XM 035465) 


hypothetical protein 
XP_035465 
[Homo sapiens] 


2828 


1590/1818 
(87) 


1654/1818 
(90) 


0.0 
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gi | 9280405 | gb 












| AAF86402 . 1 | A 


Adlican 


2828 


1591/1818 


1654/1818 


0.0 


F245505_l 


[Homo sapiens] 


(87) 


(90) 




(AF245505) 













This BLASTP data is displayed graphically in the ClustalW in Table 6 IE. A multiple 
sequence alignment is given, with the NOV61 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 6 ID. 



Table 61E. ClustalW Alignment of NOV61 
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(SEQ ID NO:220) 
(SEQ ID NO: 607) 
(SEQ ID NO: 608) 
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MPKRAHWGALSWLILLWGHPRVALACPHPCACYVPSEVHCTFRSLASVPAGIAKHVERI 
MPKRAHWGALSWLILLWGHPRVALACPHPCACYVPSEVHCTFRSLASVPAGIAKHVERI 
MPKRAHWGALSWLILLWGHPRVALACPHPCACYVPSEVHCTFRSLASVPAGIA^HVERI 
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sILGFNSIQALSETSFAGLTKLELLMIHGNE I PS I PDGALRDLSSLQVFKFSYNKLRVITG 
"LGFNSIQALSETSFAGLTKLELLMIHGNEIPSIPDGALRDLSSLQVFKFSYNKLRVITG 
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129 K 
301 
301 



310 

._J_. . 



[ 



320 
. . I . . 



330 
.. I .. 



340 
I .... I ■ . 



350 
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DGGSQLILEKFQLPQWSISLNMTDEHGNMVNLVCDIKKPMDVYKIHLNQTDPPDIDINAT 
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VALDFEaPMTgENYElJjLWKLIAYYSEVPiKLHRELMLSKjljPRVSYQYRQDADEEALYYTG 
VALDFECPMTRENYEKIiWKLIAYYSEVPVKLHRELMLSKDPRVSYQYRQDADEEALYYTG 
VALDFECPMTRENYEKLWKLIAYYSEVPVKLHRELMLSKDPRVSYQYRQDADEEALYYTG 
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VRAQILAEPEWgMQPSIDIQLNRgQSTAKKVLLSYYjJjQYSQTIgTKDTRQARGRSWVMIE 
VRAQILAEPEWVMQPSIDIQLNRRQSTAKKVLLSYYTQYSQTISTKDTRQARGRSWVMIE 
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RIKSMEPSDSGLYQCIAQVRDEMDRMVYRVLVQSPSTQPAEKDTVTIGKNPGESVTLPCN 
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HFTVGITVTKKGSGLPSKRGRRPGAKALSRVREDIVEDEGGSGMGDEENTSRRLLHPKDQ' 
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....|....|....|....|....|....|....|....|....|....|....|....| 

1879 1879 

2101 M^M^dWjjMgfl^ 2160 
2101 ^^^^^^^^^^^^^^^^^^p ^^^ ^S^SP^^W^^^^^^^^^^ 2X60 

2170 2180 2190 2200 2210 2220 
....|....|....|....|....|....|....|....|....|....|....|....| 
1879 1879 

2161 ^^^^^^^^gg^^^^^^^^^^^^^^33^^^^^^B^^B^SB2^BB3 2220 

2161 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 2220 

2230 2240 2250 2260 2270 2280 

....|....|....|....|....|....|....|....|....|....|....|....| 

1879 1879 

2221 Sgggg^roggTOOTgjgg^^^^^^^^^^^^^^^^^^^^^^^^^^g^^ gjg 2280 
2221 B^^^^^^^^^^^^^^^^^^^s^^^^^^^^^^^^^^^^^^^P^ 2280 

2290 2300 2310 2320 2330 2340 

....|....|....|....|....|....|....|....|....|.--.|....|....| 

1879 1879 

2281 l^^^^^^^^^^^^^^ra^^2^^B^^^^%M^^^^^^^^^^^^^^^^E 2340 
2281 ^^^^^^^fe^S^^^^^^^^^^^^^^^^^^^^^B'-S E^^^^ ^^^ 234 0 

2350 2360 2370 2380 2390 2400 
....|....|....|....|....|....|....|....|....|....|....|....| 
1879 1879 

2341 ^B^^^^^^^^^^^^^^^^^^^^^^^^^^^EJEEBSS||^^^SSSS*3 2400 

2341 ^^^^^^^^^^^^^^me^^^^^^^^^^^^^^^^^^^ S^^^ 2400 

2410 2420 2430 2440 2450 2460 
.... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | .... | 
1879 1879 

24 01 ^^^^^^^^^^^^^^^^^^^^^^^^^^^H^^^^^SIS^^^^^^ 24 6 0 

2401 2460 

2470 2480 2490 2500 2510 2520 

....|....|....|....|....|....|....|....|....|....|....|....| 

1879 1879 

2461 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^g^HSggg 2520 

2461 plWS^j^^mg.'a ^^^^^^^^^WS^^^m^fm^^^ ^^^^BB!m.^M 2520 

2530 2540 2550 2560 2570 2580 
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RNEGGEARLlgQLTVLEPMEKPIFHDPISEKITAMAGHTISLNCSAAGTPTPSLVWVLPN 
RNEGGEARLlBQL'TVLEPMEKPIFHDPISEKITAMAGHTISLNCSAAGTPTPSLVWVLPN 



NOV61 

gi|l4766612 | 
gi | 9280405 | 



NOV61 

gi|l4766612 | 
gi j 9280405 | 



NOV61 

gi | 14766612 | 
gi|9280405| 



NOV61 

gi | 14766612 | 
gi|9280405| 



NOV61 

gi | 14766612 | 
gi|9280405| 



NOV61 

gi|!4766612 | 
gi|9280405| 



1879 
2521 
2521 

2590 2600 2610 2620 2630 2640 
....|....|....|....|....|....|....|....|....|....|....|....| 
1879 1879 

2581 ^^^^^^S^^^^^^^^MS^^BS^SSI^S^SSSS^SSSSS^S^^Si 2640 

2581 ^^^^M^^^^^^^^^^^^^^^^^^^^^^ ^^^^^^^^ 1 2640 

2650 2660 2670 2680 2690 2700 

I I I I I I I I I I I I 

1879 1879 

2641 BSK!1I^^ 2700 

2710 2720 2730 2740 2750 2760 

1879 1879 

2 7 01 ^^^SP^^^3!Si^^^^^^^S33^^^^^^^^^^^^^^^^^S^^^^^^^ 2 76 0 

2701 ^^^^^^^^^^ ^^^^^^^^ ^^^^^^^ ^■WBiaSSBSBHBBBBH 2760 

2770 2780 2790 2800 2810 2820 
. ... | .... | .... | .... | .... | .... | , ... | .... | .... | .... | .... | .... | 
1879 1879 

2761 jg^3HI53355^55^^^^2^B553 BS^^BEEI^^^S^^^^^^^^^^^^E 2820 

2761 ^^^§^^^^^^^^^^^^^^^^^^^^^^^?^^^^^^^SS^^@^^^^B 2820 

1879 1879 

2821 ^ggjg 2828 
2821 2828 



Table 61F lists the domain description from DOMAIN analysis results against NOV61. 
This indicates that the NOV61 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 61F. Domain Analysis of NOV61 



gnl 1 Smart j smart00409 , IG, Immunoglobulin SEQ ID NO: 861 

CD-Length = 86 residues, 100.0% aligned 
Score = 62.8 bits (151), Expect - 2e-10 



NOV61 : 1685 PQTVSVTAETDAVFPCEAIGKPKPFVTWTKVSTGVLMTPNTRIQRFEVLKNGTLVIRKFQ 1744 

p +V+V CEA GPP VTW K G L+ + R N TL I 

Sbjct : 1 PPSVTVKEGESVTLSCEASGNPPPTVTWYKQG-GKLLAESGRFSVSRSGGNSTLTISNVT 59 



NOV6 1 : 174 5 VQDRGQ YMCTASNL YGLDRMWFLWVT 

+D G Y C A+N G L V 

Sbjct : 60 PEDSGT YTCAATNS SGS AS SGTTLTVL 
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The gene of invention is a close homolog of the adlican gene and belongs to the 
superfamily of cell adhesion molecules. Cell adhesion molecules mediate key aspects of 
development, differentiation, cellular plasticity and physiological function in a variety of 
tissues. In addition, they are central to a number of disease processes such as cancer. Adlican 
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is a protein that has been described to be elevated in patients with osteoarthritis. Sequence 
analysis indicates that it is likely to be a secreted protein. A rat gene named mechanical stress- 
induced protein has been patented and shows significant similarity to adlican. This protein is 
elevated in osteoblasts subjected to mechanical stress and has been suggested to be effective in 
5 the prognosis, diagnosis or treatment of osteoarthritis. Since this family of proteins seems to 
have involvement in osteoarthritis, it follows that the protein of invention may share that 
characteristic. 

The disclosed NOV61 of invention has two significant attributes - it is truncated 
relative to its homolog, adlican, and secondly, it maps to the Y chromosome. The first attribute 

10 is significant in that it is possible that the truncated adlican -like protein may play a dominant - 
negative role in the function of adlican . It is therefore possible that it may be constitute an 
effective treatment for osteoarthritis. The chromosomal localization is relevant because it is 
known that osteoarthritis has a higher frequency in women. It is possible, therefore, that the 
truncated protein may be nullifying the effect of adlican, if any, in males. 

15 The NOV61 disclosed in this invention is predicted to be expressed in at least the 

following tissues: muscle, lymph. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data forNOV61 is provided in Example 2. 

20 The nucleic acids and proteins of NOV61 are useful in potential therapeutic 

applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for the treatment of patients 
suffering from: osteoarthitis, asthma, allergy, muscular dystrophy, Lesch-Nyhan syndrome, 
myasthenia gravis, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 

25 autoimmune disease, allergies, immunodeficiencies, transplantation, graft versus host disease 
(GVHD), lymphaedema, cancer, tissue degeneration as well as other diseases, disorders and 
conditions.. The NOV61 nucleic acid encoding the adlican-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. The novel nucleic acid of the 

30 invention encoding a adlican-like protein includes the nucleic acid whose sequence is provided 
in Table 61 A, or a fragment thereof. The invention also includes a mutant or variant nucleic 
acid any of whose bases may be changed from the corresponding base shown in Table 61 A 
while still encoding a protein that maintains its adlican-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 

523 




whose sequences are complementary to the sequence of Table 61 A, including nucleic acid 
fragments that are complementary to any of the nucleic acids just described. The invention 
additionally includes nucleic acids or nucleic acid fragments, or complements thereto, whose 
structures include chemical modifications. Such modifications include, by way of non-limiting 
5 example, modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 7% of the bases may be so changed. 

10 The novel protein of the invention includes the adlican-like protein whose sequence is 

provided in Table 6 IB. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residue shown in Table 6 IB while still 
encoding a protein that maintains its adlican-like activities and physiological functions, or a 
functional fragment thereof. Tn the mutant or variant protein, up to about 13% of the amino 

15 acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

20 below. 

NOV62 

NOV62 has homology to neuropsin, an extracellular matrix serine protease. The 
disclosed NOV62 (alternatively referred to herein as CG56781-01) includes the 834 nucleotide 
sequence (SEQ ID NO:221) shown in Table 62 A. A NOV62 ORF begins with a Kozak 
25 consensus ATG initiation codon at nucleotides 31-33 and ends with a TGA codon at 
nucleotides 808-810. The disclosed NOV62 maps to human chromosone 19. 



Table 62A. NOV62 Nucleotide Sequence (SEQ ID 
NO:221) 

AGTCTTGCCTTCTTTTGAGCCTAAGTCATGAGTTGGATGTTCCTCAGAGATCTCCTGAGT 
GGAGTAAATAAATACTCCACTGGGACTGGATGGATTTGGCTGGCTGTCGTGTTTGTCTTC 
CGTTTGCTGGTCTACATGGTGGCAGCAGAGCACGTGTGGAAAGATGAGCAGAAAGAGTTT 
GAGTGCAACAGTAGACAGCCCGGTTGCAAAAATGTGTGTTTTGATGACTTCTTCCCCATT 
TCCCAAGT C AGACT TTGGGC CTT ACAAC TG AT AATGGT C T C C AC ACCTT C AC TT CTGGTG 
GT TTT AC ATGTAG C C T ATCATGAGGGT AGAG AG AAAAGGC AC AGAAAGAAAC T C T ATGTC 
AGCCCAGGTACAATGGATGGGGGCCTATGGTACGCTTATCTTATCAGCCTCATTGTTAAA 
ACTGGTTTTGAAATTGGCTTCCTTGTTTTATTTTATAAGCTATATGATGGCTTTAGTGTT 
CCCTACCTTATAAAGTGTGATTTGAAGCCTTGTCCCAACACTGTGGACTGCTTCATCTCC 
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AAACCCACTGAGAAGACGATCTTCATCCTCTTCTTGGTCATCACCTCATGCTTGTGTATT 
GTGTTGAATTTCATTGAACTGAGTTTTTTGGTTCTCAAGTGCTTTATTAAGTGCTGTCTC 
CAAAAATATTTAAAAAAACCTCAAGTCCTCAGTGTGTGAGTGCCACAGCCTCAGATATGT 
TGAATGTG 

The NOV62 polypeptide (SEQ ID NO:222) encoded by SEQ ID NO:221 is 259 amino 
acids in length and is presented using the one-letter amino acid code in Table 62B. The Psort 
profile for NOV62 predicts that this sequence has a signal peptide and is likely to be localized 
5 to the endoplasmic reticulum (membrane) at the plasma membrane with a certainty of 0.5500. 
In alternative embodiments, a NOV62 polypeptide is located to lysosomes with a certainty of 
0.2353, or to the endoplasmic reticulum (lumen) with a certainty of 0.1000. The Signal P 
predicts a likely cleavage site for a NOV62 peptide is between positions 28 and 29, i.e., at the 
dash in the sequence TRA-QG. 

10 

Table 62B. NOV62 Polypeptide Sequence (SEQ ID 
NO:222) 

MGRPPPCAIQPWILLLLFMGAWAGVTRAQGSRSRKGQASKPHSQPWQAALFQGERLICGG 
VLVGDRWVLTAAHCKKQKYSVRLGDHSLQSRDQPEQEIQVAQSIQHPCYNNSNPEDHSHD 
IMLIRLQNSANLGDKVKPVQLANLCPKVGQKCIISGWGTVTSPQENFPNTLNCAEVKIYS 
QNKCERAYPGKITEGMVCAGSSNGADTCQGDSGGPLVCEGTLAGIVSGGSEPVFRPRRPA 
VYTNVFDYLEWIESTMEKN 



A BLAST analysis of NOV62 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV62 had high 
15 homology to other proteins as shown in Table 62C. 



Table 62C. BLASTX results from PatP database for NOV62 










Smallest 










Sum 








High 


Probability 


Sequences producing 


High- scoring Segment Pairs: 


Score 


P (N) 


patp 


:AAW10 6 94 Human 


recombinant neuropsin 


1247 


8 . 9e-127 


patp 


:AAW12 3 93 Mouse 


neuropsin protein 


1247 


8 . 9e-127 


patp 


•AAB21311 Human 


neuropsin - Homo sapiens , 2 75 aa. 


972 


1.2e-97 


patp 


AAY41744 Human 


PR0322 protein sequence 


965 


6 . 8e-97 


patp 


AAY3 2 8 52 Human 


serine protease protein sequence 


965 


6 . 8e-97 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 747 of 805 bases (92%) identical to a gb:GENBANK- 
20 ID:E12348|acc:E12348.1 mRNA from Mus sp. The full amino acid sequence of the protein of 
the invention was found to have 227 of 257 amino acid residues (88%) identical to, and 238 of 
257 amino acid residues (92%) similar to, the 260 amino acid residue ptnnSWISSNEW- 
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ACC:Q61955 protein from Mus musculus (Mouse) (NEUROPSIN PRECURSOR (EC 3.4.21 .) 
(NP)). NOV62 also has homology to the other proteins shown in the BLASTP data in Table 
62D. 



Table 62D. NOV62 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 6679487 | re 
f |NP_032966 . 1 
| (NM_0 0 8 94 0) 


protease, serine, 19 
(neuropsin) ; Brain 

Serine protease 1 
[Mus musculus] 


260 


227/257 
(88) 


238/257 
(92) 


e-131 


gi | 6093538 | Sp 
| 088 78 0 |NRPN_ 
RAT 


NEUROPSIN PRECURSOR (NP) 
(BRAIN SERINE PROTEASE 
1) 


260 


216/258 
(83) 


232/258 
(89) 


e-126 


gi | 4699764 | pd 
b | 1NPM | A 


Chain A, Neuropsin, A 
Serine Protease 
Expressed In The Limbic 
System Of Mouse Brain 


225 


197/223 
(88) 


206/223 
(92) 


e-112 


gi | 6005844 | re 
f |NP_0 0 912 7 . 1 
| <NM_007196) 


kallikrein 8 
(neuropsin/ovasin) ; 
protease, serine, 19 
(neuropsin/ ovasin) 
[Homo sapiens] 


260 


172/253 
(67) 


207/253 
(80) 


8e-99 


gi | 16162680 | r 
ef |XP_057595 . 
1 | (XM 057595) 


hypothetical protein 
XP_057595 
[Homo sapiens] 


260 


169/253 
(66) 


203/253 
(79) 


3e-96 



This BLASTP data is displayed graphically in the ClustalW in Table 62E. A multiple 
sequence alignment is given, with the NOV62 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 62D. 



Table 62E. ClustalW Alignment of NOV62 



NOV62 

gi| 6679487 | 
gij 6093538 I 
gi|4699764| 
gij 6005844 j 
gij 16162680 | 



(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 



NOV62 


1 


■mgrp 


gi 


6679487 | 


1 


Imgrp 


gi 


6093538| 


1 


Bmgrp 


gi 


4699764 j 


1 




gi 


6005844 j 


1 




gi 


16162680 | 


1 


BH3S 



ID NO: 22 2) 
ID NO: 60 9) 
ID NO: 610) 
ID NO: 611) 
ID NO: 612) 
ID NO: 613) 
10 




NOV62 

gi | 6679487 | 
gi | 6093538 j. 
gi|4699764| 



61 
61 
61 
29 



70 



80 
■ I - - 



90 



100 
. . I . - 



110 

. - 1 - . 



120 
. - I 



vlvgdrwvltaahckkfflkysvrlgdhslq|rd|peqeiqvaqsiqhpcyn|snpedhshd 
vlvgdrwvltaahckkBkysvrlgdhslqIrdSpeqeiqvaqsiqhpcynSsnpedhshd 

VLVGDRWVLTAAHCKKfflKYSVRLGDHSLQ|RD|pEQEIQVAgSIQHPC§NlsNPEDHSHD 
VLVGDRWVLTAAHCKK@KYSVRLGDHSLQ|RDgPEQEIQVAQSIQHPCYN|sNPEDHSHD 



gi|6005844| 


61 


gi j 16162680 | 


61 


NOV62 


121 


gi | 6679487 | 


121 


gi | 6093538 j 


121 


gi|4699764| 


89 


gi | 6005844 | 


121 


gi j 16162680 | 


121 


NOV 6 2 


181 


gi | 6679487 | 


181 


gi j 6093538 j 


181 


gi | 4699764 | 


14 9 


gi | 6005844 j 


181 


gi j 16162680 | 


181 


NOV62 


241 


gi | 6679487 | 


241 


gi|6093538| 


241 


gi|4699764| 


209 


gi j 6005844 | 


241 


gi|l6162680| 


241 





260 
I 

KN- 259 

N1&> 260 

Kit) 260 

--- 22 5 

pS&G 26 0 

!l£GSKG 26 0 



Table 62F lists the domain description from DOMAIN analysis results against NOV62. 
This indicates that the NOV62 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 62F. Domain Analysis of NOV62 



gnl 1 Smart | smartQQ02 0 , Tryp SPc, Tryps in- like serine protease; Many of these 

are synthesised as inactive precursor zymogens that are cleaved during 

limited proteolysis to generate their active forms. A few, however, are 

active as single chain molecules, and others are inactive due to 

substitutions of the catalytic triad residues. SEQ ID NO: 812 

CD-Length = 230 residues, 98.3% aligned 
Score = 237 bits (604), Expect = 7e-64 



NOV62: 36 GQASKPHSQPWQAALF - QGERL I CGGVLVGDRWVLTAAHC KKQKYSVRLGDHSLQS 90 

G + S PWQ + L +G R CGG L+ RWVLTAAHC ' VRLG H L S 

Sbjct: 5 GSEANIGSFPWQVSLQYRGGRHFCGGSLISPRWVLTAAHCVYGSAPSSIRVRLGSHDLSS 64 

NOV62: 91 RDQPEQE I QVAQS I QHPC YNNSNPEDHSHDIMLI RLQNS ANLGDKVKPVQLA- - NLCPKV 14 8 

+ q ++V++ I HP YN P + +DI L++L L D V+P + L 

Sbjct: 65 GE - ETQTVKVS KVI VHPNYN PSTYDNDIALLKLSEPVTLSDTVRPICLPSSGYNVPA 120 

NOV62 : 149 GQ K C 1 1 SG WGT VT S PQENF PNT LN C AE VK I Y S QNK C E RAY PG - - K I TE GM VC AG S SN - G A 205 

G C +SGWG + + P+TL VIS C RAY G IT+ M+CAG G 

Sbjct: 121 GTTCTVSGWGRTSESSGSLPDTLQEVNVPIVSNATCRRAYSGGPAITDNMLCAGGLEGGK 18 0 

NOV62 : 206 DTCQGDSGGPLVCE GTLAGIVSGGSEPVFRPRRPAVYTNVFDYLEWI 252 



527 



fi \i \\ Ii i; 



:i si 



ii 1! i. if" 



D CQGDSGGPLVC L GIVS GS RP +P VYT V YL+WI 
Sbjct: 181 D AC QGD S GG P L VCNDP RWVL VG I V S WG S YG CAR PNK PG VYT RVS S Y LP W I 230 

Neuropsin appears to act as a regulatory molecule in the early phase of LTP via its 
proteolytic function on extracellular matrix rather than affecting NMDA receptor-mediated 
calcium increase. The behavioral and electrographical abnormalities associated with seizures 
5 in epileptic (kindled) mice correspond with those of human epilepsy. In kindled mice, 
neuropsin was markedly increased in the hippocampus and cerebral cortices. A single 
intraventricular injection of monoclonal antibodies specific to neuropsin reduced or eliminated 
the epileptic pattern noted on electroencephalograms and, as a result markedly inhibited the 
progression of kindling. 

10 Therefore, neuropsin appears to be a key protein controlling pathogenic events in the 

hippocampus, and thus neuropsin inhibitors might be useful for treatment of epilepsy. 
Neuropsin has two isoforms, which have been reported to be involved in hippocampal 
plasticity. The amino acid sequences of the two types of human neuropsin were identical, 
except that type 2 carried an insert of 45 amino acids at the C-terminus of the leader sequence. 

15 The essential three amino acids in the active site triad, His, Asp, and Ser, and the single 
putative N-glycosylation site were conserved in human and mouse neuropsin. Sequence 
analysis of the 946 bp genomic DNA spanning the region encoding the insertion sequence 
revealed that two isoforms were generated in human brain by alternative splicing. However, 
the mouse genomic sequence did not conserve the 3' acceptor consensus sequence at the 

20 corresponding position, suggesting that type 2 neuropsin was a species-specific splice variant. 
When the open reading frames of human neuropsin were expressed in insect cells, both types 
of neuropsin were detected in the conditioned media by western blot analysis using anti- 
human neuropsin serum. 

Northern blot hybridization and reverse transcription-polymerase chain reaction 

25 showed predominant expression of type 1 neuropsin in pancreas. Type 2 neuropsin was 

preferentially expressed in human adult brain and hippocampus, although both types were 
expressed in fetal brain and placenta in comparable amounts. Dot blot hybridization showed 
that neuropsin was expressed in various regions of adult brain, including the hippocampus and 
cerebral cortex, and also in various fetal tissues. These results suggest that human type 2 

30 neuropsin may be important to the adult brain plasticity, although both types may be necessary 
for the development of the nervous system. 

The disclosed NOV62 is predicted to be expressed in at least the following tissues: 
brain. This information was derived by determining the tissue sources of the sequences that 
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were included in the invention including but not limited to SeqCalling sources, public EST 
sources, literature sources, and/or RACE sources. Further expression data for NOV62 is 
provided in Example 2. 

The nucleic acids and proteins of NOV62 are useful in potential therapeutic 
5 applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for treatment of patients suffering 
from: Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 

10 disorders, addiction, anxiety, pain, neuroprotection, osteoarthritis, and other diseases, 

disorders and conditions of the like. The NOV62 nucleic acid encoding the neuropsin-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. The 
novel nucleic acid of the invention encoding a neuropsin precursor-like protein includes the 

15 nucleic acid whose sequence is provided in Table 62A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 62A while still encoding a protein that maintains its 
neuropsin precursor-like activities and physiological functions, or a fragment of such a nucleic 
acid. The invention further includes nucleic acids whose sequences are complementary to 

20 those just described, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or . 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

25 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 8% of the residues 
may be so changed. 

30 The novel protein of the invention includes the neuropsin precursor-like protein whose 

sequence is provided in Table 62B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 62B while 
still encoding a protein that maintains its neuropsin precursor-like activities and physiological 
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functions, or a functional fragment thereof. In the mutant or variant protein, up to about 12% 
of the bases may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV63 

NOV63 has homology to the WNT family of proteins. The Wnt gene family consists 
of at least 15 structurally related genes that encode secreted extracellular signaling factors. 
WNT proteins function in a range of critical developmental processes in both vertebrates and 
invertebrates and are implicated in regulation of cell growth and differentiation in certain adult 
mammalian tissues, including the mammary gland. The disclosed NOV63 (alternatively 
referred to herein as CG56054-02) includes the 1 128 nucleotide sequence (SEQ ID NO: ) 
shown in Table 63 A. A NOV63 ORF begins with a Kozak consensus ATG initiation codon at 
nucleotides 31-33 and ends with a stop codon at nucleotides 1 102-1 104. The disclosed 
NOV63 maps to human chromosome 1 . 



Table 63A. NOV63 Nucleotide Sequence (SEQ ID 
NO:223) 

TCCTCCCCGCAGCTTCTCGCTGAATTCCGAGGGGGCTGAGAGGATGGCCACCACCGGGAC 
CCCAACGGCCGACCGAGGCGACGCAGCCGCCACAGATGACCCGGCCGCCCGCTTCCAGGT 
GCAGAAGCACTCGTGGGACGGGCTCCGGAGCATCATCCACGGCAGCCGCAAGTACTCGGG 
CCTCATTGTCAACAAGGCGCCCCACGACTTCCAGTTTGTGCAGAAGACGGATGAGTCTGG 
GCCCCACTCCCACCGCCTCTACTACCTGGGAATGCCATATGGCAGCCGAGAGAACTCCCT 
CCTCTACTCTGAGATTCCCAAGAAGGTCCGGAAAGAGGCTCTGCTGCTCCTGTCCTGGAA 
G CAGATGCTGGATCAT T TC CAGGC CACGC C C C AC C ATGGGGT C TACT CT CGGGAGG AGGA 
GCTGCTGAGGGAGCGGAAACGCCTGGGGGTCTTCGGCATCACCTCCTACGACTTCCACAG 
CGAGAGTGGCCTCTTCCTCTTCCAGGCCAGCAACAGCCTCTTCCACTGCCGCGACGGCGG 
CAAGAACGGCTTCATGGTGTCCCCTATGAAACCGCTGGAAATCAAGACCCAGTGCTCAGG 
GCCCCGGATGGACCCCAAAATCTGCCCTGCCGACCCTGCCTTCTTCTCCTTCATCAATAA 
CAGCGACCTGTGGGTGGCCAACATCGAGACAGGCGAGGAGCGGCGGCTGACCTTCTGCCA 
CCAAGGTTTATCCAATGTCCTGGATGACCCCAAGTCTGCGGGTGTGGCCACCTTCGTCAT 
ACAGGAAGAGTTCGACCGCTTCACTGGGTACTGGTGGTGCCCCACAGCCTCCTGGGAAGG 
TTCAGAGGGCCTCAAGACGCTGCGAATCCTGTATGAGGAAGTCGATGAGTCCGAGGTGGA 
GGTCATTCACGTCCCCTCTCCTGCGCTAGAAGAAAGGAAGACGGACTCGTATCGGTACCC 
CAGGACAGGTAGCAAGAATCCCAAGATTGCCTTGAAACTGGCTGAGTTCCAGACTGACAG 
CCAGGGCAAGATCGTCTCGACCCAGGAGAAGGAGCTGGTGCAGCCCTTCAGCTCGCTGTT 
CCCGAAGGTGGAGTACATCGCCAGGGCCGGGTGGACCCGGGATGGCAAATACGCCTGGGC 
CATGTTCCTGGACCGGCCCCAGCAGTGGCTCCAGCTCGTCCTCCTCCCCCCGGCCCTGTT 
CATCCCGAGCACAGAGAATGAGGAGCAGCGGCTAGCCTCTGCCAGAGCTGTCCCCAGGAA 
TGTCCAGCCGTATGTGGTGTACGAGGAGGTCACCAACGTCTGGATCAATGTTCATGACAT 
CTTCTATCCCTTCCCCCAATCAGAGGGAGAGGACGAGCTCTGCTTTCTCCGCGCCAATGA 
ATGCAAGACCGGCTTCTGCCATTTGTACAAAGTCACCGCCGTTTTAAAATCCCAGGGCTA 
CGATTGGAGTGAG C CCTT CAG C C CCGGGG AAGATGAATTTAAGTGCC C CAT T AAGGAAG A 
GATTGCTCTGACCAGCGGTGAATGGGAGGTTTTGGCGAGGCACGGCTCCAAGATCTGGGT 
CAATGAGGAGACCAAGCTGGTGTACTTCCAGGGCACCAAGGACACGCCGCTGGAGCACCA 
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CCTCTACGTGGTCAGCTATGAGGCGGCCGGCGAGATCGTACGCCTCACCACGCCCGGCTT 

CTCCCATAGCTGCTCCATGAGCCAGAACTTCGACATGTTCGTCAGCCACTACAGCAGCGT 

GAGCACGCCGCCCTGCGTGCACGTCTACAAGCTGAGCGGCCCCGACGACGACCCCCTGCA 

CAAGCAGCCCCGCTTCTGGGCTAGCATGATGGAGGCAGCCAGCTGCCCCCCGGATTATGT 

TCCTCCAGAGATCTTCCATTTCCACACGCGCTCGGATGTGCGGCTCTACGGCATGATCTA 

CAAGCCCCACGCCTTGCAGCCAGAGAAGAAGCACCCCACCGTCCTCTTTGTATATGGAGG 

CCCCCAGGTGCAGCTGGTGAATAACTCCTTCAAAGGCATCAAGTACTTGCGGCTCAACAC 

ACTGGCCTCCCTGGGCTACGCCGTGGTTGTGATTGACGGCAGGGGCTCCTGTCAGCGAGG 

GCTTCGGTTCGAAGGGGCCCTGAAAAACCAAATGGGCCAGGTGGAGATCGAGGACCAGGT 

GGAGGGCCTGCAGTTCGTGGCCGAGAAGTATGGCTTCATCGACCTGAGCCGAGTTGCCAT 

CCATGGCTGGTCCTACGGGGGCTTCCTCTCGCTCATGGGGCTAATCCACAAGCCCCAGGT 

GTTCAAGGTGGCCATCGCGGGTGCCCCGGTCACCGTCTGGATGGCCTACGACACAGGGTA . 

CACTGAGCGCTACATGGACGTCCCTGAGAACAACCAGCACGGCTATGAGGCGGGTTCCGT 

GGCCCTGCACGTGGAGAAGCTGCCCAATGAACCCAACCGCTTGCTTATCCTCCACGGCTT 

CCTGGACGAAAACGTGCACTTTTTCCACACAAACTTCCTCGTCTCCCAACTGATCCGAGC 

AGGGAAACCTTACCAGCTCCAGATCTACCCCAACGAGAGACACAGTATTCGCTGCCCCGA 

GTCGGGCGAGCACTATGAAGTCACGTTGCTGCACTTTCTACAGGAATACCTCTGAGCCTG 

CCCACCGGGAGCCGCCACAT __ 

The NOV63 polypeptide (SEQ ID NO:224) encoded by SEQ ID NO:223 is 357 amino 
acids in length and is presented using the one-letter amino acid code in Table 63B. The Psort 
profile for NOV63 predicts that this sequence has a signal peptide and is likely to be exported 
from the cell with a certainty of 0.3700. In alternative embodiments, a NOV63 polypeptide is 
located to lysosomes with a certainty of 0.1000, or to the endoplasmic reticulum (membrane) 
with a certainty of 0.1000. The Signal P predicts a likely cleavage site for a NOV63 peptide is 
between positions 18 and 19, i.e., at the dash in the sequence ALG-SY. 



Table 63B. NOV63 Polypeptide Sequence (SEQ ID 
N0224) 

MATTGTPTADRGDAAATDDPAARFQVQKHSWDGLRSIIHGSRKYSGLIVNKAPHDFQFVQ 
KTDESGPHSHRLYYLGMPYGSRENSLLYSEIPKKVRKEALLLLSWKQMLDHFQATPHHGV 
YSREEELLRERKRLGVFGITSYDFHSESGLFLFQASNSLFHCRDGGKNGFMVSPMKPLEI 
KTQCSGPRMDPKICPADPAFFSFINNSDLWVANIETGEERRLTFCHQGLSNVLDDPKSAG 
VATFVIQEEFDRFTGYWWCPTASWEGSEGLKTLRILYEEVDESEVEVIHVPSPALEERKT 
DSYRYPRTGSKNPKIALKIAEFQTDSQGKIVSTQEKELVQPFSSLFPKVEYIARAGWTRD 
GKYAWAMFLDRPQQWLQLVLLPPALFIPSTENEEQRLASARAVPRNVQPYWYEEVTNVW 
INVHDIFYPFPQSEGEDELCFLRANECKTGFCHLYKVTAVLKSQGYDWSEPFSPGEDEFK 
CPIKEEIALTSGEWEVLARHGSKIWVNEETKLVYFQGTKDTPLEHHLYWSYEAAGEIVR 
LTTPGFSHSCSMSQNFDMFVSHYSSVSTPPCVHVYKLSGPDDDPLHKQPRFWASMMEAAS 
CPPDYVPPEIFHFHTRSDVRLYGMIYKPHALQPEKKHPTVLFVYGGPQVQLVNNSFKGIK 
YLRLNTLASLGYAVWIDGRGSCQRGLRFEGALKNQMGQVEIEDQVEGLQFVAEKYGFID 
LS R VA I HGW S YGG FL S LMGL I HK PQVF K VA I AG AP VT VWMAYDTG YT E R YMDV PENNQHG 
YEAGSVAIiHVEKLPNEPNRLLILHGFLDENVHFFHTNFLVSQLIRAGKPYQLQIYPNERH 
SIRCPESGEHYEVTLLHFLQEYL 



A BLAST analysis of NOV63 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV63 had high 
homology to other proteins as shown in Table 63C. 



Table 63C. BLASTX results from PatP database for NOV63 

_____ Smallest 

531 



.>«> .!'"»! ,J" I, f> " U .. "1. 

Si fi ii d Ai .:■ s!" '* :t ii ;t 







Sum 




High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp :AAY8 16 93 Human Wnt-6 protein sequence - Homo sapiens 


411 


1 . Oe-61 


patp :AAB4 9769 Amyloid-beta protein agglutination regulator 


411 


1 . Oe-61 


patp : AAB8 843 9 Human membrane or secretory protein 


411 


1 . Oe-61 


patp : AAB19786 Human Wnt-1 protein involved in kidney 


630 


2 .2e-6l 


patp:AAY94319 Murine Wnt-lOA protein - Mus musculus, 417 aa . 


398 


2 . 7e-61 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 633 of 795 bases (79%) identical to a gb:GENBANK- 
ID:AF031 168|acc:AF031 168.1 mRNA from Gallus gallus (Gallus gallus Wnt-14 protein 
5 (Wnt-14) mRNA). The full amino acid sequence of the protein of the invention was found to 
have 287 of 349 amino acid residues (82%) identical to, and 319 of 349 amino acid residues 
(91%) similar to, the 354 amino acid residue ptnr:SWISSPROT-ACC:O42280 protein from 
Gallus gallus (Chicken) (WNT-14 PROTEIN PRECURSOR). NOV63 also has homology to 
the other proteins shown in the BLASTP data in Table 63D. 
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Table 63D. NOV63 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

<%) 


Positive 

(%) ! 


Expect 


gi | 15082261 | r 
ef |NP_003386 . 
1 | (NM_003395) 


wingless -type MMTV 
integration site family, 
member 14 

[Homo sapiens] 


365 


339/356 
(95) 


343/356 
(96) 


0 . 0 


gi | 3915306 | sp 
| O42280 | WN14_ 
CHICK 


WNT-14 PROTEIN PRECURSOR 


354 


283/333 
(84) 


310/333 
(92) 


e-156 


gi | 16303264 | d 
bj | BAB70499 . 1 
| (AB063483) 


WNT14B 

[Homo sapiens] 


357 


215/352 
(61) 


263/352 
(74) 


e-107 | 


gi | 17017976 | r 
ef |NP_003387 . 
1 | <NM_003396) 


wingless -type MMTV 
integration site family, 
member 15 precursor 
[Homo sapiens] 


357 


215/352 
(61) 


263/354 
(74) 


e-107 


gi | 18181917 | d 
bj | BAB83866 . 1 
| (AB073819) 


Wntl4b 

[Mus musculus] 


359 


216/354 
(61) 


264/354 
(74) 


e-106 j 



This BLASTP data is displayed graphically in the ClustalW in Table 63E. A multiple 
sequence alignment is given, with the disclosed NOV63 protein being shown on line 1 in a 
ClustalW analysis comparing the protein of the invention with the related protein sequences 
15 shown in Table 63D. 



Table 63E. ClustalW Alignment of NOV63 


NOV63 


(SEQ 


ID 


NO: 224) 


gi | 15082261 | 


(SEQ 


ID 


NO: 614) 


gi | 3915306 | 


(SEQ 


ID 


NO: 615) 



532 



... fi r I '! "ii m ,H Si I: ii 3 ^ 

— - _ _ ... \ 




gi|l6303264| 
gi | 17017976 j 
gij 18181917 j 



NOV63 

gi | 15082261 | 
gi|3915306| 
gijl6303264| 
gij 17017976 | 
gij 18181917 j 



NOV63 

gi|l508226l| 
gij 3915306 | 
gijl6303264| 
gi j 17017976 j 
gij 18181917 j 



NOV63 

gi|l508226l| 
gi|3915306| 
gij 16303264 | 
gij 17017976 j 
gij 18181917 j 



NOV63 

gi|l508226l| 
gi|3915306| 
gij 16303264 | 
gij 17017976 j 
gij 18181917 j 



NOV63 

gi|l508226l| 
gi j3915306 | 
gij 16303264 | 
gijl7017976j 
gij 18181917 j 



NOV63 J 
gi|l508226l| 
gij 3915306 | 
gij 16303264 | 
gi j 17017976 j 
gij 18181917 j 



NOV63 



gi 
gi 
gi 
gi 
gi 



15082261 | 
3915306 | 
16303264 
17017976 
18181917 



(SEQ ID NO:616) 
(SEQ ID NO: 617) 
(SEQ ID NO:618) 



10 20 30 

| I ' • - - I i I • ' * J 

UjAgLGYFLfflLCS fflKQAL- -G-^YPI; 

MLDGSgLARWLAgAFGLTLSjAAL - - RP* 

~~ ^lr^lglla ct 

3pp^aTOglc — BIal 
SppEWa^glc — 
HaSSalc 





130 



140 



150 
: I - : 




160 



170 

. _J_. . 



WQWGgCGDNLKYS 



.CSAGRMERCTCDg|PgLE|jRi|AWQWGgCGDNLKYS 

.csagrmerctcd^pBle|r||awowgBcgdnlkys 
.csagrmerctcd^pSleBrSawqwgBcgpnlkys 
gsagrmerctcd ^ pBleIrIawowgBcgdnlkys 
.csagrmerctcdSSpBleSrBawowgScgdnlkys 



•SleSrS. 



CAWQWGffiCGDNLKYS 



180 

4 

1 176 

3 168 

2 170 

H 170 

$ 172 



220 
- I - • 



230 
-I 



240 



TTCKCHGVSGSCi 
TTCKCHGVSGSCj 
TTCKCHGVSGSCj 
TTCKCHGVSGSCi 
TTCKCHGVSGSC 
TTCKCHGVSGSC 



280 



290 

RASg^GGSDPLiaSllPi 

ras@|ggsdpl 
-ksipghsdqi 

;LTKGLA 
•TKGLA 
PAKGLA 




300 




287 




295 




284 




287 




287 




289 



310 



320 




330 
■ J . . 



340 



350 



I 



I 



CCGRGffiffiTQSR 

ccgrgStqsr 
ccgrg^Stqsr 
ccgrgBItosr 

ccgrg E tqsr 

ccgrgBStqsr 



CgCQVgWCCYVECgQC 

cHcqvHwccyvecSqc 

cfficQ VgWC C Y VE C|Q C 

cScqvSwccyvecSqc 
cScqvBwccyvecSqc 



cBcqvBwccyvecSqc 



360 
J 

r 
r 
r 
v 
v 

A 



347 
355 
344 
347 
347 
349 



348 
356 
345 
348 
348 
350 



Q| 


E 


gVYTCKj 


q| 


E 


SVYTCK 


q| 


E 


SVYTCKJ 




E 


1VYTCKJ 




E 


1VYTCK* 




E 


3vytck| 



370 
I 



357 
365 
354 
357 
357 
359 
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Table 63F lists the domain description from DOMAIN analysis results against NOV63. 
This indicates that the NOV63 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 63F. Domain Analysis of NO V63 


gnl | Pf 


am|pfam00110, wnt, wnt family. SEQ ID NO:862 








CD-Length = 313 residues, 99.7% aligned 




Score = 


283 bits (725), Expect = 9e-78 




NOV63 : 


51 


CDRLK-LERKQRRMCRRDPGVAETLVEAVSMSALECQFQFRFERWNCTLEGRYRASL 


106 






C L L +QR++CRR+P V ++ E ++ ECQ QFR RWNC+ R R 




Sbj Ct : 


2 


CRSLPGLSPRQRQLCRRNPDVMASVSEGAQLAIQECQHQFRGRRWNCSTLDRLRWFGKV 


61 


NOV63 : 


107 


LKRGFKETAFLYAI SS AGLTHALAKACSAGRMERCTCDE - APDLENREAWQWGGCGDNLK 


165 






LK+G +ETAF+ YAI SSAG+ HA+ +ACS G +E C CD + +WQWGGC DN++ 




Sbjct : 


62 


LKKGTRETAFVYAISSAGVAHAVTRACSEGELESCGCDYKKGPGGPQGSWQWGGCSDNVE 


121 


NOV63 : 


166 


YSSKFVKEFL-GRRSSKDLRARVDFHNNLVGVKVIKAGVETTCKCHGVSGSCTVRTCWRQ 


224 






+ +F +EF+ R +D R+ ++ HNN G K +K+ + CKCHGVSGSC+++TCW 




Sbjct: 


122 


FGIRFSREFVnDARERERDARSLMNLHNNEAGRKAVKSHMRRECKCHGVSGSCSMKTCWLS 


181 


NOV63 : 


225 


LAP FHE VGKHL KHK YE TAL KV - G S TTNE AAGE AGA ISP PRGRAS GAGG S D P L PRT P EL VH 


283 






L F VG LK KY+ A++V + G A + R SD LV+ 




Sbjct : 


182 


L PD FRAVGD AL KDK YDG A I RVE PNKRGMGQGS APRLVAKNPRFKPPTRSD LVY 


234 


NOV63 : 


284 


LDDSPS FCL - - AGRFS PGTAGRRC HREKNCE S I CCGRGHNTQSRWTRPCQCQVRW 


337 






L+DSP +C S GT GR C CE +CCGRG+NTQ T C C+ W 




Sbjct : 


235 


LEDSPDYCERDRSTGSLGTQGRVCNKTSKGLDGCELLCCGRGYNTQQVERTEKCNCKFHW 


294 


NOV63 : 


338 


CCYVECRQCTQREE VYTCK 3 5 6 








CCYV+C +C + EV+TCK 




Sbjct : 


295 


CCYVKCEECQEWEVHTCK 313 





5 

Wnt-1 (previously known as int-1) is a proto-oncogene induced by the integration of 
the mouse mammary tumor virus. It is thought to play a role in intercellular communication 
and seems to be a signalling molecule important in the development of the central nervous 
system (CNS). The sequence of wnt-1 is highly conserved in mammals, fish, and amphibians. 

10 Wnt-1 is a member of a large family of related proteins that are all thought to be 

developmental regulators. These proteins are known as wnt-2 (also known as irp), wnt-3, up to 
wnt-1 5. At least four members of this family are present in Drosophila. One of them, wingless 
(wg), is implicated in segmentation polarity. All these proteins share the following features 
characteristics of secretory proteins, a signal peptide, several potential N-glycosylation sites 

1 5 and 22 conserved cysteines that are probably involved in disulfide bonds. The Wnt proteins 

seem to adhere to the plasma membrane of the secreting cells and are therefore likely to signal 
over only few cell diameters. 

The NOV63 disclosed in this invention is predicted to be expressed in at least the 
following tissues: brain This information was derived by determining the tissue sources of the 
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sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV63 is provided in Example 2. 

The nucleic acids and proteins of NOV63 are useful in potential therapeutic 
5 applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for treatment of patients suffering 
from: Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
10 disorders, addiction, anxiety, pain, neuroprotection, osteoarthritis, and other diseases, 
disorders and conditions of the like. The NOV63 nucleic acid encoding the WNT-14 
precursor-like protein of the invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. The novel nucleic acid of the invention encoding a WNT-14 precursor-like 
15 protein includes the nucleic acid whose sequence is provided in Table 63 A, or a fragment 

thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 63A while still encoding a protein 
that maintains its WNT-14 precursor-like activities and physiological functions, or a fragment 
of such a nucleic acid. The invention further includes nucleic acids whose sequences are 
20 complementary to the sequence indicated in Table 63A, including nucleic acid fragments that 
are complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
25 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 21% of the bases 
may be so changed. 

30 The novel protein of the invention includes the WNT-14 precursor-like protein whose 

sequence is provided in Table 63B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 63B while 
still encoding a protein that maintains its WNT-14 precursor-like activities and physiological 
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functions, or a functional fragment thereof. In the mutant or variant protein, up to about 1 8% 
of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



10 
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NOV64 

NOV64 has homology to dipeptidyl peptidase. The disclosed NOV64 (alternatively 
referred to herein as CG56884-01) includes the 2660 nucleotide sequence (SEQ ID NO:225) 
shown in Table 64A. A NOV64 ORF begins with a Kozak consensus ATG initiation codon at 
nucleotides 44-46 and ends with a stop codon at nucleotides 2633-2635. The disclosed 
NOV64 maps to human chromosome 17. 



Table 64A. NOV64 Nucleotide Sequence (SEQ ID 
NO:225) 



TCCTCCCCGCAGCTTCTCGCTGAATTCCGAGGGGGCTGAGAGGATGGCCACCACCGGGAC 

CCCAACGGCCGACCGAGGCGACGCAGCCGCCACAGATGACCCGGCCGCCCGCTTCCAGGT 

GCAGAAGCACTCGTGGGACGGGCTCCGGAGCATCATCCACGGCAGCCGCAAGTACTCGGG 

CCTCATTGTCAACAAGGCGCCCCACGACTTCCAGTTTGTGCAGAAGACGGATGAGTCTGG 

GCCCCACTCCCACCGCCTCTACTACCTGGGAATGCCATATGGCAGCCGAGAGAACTCCCT 

CCTCTACTCTGAGATTCCCAAGAAGGTCCGGAAAGAGGCTCTGCTGCTCCTGTCCTGGAA 

GCAGATGCTGGATCATTTCCAGGCCACGCCCCACCATGGGGTCTACTCTCGGGAGGAGGA 

GCTGCTGAGGGAGCGGAAACGCCTGGGGGTCTTCGGCATCACCTCCTACGACTTCCACAG 

CGAGAGTGGCCTCTTCCTCTTCCAGGCCAGCAACAGCCTCTTCCACTGCCGCGACGGCGG 

CAAGAACGGCTTCATGGTGTCCCCTATGAAACCGCTGGAAATCAAGACCCAGTGCTCAGG 

GCCCCGGATGGACCCCAAAATCTGCCCTGCCGACCCTGCCTTCTTCTCCTTCATCAATAA 

CAGCGACCTGTGGGTGGCCAACATCGAGACAGGCGAGGAGCGGCGGCTGACCTTCTGCCA 

CCAAGGTTTATCCAATGTCCTGGATGACCCCAAGTCTGCGGGTGTGGCCACCTTCGTCAT 

ACAGGAAGAGTTCGACCGCTTCACTGGGTACTGGTGGTGCCCCACAGCCTCCTGGGAAGG 

TTCAGAGGGCCTCAAGACGCTGCGAATCCTGTATGAGGAAGTCGATGAGTCCGAGGTGGA 

GGTCATTCACGTCCCCTCTCCTGCGCTAGAAGAAAGGAAGACGGACTCGTATCGGTACCC 

CAGGAC AGGTAGCAAGAAT C C C AAGATTG C CTTG AAAC TGG C TGAGT T CCAGACTG ACAG 

CCAGGGCAAGATCGTCTCGACCCAGGAGAAGGAGCTGGTGCAGCCCTTCAGCTCGCTGTT 

CCCGAAGGTGGAGTACATCGCCAGGGCCGGGTGGACCCGGGATGGCAAATACGCCTGGGC 

CATGTTCCTGGACCGGCCCCAGCAGTGGCTCCAGCTCGTCCTCCTCCCCCCGGCCCTGTT 

CAT C C CGAG CACAGAGAATGAGGAGCAGCGG CTAGC CT C TG C C AGAG CTGTC CCC AGGAA 

TGTCCAGCCGTATGTGGTGTACGAGGAGGTCACCAACGTCTGGATCAATGTTCATGACAT 

CTTCTATCCCTTCCCCCAATCAGAGGGAGAGGACGAGCTCTGCTTTCTCCGCGCCAATGA 

ATGCAAGACCGGCTTCTGCCATTTGTACAAAGTCACCGCCGTTTTAAAATCCCAGGGCTA 

CGAT TGG AGTG AG C C C TTC AGC C C CGGGGAAGATGAATTT AAGTG CCC CATTAAGGAAG A 

GATTGCTCTGACCAGCGGTGAATGGGAGGTTTTGGCGAGGCACGGCTCCAAGATCTGGGT 

CAATGAGG AGACC AAG CTGGTGTACT TC CAGGGCAC C AAGGACACG C CG C TGGAGC AC C A 

CCTCTACGTGGTCAGCTATGAGGCGGCCGGCGAGATCGTACGCCTCACCACGCCCGGCTT 

CTCCCATAGCTGCTCCATGAGCCAGAACTTCGACATGTTCGTCAGCCACTACAGCAGCGT 

GAGCACGCCGCCCTGCGTGCACGTCTACAAGCTGAGCGGCCCCGACGACGACCCCCTGCA 

CAAGCAGCCCCGCTTCTGGGCTAGCATGATGGAGGCAGCCAGCTGCCCCCCGGATTATGT 

TCCTCCAGAGATCTTCCATTTCCACACGCGCTCGGATGTGCGGCTCTACGGCATGATCTA 

CAAGCCCCACGCCTTGCAGCCAGAGAAGAAGCACCCCACCGTCCTCTTTGTATATGGAGG 

CCCCCAGGTGCAGCTGGTGAATAACTCCTTCAAAGGCATCAAGTACTTGCGGCTCAACAC 

ACTGGCCTCCCTGGGCTACGCCGTGGTTGTGATTGACGGCAGGGGCTCCTGTCAGCGAGG 

GCTTCGGTTCGAAGGGGCCCTGAAAAACCAAATGGGCCAGGTGGAGATCGAGGACCAGGT 
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GGAGGGCCTGCAGTTCGTGGCCGAGAAGTATGGCTTCATCGACCTGAGCCGAGTTGCCAT 
CCATGGCTGGTCCTACGGGGGCTTCCTCTCGCTCATGGGGCTAATCCACAAGCCCCAGGT 
GTTCAAGGTGGCCATCGCGGGTGCCCCGGTCACCGTCTGGATGGCCTACGACACAGGGTA 
CACTGAGCGCTACATGGACGTCCCTGAGAACAACCAGCACGGCTATGAGGCGGGTTCCGT 
GGCCCTGCACGTGGAGAAGCTGCCCAATGAACCCAACCGCTTGCTTATCCTCCACGGCTT 
CCTGGACGAAAACGTGCACTTTTTCCACACAAACTTCCTCGTCTCCCAACTGATCCGAGC 
AGGGAAAC CTT AC C AG CTC C AGATCT AC C CCAACG AGAG AC AC AGT AT TCG CTGCC C CG A 
GTCGGGCGAGCACTATGAAGTCACGTTGCTGCACTTTCTACAGGAATACCTCTGAGCCTG 
CCC AC CGGGAGCCGCCACAT 



A NOV64 polypeptide (SEQ ID NO:226) encoded by SEQ ID NO:225 is 863 amino 
acids in length and is presented using the one-letter amino acid code in Table 64B. The Psort 
profile for NOV64 predicts that this sequence has no signal peptide and is likely to be 
localized at peroxisomal microbodies with a certainty of 0.6400. In alternative embodiments, 
a NOV64 polypeptide is located to lysosomes with a certainty of 0.1000, or to the cytoplasm 
with a certainty of 0.4500. 



Table 64B. NOV64 Polypeptide Sequence (SEQ ID 
NO:226) 

MATTGTPTADRGDAAATDDPAARFQVQKHSWDGLRS I IHGSRKYSGLI VNKAPHDFQFVQ 
KTDESGPHSHRLYYLGMPYGSRENSLLYSEIPKKVRKEALLLLSWKQMLDHFQATPHHGV 
YSREEELLRERKRLGVFGITSYDFHSESGLFLFQASNSLFHCRDGGKNGFMVSPMKPLEI 
KTQCSGPRMDPKICPADPAFFSFINNSDLWVANIETGEERRLTFCHQGLSNVLDDPKSAG 
VATFVIQEEFDRFTGYWWCPTASWEGSEGLKTLRILYEEVDESEVEVIHVPSPALEERKT 
DSYRYPRTGSKNPKIALKLAEFQTDSQGKIVSTQEKELVQPFSSLFPKVEYIARAGWTRD 
GKYAWAMFLDRPQQWLQLVLLPPALFIPSTENEEQRLASARAVPRNVQPYVVYEEVTNVW 
INVHDIFYPFPQSEGEDELCFLRANECKTGFCHLYKVTAVLKSQGYDWSEPFSPGEDEFK 
CPIKEEIALTSGEWEVLARHGSKIWVNEETKLVYFQGTKDTPLEHHLYWSYEAAGEIVR 
LTTPGFSHSCSMSQNFDMFVSHYSSVSTPPCVHVYKLSGPDDDPLHKQPRFWASMMEAAS 
CPPDYVP PE I FHFHTRSDVRLYGM I Y K PHAL QP E KKH P T VL F VYGG PQ VQLVNNS FKGIK 
YLRLNTLASLGYAWVIDGRGSCQRGLRFEGALKNQMGQVEIEDQVEGLQFVAEKYGFID 
LSRVAIHGWSYGGFLSLMGLIHKPQVFKVAIAGAPVTVWMAYDTGYTERYMDVPENNQHG 
YEAGSVALHVEKLPNEPNRLLILHGFLDENVHFFHTNFLVSQLIRAGKPYQLQIYPNERH 
S I RCPESGEHYEVTLLHFLQE YL 



A BLAST analysis of NOV64 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV64 had high 
homology to other proteins as shown in Table 64C. 



Table 64C. BLASTX results from PatP database for NOV64 










Smallest 










Sum 








High 


Probability 1 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp 


AAB41626 


Human ORFX ORF13 9 0 polypeptide sequence 


3403 


0 . 0 


patp 


AAM38724 


Human polypeptide 


1987 


0 . 0 


patp 


AAM40510 


Human polypeptide 


1823 


0.0 


patp 


AAB47187 


Human DPP8 - Homo sapiens, 8 82 aa . 


2868 


1.5e-298 


patp 


AAY90299 


Human peptidase, HPEP-16 protein sequence 


2547 


1 . 6e-264 
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In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1601 of 2525 bases (63%) identical to a gb:GENBANK- 
ID:AF221634|acc:AF22 1634.1 mRNA from Homo sapiens (dipeptidyl peptidase 8 (DPP8) 
mRNA). The full amino acid sequence of the protein of the invention was found to have 521 
of 856 amino acid residues (60%) identical to, and 657 of 856 amino acid residues (76%) 
similar to, the 882 amino acid residue ptnr:TREMBLNEW-ACC:AAG29766 protein from 
Homo sapiens (Human) (DIPEPTIDYL PEPTIDASE 8). NOV64 also has homology to the 
other proteins shown in the BLASTP data in Table 64D. 



Table 64D. NOV64 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi | 17865311 | g 
b|AAL47179 . 1 | 
AF452102_1 
(AF452102) 


dipeptidyl peptidase 9 
{Homo sapiens] 


863 


862/863 
(99) 


862/863 
(99) 


0.0 


gi | 17483229 | r 
ef |XP_035636 . 
2 | (XM 035636) 


hypothetical protein 
XP_035636 
[Homo sapiens] 


689 


688/689 
(99) 


688/689 
(99) 


0 . 0 


gi | 12855335 | d 
bj |BAB30295.1 
| (AK016546) 


DIPEPTIDYL PEPTIDASE 
8~putative [Mus 
musculus] 


883 


519/859 
(60) 


658/859 
(76) 


0 . 0 


gi | 11095188 |g 
b | AAG29766 . 1 | 
AF221634_1 
(AF221634) 


dipeptidyl peptidase 8 
[Homo sapiens] 


882 


516/840 
(61) 


650/840 
(76) 


0 . 0 


gi | 3513303 | gb 
| AAC3 3 801 . 1 | 
(AC005594) 


R269841 
[Homo sapiens] 


508 


492/543 
(90) 


495/543 
(90) 


o.o 1 



This BLASTP data is displayed graphically in the ClustalW in Table 64E. A multiple 
sequence alignment is given, with the NOV64 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 64D. 



Table 64E. ClustalW Alignment of NOV64 



NOV64 

gi 1 17865311 1 
gi j 17483229 | 
gi | 12855335 j 
gi | 11095188 | 
gi [3513303 | 



(SEQ ID NO: 226) 
(SEQ ID NO: 619) 
(SEQ ID NO: 620) 
(SEQ ID NO: 621) 
(SEQ ID NO: 622) 
(SEQ ID NO: 623) 



10 20 30 40 50 60 

NOV64 1 - pATT(^PTABRGDAApTffiDH - AAI^QffiQ^^jDG^@S^iHG^S|^SfflL^ 48 

gi | 17865311 | 1 y ATT ^ pT ^ RGDAA jg T g D g _ AARgQ^gf^DGggiSl JHGgg^SgLg 48 

gi|!7483229| 1 1 
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gi| 12855335 | 
gi|H095188| 
gi|3513303 | 



NOV6 4 

gi|l786531l| 
gi|l7483229| 
gi|l2855335| 
gi|H095188 | 
gi|3513303 | 



NOV64 

gi|l786531l| 
gi|l7483229| 
gi|l2855335| 
gi|11095188| 
gi|3513303 | 



NOV64 

gi| 17865311 | 
gij 17483229 | 
gi|l2855335| 
gi | 11095188 j 
gi|3513303 | 



NOV64 

gi|l786531l| 
gi|l7483229| 
gi|l2855335| 
gi|H095188| 
gi|3513303 | 



NOV64 

gi 1 17865311 1 
gi|l7483229| 
gi j 12855335 j 
gi|H095188| 
gi|3513303 | 



NOV64 

gi | 17865311 1 
gi | 17483229 | 
gi|l2855335| 
gi|H095188| 
gi|3513303| 



NOV64 

gi|l786531l| 
gi|17483229| 
gi|l2855335| 
gi|H095188| 
gi|3513303 | 




61 
60 
1 



109 
109 
1 

121 
120 
1 



YSEIPK 
YSEIPK 




160 



170 



180 




GlYSREEELLRERKRj 

gSysreeellrerkr 






260 



270 
■ ■ 1 . . 



280 



290 



300 



QEEFDRfgGYWWCF 

qeefdrSRgywwcp 
qeefdr!||gywwcp 

QEEFDrB|GYWWCP 
QEEFDR™GYWWCP 




LRILYEEMDESEVE 
LRILYEEfflDESEVE 

lrilyeeRdeseve 
lrilyeeBdeseve 

LRI LYEeSdESEVE 




310 



I 



289 
289 
115 
300 
299 
1 



HVgSPgLE 

hvHspBle 
hvHspBle 
hvHspSle 




350 



360 



I 



D^GglVSTQEKELVQPFSSLFPK 

d^gSivstqekelvqpfsslfpk 
d^gIi vstqe kelvqpfsslfpk 

d|§g| I gjj^KEL VQ P F^L F[ 
^SSGBl BMBS KELROPFtaiLFt 

IVSTQEKELVQPFSSLFPK 



348 
348 
174 
359 
358 
19 



370 
IRE 



380 




390 
. . I . . 



400 
. . I . . 



410 



gkyawamfldrpqqwlqlvllppalfipsteneeqrlasaravprnvq 
gkyawamfldrpqqwlqlvllppalfipsteneeqrlasaravprnvq 
gkyawamfldrpqqwlqlvllppalf i p ^^^^^ r ^^^^ vp ^ v ° 

Iawamfldrpqqwlqlvllppalfipsteneeqrlasaravprnvq 



420 

- 1 

408 
408 
234 
&BT 419 



| 



430 
. . | I 



418 
72 



440 
. . I . . 



I 



450 
. . I . . 



I 



460 
. . I . . 



I 



470 



480 
^1 



409 
409 
235 
420 
419 
73 



PYWYEEVTNVWINVHDIFYPFPQSEGEDELCFLRANECKTGFCHLYKVTAVLKSQGYDW 
PYWYEEVTNVWINVHDIFYPFPQSEGEDELCFLRANECKTGFCHLYKVTAVLKSQGYDW 
PYWYEEVTNVWINVHDIFYPFPQSEGEDELCFLRANECKTGFCHLYKVTAVLKSQGYDW 



3YEEBT(gI8wiNfiHDIFI^FPQ3SBEDEiigFE 



gECKTGFSHLYKflT^HljF 



p153 yee B t 1^^ 

PYWYEEVTNVWINVHDIFYPFPQSEGEDELCFLRANECKTGFCHLYKVTAVLKSQGYDW 



468 
468 
294 
478 
477 
132 



490 



500 



520 



530 



540 
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NOV64 

gi 1 17865311 1 
gi|l7483229| 
gi|l2855335| 
gi j 11095188 | 
gi|3513303 | 



NOV64 

gi|!786531l| 
gi|l7483229| 
gi|l2855335 j 
gi | 11095188 j 
gi|3513303 | 



NOV64 

gi | 17865311 | 
gi | 17483229 j 
gi | 12855335 j 
gi|H095188| 
gi|3513303 | 



NOV64 

gi | 17865311 | 
gi|l7483229| 
gi j 12855335 | 
gij 11095188 j 
gi|3513303 | 



NOV64 

gi|l786531l| 
gi|l7483229| 
gi|l2855335| 
gi|H095188| 
gi|3513303 | 



NOV64 

gi|l786531l| 
gijl7483229| 
gijl2855335| 
gi|H095188 | 
gi|3513303 | 



469 
469 
295 
479 
478 
133 



fkcpikeeialtsgewevlgrhgsglwvneetklvyfqgtkdtplehhly 
fkcpikeeialtsgewevlBrhgsHiwweetklvyfqgtkdtplehhly 
fkcpikeeialtsgewevlBrhgsHiwvneetklvyfqgtkdtplehhly 

EFKCPIKEElfflTSGEWEVLgRHGsSlWVffi^ 

S fkcpike eia^t sgewevl§rhgsiS iB v 1 e B2 lvyf I^ gtkd 1 plehhly 

SEPFSPGEfcfas^^^M^B^jT.T-J'M 1 i J 'gmaereiWVNEETKLVYFOGTKDTPLEHHLY 



600 



DEI 

\PS£ 
\PSE 




649 
649 
475 
659 
658 
285 



709 
709 
535 
719 
718 
345 



769 
769 
595 
779 
778 
405 



610 




620 
. . I . . 



630 



I 



I 



640 
■ - I . . 



650 



660 



PDYffiPPEIFHFHTRSDVRLYGMIYKPHALQPGKKHPTVLFVYGGPQ 
PDYfflPPEIFHF HTRSDVR LYGMIYKPHALQPGKKHPTVLFVYGGPQ 
PDYHPPEIFgF j^^e^ LYGMgYKPHfgLQPGKKgPTVLFjlYGGPQ' 

PDYUPPE I F§F^^^^5 LYGM i YKPH 3 L Q PGKK S PTVLF S YGGPQ 

31 FHFHTRSDVRLYGM I YKPHALQPGKKHPTVLFVYGGPQ 



648 
648 
474 
658 
657 
284 



I 



670 
. . I . . 



I 



680 
. . I . . 



i 



690 
. . I . . 



I 



700 



710 



I 



720 
■ - I 



VQLVNNS FKG I KYLRLNTLAS LG YAVW I DGRGS CQRGLRFEGALKNQMGQVE I EDQVEG 
VQLVNNS FKG I KYLRLNTL AS LG YAVW I DGRGS CQRGLRFEGALKNQMGQVE I EDQVEG 
VQLVNNS FKG I KYLRLNT LAS LG YAVW I DGRGS CQRGLRFEGALKNQMGQVE I EDQVEG 
VQLVNNgFKGgKYgRLNTLASLGYgvWIDgRGSCgRGLgFEGA{|K^MGQpEl(gDQVEG 
VQLVNNgFKG^KYgRLNTLASLGYgVVVID^RGSCgRGL^FEGAyK^^MGQyEIj^DQVEG 
VQLVNNS FKGI KYLRLNT LAS LGYAVWIDGRGSCQRGLRFEGALKNQMGQVEI EDQVEG 



708 
708 
534 
718 
717 
344 



730 
. - I . . 



740 



750 



760 
. . I . . 



770 



780 



LQFVAEKYGFIDLSRVAIHGWSYGGFLSLMGLIHKPQVFKVAIAGAPVTVWMAYDTGYTE 
LQFVAEKYGFIDLSRVAIHGWSYGGFLSLMGLIHKPQVFKVAIAGAPVTVWMAYDTGYTE 
LQFVAEKYGFIDLSRVAIHGWSYGGFLSLMGL IHKPQV FKVAIAGAPVTVWMAYDTGYTE 
LQSBa^YiJf I DLgRVgl HGWS YGGgLS LMgL^^g^FgVAI AGAPVTilwjjgYDTGYTE 
LqSBa^y'f I DLgR vjl HGWS YGG§LS LMgL jffi||0|Fj§VAI AGAPVTjSwf^YDTGYTE 
LQFVAEKYGFIDLSRVAIHGWSYGGFLSLMGLIHKPQVFKVAIAGAPVTVWMAYDTGYTE 



768 
768 
594 
778 
777 
404 



790 



I 



800 
. . I . . 



I 



810 



820 



830 
. - I . . 



840 
• - L 



RYMDVPENNQHGYEAGS VALHVE KLPNE PNRLL I LHGFLDENVHFFHTNFLVSQL I RAGK 
RYMDVPENNQHGYEAGS VALHVE KLPNE PNRLL I LHGFLDENVHFFHTNFLVSQL I RAGK 
RYMDVPENNQHGYEAGSVA LHV E KLPNE PNRLLILHGFLDENVHFFHTNFLVSQLI RAGK 
RYM^P^N^GYfflGS VA||^gE KgP§E PNRLLilLHGFLDENVHFgHT^LjlsgLERAGK 
RYM^Pi^&J^GY!HGSVA^SEKBpiEPNRLL|LHGFLDENVHF"HT^LlsBLlRAGK 
RYMDVPENNQHGYEAGSVALHVE KLPNE PNRLL I LHGFLDENVHFFHTNFLVSQL I RAGK 



828 
828 
654 
838 
837 
464 



850 
..|.. 



860 
. . I . . 



I 



870 



880 

■ J ■ ■ 



890 



NOV64 


829 


PYQL 


gi 


17865311 | 


829 


PYQL 


gi 


17483229 | 


655 


PYQL 


gi 


12855335 | 


839 


pySl 


gi 


11095188 | 


838 


PYgL 


gi 


3513303 | 


465 


PYQL 



QVALPPVSP 



QIYPNERHSIRCPESGEHYEVTLLHFLQEYL 
QIYPNERHSIRCPESGEHYEVTLLHFLQEYL 
QI YPNERHS I RCPESGEHYEVTLLHFLQEYL 
QIYPffiERHSIRffiPESGEHYE|gLLH|jjLQEjSL 
OIYP^ERHSIRfflPESGEHYE^LLHgjLOE^L 
QI YPNERHS I RCPESGEHYEVTLLHFLQEYL 



J 



863 

863 

689 

GSRIAALKVI 883 
GSRIAALKVI 882 
508 



Table 64F lists the domain description from DOMAIN analysis results against NOV64. 
This indicates that the NOV64 sequence has properties similar to those of other proteins 
known to contain this domain. 
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Table 64F. Domain Analysis of NOV64 

gnl 1 Pf am] pf am0093 0 , DPPIV_N_term / Dipeptidyl peptidase IV (DPP IV) N- terminal 

region. This family is an alignment of the region to the N- terminal side of 

the active site. The Prosite motif does not correspond to this Pfam entry. 

SEQ ID NO: 8 63 

CD-Length = 504 residues 
Score = 112 bits (280), Expect = 9e-26 

NOV64 : 200 FFSFINNSDLWVANIETGEERRLTFCHQGLSNVLDDPKSAGVATFVIQEEFDRFTG- YWW 258 

+ F+ +++L++ + +G ++T G SN + + G+ +V +EE * WW 

Sbjct: 122 KLAFVRDNNLYIQKLPSGPAIQITT- -DGKSNDIFN GIPDWVYEEEILSTDYALWW 175 

NOV64 : 259 CPTASWEGSEGLKTLRILYEEVDESEVEVIHVPSPALEE- -RKTDSYRYPRTGSKNPKIA 316 

P + Y ++SEV VI P + + +YP+ G NP + 

'Sbjct: 176 SPDGD FLAYLRFNDSEVPVI EYPFYTDDSQYPEDME I KYPKAGDPNPTVK 225 

NOV64 : 317 LKLAEFQTDSQGKIVSTQEKELVQPFSSLFPKVEYIARAGWTRDGKYAWAMFLDRPQQJtfL 3 76 

L + G VS + +SL YI R W + + A +L+R Q 

Sbjct : 226 LFWNLAD GASVSE IPLPASLASGDYYITRVAWVTNERLA- VQWLNRDQNIS 276 

NOV64 : 377 QL VLLPPALF I P S T ENE E QRL AS ARAVP RNVQ P YWYE E VTNVW I NVHD IFYPFPQSEGE 43 6 

L L A +S V +N +E + W+ + P +G 

Sbjct: 277 VLSLCDTA SSTWNWKN FEDSETGWVETFNPSLPVFPLDGL 317 

NOV64 : 437 DELCFLRANECKTGFCHLYKVTAVLKSQGYDWSEPFSPGEDEFKCPIKEEIALTSGEWEV 496 

+L ++ + G+ HL E + K PI ALT G WEV 

Sbjct : 318 S Y — YLDI SD - RDGYKHLAYY ELDGKEPI ALTKGNWEV 352 

NOV64 : 497 LARHG SKI WVNE ET KL VY FQGT KDT P L EHHLYWS Y EAAGEIVRLTTPGFSHSCSM 552 

+ + V+ +T VYF T+ + E HLY +S + G+ +S S 

Sbjct: 353 T NILGVDSKTDTVYFTATEEGSGERHLYSISLKGGKTTLSCQLDSERCGY- YSASF 407 

NOV64 : 553 SQNFDMFVSHYSSVSTPPCVHVYKLSGPDDDPLHKQPRFWASMME AASCPPDYV 606 

S N ++ YS P SDL +E A 

Sbjct: 408 S PNAKYYI LTYSGPGVP IQTLHSSNDTKELR TLEDNEALKKALKNYQLP 456 

NOV64: 607 PPE I FHFHTRSDVRLYGM I YKPHALQPEKKHPTVLFVYGGPQVQLV 652 

E + L + KP P KK+P + FVYGGP Q V 
Sbjct: 457 SKEFGKIKLADGITLNYQMIKPANFDPSKKYPVLFFVYGGPGSQQV 502 



NOV64 is a member of the family of dipeptidyl peptidases (DPPs). This group of 
enzymes catalyzes the removal of dipeptides from the N termini of polypeptides. This novel 
gene has greatest homology to a recently discovered protein, DPP8 (Abbott et al., Eur J 
Biochem 2000 Oct;267(20):6140-50). DPP8 in turn is related to DPP4, which is a cell surface 
peptidase involved in T-cell activation (Kahne et al., Int J Mol Med 1999 Jul;4(l):3-15). 
Other members of the peptidase family have been targeted as putative drug targets, for 
instance, in situations where they might cleave polypeptides beneficial in the prevention or 
reduction of a disease condition. 

The NOV64 disclosed in this invention is predicted to be expressed in at least the 
following tissues: bone, bone marrow, brain (cerebellum, substantia nigra, thalamus), 
bronchus, cartilage, cervix, chorionic villus, coronary artery, colon, breast, heart, kidney, liver, 
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lung, lymph node, lymphoid tissue, ovary, placenta, pituitary gland, respiratory bronchiole, 
retina, skeletal muscle, skin, small intestine,spinal cord, spleen, testis, thymus, thyroid, 
umbilical vein, urinary bladder, vulva, adrenal gland/suprarenal gland, synovium/synovial 
membrane, and uterus. This information was derived by determining the tissue sources of the 
5 sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV64 is provided in Example 2. 

The nucleic acids and proteins of NOV64 are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 

10 the compositions of the present invention will have efficacy for treatment of patients suffering 
from: Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, osteoarthritis, and other diseases, 

15 disorders and conditions of the like. A NOV64 nucleic acid encoding the dipeptidyl 
peptidase-like protein of the invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. 

The novel nucleic acid of the invention encoding a dipeptidyl peptidase-like protein 
20 includes the nucleic acid whose sequence is provided in Table 64A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 64A while still encoding a protein that maintains 
its dipeptidyl peptidase-like activities and physiological functions, or a fragment of such a 
nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
25 to the sequence of Table 64A, including nucleic acid fragments that are complementary to any 
of the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
30 carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 
applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 37% of the bases may be so changed. 
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The novel protein of the invention includes the dipeptidyl peptidase-like protein whose 
sequence is provided in Table 64B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 64B while 
still encoding a protein that maintains its dipeptidyl peptidase-like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 40% 
of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV65 

NOV65 includes two dual specificity phosphatase-like proteins, designated herein as 
NOV65a and NOV65b. 

NOV65a 

A disclosed NOV65a (alternatively referred to herein as CG56651-01) includes the 711 
nucleotide sequence (SEQ ID NO: ) shown in Table 65A. A NOV65a ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 652-654. The disclosed NOV65a maps to human chromosome 1 . 



Table 65A. NOV65a Nucleotide Sequence (SEQ ID 
NO:227) 

ATGCTTCCCAAACGCGTGAGGGAGAAGATGGATGACACCAGCCTCTATAATACGCCCTGT 
GTCCTGGACCTACAGCGGGCCCTGGTTCAGGATCGCCAAGAGGCGCCCTGGAATGAGGTG 
GATGAGGTCTGGCCCAATGTCTTCATAGCTGAGAAGAGTGTGGCTGTGAACAAGGGGAGG 
CTGAAGAGGCTGGGAATCACCCACATTCTGAATGCTGCGCATGGCACCGGCGTTTACACT 
GGCCCCGAATTCTACACTGGCCTGGAGATCCAGTACCTGGGTGTAGAGGTGGATGACTTT 
CCTGAGGTGGACATTTCCCAGCATTTCCGGAAGGCGTCTGAGTTCCTGGATGAGGCGCTG 
CTGACTTACAGAGGGAAAGTCCTGGTCAGCAGCGAAATGGGCATCAGCCGGTCAGCAGTG 
CTGGTGGTCGCCTACCTGATGATCTTCCACAACATGGCCATCCTGGAGGCTTTGATGACC 
GTGCGTAAGAAGCGGGCCATCTACCCCAATGAGGGCTTCCTGAAGCAGCTGCGGGAGCTC 
AATG AGAAGTTG ATGAGGAG AGAG AAG AGGACT ATGGCCGGG AGGGGGG AT C AG C TG AGG 
CTGAGGAGGGCGAGGGCACTGGGAGCATGCTCGGGGCCAGAGTGCACGCCCTGACGGTGG 
AAGAGGAGGACGACAGCGCCAGCCACCTGAGTGGCTCCTCCCTGGGGAAGG ' 

The NOV65a polypeptide (SEQ ID NO:228) encoded by SEQ ID NO:227 is 217 
amino acids in length and is presented using the one-letter amino acid code in Table 65B. The 
Psort profile for NOV65a predicts that this sequence is likely to be a Type II membrane 
protein, and is likely to be localized at the plasma membrane with a certainty of 0.4400. In 
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alternative embodiments, a NOV65a polypeptide is located to the endoplasmic reticulum 
(membrane) with a certainty of 0.8500, or to the nucleus with a certainty of 0.7400. 



Table 65B. NOV65a Polypeptide Sequence (SEQ ID 
NO:228) 

MLPKRVREKMDDTSLYNTPCVLDLQRALVQDRQEAPWNEVDEVWPNVFIAEKSVAVNKGR 
LKRLGITHILNAAHGTGVYTGPEFYTGLEIQYLGVEVDDFPEVDISQHFRKASEFLDEAL 
LTYRGKVLVSSEMGISRSAVLWAYLMIFHNMAILEALMTVRKKRAIYPNEGFLKQLREL 
NEKLMRREKRTMAGRGDQLRLRRARALGACSGPECTP 



NOV65b 

The disclosed NOV65b (alternatively referred to herein as CG56652-02) includes the 
3212 nucleotide sequence (SEQ ID NO: ) shown in Table 65C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 3193-3195. The disclosed NOV65b maps to human chromosome 1 . 



Table 65C. NOV65b Nucleotide Sequence (SEQ ID 
NO:229) 

ATGCTTCCCAAACGCGTGAGGGAGAAGATGGATGACACCAGCCTCTATAATACGCCCTGT 

GTCCTGGACCTACAGCGGGCCCTGGTTCAGGATCGCCAAGAGGCGCCCTGGAATGAGGTG 

GATGAGGTCTGGCCCAATGTCTTCATAGCTGAGAAGAGTGTGGCTGTGAACAAGGGGAGG 

CTGAAGAGGCTGGGAATCACCCACATTCTGAATGCTGCGCATGGCACCGGCGTTTACACT 

GGCCCCGAATTCTACACTGGCCTGGAGATCCAGTACCTGGGTGTAGAGGTGGATGACTTT 

CCTGAGGTGGACATTTCCCAGCATTTCCGGAAGGCGTCTGAGTTCCTGGATGAGGCGCTG 

CTGACTTACAGAGGGAAAGTCCTGGTCAGCAGCGAAATGGGCATCAGCCGGTCAGCAGTG 

CTGGTGGTCGCCTACCTGATGATCTTCCACAACATGGCCATCCTGGAGGCTTTGATGACC 

GTGCGTAAGAAGCGGGCCATCTACCCCAATGAGGGCTTCCTGAAGCAGCTGCGGGAGCTC 

AATG AGAAGTT G ATGGAGG AG AGAGAAG AGGACT ATGGC CGGGAGGGGGG AT C AG C TG AG 

GCTGAGGAGGGCGAGGGCACTGGGAGCATGCTCGGGGCCAGAGTGCACGCCCTGACGGTG 

GAAGAGGAGGACGACAGCGCCAGCCACCTGAGTGGCTCCTCCCTGGGGAAGGCCACCCAG 

GCCTCCAAGCCCCTCACCCTCATAGACGAGGAGGAGGAGGAGAAACTGTACGAGCAGTGG 

AAGAAGGGGCAGGGCCTCCTCTCAGACAAGGTCCCCCAGGATGGAGGTGGCTGGCGCTCA 

GCCTCCTCTGGC C AGGGTGGGGAGG AG CTCG AGGACGAGG ACGTGGAG AGG ATC AT C C AG 

G AGTGGC AGAGC CG AAACG AG AGGT AC C AAGC AGAAGGGT AC C GG AGGTGGGG AAGGG AG 

GAGGAGAAGGAGGAGGAGAGCGACGCTGGCTCCTCGGTGGGGAGGCGGCGGCGCACCCTG 

AGCGAGAGC AG CG C C TGGG AG AGCGTGAGC AG CC ACG AC ATCTGGGT C CTG AAG C AG CAG 

CTGGAGCTGAACCGCCCGGACCACGGCAGGAGGCGCCGCGCAGACTCGATGTCCTCGGAG 

AGCACCTGGGACGCATGGAACGAGAGGCTGCTGGAGATTGAGAAGGAGGCTTCCCGGAGG 

T AC C ACGCC AAGAGC AAGAG AGAGGAGGCGGC AGAC AGGAGCT C AGAAGC AGGGAG C AGG 

GTGCGGGAGGATGATGAGGACAGCGTGGGCTCTGAGGCCAGTTCCTTCTACAACTTCTGC 

AGCAGGAACAAGGACAAGCTCACTGCCCTGGAAAGATGGAAGATCAAGAGAATCCAATTT 

GG ATTT C AC AAGAAAG AC T TGGGAGCGGG AGACAGC AGCGGTGAGC CCGGTGC AGAGGAG 

GCAGTAGGGGAGAAGAACCCCTCCGACGTCAGCCTGACAGCCTACCAGGCCTGGAAGCTG 

AAACACCAGAAGAAGGTGGGCAGTGAGAACAAGGAGGAGGTGGTGGAGCTCAGCAAGGGG 

GAGGACTCGGCCTTGGCTAAGAAGAGACAACGGAGGCTGGAGCTGCTGGAGAGAAGCCGG 

CAGACGCTGGAGGAGAGCCAGTCTATGGCAAGCTGGGAGGCGGACAGCTCCACGGCCAGC 

GGGAGCATTCCCCTGTCTGCGTTCTGGTCTGCAGACCCCTCAGTCAGCGCTGATGGGGAC 

ACGACGTCAGTACTGAGCACCCAGAGCCACCGCTCCCACCTGTCTCAGGCTGCAAGCAAC 

ATAGCGGGGTGTTCAACCTCCAACCCCACCACACCCCTGCCTAACCTGCCAGTGGGGCCT 

GGAGACACCATTTCCATTGCCAGTATCCAGAACTGGATTGCCAATGTAGTCAGTGAGACC 

CTTGCTCAGAAGCAAAATGAAATGCTGCTGTTGTCCCGCTCACCGTCTGTTGCAAGCATG 

AAGGCAGTACCAGCGGCTAGCTGCCTGGGGGATGACCAAGTCTCCATGCTTAGTGGACAC 

AGCAGCTCCTCCTTGGGTGGCTGCCTGTTGCCTCAGAGCCAGGCAAGACCCAGCTCTGAC 

ATGCAGTCTGTGCTGTCCTGCAACACCACACTGAGCTCACCCGCGGAAAGTTGCAGAAGC 

AAAGTGAGGGGGACCAGCAAGCCCATCTTCAGCCTCTTTGCTGACAATGTGGACCTAAAG 

GAACTTGGCCGGAAGGAGAAGGAGATGCAGATGGAGCTTAGGGAGAAGATGTCTGAGTAC 
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AAAATGGAAAAGCTGGCCTCAGACAACAAACGCAGCTCCCTCTTCAAGAAGAAGAAGGTC 
AAGGAAGATGAGGATGATGGTGTGGGTGATGGGGATGAGGACACTGACAGTGCCATAGGG 
AGCTTCCGATATTCTTCCCGCAGTAATTCCCAGAAACCTGAAACAGACACATGCTCCTCC 
CTGGCTGTCTGTGATCACTATGCAAGTGGCAGCAGAGTTGGCAAAGAGATGGATAGCAGT 
ATTAATAAGTGGCTCAGTGGCCTCAGGACGGAGGAAAAACCTCCTTTCCAAAGTGACTGG 
TCTGGAAGTTCCAGAGGGAAGTACACCAGATCGTCCCTGCTCAGGGAGACAGAGTCTAAA 
TCCTCCAGTTACAAGTTTTCCAAATCCCAGTCAGAGGAACAGGACACCTCCTCCTACCAC 
GAGGCAAATGGCAACTCTGTAAGAAGCACTTCACGGTTCTCATCTTCCTCCACCAGGGAG 
GGCAGAGAGATGCACAAGTTCTCCAGGTCCACGTACAACGAGACCTCAAGTTCCCGAGAG 
GAGAGCCCAGAGCCCTACTTCTTCCGCCGGACCCCAGAGTCCTCAGAAAGGGAAGAGTCC 
CCAGAACCACAGCGCCCAAATTGGGCCAGGTCCAGGGACTGGGAAGATGTGGAAGAGTCA 
TCCAAGTCAGACTTCTCTGAATTTGGAGCCAAGAGGAAGTTCACCCAGAGCTTTATGAGG 
TCTGAAGAAGAGGGAGAGAAAGAGAGGACAGAAAACAGAGAAGAAGGGAGGTTTGCATCT 
GG ACGGCGGT C C C AGT AT CGG AGAAGC ACTG AC AGGG AGG AAGAGGAAGAAATGGACG AT 
GAAG C CAT C ATTG CTG C T TGG AG ACGC CGGC AAGAAG AAAC C AGG AC C AAG CT GC AGAAA 
AGG AGGGAGGACTGAGCTGGGGAAAAT CTG AG 



The NOV65b polypeptide (SEQ ID NO:230) encoded by SEQ ID NO:229 is 1064 
amino acids in length and is presented using the one-letter amino acid code in Table 65D. The 
Psort profile for NOV65b predicts that this sequence is a Type II membrane protein, and is 
5 likely to be localized at the plasma membrane with a certainty of 0.7900. In alternative 

embodiments, a NOV65b polypeptide is located to Goligi bodies with a certainty of 0.3000 or 
to the nucleus with a certainty of 0.8200. 



MLPKRVREKMDDTSLYNTPCVLDLQRALVQDRQEAPWNEVDEVWPNVFIAEKSVAVNKGR 
LKRLGITHILNAAHGTGVYTGPEFYTGLEIQYLGVEVDDFPEVDISQHFRKASEFLDEAL 
LTYRGKVLVS SEMGI S RSAVLWAYLM I FHNMAI LEALMTVRKKRAI YPNEGFLKQL.REL 
NEKLMEEREEDYGREGGSAEAEEGEGTGSMLGARVHALTVEEEDDSASHLSGSSLGKATQ 
ASKPLTLIDEEEEEKLYEQWKKGQGLLSDKVPQDGGGWRSASSGQGGEELEDEDVERIIQ 
EWQSRNERYQAEGYRRWGREEEKEEESDAGSSVGRRRRTLSESSAWESVSSHDIWVLiKQQ 
LELNRPDHGRRRRADSMSSESTWDAWNERLLEIEKEASRRYHAKSKREEAADRSSEAGSR 
VREDDEDSVGSEASSFYNFCSRNKDKLTALERWKIKRIQFGFHKKDLGAGDSSGEPGAEE 
AVGEKNPSDVSLTAYQAWKLKHQKKVGSENKEEWELSKGEDSALAKKRQRRLELLERSR 
QTLEESQSMASWEADSSTASGS I PLS AFWS ADPS VSADGDTTSVLSTQSHRSHLSQAASN 
I AGCSTSNPTTPLPNLPVGPGDT I S I AS I QNW I ANWSETLAQKQNEMLLLSRS PS VASM 
KAVPAASCLGDDQVSMLSGHSSSSLGGCLLPQSQARPSSDMQSVLSCNTTLSSPAESCRS 
KVRGTSKPIFSLFADNVDLKELGRKEKEMQMELREKMSEYKMEKLASDNKRSSLFKKKKV 
KEDEDDGVGDGDEDTDSAIGSFRYSSRSNSQKPETDTCSSLAVCDHYASGSRVGKEMDSS 
INKWLSGLRTEEKPPFQSDWSGSSRGKYTRSSLLRETESKSSSYKFSKSQSEEQDTSSYH 
EANGNSVRSTSRFSSSSTREGREMHKFSRSTYNETSSSREESPEPYFFRRTPESSEREES 
PEPQRPNWARSRDWEDVEESSKSDFSEFGAKRKFTQSFMRSEEEGEKERTENREEGRFAS 
GRRSQYRRSTDREEEEEMDDEAIIAAWRRRQEETRTKLQKRRED 



A BLAST analysis of NOV65 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV65 had high 
homology to other proteins as shown in Table 65E. 



Table 65D. NOV65b Polypeptide Sequence (SEQ ID 
NO:230) 



10 
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Table 65E. BLASTX results from PatP database for NOV65 








Smallest 








Sum 






High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P(N) 


patp 
patp 
patp 
patp 
patp 


AAE04 83 6 Human SGP018 phosphatase polypeptide 
AAB40919 Human ORFX ORF683 polypeptide sequence 
:AAE04837 Human SGP003 phosphatase polypeptide 
:AAY68779 Amino acid sequence of a human 
-AAB42334 Human ORFX ORF2098 polypeptide sequence 


858 
767 
410 
389 
389 


1.5e-85 

6 . 5e-76 
4 . 4e-38 
7 .4e-36 

7 . 4e-36 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 328 of 531 bases (61%) identical to a gb:GENBANK- 
ID:AB027004|acc:AB027004.1 mRNA from Homo sapiens (mRNA for protein phosphatase). 
The full amino acid sequence of the protein of the invention was found to have 80 of 174 
amino acid residues (45%) identical to, and 1 15 of 174 amino acid residues (66%) similar to, 
the 198 amino acid residue ptnr:SPTREMBL-ACC:Q9UII6 protein from Homo sapiens 
(Human) (PROTEIN PHOSPHATASE). NOV65 also has homology to the other proteins 
shown in the BLASTP data in Table 65F. 



Table 65F. NOV65 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 14602535 | g 
b|AAH09778 . 1 | 
AAH09778 
(BC009778) 


protein phosphatase 
[Homo sapiens] 


198 


81/178 
(45) 


117/178 
(65) 


3e-37 


gi | 17454087 | r 
ef |XP 061191 . 
1 | (XM 061191) 


similar to protein 
phosphatase (H. sapiens) 
[Homo sapiens] 


370 


82/186 
(44) 


121/186 
(64) 


4e-37 


gi | 7705959 | re 
f |NP 057448.1 
| (NM 016364) 


protein phosphatase 
[Homo sapiens] 


198 


81/178 
(45) 


116/178 
(64) 


le-36 


gi | 7305011 | re 
f |NP_038877 . 1 
| (NM 013849) 


dual specificity 
phosphatase 13 
[Mus musculus] 


198 


80/178 
(44) 


115/178 
(63) 


3e-36 


gi|l283924l|d 
bj |BAB24480.1 
| (AK006247) 


dual specificity 
phosphatase 13 -putative 
[Mus musculus] 


198 


79/178 
(44) 


114/178 
(63) 


3e-35 



This BLASTP data is displayed graphically in the ClustalW in Table 65G. A multiple 
sequence alignment is given, with the NOV65a and b proteins being shown on lines 1 and 2 ir 
a ClustalW analysis comparing the protein of the invention with the related protein sequences 
shown in Table 65F. 

Table 65G. ClustalW Alignment of NOV65 
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NOV6 5a 

NOV65b 

gi | 14602535 | 

gi | 17454087 j 

gi j 7705959 | 

gi | 7305011 | 

gi j 12839241 | 



NOV6 5a 

NOV65b 

gi | 14602535 | 

gi | 17454087 j 

gi | 7705959 | 

gi | 7305011 | 

gi | 12839241 | 



NOV65a 
NOV6 5b 
gi | 14602535 | 
gi | 17454087 j 
gi j 7705959 | 
gi | 7305011 | 
gi j 1283924l| 



NOV6 5a 

NOV65b 

gi | 14602535 | 

gi | 17454087 | 

gi | 7705959 | 

gi | 7305011 j 

gi j 12839241 | 



NOV65a 

NOV65b 

gi | 14602535 | 

gi | 17454087 j 

gi j 7705959 | 

gi j 7305011 | 

gi | 12839241 | 



NOV65a 

NOV65b 

gi | 14602535 | 

gi j 17454087 j 

gi | 7705959 | 

gi | 7305011 j 

gi j 12839241 | 



NOV65a 
NOV6 5b 
gi | 14602535 | 
gi j 17454087 j 
gi j 7705959 | 
gi | 7305011 | 
gi j 12839241 1 



(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 



1 
1 
1 
1 
1 
1 
1 



13 
13 
12 
61 
12 
12 
12 



NO : 22 8) 
NO:230) 
NO: 624) 
NO: 625) 
NO:626) 
NO: 627) 
NO: 628) 



3 20 30 40 

....|....|....|....|-...|.-..|..-.| 

-MljPKRV^ 

-MfcpKRVM 

ofjgi 

mlevdpvtvpviJpWkrpeleevkefaqvqryglgdvlfpladgslI 

D&£bi 

osm^ 

DBpm 




70 



80 



90 



100 



110 

..I.. 



120 
••I 



13 
13 
16 



llglpBlJpkwpkpvgfsqikekvgtkgvkskescrkerkslgskmtsgevktslknays 12 0 

16 



16 
16 



130 



14 0 



150 



160 



170 



180 




118 e^ltyrg; 

118 eHlTYRG. 

12 5 aBsVPQG 

23 9 rKs.DDHS. 

12 5 I>BlStIPR^ 

125 nH^if'Rs: 




185 
185 . 
185 ^vBn 



310 320 330 340 350 360 

RE K 189 

IREEDYGREGGSAE AEEGEGTGSMLGARVHALTVEEEDDSASHLSGSSLGK 2 3 7 

| F 

RRBSQRQDGEN 

\ F 



198 
GS 321 
198 
-- 198 
-- 198 
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si:' 1 



;1i OiP 



370 



NOV65a 

NOV6 5b 

gi | 14602535 | 

gi | 17454087 j 

gi j 7705959 | 

gi|730501l| 

gi | 12839241 | 



NOV65a 

NOV6 5b 

gi | 14602535 | 

gi | 17454087 j 

gi |7705959 | 

gi j 7305011 j 

gi|l283924l| 



NOV6 5a 

NOV65b 

gi | 14602535 | 

gi | 17454087 j 

gi | 7705959 | 

gi | 7305011 j 

gi j 12839241 | 



NOV65a 

NOV65b 

gi | 14602535 | 

gi | 17454087 | 

gi | 7705959 | 

gi | 7305011 | 

gi j 12839241 | 



NOV65a 

NOV65b 

gi | 14602535| 

gi j 17454087 j 

gi j 7705959 | 

gi [7305011 j 

gi j 12839241 | 



NOV65a 
NOV65b 
gi|l4602535| 
gi j 17454087 j 
gi j 7705959 | 
gi | 7305011 | 
gi j 12839241 | 



190 
238 
198 
322 
198 
198 
198 



200 
298 
198 
337 
198 
198 
198 



217 
358 
198 
370 
198 
198 
198 



217 
418 
198 
370 
198 
198 
198 



217 
478 
198 
370 
198 
198 
198 



217 
538 
198 
370 
198 
198 
198 



3 80 3 90 400 410 420 

RTMAGR GDQLR 200 

ATQASKPLTLIDEEEEEKLYEQWKKGQGLLSDKVPQDGGGWRSASSGQGGEELEDEDVER 297 
. 19 8 



RHEAGSDSF- 



-QGQESQT- 



337 
198 
198 
198 



430 



I 



440 



450 



460 470 480 

- - - - 1 1 I | ... J 

- LRRARALG AC S G P E CT P 217 

IIQEWQSRNERYQAEGYRRWGREEEKEEESDAGSSVGRRRRTLSESSAWESVSSHDIWVL 357 
198 

370 
-- 198 
-- 198 
-- 198 



- RQKPKRVAAVG- 



-DA- 



- GRQGPGMEMAWRNQNVI KAF- 



490 



500 



510 



520 
..|.. 



530 



540 
•-I 



217 

KQQLELNRPDHGRRRRADSMSSESTWDAWNERLLEIEKEASRRYHAKSKREEAADRSSEA 417 



370 
198 
198 
198 



I 



550 



560 



570 
..|.. 



580 
..|.. 



590 



600 
•-I 



217 

GSRVREDDEDSVGSEASSFYNFCSRNKDKLTALERWKIKRIQFGFHKKDLGAGDSSGEPG 477 

198 

37 0 



198 
198 
198 



610 



620 



I 



630 



640 



650 



I 



660 
-I 



- 217 

AEEAVGEKNPSDVSLTAYQAWKLKHQKKVGSENKEEWELSKGEDSALAKKRQRRLELLE 53 7 

198 

370 



198 
198 
198 



670 



680 
..|.. 



690 
..|.. 



700 



710 
..|.. 



720 
•-I 



217 

RSRQTLEESQSMASWEADSSTASGSIPLSAFWSADPSVSADGDTTSVLSTQSHRSHLSQA 597 
19 8 



370 
198 
198 
198 



NOV65a 


217 


NOV6 5b 


598 


gi | 14602535 | 


198 


gi j 17454087 | 


370 


gi j 7705959 | 


198 


gi|730501l| 


198 


gi | 12839241 | 


198 



730 



740 



750 



760 



770 



780 
•♦I 



217 

ASNI AGCSTSNPTTPLPNLPVGPGDT I S I AS I QNWI ANWSETLAQKQNEMLLLSRS PSV 657 

198 

370 

198 

198 

198 
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NOV6 5a 
NOV6 5b 
gi | 14602535 | 
gi I 17454087 j 
gi | 7705959 | 
gi | 7305011 j 
gi j 12839241 1 



NOV6 5a 

NOV6 5b 

gi | 14602535 | 

gi j 17454087 j 

gi | 7705959 | 

gi|730501l| 

gi | 12839241 | 



NOV65a 

NOV65b 

gi | 14602535 | 

gi | 17454087 j 

gi j 7705959 | 

gi | 7305011 | 

gi | 12839241 1 



NOV65a 

NOV65b 

gi | 14602535 | 

gi j 17454087 j 

gi j 7705959 | 

gi | 7305011 | 

gi j 12839241 | 



NOV65a 

NOV6 5b 

gi | 14602535 | 

gi j 17454087 | 

gi | 7705959 | 

gi|730501l| 

gi | 12839241 | 



NOV65a 

NOV65b 

gi | 14602535 | 

gi j 17454087 j 

gi j 7705959 | 

gi | 7305011 | 

gi | 12839241 | 



NOV65a 
NOV6 5b 
gi | 14602535 | 
gi j 17454087 | 
gi j 7705959 | 
gi | 7305011 | 



790 800 810 820 830 840 

217 : 217 

658 ASMKAVPAASCLGDDQVSMLSGHSSSSLGGCLLPQSQARPSSDMQSVLSCNTTLSSPAES 717 

198 198 

370 370 

198 7 198 

198 198 

198 198 

850 860 870 880 890 900 

217 217 

718 CRSKVRGTSKPIFSLFADNVDLKELGRKEKEMQMELREKMSEYKMEKLASDNKRSSLFKK 77 7 

198 198 

3 70 370 

198 198 

198 198 

198 198 

910 920 930 940 950 960 

— I — I — I — I — I — I — I — I — I — I 

217 217 

7 7 8 KKVKEDEDDGVGDGDEDTDSAIGSFRYSSRSNSQKPETDTCSSLAVCDHYASGSRVGKEM 83 7 

198 198 

370 370 

198 198 

198 198 

198 198 

970 980 990 1000 1010 1020 

217 217 

83 8 DSSINKWLSGLRTEEKPPFQSDWSGSSRGKYTRSSLLRETESKSSSYKFSKSQSEEQDTS 897 

198 198 

3 70 370 

198 198 

198 198 

198 198 

1030 1040 1050 1060 1070 1080 

217 Z1/ 

898 SYHEANGNSVRSTSRFSSSSTREGREMHKFSRSTYNETSSSREESPEPYFFRRTPESSER 957 

198 198 

370 370 

198 198 

198 198 

198 198 

1090 1100 1110 1120 1130 1140 

217 " 21 1 

958 EESPEPQRPNWARSRDWEDVEESSKSDFSEFGAKRKFTQSFMRSEEEGEKERTENREEGR 1017 

198 198 

370 370 

198 198 

198 198 

198 198 

1150 1160 1170 1180 

217 217 

1018 FASGRRSQYRRSTDREEEEEMDDEAIIAAWRRRQEETRTKLQKRRED 1064 

198 198 

37 0 370 

198 198 

198 198 



549 



T p ifl 



■JL fl_ 1 



gi | 12839241| 198 



Table 65H lists the domain description from DOMAIN analysis results against 
NOV65. This indicates that the NOV65 sequence has properties similar to those of other 
proteins known to contain this domain. 

5 



Table 65H. Domain Analysis of NOV65 


gnl | Pf am|pf am007 82, DSPc, Dual specificity phosphatase, catalytic domain. 


Ser/Thr 


and 


Tyr protein phosphatases. The enzyme's tertiary fold is 


highly 


similar 


to 


that of tyrosine-specif ic phosphatases, except for a "recognition" 


region. 


SEQ 


ID NO: 864 








CD-Length = 139 residues, 97.8% aligned 




Score = 


1 13 bits (282), Expect = le-26 




NOV65 : 


42 


EVWPNVFIAEKSVAVNKGRLKRLGITHILNAAHGTGVYTGPEFYTGLEIQYLGVEVDDFP 


101 






E+ p +++ + AN L +LGITH++N F YL + VDD 




Sbj Ct : 


4 


EILPHLYLGSYPTASNLAFLSKLGITHVINVTEEVPNSKNSGF LYLHIPVDDNH 


57 


NOV65 : 


102 


EVDISQHFRKASEFLDEALLTYRGKVLVSSEMGISRSAVLVVAYLMIFHNMAILEALMTV 


161 






E DIS + +A EF+++A GKVLV + GISRSA L++AYLM N+++ EA V 




Sbj Ct : 


58 


ETDI S P YLDEAVEFI EDAR - QKGGKVLVHCQAGI S RS ATL I I AYLMKTRNLSLNEAYS FV 


116 


NOV6 5 : 


162 


RKKR-AIYPNEGFLKQLRELNEK 18 3 








+ + + R I PN GF +QL E K 




Sbjct : 


117 


KERRPI ISPNFGFKRQLIEYERK 13 9 





The NOV65 gene of invention is a member of the family of dual specificity protein 
phosphatases (DSPs; Martell et al., Mol Cells 1998 Feb 28;8(1):2-1 1). DSPs recognize either 
serine/threonine (Ser/Thr) or tyrosine (Tyr) moieties as targets for dephosphorylation. These 

10 enzymes regulate mitogenic signal transduction and can thereby regulate the cell cycle. Some 
members of this family are effective tumor suppressors, for example, PTEN. PTEN is required 
during embryonic development and later in life, and mutations in this gene give rise to 
different kinds of inherited and sporadic cancers (Eng, Recent Prog Horm Res 1999;54:441- 
52; discussion 453). In Drosophila, members of the DSP family, such as puckered, have 

1 5 important roles in development (Martin-Blanco et al., Genes Dev 1998 Feb 15;12(4):557-70). 
The crystal structure of one member of the DSP family has been elucidated (Yuvaniyama at 
al., Science 1996 May 31 ;272(5266): 1328-31) and this family has been successfully targeted 
for small molecule drug development (Ducruet et al., Bioorg Med Chem 2000 Jun;8(6):1451- 
66). In addition, overexpression of a DSP has been demonstrated to be a potential therapy for 

20 cardiac hypertrophy (Bueno et al., Circ Res 2001 Jan 19;88(l):88-96). NOV65 has closest 

homology to a phosphatase that is differentially regulated in the testis during spermatogenesis 
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and is therefore thought to be involved in sperm development and maturation (Nakamura et 
al., Biochem. J. 344 Pt 3, 819-825 (1999)). 

The disclosed NOV65 is predicted to be expressed in at least the following tissues: 
heart, skeletal muscle, colon, fetal lung, head, and ovary. This information was derived by 
5 determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, public EST sources, literature sources, and/or RACE 
sources. Further expression data for NOV65 is provided in Example 2. 

The NOV65 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 

10 cardiomyopathy, atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial 
septal defect (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary 
stenosis, subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation, fertility, polycystic ovarian syndrome, cancer, tissue 
degeneration, bacterial/viral/parasitic infection, systemic lupus erythematosus, autoimmune 

15 disease, asthma, emphysema, scleroderma, allergy, ARDS, muscular dystrophy, Lesch-Nyhan 
syndrome, myasthenia gravis, Hirschsprung's disease, Crohn's Disease, appendicitis as well as 
other diseases, disorders and conditions. The NOV65 nucleic acid encoding the phosphatase- 
like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 

20 assessed. The novel nucleic acid of the invention encoding a protein phosphatase-like protein 
includes the nucleic acid whose sequence is provided in Table 65 A or 65C, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 65A or 65C while still encoding a 
protein that maintains its protein phosphatase-like activities and physiological functions, or a 

25 fragment of such a nucleic acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence indicated in Table 65A or 65C, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. The invention additionally includes 
nucleic acids or nucleic acid fragments, or complements thereto, whose structures include 

30 chemical modifications. Such modifications include, by way of non-limiting example, 
modified bases, and nucleic acids whose sugar phosphate backbones are modified or 
derivatized. These modifications are carried out at least in part to enhance the chemical 
stability of the modified nucleic acid, such that they may be used, for example, as antisense 
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binding nucleic acids in therapeutic applications in a subject. In the mutant or variant nucleic 
acids, and their complements, up to about 39% of the bases may be so changed. 

The novel protein of the invention includes the protein phosphatase-like protein whose 
sequence is provided in Table 65B or 65D. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
65B or 65D while still encoding a protein that maintains its protein phosphatase-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 54% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV66 

The disclosed NOV66 (alternatively referred to herein as CG56633-01) includes the 
1036 nucleotide sequence (SEQ ID NO:231) shown in Table 66A. A NOV66 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 28-30 and ends with a stop codon 
at nucleotides 913-915. The disclosed NOV66 maps to human chromosome 3. 



Table 66A. NOV66 Nucleotide Sequence (SEQ ID 
NO:231) 

CCCGTCCTCGCTGGGTTGTCCCGGTCCATGTATCTGGTCATGGTGCTGAGGAACCTGCTC 
ATCATCCTGGCTGTCAGCTCTGACTCCCACCTCCACACCCCCATGTACTTCTTCCTCTCC 
AACCTGTGCTGGGCTGACATCGGTTTCACCTCGGCCATGGTTCCCAAGATGATTGTGGAC 
ATGCAGTCTCATAGCAGAGTCATCTCTTATGCGGGCTGCCTGACACGGATGTCTTTCTTG 
GTCCTTTTTGCATGTATAGAAGACATGCTCCTGACTGCGATGGCCTATGACTGCTTTGTA 
GCCATCTGTCGCCCTCTGCACTACCCAGTCATCGTGAATCCTCACCTCTCTGTCTTCTTA 
GTTTTGGTGTCCTTTTTCCTTAGCCTGTTGGATTCCCAGCTGCACAGTTTGATTGTGTTA 
CAATTCACCTTCTTCAAGAATGTGGAAATCTCTAATTTTGTCTGTGAGCCATCTCAGCTT 
CTCAACCTTGCCTGTTCTGACAGCGTCATCAATAGCATATTCTTATATTTCGATAGTACT 
ATGTTTGGTTTTCTTCCCATTTCAAGGATCCTTTTGTCTTACTATAAAATTGTCCCCTCC 
ATTCTAAGGATTTCATCGTCAGATGGGAAGTATAAAGCCTTCTCCACCTGTGGCTCTCAC 
CTAGCAGTTGTTTGCTTATTTTATGGAACAGGCATTGGCGTGTACCTGACTTCAGCTGTG 
TCACCACCCCCCAGGAGTGGTGTGGTGGCGTCAGTGATGTACGCTGTGGTCACCCCCATG 
CTGAACCCTTTCATCTATAGCCTGAGAAACAGAGACATTCAAAGCGCCCTCTGGAGGCTG 
CGCAGCAGAACAGTCGAATCTCATGATCTGTTCCATCCTTTTTCTTGTGTGGGTAAGAAA 
GGGCAACCACATTAAATCCCTGCATCTGCAAATCCTGCTCCTTAGTCACATTATTTTTGT 
GGCTTGATGGCTTTTATTCCTTTCCGCATTTCCTATGTGAATATTGTTTTCTTCGTTATG 
CCTTTAACTGGAATGG 

A NOV66 polypeptide (SEQ ED NO:232) encoded by SEQ ID NO:231 is 295 amino 
acids in length and is presented using the one-letter amino acid code in Table 66B. The Psort 
profile for NOV66 predicts that this sequence is a Type Ilia membrane protein, has a signal 
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peptide and is likely to be localized at the plasma membrane with a certainty of 0.6400. In 
alternative embodiments, a NOV66 polypeptide is located to Golgi bodies with a certainty of 
0.4600, or to the endoplasmic reticulum (membrane) with a certainty of 0.3700. The Signal P 
predicts a likely cleavage site for a NOV66 peptide is between positions 17 and 1 8, i.e., at the 
dash in the sequence AVS-SD. 



Table 66B. NOV66 Polypeptide Sequence (SEQ ID 
NO:232) 

MYLVMVLRNLLIILAVSSDSHLHTPMYFFLSNLCWADIGFTSAMVPKMIVDMQSHSRVIS 

YAGCLTRMS FLVL FAC I E DML LT AMAYD C FV A ICR P LH Y P V I VNP HL S VFL VLVS FFLS L 
LDSQLHSLIVLQFTFFKNVEISNFVCEPSQLLNLACSDSVINSIFLYFDSTMFGFLPISR 
ILLSYYKIVPSILRISSSDGKYKAFSTCGSHLAWCLFYGTGIGVYLTSAVSPPPRSGW 
ASVMYAWTPMLNPFIYSLRNRDIQSALWRLRSRTVESHDLFHPFSCVGKKGQPH 

A BLAST analysis of NOV66 was run against the proprietary PatP GENE SEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV66 had high 
homology to other proteins as shown in Table 66C. 



Table 66C. BLASTX results from PatP database for NOV66 



Sequences producing High- scoring Segment Pairs: 

patp:AAG71875 Human olfactory receptor polypeptide 
patp:AAE04583 Human G-protein coupled receptor-3 9 
patp :AAU24 551 Human olfactory receptor AOLFR3 8 
patp:AAG718l6 Human olfactory receptor polypeptide 
patp:AAU24 54 9 Human olfactory receptor AOLFR3 6 

In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 973 of 1036 bases (93%) identical to a gbiGENBANK- 
ID:AF042089|acc:AF042089.1 mRNA from Homo sapiens (chromosome 3, olfactory receptor 
pseudogene cluster 1, complete sequence, and myosin light chain kinase (MLCK) pseudogene, 
partial sequence). The full amino acid sequence of the protein of the invention was found to 
have 192 of 265 amino acid residues (72%) identical to, and 221 of 265 amino acid residues 
(83%) similar to, the 264 amino acid residue ptnr:SPTREMBL-ACC:043789 protein from 
Homo sapiens (Human) (OLFACTORY RECEPTOR). NOV66 also has homology to the 
other proteins shown in the BLASTP data in Table 66D. 



Smallest 
Sum 

High Probability 
Score P (N) 

1466 5.6e-150 

1377 1.5e-140 

1377 1.5e-140 

1363 4.6e-139 

1354 4 . le-138 
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Table 66D. NOV66 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi | 17466082 | r 
ef j XP 07 0192. 
1 | (XM~070192) 


similar to olfactory- 
receptor, family 7 , 
subfamily C, member 3 
(H. sapiens) 
[ Homo s api en s ] 


349 


221/250 
(88) 


229/250 
(91) 


e-lll 


gi { 174 82057 J r 
ef |XP_064778 . 
l| (XM 064 77 8) 


similar to G protein- 
coupled receptor homolog 
clone G3 (H. sapiens) 
[Homo sapiens] 


251 


220/250 
(88) 


229/250 
(91) 


e-105 


gi | 17448458 | r 
ef |XP_070402 . 
1 | <XM_070402) 


similar to OLFACTORY 
RECEPTOR 7C2 (OLFACTORY 
RECEPTOR 19-18) (OR19- 
18) (R\ sapiens) [Homo 
sapiens] 


528 


221/242 
(87) 


221/242 
(91) 


e-101 


gi | 7443955 | pi 
r| |PC4369 


olfactory receptor, HT2 
- human (fragment) 


264 


192/265 
(72) 


221/265 
(82) 


6e-92 


gi | 4092819 | gb 
| AAD03353 . 1 | 
(AC006271) 


BC319430_5 
[Homo sapiens] 


263 


191/264 
(72) 


220/264 
(82) 


44-91 



This BLASTP data is displayed graphically in the ClustalW in Table 66E. A multiple 
sequence alignment is given, with the NOV66 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 66D. 



Table 66E. ClustalW Alignment of NOV66 



NOV 6 6 

gi | 17466082 | 
gi I 17482057 | 
gi | 17448458 j 
gi | 7443955 | 
gi|4092819| 



NOV66 

gi | 17466082 | 
gi j 17482057 j 
gi|l7448458| 
gi|7443955| 
gi | 4092819 j 



NOV6 6 

gi|l7466082| 
gi | 17482057 | 
gi | 17448458 j 
gi | 7443955 | 
gi|4092819| 

13 0 140 150 160 170 180 




(SEQ ID NO:232) 
(SEQ ID NO : 629) 
(SEQ ID NO: 63 0) 
(SEQ ID NO:631) 
(SEQ ID NO: 632) 
(SEQ ID NO: 633) 

10 20 30 40 50 60 

X - 1 

± MVS D I RKE DGMN - 12 

x 1 

1 MYLVTVLRNVLI ILAVSSDSHLHTPMYFFLSSLCWADIGFTSATVPKMTVDMQSHSRVIS 60 

! 1 

1 ~~ 



1 



70 80 90 100 110 120 

! 1 

12 VLPLKYIPNVG VN FS 27 



1 



61 YLHSWIVLQFTLFKNVENSSFVCDPSQLLNLACSDSVINSIFIYFDRRTVDLFSWMAQQ 12 0 
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NOV6 6 

gi | 17466082 | 
gi j 17482057) 
gi j 17448458 j 
gi|7443955| 
gij 4092819] 



NOV66 

gi | 17466082 | 
gi j 17482057 j 
gij 17448458 | 
gi |7443955 | 
gi|4092819| 



NOV66 

gi|l7466082| 
gij 17482057 j 
gij 17448458 | 
gij 7443955 | 
gi j4092819 j 



NOV66 

gi|l7466082| 
gi j 17482057 j 
gi j 17448458 j 
gi (7443955 | 
gi|4092819| 



1 

27 
1 



- FAGVYL - 



-ASETLP- 



1 

39 
1 



121 RGRAVPSSEDPEQQPVLAGLFLSMCLVTVLGNLLI ILAVSPDSHLHTPMYLFLSNLSLPD 18 0 

1 1 

! ! 



190 



200 
..| 



210 



220 



230 



240 



39 GSFAHPEATSRGAVATGTTHLASAVEPN GDSWCK QRS P - - 77 

! ! 

181 IGFTSSMSLLDAQVHNLIALQMTCFKDVEIPNFFWEPSQLPHLACCDTFTNNIIMYSPAA 24 0 

1 1 

1 1 



250 



260 
..|.. 



270 



280 



290 



300 



-RVS - VQDPQLQPALALLSLSLS 



! 

77 

x 

241 IFGFLPISGTLFSYYKIVSSILRVSSSEDPELQSVLALLSLSLS 

! 

x 






16 


Fsm% 


114 




16 




300 




1 




1 



I 



310 
■ ■ I ■■ 



320 
..I 



L 



330 



340 



350 



360 



J 



17 

115 

17 

301 

1 

1 



LHTPMYFFLSNLCWADIGFTSAfflVPKMIVDMQSHSRVISg 



9GCLTjgMSFLVLFAC 

ssd^BlhtpmyfflsnlcwHdigftsaBvpkmivdHqshsrvis^SgcltqmsflSlBac 

SSDfflLHTPMYFFLSNLCwBDIGFTSAivPKMIVDQQSHSRVISjgGCLTQMSFLgLBAC 
S S D^BLHT PM YF FL S NL CWAD I G[Jt S AmVP Kjjj I [jDMQ S HS R V I SgSG CLTQM S FL VL FAC 



PMYFFLSNL 
jMYFFLSNL 



ADIGFTS 
AJ^IGFTSi 



gTSAI|VPKQI{|DMQSHSRVIS| 
FTSj^uVPKMIVDMQpHSRVISp 
FT sfflvPKM I VDMQliHS RVI SB 



5GCLTQMS FH VL FAC 
»GCLTQMSF§VLFAC 



76 

174 

76 

360 

52 

51 



370 



390 



400 



420 



NOV66 


77 


gi|l7466082| 


175 


gij 17482057 j 


77 


gij 17448458 j 


361 


gij 7443955 | 


53 


gij4092819 j 


52 




430 



470 



480 



NOV66 


137 


KNVEIS 


NFfi 


CDPSQLL 


gi 1 17466082 | 


235 


KNVEIS 


NTO 


CDPSQLL 


gij 17482057 j 


137 


KNVEIS 


nS j 


CDPSQLL 


gi j 17448458 j 


421 


KNVEIS 


NF j 


CDPSQLL 


gi j7443955 | 


113 


K 1 V 1 IS 


NF j 


CDPSQLL 


gij4092819 j 


112 


kIvIis 


NF j 


CDPSQLL 



490 
.. I .. 



NOV6 6 


197 


gi | 17466082 | 


295 


gijl7482057j 


197 


gijl7448458j 


481 


gi j7443955 | 


173 


gi|4092819 j 


172 




NOV6 6 

gi | 17466082 | 
gij 17482057 j 
gi j 17448458 j 
gi[7443955 [ 



S SDGKYKAFSTCGSHLAWC 

ssdgkykafstcgBhlawc 
s sdgkykafstcgshlawc 
s sdgkykafstcgshlawc 
|sdgkykafstcgshlawc 
ij'sdgkykafstcgshlavvc 



550 560 570 

....|....|....|....|....|....|....|.... 

257 YSLRNRDIQSALWRLRSRTVESHDLFHPFSCVGKKGQPH 2 95 

349 349 

251 251 

528 528 

23 3 YSLRNKDIQS ALCRIjHGRI I KSHHLHPFCYMG 2 64 



iLNPFI 231 
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i 



i I 4092819 | 232 YSLRNKDIQSALCRLHGRIIKSHHLHPFCYMG 
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Table 66F lists the domain description from DOMAIN analysis results against NOV66. 
This indicates that the NOV66 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 66F. Domain Analysis of NOV66 




gni| 


Pfam|pf amOOOOl, 7tm_l, 7 transmembrane receptor 




(rhodopsin family). SEQ ID NO: 810 








CD- Length = 254 residues, 9 9.6% aligned 








Score = 88.6 bits (218), Expect = 5e-l9 




NOV6 6 : 


9 


NLLIILAVSSDSHLHTPMYFFLSNLCWADIGFTSAMVPKMIVDMQSHSRVISYAGCLTRM 


68 






NLL+IL + Li TP FL NL AD+ F + P + + V AC 




Sbj Ct : 


2 


NLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLVG 


61 


NOV6 6 : 


69 


SFLVLFACIEDMLLTAMAYDCFVAICRPLHYPVIVNPHI#SVFI/VI,VSFFI,SLZJDSQLHSL 


128 






+ V+ +LLTA+ + D + +AI PL Y I P + L+L + + L+LL S L 




SbjCt : 


62 


ALFWNGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILLVWVLALLLSLPPLL 


121 


NOV66 : 


129 


IVLQFTFFKNVEISNFVCEPSQLLNLACSDSVINSIFLYFDSTMFGFLPISRILLSYYKI 


188 






T+ + p ++ ++++ LP+ IL + Y +1 




SbjCt: 


122 


FSWLRTVEEGNTTVCLIDFPEESV KRSYVLLSTLVGFVLPLLVILVCYTRI 


172 


NOV6 6 : 


189 


VP- SILRISSSDGKYKAFSTCGSHLAWCLFYGTGIGVYL TSAVSPPP 


235 






+ L+SS + A +V+I + L + + 




Sbjct : 


173 


LRTLRKRARSQRSLKRRSSSERKAAKMLLVWWFVLCWLPYHIVLLLDSLCLLSIWRVL 


232 


NOV66 : 


236 


RSG WAS VMYA WT PMLNP F I Y 257 








+ ++ ++ A V LNP IY 




SbjCt : 


233 


FTALLITLWLAYVNSCLNPIIY 2 54 





The olfactory system is able to distinguish several thousand odorant molecules. 
Olfactory receptors are believed to be encoded by an extremely large subfamily of G protein- 
coupled receptors. These receptors share a 7-transmembrane domain structure with many 
neurotransmitter and hormone receptors. They are responsible for the recognition and G 
protein-mediated transduction of odorant signals. The genes encoding these receptors are 
devoid of introns within their coding regions. Schurmans et al. (1993) cloned a member of this 
family of genes, OLFR1, from a genomic library by cross-hybridization with a gene fragment 
obtained by PCR. By isotopic in situ hybridization, they mapped the gene to 17pl3-pl2 with a 
peak at band 17pl3. A minor peak was detected on chromosome 3, with a maximum in the 
region 3ql3-q21 . After Mspl digestion, a RFLP was demonstrated. Using this in a study of 3 
CEPH pedigrees, they demonstrated linkage with D17S126 at 17pter-pl2; maximum lod = 3.6 
at theta = 0.0. Used as a probe on Southern blots under moderately stringent conditions, the 
cDNA hybridized to at least 3 closely related genes. Ben-Arie et al. (1994) cloned 16 human 
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OLFR genes, all from 17pl3.3. The intronless coding regions are mapped to a 350-kb 
contiguous cluster, with an average intergenic separation of 15 kb. The OLFR genes in the 
cluster belong to 4 different gene subfamilies, displaying as much sequence variability as any 
randomly selected group of OLFRs. This suggested that the cluster may be one of several 
5 copies of an ancestral OLFR gene repertoire whose existence may have predated the 

divergence of mammals. Localization to 17pl3.3 was performed by fluorescence in situ 
hybridization as well as by somatic cell hybrid mapping. 

The ability to distinguish different odors depends on a large number of different 
odorant receptors (ORs). Sullivan et al. (1996) noted that ORs are expressed by nasal olfactory 

10 sensory neurons; each neuron expresses only 1 allele of a single OR gene. In the nose, 

different sets of ORs are expressed in distinct spatial zones. Neurons that express the same OR 
gene are located in the same zone; however, in that zone they are randomly interspersed with 
neurons expressing other ORs. This distribution suggested to the authors that, when the cell 
chooses an OR gene for expression, it may be restricted to a specific zonal gene set, but it may 

15 select from that set by a stochastic mechanism. Proposed models of OR gene choice fall into 2 
classes: locus-dependent and locus-independent. Locus-dependent models posit that OR genes 
are clustered in the genome, perhaps with members of different zonal gene sets clustered at 
distinct loci. In contrast, locus-independent models do not require that OR genes be clustered. 
To assess the feasibility of these models, Sullivan et al. (1996) determined the expression 

20 zones, sequences, and chromosomal locations of a number of mouse OR genes. They mapped 
OR genes to 1 1 different regions on 7 chromosomes. These loci lie within paralogous 
chromosomal regions that appear to have arisen by duplications of large chromosomal 
domains followed by extensive gene duplication and divergence. These studies showed that 
OR genes expressed in the same zone map to numerous loci. Moreover, a single locus can 

25 contain genes expressed in different zones. These findings raised the possibility that OR gene 
choice is locus-independent or involved consecutive stochastic choices. 

Nekrasova et al. (1996) overexpressed human (OR17-4) and rat (olp4) olfactory 
receptor genes in insect cells, purified them, and characterized them biochemically. They 
identified monomeric, dimeric, and trimeric forms of the proteins corresponding to molecular 

30 weights of 32, 69, and 94 kD by electrophoresis. The oligomers were resistant to reduction and 
alkylation and were therefore thought to be held together by SDS-resistant hydrophobic 
interactions, consistent with observations of other G protein-coupled receptors. 

Glusman et al. (1996) described the results of complete sequencing of an OR-rich 
cosmid spanning the center of the OR gene cluster on 17pl3.3. The resulting 40-kb sequence 
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revealed 3 known OR coding regions, 2 OR genes which may have originated from a tandem 
duplication event, and a new OR pseudogene fused to another OR gene. 

Issel-Tarver and Rine (1996) characterized 4 members of the canine olfactory receptor 
gene family. The 4 subfamilies comprised genes expressed exclusively in olfactory epithelium. 
Analysis of large DNA fragments using Southern blots of pulsed field gels indicated that 
subfamily members were clustered together, and that 2 of the subfamilies were closely linked 
in the dog genome. Analysis of the 4 olfactory receptor gene subfamilies in 26 breeds of dog 
provided evidence that the number of genes per subfamily was stable in spite of differential 
selection on the basis of olfactory acuity in scent hounds, sight hounds, and toy breeds. 

Issel-Tarver and Rine (1997) performed a comparative study of 4 subfamilies of 
olfactory receptor genes first identified in the dog to assess changes in the gene family during 
mammalian evolution, and to begin linking the dog genetic map to that of humans. These 4 
families were designated by them OLF1, OLF2, OLF3, and OLF4 in the canine genome. The 
subfamilies represented by these 4 genes range in size from 2 to 20 genes. They are all 
expressed in canine olfactory epithelium but were not detectably expressed in canine lung, 
liver, ovary, spleen, testis, or tongue. The OLF1 and OLF2 subfamilies are tightly linked in the 
dog genome and also in the human genome. The smallest family is represented by the canine 
OLF1 gene. Using dog gene probes individually to hybridize to Southern blots of genomic 
DNA from 24 somatic cell hybrid lines. They showed that the human homologous OLF1 
subfamily maps to human chromosome 1 1 . The human gene with the strongest similarity to 
the canine OLF2 gene also mapped to chromosome 1 1. Both members of the human subfamily 
that hybridized to canine OLF3 were located on chromosome 7. It was difficult to determine to 
which chromosome or chromosomes the human genes that hybridized to the canine OLF4 
probe mapped. This subfamily is large in mouse and hamster as well as human, so the rodent 
background largely obscured the human cross-hybridizing bands. It was possible, however, to 
discern some human-specific bands in blots corresponding to human chromosome 19. They 
refined the mapping of the human OLF1 homolog by hybridization to YACs that map to 
1 lql 1 . In dogs, the OLF1 and OLF2 subfamilies are within 45 kb of one another (Issel-Tarver 
and Rine (1996)). Issel-Tarver and Rine (1997) demonstrated that in the human OLF1 and 
OLF2 homologs are likewise closely linked. By studying YACs, Issel-Tarver and Rine (1997) 
found that the human OLF3 homolog maps to 7q35. A chromosome 19-specific cosmid library 
was screened by hybridization with the canine OLF4 gene probe, and clones that hybridized 
strongly to the probe even at high stringency were localized to 19pl3.1 and 19pl3.2. These 
clones accounted, however, for a small fraction of the homologous human bands. 
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Rouquier et al. (1998) demonstrated that members of the olfactory receptor gene 
family are distributed on all but a few human chromosomes. Through fluorescence in situ 
hybridization analysis, they showed that OR sequences reside at more than 25 locations in the 
human genome. Their distribution was biased for terminal bands of chromosome arms. Flow- 
sorted chromosomes were used to isolate 87 OR sequences derived from 16 chromosomes. 
Their sequence relationships indicated the inter- and intrachromosomal duplications 
responsible for OR family expansion. Rouquier et al. (1998) determined that the human 
genome has accumulated a striking number of dysfunctional copies: 72% of these sequences 
were found to be pseudogenes. ORF-containing sequences predominate on chromosomes 7, 
16, and 17. 

Trask et al. (1998) characterized a subtelomeric DNA duplication that provided insight 
into the variability, complexity, and evolutionary history of that unusual region of the human 
genome, the telomere. Using a DNA segment cloned from chromosome 19, they demonstrated 
that the blocks of DNA sequence shared by different chromosomes can be very large and 
highly similar. Three chromosomes appeared to have contained the sequence before humans 
migrated around the world. In contrast to its multicopy distribution in humans, this 
subtelomeric block maps predominantly to a single locus in chimpanzee and gorilla, that site 
being nonorthologous to any of the locations in the human genome. Three new members of the 
olfactory receptor (OR) gene family were found to be duplicated within this large segment of 
DNA, which was found to be present at 3q, 15q, and 19p in each of 45 unrelated humans 
sampled from various populations. From its sequence, one of the OR genes in this duplicated 
block appeared to be potentially functional. The findings raised the possibility that functional 
diversity in the OR family is generated in part through duplications and interchromosomal 
rearrangements of the DNA near human telomeres. 

Mombaerts (1999) reviewed the molecular biology of the odorant receptor (OR) genes 
in vertebrates. Buck and Axel (1991) discovered this large family of genes encoding putative 
odorant receptor genes. Zhao et al. (1 998) provided functional proof that one OR gene encodes 
a receptor for odorants. The isolation of OR genes from the rat by Buck and Axel (1991) was 
based on 3 assumptions. First, ORs are likely G protein-coupled receptors, which 
characteristically are 7-transmembrane proteins. Second, ORs are likely members of a 
multigene family of considerable size, because an immense number of chemicals with vastly 
different structures can be detected and discriminated by the vertebrate olfactory system. 
Third, ORs are likely expressed selectively in olfactory sensory neurons. Ben-Arie et al. 
(1994) focused attention on a cluster of human OR genes on 17p, to which the first human OR 
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gene, OR1D2, had been mapped by Schurmans et al. (1993). According to Mombaerts (1999), 
the sequences of more than 150 human OR clones had been reported. The human OR genes 
differ markedly from their counterparts in other species by their high frequency of 
pseudogenes, except the testicular OR genes. Research showed that individual olfactory 
5 sensory neurons express a small subset of the OR repertoire. In rat and mouse, axons of 
neurons expressing the same OR converge onto defined glomeruli in the olfactory bulb. 

Gilad et al. (2000) reported the population sequence diversity of genomic segments 
within a 450-kb cluster of olfactory receptor (OR) genes on chromosome 17. They found a 
dichotomy in the pattern of nucleotide diversity between OR pseudogenes and introns on the 
10 one hand and the closely interspersed intact genes on the other. They suggested that weak 
positive selection is responsible for the observed patterns of genetic variation. This was 
inferred from a lower ratio of polymorphism to divergence in genes compared with 
pseudogenes or introns, high nonsynonymous substitution rates in OR genes, and a small but 
significant overall reduction in variability in the entire OR gene cluster compared with other 
1 5 genomic regions. The dichotomy among functionally distinct segments within a short genomic 
distance requires high recombination rates within this OR cluster. 

NOV66 is predicted to be expressed in at least the following tissues: lung, liver, ovary, 
spleen, testis. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
20 public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV66 is provided in Example 2. 

The nucleic acids and proteins of NOV66 are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
cardiomyopathy, atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial 
25 septal defect (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary 

stenosis, subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation, fertility, polycystic ovarian syndrome, cancer, tissue 
degeneration, bacterial/viral/parasitic infection, systemic lupus erythematosus, autoimmune 
disease, asthma, emphysema, scleroderma, allergy, ARDS, muscular dystrophy, Lesch-Nyhan 
30 syndrome, myasthenia gravis, Hirschsprung's disease, Crohn's Disease, appendicitis as well as 
other diseases, disorders and conditions. The NOV66 nucleic acid encoding the GPCR-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. A 
NOV66 nucleic acid of the invention encoding a Olfactory receptor-like protein includes the 
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nucleic acid whose sequence is provided in Table 66A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 66A while still encoding a protein that maintains its 
Olfactory receptor -like activities and physiological functions, or a fragment of such a nucleic 
5 acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence disclosed in Table 66A, including nucleic acid fragments that are complementary 
to any of the nucleic acids just described. The invention additionally includes nucleic acids or 
nucleic acid fragments, or complements thereto, whose structures include chemical 

10 modifications. Such modifications include, by way of non-limiting example, modified bases, 
and nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the modified 
nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in - 
therapeutic applications in a subject. In the mutant or variant nucleic acids, and their 

1 5 complements, up to about 7% of the bases may be so changed. 

The novel protein of the invention includes the olfactory receptor-like protein whose 
sequence is provided in Table 66B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 66B while 
still encoding a protein that maintains its Olfactory receptor -like activities and physiological 

20 functions, or a functional fragment thereof In the mutant or variant protein, up to about 28% 
of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 

25 prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV67 

The disclosed NOV67 (alternatively referred to herein as CG56571-01) includes the 
30 1072 nucleotide sequence (SEQ ID NO:233) shown in Table 67A. A NOV67 ORF begins 

with a Kozak consensus ATG initiation codon at nucleotides 41-43 and ends with a stop codon 
at nucleotides 989-991 . The disclosed NOV67 maps to human chromosome 7. 
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Table 67 A. NOV67 Nucleotide Sequence (SEQ ID 
NO:233) 



CATTCCCTCCTAACCACCTAGATTGAAGAAGTGAGGTTCAATGTCCCAACTGGGAAGGGA 
CAACATAACCTGGGTGAGTGAGTTCATCCTAATGGGTCTCTCCAGTGACAGGCAGACCCA 
GGCTGGACTCTTTATCTTATTTGGGGCTGCCTACCTGCTGACCCTGCTGGGCAATGGGCT 
CATCCTGCTCCTGATCTGGCTGGACGTGAGACTCCACCTGCCCATGTATTTCTTCCTCTG 
CAACCTCTCACTTGTGGACATCTGCTACACCTCCAGCAGGGTCCCTCAGATGCTGGTGCA 
CTGCACCAGCAAAAGAAAGACCATCTCCTTTGCCCGATGTGGGACCCAGCTCTTTTTCTC 
CCTGGCCCTCGGAGGGACCGAGTTTTTGTTGCTGGCCGCAATGGCCTATGACCGCTACGT 
GGCTGTTTGCGACCCCCTGTGTTACATAGCAGTGATGAGCCCAAGGCTCTGCATGGCACT 
GGCAGCTGTCTCTTGGCTAGTGGGCCTGGCTAATTCTGCTATGGAGACGGCACTGACCAT 
GCACCTGCCCACCTGTGGGCACAACGTGCTGAACCATGTGGCCTGTGAGACACTGGCACT 
GGTCAGGTCGGCCTGCGTGGACATCACCTTCAATCAGGTGGTCATAGTGGCCTCCAGTGT 
GGTGGTGCTGCTGGTGCCCTGCTGCCTGGTCTCGCTGTCCTACACCCTCATTGTAGTTGC 
CGTCCTGCAGATCCACTCCACCCAGGGGCACCGCAAGGCCTTTGGGACCTGTGCCTCCCA 
CCTCACTGTGGTCTCCATATCCTATGGGATGGCCCTCTTTACCTACATGCAGCCTCGCTC 
CATGGCCTCAGCTGAGCAGGAAAAGGTGATGGTACTCTCTTATGCTGTGGTGACCCCCAT 
GTTGAATCCTTTCATCTACAGTCTGCGGAACAAGGATGTGAAGGCAGCTCTGAGTCGAGC 
TCTGATGAGGAGCTCTGAATTAAAACATTAGAGAGTGGTTTGAGTAACAAGAAGGCCTCA 
CT CTG AAAAC AGT GGG C ATTGG ACTGTGCT CT C C AGT ATAACGTGTGT ACG C 



A NOV67 polypeptide (SEQ ID NO:234) encoded by SEQ ID NO:233 is 312 amino 
acids in length and is presented using the one-letter amino acid code in Table 67B. The Psort 
profile for NOV67 predicts that this sequence is a Type Illb membrane protein, has a signal 
5 peptide, and is likely to be localized at the plasma membrane with a certainty of 0.6000. In 
alternative embodiments, a NOV67 polypeptide is located to Golgi bodies with a certainty of 
0.4000, to the endoplasmic reticulum (membrane) with a certainty of 0.3000, or to the 
mitrochondrial membrane with a certainty of 0.3522. The Signal P predicts a likely cleavage 
site for a NOV67 peptide is between positions 58 and 59, i.e., at the dash in the sequence 
10 VRL-HL. 



MSQL.GRDNITWVSEFILMGLSSDRQTQAGLFILFGAAYLLTLLGNGLILLLIWLDVRLHL 
PMYFFLCNLSLVDICYTSSRVPQMLVHCTSKRKTISFARCGTQLFFSLALGGTEFLLLAA 
MAYDRYVAVCDPLCYIAVMSPRLC^IJ^VSWL^ 

ACETLALVRSACVDITFNQWIVASSVWLLVPCCLVSLSYTLIWAVLQIHSTQGHRKA 
FGTCASHLTWSISYGMALFTYMQPRSMASAEQEKVMVLSYAWTPMLNPFIYSLRNKDV 

KAALSRALMRSSELKH 



A. BLAST analysis of NOV67 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV67 had high 
15 homology to other proteins as shown in Table 67C. 



Table 67B. NOV67 Polypeptide Sequence (SEQ ID 
NO:234) 



Table 67C. BLASTX results from PatP database for NOV67 



Smallest 
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Sequences producing High-scoring Segment Pairs: 

patp:AAG71514 Human olfactory receptor polypeptide 

patp:AAG72330 Human OR- like polypeptide query sequence 

patp:AAG72925 Human olfactory receptor data exploratorium 

patp:AAG72977 Olfactory receptor- 1 ike polypeptide 

patp:AAG714Q8 Human olfactory receptor polypeptide 

In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 536 of 649 bases (82%) identical to a gb:GENBANK- 
ID:AF073974|acc:AF073974.1 mRNA from Mus musculus domesticus {Mus musculus 
domesticus clone OR28M olfactory receptor gene). The full amino acid sequence of the 
protein of the invention was found to have 179 of 31 1 amino acid residues (57%) identical to, 
and 228 of 31 1 amino acid residues (73%) similar to, the 317 amino acid residue 
ptnr:SWISSNEW-ACC:Q13607 protein from Homo sapiens (Human) (OLFACTORY 
RECEPTOR 2F1 (OLFACTORY RECEPTOR-LIKE PROTEIN OLF3)). NOV67 also has 
homology to the other proteins shown in the BLASTP data in Table 67D. 



Table 67D. NOV67 BLASTP results 






Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

<%) 


Positive 
(%) 


Expect 


gi | 9297120 | sp 
|Q13607 |02F1_ 
HUMAN 


O L FACTORY RECEPTOR 2F1 
{ OLFACTORY RECEPTOR- LIKE 
PROTEIN OLF3) 


317 


179/311 
(57) 


228/311 
(72) 


7e-86 


gi | 6912558 | re 
f |NP_036501.1 
| (NM_012369) 


olfactory receptor, 
family 2, subfamily F, 
member 1; olfactory 
receptor, family 2, 
subfamily F, member 5 ,- 
olfactory receptor, 
family 2, subfamily F, 
member 4 [Homo sapiens] 


317 


179/311 
(57) 


228/311 
(72) 


9e-86 


gi | 2495055 | sp 
|Q95156 |OLF3_ 
CANFA 


OLFACTORY RECEPTOR- LIKE 
PROTEIN OLF3 


317 


176/305 
(57) 


228/305 
(72) 


2e-85 


gi | 14423778 | s 
p|O95006 |02F2 
_HUMAN 


OLFACTORY RECEPTOR 2F2 
(OLFACTORY RECEPTOR 7-1) 
(OR7-1) <PID:g2495051) 
[Homo sapiens] 


317 


175/308 
(56) 


220/308 
(70) 


7e-82 


gi | 5453066 | gb 
| AAD43423 . 1 | 
(AF073.974) 


olfactory receptor 
[Mus musculus] 


216 


167/216 
(77) 


189/216 
(87) 


3e-78 



This BLASTP data is displayed graphically in the ClustalW in Table 67E. A multiple 
sequence alignment is given, with the NOV67 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 67D. 
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Sum 

High Probability 

Score P(N) 

1576 1.2e-161 

924 1.5e-92 

924 1.5e-92 

924 1.5e-92 

923 1.9e-92 




Table 67E. ClustalW Alignment of NOV67 



NOV 6 7 

gi | 9297120 | 
gi | 6912558 j 
gi|2495055| 
gij 14423778 | 
gi|5453066| 



NOV67 

gi|9297120| 
gij 6912558 | 
gij 2495055 j 
gij 14423778 | 
gij5453066| 



NOV67 

gi| 9297120] 
gij 6912558 j 
gij2495055j 
gi|l4423778| 
gij5453066| 



(SEQ ID 

(SEQ ID 

{SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 



NO: 234) 
NO: 634) 
NO:635) 
NO:636) 
NO : 63 7) 
NO:638) 
10 



NOV67 


61 


gi| 9297120 | 


58 


gij6912558j 


58 


gij2495055 j 


58 


gij 14423778 | 


58 


gij 5453066 | 


1 



NOV67 


121 


gi | 9297120 | 


118 


gij 6912558 j 


118 


gi j 2495055 j 


118 


gijl4423778| 


118 


gij 5453066 | 


51 



NOV 6 7 


181 


gi | 9297120 | 


178 


gi j 6912558 j 


178 


gi|2495055 j 


178 


gijl4423778| 


178 


gij5453066| 


111 



241 
238 
238 
238 
238 
171 



NOV67 




301 


gi| 9297120 




298 


gij 6912558 




298 


gij 2495055 




298 


gi| 14423778 | 


298 


gij5453066| 


216 
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Table 67F lists the domain description from DOMAIN analysis results against NOV67. 
This indicates that the NOV67 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 67F. Domain Analysis of NOV67 




gnl| 


Pf am | pf amOOOOl , 7tm_l, 7 transmembrane receptor 




(rhodopsin family). SEQ ID NO:810 






Score = 


CD-Length = 254 residues, 93.3% aligned 
= 76.3 bits (186), Expect = 3e-15 




NOV67 : 
Sbjct : 


61 
18 


PMYFFLCNLSLVDICYTSSRVPQMLVHCTSKRKTISFARCGTQLFFSLALGGTEFLLL7AA 
P FL NL++ D+++PL+ AC + G LLL A 
PTNI FLLNLAVADIILFLLTLPPWALYYLVGGDWVFGDAL.CKLVGALFVVNGYAS I LLLTA 


120 
77 


NOV67 : 
Sbj ct : 


121 
78 


MAYDRYVAVCDPLC Y I AVMS PRLCMALAAVS WLVGLANS AMETALTMHLPT CGHNVLNHV 
++ DRY+A+ PL Y + +PR L + W++ L S + N + 
ISIDRYLAIVHPLRYRRIRTPRRAKVLILLVWVLALLLSLPPLLFSWLRTVEEGNTTVCL 


180 
137 


NOV67 : 
Sbjct : 


181 
138 


ACETLALVRS ACVDI T FNQ WI VASS VWLL VPCCL VSLSYTT, I WAVLQI HSTQGHRKA 
V+ + V ++ ++ V+++ L +L L+ S+ + A 
IDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTRILRTLRKRARSQRSLKRRSSSERKAA 


240 
197 


NOV6 7 : 
Sbjct: 


241 
198 


FGTCASHLTWSISYG MAL FT YMQP R S MAS AEQE K VM VL S Y A WT PM LNP F I Y 2 93 

+ V +L+ ++LAV LNP IY 
KMLLVWWFVLCWLPYHIVLLLDSLCLLSIWRVLPTALLITLWLAYVNSCLNPIIY 254 



5 

The olfactory system is able to distinguish several thousand odorant molecules. 
Olfactory receptors are believed to be encoded by an extremely large subfamily of G protein- 
coupled receptors. These receptors share a 7-transmembrane domain structure with many 
neurotransmitter and hormone receptors. They are responsible for the recognition and G 

10 protein-mediated transduction of odorant signals. The genes encoding these receptors are 

devoid of introns within their coding regions. Schurmans et al. (1993) cloned a member of this 
family of genes, OLFR1, from a genomic library by cross-hybridization with a gene fragment 
obtained by PCR. By isotopic in situ hybridization, they mapped the gene to 17pl3-pl2 with a 
peak at band 17pl3. A minor peak was detected on chromosome 3, with a maximum in the 

15 region 3ql3-q21 . After Mspl digestion, a RFLP was demonstrated. Using this in a study of 3 
CEPH pedigrees, they demonstrated linkage with D17S126 at 17pter-pl2; maximum lod - 3.6 
at theta = 0.0. Used as a probe on Southern blots under moderately stringent conditions, the 
cDNA hybridized to at least 3 closely related genes. Ben-Arie et al. (1994) cloned 16 human 
OLFR genes, all from 17pl3.3. The intronless coding regions are mapped to a 350-kb 

20 contiguous cluster, with an average intergenic separation of 1 5 kb. The OLFR genes in the 

cluster belong to 4 different gene subfamilies, displaying as much sequence variability as any 
randomly selected group of OLFRs. This suggested that the cluster may be one of several 
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copies of an ancestral OLFR gene repertoire whose existence may have predated the 
divergence of mammals. Localization to 17pl3.3 was performed by fluorescence in situ 
hybridization as well as by somatic cell hybrid mapping. 

The ability to distinguish different odors depends on a large number of different 

5 odorant receptors (ORs). Sullivan et al. (1996) noted that ORs are expressed by nasal olfactory 
sensory neurons; each neuron expresses only 1 allele of a single OR gene. In the nose, 
different sets of ORs are expressed in distinct spatial zones. Neurons that express the same OR 
gene are located in the same zone; however, in that zone they are randomly interspersed with 
neurons expressing other ORs. This distribution suggested to the authors that, when the cell 

10 chooses an OR gene for expression, it may be restricted to a specific zonal gene set, but it may 
select from that set by a stochastic mechanism. Proposed models of OR gene choice fall into 2 
classes: locus-dependent and locus-independent. Locus-dependent models posit that OR genes 
are clustered in the genome, perhaps with members of different zonal gene sets clustered at 
distinct loci. In contrast, locus-independent models do not require that OR genes be clustered. 

15 To assess the feasibility of these models, Sullivan et al. (1996) determined the expression 

zones, sequences, and chromosomal locations of a number of mouse OR genes. They mapped 
OR genes to 1 1 different regions on 7 chromosomes. These loci lie within paralogous 
chromosomal regions that appear to have arisen by duplications of large chromosomal 
domains followed by extensive gene duplication and divergence. These studies showed that 

20 OR genes expressed in the same zone map to numerous loci. Moreover, a single locus can 

contain genes expressed in different zones. These findings raised the possibility that OR gene 
choice is locus-independent or involved consecutive stochastic choices. 

Nekrasova et al. (1996) overexpressed human (OR17-4) and rat (olp4) olfactory 
receptor genes in insect cells, purified them, and characterized them biochemically. They 

25 identified monomeric, dimeric, and trimeric forms of the proteins corresponding to molecular 
weights of 32, 69, and 94 kD by electrophoresis. The oligomers were resistant to reduction and 
alkylation and were therefore thought to be held together by SDS-resistant hydrophobic 
interactions, consistent with observations of other G protein-coupled receptors. 



30 cosmid spanning the center of the OR gene cluster on 17pl3.3. The resulting 40-kb sequence 
revealed 3 known OR coding regions, 2 OR genes which may have originated from a tandem 
duplication event, and a new OR pseudogene fused to another OR gene. 

Issel-Tarver and Rine (1996) characterized 4 members of the canine olfactory receptor 
gene family. The 4 subfamilies comprised genes expressed exclusively in olfactory epithelium 



Glusman et al. (1996) described the results of complete sequencing of an OR-rich 
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Analysis of large DNA fragments using Southern blots of pulsed field gels indicated that 
subfamily members were clustered together, and that 2 of the subfamilies were closely linked 
in the dog genome. Analysis of the 4 olfactory receptor gene subfamilies in 26 breeds of dog 
provided evidence that the number of genes per subfamily was stable in spite of differential 
5 selection on the basis of olfactory acuity in scent hounds, sight hounds, and toy breeds. 

Issel-Tarver and Rine (1997) performed a comparative study of 4 subfamilies of 
olfactory receptor genes first identified in the dog to assess changes in the gene family during 
mammalian evolution, and to begin linking the dog genetic map to that of humans. These 4 
families were designated by them OLF1, OLF2, OLF3, and OLF4 in the canine genome. The 

10 subfamilies represented by these 4 genes range in size from 2 to 20 genes. They are all 

expressed in canine olfactory epithelium but were not detectably expressed in canine lung, 
liver, ovary, spleen, testis, or tongue. The OLF1 and OLF2 subfamilies are tightly linked in the 
dog genome and also in the human genome. The smallest family is represented by the canine 
OLF1 gene. Using dog gene probes individually to hybridize to Southern blots of genomic 

15 DNA from 24 somatic cell hybrid lines. They showed that the human homologous OLF1 

subfamily maps to human chromosome 1 1 . The human gene with the strongest similarity to 
the canine OLF2 gene also mapped to chromosome 11. Both members of the human subfamily 
that hybridized to canine OLF3 were located on chromosome 7. It was difficult to determine to 
which chromosome or chromosomes the human genes that hybridized to the canine OLF4 

20 probe mapped. This subfamily is large in mouse and hamster as well as human, so the rodent 
background largely obscured the human cross-hybridizing bands. It was possible, however, to 
discern some human-specific bands in blots corresponding to human chromosome 19. They 
refined the mapping of the human OLF1 homolog by hybridization to YACs that map to 
1 lql 1. In dogs, the OLF1 and OLF2 subfamilies are within 45 kb of one another (Issel-Tarver 

25 and Rine (1996)). Issel-Tarver and Rine (1997) demonstrated that in the human OLF1 and 

OLF2 homologs are likewise closely linked. By studying YACs, Issel-Tarver and Rine (1997) 
found that the human OLF3 homolog maps to 7q35. A chromosome 19-specific cosmid library 
was screened by hybridization with the canine OLF4 gene probe, and clones that hybridized 
strongly to the probe even at high stringency were localized to 19pl3.1 and 19pl3.2. These 

30 clones accounted, however, for a small fraction of the homologous human bands. 

Rouquier et al. (1998) demonstrated that members of the olfactory receptor gene 
family are distributed on all but a few human chromosomes. Through fluorescence in situ 
hybridization analysis, they showed that OR sequences reside at more than 25 locations in the 
human genome. Their distribution was biased for terminal bands of chromosome arms. Flow- 



567 



1! Jl}\Jl ji 77" O l! jr' 1 *' 




j! 



10 



15 



20 



25 



sorted chromosomes were used to isolate 87 OR sequences derived from 16 chromosomes. 
Their sequence relationships indicated the inter- and intrachromosomal duplications 
responsible for OR family expansion. Rouquier et al. (1998) determined that the human 
genome has accumulated a striking number of dysfunctional copies: 72% of these sequences 
were found to be pseudogenes. ORF-containing sequences predominate on chromosomes 7, 
16, and 17. 

Trask et al. (1998) characterized a subtelomeric DNA duplication that provided insight 
into the variability, complexity, and evolutionary history of that unusual region of the human 
genome, the telomere. Using a DNA segment cloned from chromosome 19, they demonstrated 
that the blocks of DNA sequence shared by different chromosomes can be very large and 
highly similar. Three chromosomes appeared to have contained the sequence before humans 
migrated around the world. In contrast to its multicopy distribution in humans, this 
subtelomeric block maps predominantly to a single locus in chimpanzee and gorilla, that site 
being nonorthologous to any of the locations in the human genome. Three new members of the 
olfactory receptor (OR) gene family were found to be duplicated within this large segment of 
DNA, which was found to be present at 3q, 15q, and 19p in each of 45 unrelated humans 
sampled from various populations. From its sequence, one of the OR genes in this duplicated 
block appeared to be potentially functional. The findings raised the possibility that functional 
diversity in the OR family is generated in part through duplications and interchromosomal 
rearrangements of the DNA near human telomeres. 

Mombaerts (1999) reviewed the molecular biology of the odorant receptor (OR) genes 
in vertebrates. Buck and Axel (1991) discovered this large family of genes encoding putative 
odorant receptor genes. Zhao et al. (1998) provided functional proof that one OR gene encodes 
a receptor for odorants. The isolation of OR genes from the rat by Buck and Axel (1991) was 
based on 3 assumptions. First, ORs are likely G protein-coupled receptors, which 
characteristically are 7-transmembrane proteins. Second, ORs are likely members of a 
multigene family of considerable size, because an immense number of chemicals with vastly 
different structures can be detected and discriminated by the vertebrate olfactory system. 
Third, ORs are likely expressed selectively in olfactory sensory neurons. Ben-Arie et al. 
(1994) focused attention on a cluster of human OR genes on 17p, to which the first human OR 
gene, OR1D2, had been mapped by Schurmans et al. (1993). According to Mombaerts (1999), 
the sequences of more than 150 human OR clones had been reported. The human OR genes 
differ markedly from their counterparts in other species by their high frequency of 
pseudogenes, except the testicular OR genes. Research showed that individual olfactory 
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sensory neurons express a small subset of the OR repertoire. In rat and mouse, axons of 
neurons expressing the same OR converge onto defined glomeruli in the olfactory bulb. 

Gilad et al. (2000) reported the population sequence diversity of genomic segments 
within a 450-kb cluster of olfactory receptor (OR) genes on chromosome 17. They found a 
5 dichotomy in the pattern of nucleotide diversity between OR pseudogenes and introns on the 
one hand and the closely interspersed intact genes on the other. They suggested that weak 
positive selection is responsible for the observed patterns of genetic variation. This was 
inferred from a lower ratio of polymorphism to divergence in genes compared with 
pseudogenes or introns, high nonsynonymous substitution rates in OR genes, and a small but 

10 significant overall reduction in variability in the entire OR gene cluster compared with other 

genomic regions. The dichotomy among functionally distinct segments within a short genomic 
distance requires high recombination rates within this OR cluster. 

NOV67 is predicted to be expressed in at least the following tissues: brain, testis, 
ovary, skeletal muscle, neuronal tissue. This information was derived by determining the 

15 tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV67 is provided in Example 2. 

The NOV67 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein. The 

20 NOV67 nucleic acid encoding the GPCR-like protein of the invention, or fragments thereof, 

may further be useful in diagnostic applications, wherein the presence or amount of the nucleic 
acid or the protein are to be assessed. The novel nucleic acid of the invention encoding an 
olfactory receptor-like protein OLF3-like protein includes the nucleic acid whose sequence is 
provided in Table 67A, or a fragment thereof. 

25 The invention also includes a mutant or variant nucleic acid any of whose bases may 

be changed from the corresponding base shown in Table 67A while still encoding a protein 
that maintains its olfactory receptor-like protein OLF3-like activities and physiological 
functions, or a fragment of such a nucleic acid. The invention further includes nucleic acids 
whose sequences are complementary to the sequence disclosed in Table 67A, including 

30 nucleic acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 
thereto, whose structures include chemical modifications. Such modifications include, by way 
of non-limiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
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chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or 
variant nucleic acids, and their complements, up to about 1 8% of the bases may be so changed. 
The novel protein of the invention includes the olfactory receptor-like protein OLF3- 
5 like protein whose sequence is provided in Table 67B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 67B while still encoding a protein that maintains its olfactory receptor-like protein 
OLF3-like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 43% of the amino acid residues may be so changed. 
10 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

15 

NOV68 

The disclosed NOV68 (alternatively referred to herein as CG56844-01) includes the 
2580 nucleotide sequence (SEQ ID NO:235) shown in Table 68A. A NOV68 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 21-23 and ends with a TAG 
20 codon at nucleotides 1 896-1 898. The disclosed NOV68 maps to human chromosome 9. 



Table 68A. NOV68 Nucleotide Sequence (SEQ ID 
NO:235) 

CAGGCCCCCACGTGGACAGCATGGACCGCGGCACGCTCCCTCTGGCTGTTGCCCTGCTGC 
TGGCCAGCTGCAGCCTCAGCCCCACAAGTCTTGCAGAAACAGTCCATTGTGACCTTCAGC 
CTGTGGGCCCCGAGAGGGGCGAGGTGACATATACCACTAGCCAGGTCTCGAAGGGCTGCG 
TGGCTCAGGCCCCCAATGCCATCCTTGAAGTCCATGTCCTCTTCCTGGAGTTCCCAACGG 
GCCCGTCACAGCTGGAGCTGACTCTCCAGGCATCCAAGCAAAATGGCACCTGGCCCCGAG 
AGGTGCTTCTGGTCCTCAGTGTAAACAGCAGTGTCTTCCTGCATCTCCAGGCCCTGGGAA 
TCCCACTGCACTTGGCCTACAATTCCAGCCTGGTCACCTTCCAAGAGCCCCCGGGGGTCA 
ACACCACAGAGCTGCCATCCTCCTCCTCCTCCTCACTGTCCTTCTGCATGCTGGAAGCCA 
GC C AGGAC ATGGGC CGC AC GC T CG AGTGGCGG CCG CGT AC T C C AG CCT TGGT C CGGGGC T 
GCCACTTGGAAGGCGTGGCCGGCCACAAGGAGGCGCACATCCTGAGGGTCCTGCCGGGCC 
ACTCGGCCGGGCCCCGGACGGTGACGGTGAAGGTGGAACTGAGCTGCGCACCCGGGGATC 
TCGATGCCGTCCTCATCCTGCAGGGTCCCCCCTACGTGTCCTGGCTCATCGACGCCAACC 
ACAACATGCAGATCTGGACCACTGGAGAATACTCCTTCAAGATCTTTCCAGAGAAAAACA 
TT CGTGGCTT C AAG C T C C C AG AC AC AC C T C AAGGC CT CC TGGGGGAGGC C CGG ATG CT C A 
ATGCCAGCATTGTGGCATCCTTCGTGGAGCTACCGCTGGCCAGCATTGTCTCACTTCATG 
CCTCCAGCTGCGGTGGTAGGCTGCAGACCTCACCCGCACCGATCCAGACCACTCCTCCCA 
AGGACACTTGTAGCCCGGAGCTGCTCATGTCCTTGATCCAGACAAAGTGTGCCGACGACG 
CCATGACCCTGGTACTAAAGAAAGAGCTTGTTGCGCATTTGAAGTGCACCATCACGGGCC 
TGACCTTCTGGGACCCCAGCTGTGAGGCAGAGGACAGGGGTGACAAGTTTGTCTTGCGCA 
GTGCTTACTCCAGCTGTGGCATGCAGGTGTCAGCAAGTATGATCAGCAATGAGGCGGTGG 
TCAATATCCTGTCGAGCTCATCACCACAGCGGAAAAAGGTGCACTGCCTCAACATGGACA 
GCCTCTCTTTCCAGCTGGGCCTCTACCTCAGCCCACACTTCCTCCAGGCCTCCAACACCA 
TCGAGCCGGGGCAGCAGAGCTTTGTGCAGGTCAGAGTGTCCCCATCCGTCTCCGAGTTCC 
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TGCTCCAGTTAGACAGCTGCCACCTGGACTTGGGGCCTGAGGGAGGCACCGTGGAACTCA 
TCCAGGGCCGGGCGGCCAAGGGCAACTGTGTGAGCCTGCTGTCCCCAAGCCCCGAGGGTG 
ACCCGCGCTTCAGCTTCCTCCTCCACTTCTACACAGTACCCATACCCAAAACCGGCACCC 
TCAGCTGCACGGTAGCCCTGCGTCCCAAGACCGGGTCTCAAGACCAGGAAGTCCATAGGA 
CTGTCTTCATGCGCTTGAACATCATCAGCCCTGACCTGTCTGGTTGCACAAGCAAAGGCC 
TCGTCCTGCCCGCCGTGCTGGGCATCACCTTTGGTGCCTTCCTCATCGGGGCCCTGCTCA 
CTGCTGCACTCTGGTACATCTACTCGCACACGCGTTCCCCCAGCAAGCGGGAGCCCGTGG 
TGGCGGTGGCTGCCCCGGCCTCCTCGGAGAGCAGCAGCACCAACCACAGCATCGGGAGCA 
CCCAGAGCACCCCCTGCTCCACCAGCAGCATGGCATAGCCCCGGCCCCCCGCGCTCGCCC 
AGCAGGAGAGACTGAGCAGCCGCCAGCTGGGAGCACTGGTGTGAACTCACCCTGGGAGCC 
AGTCCTCCACTCGACCCAGAATGGAGCCTGCTCTCCGCGCCTACCCTTCCCGCCTCCCTC 
TCAGAGGCCTGCTGCCAGTGCAGCCACTGGCTTGGAACACCTTGGGGTCCCTCCACCCCA 
CAGAACCTTCAACCCAGTGGGTCTGGGATATGGCTGCCCAGGAGACAGACCACTTGCCAC 
GCTGTTGTAAAAACCCAAGTCCCTGTCATTTGAACCTGGATCCAGCACTGGTGAACTGAG 
CTGGGCAGGAAGGGAGAACTTGAAACAGATTCAGGCCAGCCCAGCCAGGCCAACAGCACC 
TCCCCGCTGGGAAGAGAAGAGGGCCCAGCCCAGAGCCACCTGGATCTATCCCTGCGGCCT 
CCACACCTGAACTTGCCTAACTAACTGGCAGGGGAGACAGGAGCCTAGCGGAGCCCAGCC 
TGGGAGCCCAGAGGGTGGCAAGAACAGTGGGCGTTGGGAGCCTAGCTCCTGCCACATGGA 
GCCCCCTCTGCCGGTCGGGCAGCCAGCAGAGGGGGAGTAGCCAAGCTGCTTGTCCTGGGC 
CTGCCCCTGTGTATTCACCACCAATAAATCAGACCATGAAACCAGTGAAAAAAAAAAAAA 



A NOV68 polypeptide (SEQ ID NO:236) encoded by SEQ ID NO:235 is 625 amino 
acids in length and is presented using the one-letter amino acid code in Table 68B. The Psort 
profile for NOV68 predicts that this sequence is a Type Ilia membrane protein, has a signal 
peptide, and is likely to be localized at the plasma membrane with a certainty of 0.6400. In 
alternative embodiments, a NOV68 polypeptide is located to Golgi bodies with a certainty of 
0.4600, or to the endoplasmic reticulum (membrane) with a certainty of 0.3700. The Signal P 
predicts a likely cleavage site for a NOV68 peptide is between positions 25 and 26, i.e., at the 
dash in the sequence SLA-ET. 



Table 68B. NOV68 Polypeptide Sequence (SEQ ID 
NO:236) 



MDRGT L P LAVAL LL AS C S L S P TS LAE T VH C DLQ P VG P E RGE VT YT TS Q VS KGC VAQ APNA 
ILEVHVLFLEFPTGPSQLELTLQASKQNGTWPREVLLVLSVNSSVFLHLQALGIPLHLAY 
NSSLVTFQEPPGVNTTELPSSSSSSLSFCMLEASQDMGRTLEWRPRTPALVRGCHLEGVA 
GHKEAHILRVLPGHSAGPRTVTVKVELSCAPGDLDAVLILQGPPYVSWLIDANHNMQIWT 
TGEYSFKIFPEKNIRGFKLPDTPQGLLGEARMLNASIVASFVELPLASIVSLHASSCGGR 
LQTSPAPIQTTPPKDTCSPELLMSLIQTKCADDAMTLVLKKELVAHLKCTITGLTFWDPS 
CEAEDRGDKFVLRSAYSSCGMQVSASMISNEAWNILSSSSPQRKKVHCLNMDSLSFQLG 
LYLSPHFLQASNTIEPGQQSFVQVRVSPSVSEFLLQLDSCHLDLGPEGGTVELIQGRAAK 
GNCVSLLSPSPEGDPRFSFLLHFYTVPIPKTGTLSCTVALRPKTGSQDQEVHRTVFMRLN 
IISPDLSGCTSKGLVLPAVLGITFGAFLIGALLTAALWYIYSHTRSPSKREPWAVAAPA 
SSESSSTNHSIGSTQSTPCSTSSMA „ 



A BLAST analysis of NOV68 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV68 had high 
homology to other proteins as shown in Table 68C. 



Table 68C. BLASTX results from PatP database for NOV68 

" " " .Smallest 

Sum ' 
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High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P (N) 


patp: AAR54 82 8 Endoglin - Homo sapiens, 6 58 aa. 


2500 


1 . 5e-259 


patp : AAR99 8 02 Endoglin - Homo sapiens, 658 aa. 


2500 


1 - 5e-259 


patp : AAY8 2 19 0 Human endoglin SEQ ID NO: 2 - Homo sapiens 


2500 


1 . 5e-259 


patp.-AAR37808 Rat betaglycan - Synthetic, 853 aa. 


215 


2 . 2e-24 


patp : AAR74601 Rat betaglycan contg . transforming growth 


factor 215 


2 .2e-24 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 2128 of 2128 bases (100%) identical to a gb.GENBANK- 
ID:HUMENDO|acc:J05481.1 mRNA from Homo sapiens (Human endoglin mRNA, 3' end). 
5 The full amino acid sequence of the protein of the invention was found to have 509 of 624 
amino acid residues (81%) identical to, and 531 of 624 amino acid residues (85%) similar to, 
the 658 amino acid residue ptnr:SWISSNEW-ACC:Pl 781 3 protein from Homo sapiens . 
(Human) (ENDOGLIN PRECURSOR (CD105 ANTIGEN)). NOV68 also has homology to 
the other proteins shown in the BLASTP data in Table 68D. 

10 



Table 68D. NOV68 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi | 3041681 1 sp 
| P17 813 |EGLN_ 
HUMAN 


ENDOGLIN PRECURSOR 
(CD10 5 ANTIGEN) 


658 


621/658 
(94) 


632/658 
(94) 


0 . 0 


gi | 15679936 |g 
b | AAH14271 . 1 | 
AAH14271 
(BC014271) 


endoglin (Osler-Rendu- 
Weber syndrome 1) 
[Homo sapiens] 


658 


620/658 
(94) 


621/658 
(94) 


0 . 0 


gi | 105920 |pir 
| (A36262 


endoglin precursor - 
human 


645 


607/644 
(94) 


608/644 
(94) 


0 . 0 


gi | 4557555 | re 
f |NP_000109 . 1 
| (NM0 00118) 


endoglin precursor; 
Endoglin 
[Homo sapi en s ] 


625 


581/618 
(94) 


582/618 
(94) 


0.0 


gi | 6679649 | re 
f|NP 031958.1 
| (NM 007932) 


endoglin 
[Mus musculus] 


653 


452/660 
(68) 


540/660 
(75) 


0 . 0 



This BLASTP data is displayed graphically in the ClustalW in Table 68E. A multiple 
sequence alignment is given, with the NOV68 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 68D. 



Table 68E. ClustalW Alignment of NOV68 


NOV6 8 


(SEQ 


ID 


NO:236) 


gi | 3041681 1 


(SEQ 


ID 


NO:639) 


gi | 15679936 | 


(SEQ 


ID 


NO : 64 0) 
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gi I 105920 I 
gi |4557555 | 
gi|6679649| 



NOV68 

gi | 3041681 | 
gi|l5679936| 
gi | 105920 | 
gi |4557555 | 
gi|6679649| 



NOV68 

gi|304168l| 
gi |15679936 | 
gi|l05920| 
gi|4557555 | 
gi| 6679649 j 



NOV68 

gi | 3041681 | 
gi|15679936| 
gi 1 105920 | 
gi |4557555 | 
gi|6679649| 



NOV68 

gi|304168l| 
gi|15679936| 
gi | 105920 | 
gi|4557555 | 
gi | 6679649 j 



NOV68 

gi | 3041681 | 
gi|l5679936| 
gi|l05920| 
gi|4557555 | 
gi|6679649| 



NOV68 

gi | 3041681 1 
gi|l5679936| 
gi j 105920 | 
gi|4557555| 
gi|6679649| 



(SEQ ID NO:641) 
( SEQ ID NO:642) 
{SEQ ID NO:643) 



1 
1 
1 
1 

i spmB 

Am 





310 
I 



320 



I 



330 
. . I - - 



I 



340 
• • ! - - 



I 



350 
. . I . . 



360 



NOV68 


266 


gi| 3041681 | 


299 


gi|l5679936| 


299 


gi| 105920 | 


286 


gi j 4557555 | 


299 


gi| 6679649] 


299 



LLGEARMLNASIVASFVELPLASIVSLHASSCGGRLQTSPAPIQTTPPKDTCSPELLMSL 
LLGEARMLNASIVASFVELPLASIVSLHASSCGGRLQTSPAPIQTTPPKDTCSPELLMSL 
LLGEARMLNASIVASFVELPLASIVSLHASSCGGRLQTSPAPIQTTPPKDTCSPELLMSL! 
LLGEARMLNASIVASFVELPLASIVSLHASSCGGRLQTSPAPIQTTPPKDTCSPELLMSL 
LLGEARMLNASIVASFVELPLASIVSLHASSCGGRLQTSPAPIQTTPPKDTCSPELLMSL 
lEBe ArBlNAS I vfls F VEL PLWS^VS lS^ASS CGGB5qt3p AP^BtT P P KPT CSP^LLMSL 



325 
358 
358 
345 
358 
358 



370 



380 



390 



400 



410 



420 




430 



440 



450 



460 



470 



480 
■•I 
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NOV6 8 


386 


gi | 3041681 1 


419 


gi j 15679936 | 


419 


ai 105920 1 


406 


gi j 4557555 | 


419 


gi | 6679649 | 


419 


NOV68 


446 


gi | 3041681 | 


479 


gi|l5679936| 


479 


gi | 105920 | 


466 


gi|4557555 | 


479 


gi | 6679649 | 


479 


NOV68 


506 


gi | 3041681 | 


539 


gi | 15679936 | 


539 


gi | 105920 | 


526 


gi | 4557555 | 


539 


gi | 6679649 | 


539 


NOV68 


566 


gi|304168l| 


599 


gi|15679936| 


599 


gi j 105920 | 


586 


gi|4557555 | 


599 


gi| 6679649 j 


594 




NOV68 


625 


625 


gi|304168l| 


658 


658 


gi|15679936| 


658 


658 


gi j 105920 | 


645 


645 


gi|4557555| 


625 


625 


gi|6679649| 


653 


653 



Table 68F lists the domain description from DOMAIN analysis results against NOV68. 
This indicates that the NOV68 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 68F. Domain Analysis of NOV68 

,gnl | Smart 1 smart00241 / ZP, Zona pellucida (ZP) domain; ZP proteins are responsible for 
sperm- adhesion fo the zona pellucida. ZP domains are also present in multidomain 
transmembrane proteins such as glycoprotein GP2 , uromodulin and TGF-beta receptor type 
III (betaglycan) . SEQ ID NO:1391 

CD-Length = 253 residues, 96.4% aligned 



Score = 42.4 bits (98), Expect = 8e-05 



NOV68 : 


329 


KCADDAMTLVLKKELVAHLKCTITGLTFWDPSCEAEDRG DKFVLRSAYSSCGMQVSA 


385 






+C +D M + + +L+ + GLT DPSC G + CG + 




SbjCt : 


1 


QCGEDRMWSVSTDLLFPGGIYVKGLTLGDPSCRPVFVGANSAWSFEVPLNECGTRRQV 


60 


NOV68 : 


386 


S MISNEAWNILSSSSP QRKKVHCLNMD 


413 






+ + SN W+ + + L D 




SbjCt : 


61 


NPDGI VYSNTLWSPI FHPLFITRDDRANYHVQCFYPESEKVSLRADVSTI PPTPLSWS 


120 



574 




NOV68 : 
Sbjct : 


414 
121 


- - SLSFQLGLYLSPHF LQASNT I E PGQQS FVQVRVS P S VS E - FL.LQLDS CHLDLGP - 

+ LY F Q+++T++ G + + L + +C+ G 
EGPPTCTYSLYKDDSFGSPYQSADTVQLGDPVYHEWSCDGRDDPSLGLLVHNCYATPGSD 


466 
180 


NOV68 : 
Sb j Ct : 


467 
181 


--EGGTVELIQGRAAKGNCVSLLSPSPEGDPRFSFLLHFYTVPIPKTGTLSCTVALRPKT 
G +1 + +SP F + + C + L K+ 
P F S GP K Y F 1 1 DNGC P VDR Y LD S VS P Y S S P S H Y AR F S VK VF K F ADR S L V Y FHCQ I TL CD K S 


524 
240 


NOV68 : 


525 


GSQD 52 8 




SbjCt : 


241 


DGSS 244 





Transforming growth factor-beta (TGF-beta) plays an important role in angiogenesis 
and vascular function. Endoglin, a transmembrane TGF-beta binding protein, is highly 
expressed on vascular endothelial cells and is the target gene for the hereditary haemorrhagic 
5 telangiectasia type I (HHT1), a dominantly inherited vascular disorder. The specific function 
of endoglin responsible for HHT1 is believed to involve alterations in TGF-beta responses. 
The initial interactions on the cell surface between endoglin and TGF-beta receptors may be 
an important mechanism by which endoglin modulates TGF-beta signalling, and thereby 
responses. On human microvascular endothelial cells, endoglin is co-expressed and is 

10 associated with betaglycan, a TGF-beta accessory receptor with which endoglin shares limited 
amino acid homology. This complex formation may occur in either a ligand-dependent or a 
ligand-independent manner. In addition, three higher order complexes containing endoglin, 
type II and/or type I TGF-beta receptors, also can occur on these cells. Thus endoglin may 
modify TGF-beta signalling by interacting with both betaglycan and the TGF-beta signalling 

1 5 receptors at physiological receptor concentrations and ratios ( Wong et al., 2000, Eur J 
Biochem vol. 267 : 5550-60). 

Endoglin is a homodimeric membrane glycoprotein. In association with transforming 
growth factor (TGF)-ss receptors I and II, endoglin can also bind TGF-ssl and -ss3 and form a 
functional receptor complex. In human vascular tissue, endoglin immunolabeling is shown to 

20 be higher in endarterectomy specimens removed from diseased coronary arteries than in 

normal internal mammary arteries. In vitro, antisense oligonucleotides to endoglin is shown to 
decrease its expression and antagonized the TGF-ss-mediated inhibition of human and porcine 
SMC migration. Thus, upregulation of endoglin occurs during arterial repair and in established 
atherosclerotic plaques and may be required for modulation of SMC migration by TGF-ss (Ma 

25 X et al., 2000, Arterioscler Thromb Vase Biol vol. 20 :2546-52). 

Hereditary hemorrhagic telangiectasia (HHT) is an inherited autosomal dominant 
vascular dysplasia caused by mutations in either endoglin (HHT1) or activin-like kinase 
receptor-1 (ALK-1) (HHT2). The majority of the mutations in endoglin cause frameshifts and 
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premature stop codons. Although initial reports suggested a dominant-negative model for 
HHT1, more recent reports have suggested that mutations in endoglin lead to 
haploinsufficiency. Expression of the missense mutants alone revealed that they are misfolded 
and that most show no cell surface expression. When co-expressed with wild-type endoglin, 
5 the missense mutants are able to dimerize with the normal endoglin protein and are trafficked 
to the cell surface. Thus either dominant-negative protein interactions or haploinsufficiency 
can cause HHT1 (Lux et ah, 2000, Hum Mol Genet vol 9 : 745-55). 

NOV68 is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
10 nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
15 public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV68 is provided in Example 2. 

The NOV68 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for the treatment of patients 
20 suffering from: arterial injuries, cerebral arteriovenous malformalities, pregnancy 

complications, carcinomas such as breast and mammary carcinoma as well as other diseases, 
disorders and conditions. The NOV68 nucleic acid encoding the endoglin-like protein of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. The novel nucleic 
25 acid of the invention encoding a Endoglin (CD105 antigen)-like protein includes the nucleic 
acid whose sequence is provided in Table 68A, or a fragment thereof. The invention also 
includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 68A while still encoding a protein that maintains its 
Endoglin (CD105 antigen)-like activities and physiological functions, or a fragment of such a 
30 nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
to the sequence disclosed in Table 68A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
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include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
5 In the mutant or variant nucleic acids, and their complements, up to about 0% of the bases may 
be so changed. 

The novel protein of the invention includes the Endoglin (CD 105 antigen)-like protein 

whose sequence is provided in Table 68B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
10 68B while still encoding a protein that maintains its Endoglin (CD 105 antigen)-like activities 

and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 

up to about 1 9% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
15 methods. These antibodies may be generated according to methods known in the art, using 

prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

below. 

NOV69 

NOV69 includes two ILl-like proteins, designated herein as NOV69a and NOV69b. 
20 These are splice variants of sequence accession number CG5 6950-01. 

NOV69a 

The disclosed NOV69a (alternatively referred to herein as CG56950-01) includes the 
414 nucleotide sequence (SEQ ID NO: ) shown in Table 69 A. A NOV69a ORF begins with a 
25 Kozak consensus ATG initiation codon at nucleotides 100-102 and ends with a TGA codon at 
nucleotides 412-414. The disclosed NOV69a maps to human chromosome 7. 



Table 69 A. NOV69a Nucleotide Sequence (SEQ ID 
NO:237) 

ATGGAAAAAGCATTGAAAATTGACACACCTCAGCAGGGGAGCATTCAGGATATCAATCAT 
CGGGTGTGGGTTCTTCAGGACCAGACGCTCATAGCAGTCCCGAGGAAGGACCGTATGTCT 
CCAGTCACTATTGCCTTAATCTCATGCCGACATGTGGAGACCCTTGAGAAAGACAGAGGG 
AACCCCACACTGCAGCTGAAGGAAAAGGATATAATGGATTTGTACAACCAACCCGAGCCT 
GTGAAGTCCTTTCTCTTCTACCACAGCCAGAGTGGCAGGAACTCCACCTTCGAGTCTGTG 
GCTTTCCCTGGCTGGTTCATCGCTGTCAGCTCTGAAGGAGGCTGTCCTCTCATCCTTACC 
CAAGAACTGGGGAAAGCCAACACTACTGACTTTGGGTTAACTATGCTGTTTTAA 
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A NOV69a polypeptide (SEQ ID NO:238) encoded by SEQ ID NO:237 is 137 amino 
acids in length and is presented using the one-letter amino acid code in Table 69B. The Psort 
profile for NOV69a predicts that this sequence has no signal peptide and is likely to be 
localized to the cytoplasm with a certainty of 0.4500. In alternative embodiments, a NOV69a 
polypeptide is located to lysosomes with a certainty of 0.1514, or to perioxisomal microbodies 
with a certainty of 0.2384. 

Table 69B. NOV69a Polypeptide Sequence (SEQ ID 
NO:238) 

MEKALKIDTPQQGSIQDINHRVWVLQDQTLIAVPRKDRMSPVTIALISCRHVETLEKDRG 
NPTLQLKEKDIMDLYNQPEPVKSFLFYHSQSGRNSTFESVAFPGWFIAVSSEGGCPLILT 

QE LGKANTTD FGLTML F „ _ 

NOV69b 

The disclosed NOV69b (alternatively referred to herein as CG56 136-02) includes the 
41 1 nucleotide sequence (SEQ ID NO: ) shown in Table 69C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 409-41 1. The disclosed NOV69b maps to human chromosome 2ql2-14.1. 



Table 69C. NOV69b Nucleotide Sequence (SEQ ID 
NO:239) 

ATGGAAAAAGCATTGAAAGTTGACACACCTCAGCGGGGGAGCATTCAGGATATCAATCAT 
CGGGTGTGGGTTCTTCAGGACCAGACGCTCATAGCAGTCCCGAGGAAGGACCGTATGTCT 
CCAGTCACTATTGCCTTAATCTCATGCCGACATGTGGAGACCCTTGAGAAAGACAGAGGG 
AACCCCATCTACCTGGGCCTGAATGGACTCAATCTCTGCCTGATGTGTGTTCAAGTCGGG 
GACCAGCCCACACTGCAGATGAACCAGAGTGGCAGGAACTCCACCTTCGAGTCTGTGGCT 
TTCCCTGGCTGGTTGATCGCTGTCAGCTCTGAAGGAGGCTGTCCTCTCATCCTTACCCAA 
GAACTGGGGAAAGCCAACACTACTGACTTTGGGTTAACTATGCTGTTTTAA 

A NOV69b polypeptide (SEQ ID NO:240) encoded by SEQ ID NO:239 is 136 amino 
acids in length and is presented using the one-letter amino acid code in Table 69D. The Psort 
profile for NOV69b predicts that this sequence has no signal peptide and is likely to be 
localized to the cytoplasm with a certainty of 0.6500. In alternative embodiments, a NOV69b 
polypeptide is located to lysosomes with a certainty of 0.2305. 

Table 69D. NOV69b Polypeptide Sequence (SEQ ID 
NO:240) 

MEKALKVDTPQRGSIQDINHRWVLQDQTLIAVPRKDRMSPVTIALISCRHVETLEKDRG 
NPIYLGLNGLNLCLMCVQVGDQPTLQMNQSGRNSTFESVAFPGWLIAVSSEGGCPLILTQ 

ELGKANTTDFGLTMLF . _ 
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A BLAST analysis of NOV69 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV69 had high 
homology to other proteins as shown in Table 69E. 



Table 69E. BLASTX results from PatP database for NOV69 



Sequences producing High-scoring Segment Pairs: 

patp:AAW86286 Rodent interleukin (IL)-l epsilon polypeptide 
patp : AAY24 04 9 Amino acid sequence of a murine SPOIL- II 
patp: AAE06662 Mouse interleukin- lepsilon { IL- lepsilon) 
patp :AAY7 0 217 Human Interleukin- 1 epsilon protein 
patp: AAY7Q218 Human Interleukin- 1 epsilon 



Smallest 
Sum 

High Probability- 
Score P (N) 



263 
263 
263 
414 
414 



1 . le-38 
l.le-38 
1 . le-38 
1.7e-38 
1 . 7e-38 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 258 of 263 bases (98%) identical to a gb:GENBANK- 
ID:AF201831|acc:AF20183Ll mRNA from Homo sapiens (FIL1 epsilon mRNA). The full 
amino acid sequence of the protein of the invention was found to have 82 of 89 amino acid 
residues (92%) identical to, and 87 of 89 amino acid residues (97%) similar to, the 158 amino 
acid residue ptnr : SPTREMBL- ACC : Q9UHA7 protein from Homo sapiens (Human) (FIL1 
EPSILON). NOV69 also has homology to the other proteins shown in the BLASTP data in 
Table 69F. 



Table 69F. NOV69 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%> 


Positive 
(%) 


Expect 


gi | 7657092 | re 
f |NP_055255.1 
| (NM 014440) 


Interleukin- 1 
Super family l [Homo 
sapiens] 


158 


137/158 
(86) 1 


137/158 
(86) 


7e-73 


gi|950660l|re 
f |NP_062323 . 1 
| (NM 019450) 


interleukin 1 family, 
member 6 (epsilon) [Mus 
musculus] 


160 


77/156 
(49) 


104/105 
(66) 


4e-38 


gi| 9665234 | re 
f|NP 062564.1 
| (NM 019618) 


interleukin- 1 homolog 1 
[Homo sapiens] 


169 


76/147 
(51) 


97/147 
(65) 


2e-35 


gi | 7657090 | re 
f |NP_055253 . 1 
| (NM 014438) 


Interleukin- 1 
Super family □ [Homo 
sapi ens] 


157 


62/145 
(42) 


85/145 
(57) 


2e-27 


gi | 12844800 | d 
bj |BAB26505 . 1 
| (AK009787) 


homolog to FIL1 
ETA~putative 
[Mus musculus] 


183 


58/144 
(40) 


77/144 
(53) 


4e-23 ' 



This BLASTP data is displayed graphically in the ClustalW in Table 69G. A multiple 
sequence alignment is given, with the NOV69 protein being shown on line 1 in a ClustalW 
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analysis comparing the protein of the invention with the related protein sequences shown in 
Table 69F. 



Table 69G. ClustalW Alignment of NOV69 



NOV6 9a 
NOV6 9b 
gi|7657092 | 
gi|950660l| 
gij 9665234 j 
gi|7657090| 
gijl2 844800| 



NOV69a 
NOV 6 9b 
gi|7657092 | 
gi|950660l| 
gij 9665234 j 
gi|7657090 j 
gij 12844800 | 



NOV6 9a 
NOV69b 
gi|7657092 | 
gi j 9506601 1 
gi| 9665234 | 
gi j 7657090 | 
gij 12844800 | 



NOV6 9a 
NOV6 9b 
gi | 7657092 | 
gi| 9506601| 
gi| 9665234| 
gi|7657090 j 
gij 12844800 | 



NOV69a 
NOV6 9b 

gi 
gi 
gi 
gi 



7657092 I 
9506601) 
9665234 j 
7657090 



(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

(SEQ ID 

{SEQ ID 



NO:238) 
NO: 24 0) 
NO: 644) 
NO: 645) 
NO : 64 6) 
NO: 647) 
NO : 64 8) 




135 




155 S|E 157 



gi 12844800] 181 SEK 183 



Table 69H lists the domain description from DOMAIN analysis results against 
NOV69. This indicates that the NOV69 sequence has properties similar to those of other 
proteins known to contain this domain. 
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Table 69H. Domain Analysis of NOV69 



gnl lPfam|pfam0034 0 , ILl, Interleukin-1 / 18. This family includes 

interleukin-1 and interleukin-18 . SEQ ID NO: 1392 

CD-Length = 142 residues 
Score = 60.1 bits (144), Expect = 8e-ll 



NOV6 9 : 57 KDRGNPTLQLKEKDIMDLYNQPEPVKS FLFYHSQSGRNSTFESVAFPGWFIAVSSEGGCP 116 

K+ P LQL+ + E K F F ++ G FES A+P WFIA E P 

Sbjct : 62 KEGDEPVLQLEMVEPPKYIKNSEMDKRFFFEKTEIGSKVYFESAAYPNWFIATKQEEDRP 121 

NOV69: 117 LILTQELGKANTTDF 131 
+ L +++ TDF 

Sbjct: 122 VFLANGPPESDITDF 136 * 



There are two structurally distinct forms of IL1 : IL1 (alpha), which is the acidic form 
with pI5, and ILl(beta) (IL1B; 147720), the neutral form with pI7. Both are 17-kD proteins 
coded by separate genes. The ILIA gene has 10,206 bp with 7 exons and 6 introns (Furutani et 
5 al., 1986). By Southern transfer analysis of DNAs from human-rodent somatic cell hybrids, 
Modi et al. (1988) assigned the ILIA gene to chromosome 2. Regional localization to 2ql3- 
q21 was achieved by in situ hybridization. Lafage et al. (1989) confirmed assignment to 2ql3 
by in situ hybridization. 

The ILIA and IL1B proteins, which are synthesized by a variety of cell types including 

10 activated macrophages, keratinocytes, stimulated B lymphocytes, and fibroblasts, are potent 
mediators of inflammation and immunity. Lord et al. (1991) demonstrated that both the alpha 
and beta forms, but particularly the beta form, are transcribed in polymorphonuclear 
leukocytes stimulated with LPS. Both ILIA and IL1B stimulate osteoclast activity in vitro and 
are potent bone resorbing factors. Sabatino et al. (1988) studied the effects of 72-hour 

15 subcutaneous infusions of interleukins 1 -alpha and -beta on plasma, calcium, and bone 
morphology. Both interleukins 1 caused a marked, dose-dependent increase in plasma 
calcium. Increased numbers of osteoclasts and bone resorption surfaces were observed on 
quantitative histomorphometry of bone. The results suggest a role for IL1 in the modulation of 
extracellular fluid calcium homeostasis. Hogquist et al. (1991) demonstrated that interleukin-1 

20 is involved in apoptosis (cell death). Both the alpha and the beta forms are released as a 
consequence of cell injury regardless of the insult. 

Bailly et al. (1993) elucidated a polymorphism that consists of a variable number of 
repeats of a 46-bp sequence within intron 6 of the ILIA gene. Among 72 unrelated persons, 
they identified 6 different alleles ranging from 5 to 1 8 repeats; the most frequent allele, present 

25 in 62%, contained 9 repeats. They suggested that the polymorphism may be of significance in 
gene function, since each repeat contains 3 potential binding sites for transcription factors. 
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Gray et al. (1986) showed that in the mouse also there are at least 2 interleukin-1 
genes, Ill(alpha) and Ill(beta). Boultwood et al. (1989) used in situ chromosome hybridization 
to show that the 2 111 genes in the mouse are located in the F region of chromosome 2. It had 
previously been shown by studies in mouse-hamster somatic cell hybrids and in recombinant 
5 inbred strains that the 2 genes are tightly linked on murine chromosome 2, approximately 4.7 
cM distal to beta-2-microglobulin. By pulsed field gel electrophoresis, Silver et al. (1990) 
showed that the mouse Ilia and II lb genes are contained in a genomic fragment of about 70 
kb. Further studies suggested that II lb lies 5-prime to Ilia, that the 2 genes are oriented in the 
same direction, and that they are separated by about 50 kb. From restriction mapping of the 

10 human genomic region, Nicklin et al. (1 994) concluded that, relative to one terminal CpG 

island, the 3 genes mapped to the following intervals: ILIA was between +0 and +35 kb, IL1B 
between +70 and +1 10 kb, and IL1RN (147679) between +330 and +430 kb. Since the 
assignment of IL1RN to 2ql4.2 appears to be the most definitive localization, the ILIA and 
IL1B genes can be presumably be said to be also on 2ql4. Cox et al. (1998) carried out studies 

15 with multiallelic markers that grouped the 3 genes into a biallelic system for use in association 
studies. They identified a common, 8-locus haplotype of the IL1 gene cluster. 

Hurwitz et al. (1992) studied the role of IL1 in the ovary, using a solution 
hybridization/RNase protection assay to test for expression of the IL1 gene, its type I receptor 
(IL1R; 147810), and its receptor antagonist (IL1RN). They presented findings which, taken 

20 together, revealed the existence of a complete, highly compartmentalized, hormone-dependent 
intraovarian IL1 system. 

Since IL1 is an important cytokine in the control of the inflammatory response central 
to the pathology of rheumatoid arthritis (180300), Cox et al. (1999) used the combined sib- 
TDT (transmission/disequilibrium test) and TDT, in addition to parametric and nonparametric 

25 linkage methods, to investigate candidate genes of the IL1 gene cluster in the 2ql3 region. 
Several tightly linked IL1 cluster markers yielded suggestive evidence for linkage in the 
combined TDT in those families in which affected sibs did not share 2 HLA-DRB 1 alleles 
identical by descent. The evidence was significant in those with severe disease, as assessed by 
the presence of bone erosions. In contrast, there was no evidence of linkage using 

30 nonparametric linkage analysis, but parametric analysis revealed weak evidence of linkage 
when marker-trait disequilibrium was incorporated into the analysis. The data provided 
preliminary evidence for linkage of genes of the IL1 cluster to rheumatoid arthritis and 
suggested a possible role for this region in severe erosive disease. 
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Kornman et al. (1997) suggested that genetic polymorphisms of the ILIA and IL1B 
genes may be associated with severity of periodontitis in adult nonsmokers. The IL1B 
polymorphism was referred to as IL1B+3953 and the ILIA polymorphism was referred to as 
IL1 A-889. Nonsmokers aged 40 to 60 carrying the *2' allele (in either homozygous or 
5 heterozygous state) at both loci were observed to have nearly 19 times the risk of developing 
severe periodontitis compared to subjects homozygous for the T allele at either or both of 
these loci. Because of the implication of interleukin-1 in adult periodontitis, Diehl et al. (1999) 
undertook an evaluation of the role of these ILIA and IL1B polymorphisms in early-onset 
periodontitis (EOP; see 170650) in 28 African-American families and 7 Caucasian-American 

10 families with 2 or more affected members. The 2 major EOP subtypes, localized juvenile 
periodontitis and generalized early-onset periodontitis, encompassing rapidly progressive 
periodontitis and generalized juvenile periodontitis, were analyzed separately and together. 
They obtained highly significant evidence of linkage disequilibrium for both groups of 
generalized EOP subjects. A similar trend was noted for the localized form. The IL1 alleles 

1 5 associated with high risk of EOP had been suggested previously to be correlated with low risk 
for severe adult periodontitis. Linkage disequilibrium with generalized EOP was equally 
strong for smoking and nonsmoking subjects. ILIA and 1L1B polymorphisms were in strong 
linkage disequilibrium with each other in Caucasians but not in African Americans. Haplotype 
analyses evaluating both polymorphisms simultaneously indicated that the IL1B variant is 

20 likely to be more important for EOP risk. Sib pair linkage analyses, by contrast, provided only 
marginal support for a gene of very major effect on EOP risk attributable to these IL1 
polymorphisms. Diehl et al. (1999) interpreted their results as indicating that EOP is a 
complex, oligogenic disorder, with interleukin-1 genetic variation contributing an important 
but not exclusive influence on disease risk. 

25 NOV69 is predicted to be expressed in at least the following tissues: spleen, lymph 

node, thymus, tonsil and leukocyte tissues. This information was derived by determining the 
tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV69 is provided in Example 2. 

30 The NOV69 nucleic acids and proteins of are useful in potential therapeutic 

applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention will have efficacy for the treatment of patients 
suffering from: inflammatory and immune system-related diseases such as rheumatoid arthritis 
and inflammatory bowel disease, periodontitis, hypothyroidism, congenital, due to thyroid 
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dysgenesis or hypoplasia; osteoarthritis of distal interphalangeal joints; selective T-cell defect; 
nephronophthisis, juvenile; purpura fulminans, neonatal; susceptibility to infections such as 
viral and bacterial; thrombophilia due to protein C deficiency; as well as other diseases, 
disorders and conditions. 
5 The NOV69 nucleic acid encoding the ILl-like protein of the invention, or fragments 

thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. The novel nucleic acid of the invention 
encoding a Interleukin 1 epsilon-like protein includes the nucleic acid whose sequence is 
provided in Table 69A or 69C, or a fragment thereof. The invention also includes a mutant or 

10 variant nucleic acid any of whose bases may be changed from the corresponding base shown 
in Table 69A or 69C while still encoding a protein that maintains its Interleukin 1 epsilon-like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to the sequence of Table 
69 A or 69C, including nucleic acid fragments that are complementary to any of the nucleic 

1 5 acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 

20 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 2% of the bases may 
be so changed. 

These materials are further useful in the generation of antibodies that bind 
25 immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV70 

30 NOV70 includes two OS9-like proteins, designated herein as NOV70a and NOV70b. 

NOV70a 
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The disclosed NOV70a (alternatively referred to herein as CG56878-01) includes the 
2739 nucleotide sequence (SEQ ID NO:241) shown in Table 70A. A NOV70a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 86-88 and ends with a stop codon 
at nucleotides 2090-2092. The disclosed NOV70a maps to human chromosome 12ql3. 



Table 70A. NOV70a Nucleotide Sequence (SEQ ID 
NO:241) 

TTGCACTCTCCCACACCCTTTTCTTTTCGTCCGCTCTTCGCTTATTTCTCCCGCCGTCTC 
CTCTGCATAAGAAGGGGAACGAAAGATGGCGGCGGAAACGCTGCTGTCCAGTTTGTTAGG 
ACTGCTGCTTCTGGGACTCCTGTTACCCGCAAGTCTGACCGGCGGTGTCGGGAGCCTGAA 
CCTGGAGGAGCTGAGTGAGATGCGTTATGGGATCGAGATCCTGCCGTTGCCTGTCATGGG 
AGGGCAGAGCCAATCTTCGGACGTGGTGATTGTCTCCTCTAAGTACAAACAGCGCTATGA 
GTGTCGCCTGCCAGCTGGAGCTATTCACTTCCAGCGTGAAAGGGAGGAGGAAACACCTGC 
TTACCAAGGGCCTGGGATCCCTGAGTTGTTGAGCCCAATGAGAGATGCTCCCTGCTTGCT 
GAAG AC AAAGGAC TGGTGGAC ATATGAATT C TGTTATGGAC GC C ACAT CC AGC AATAC C A 
CATGGAAGATTCAGAGATCAAAGGTGAAGTCCTCTATCTCGGCTACTACCAATCAGCCTT 
CGACTGGGATGATGAAACAGC C AAGGCCT C C AAG C AGCAT CGT CTTAAACGCT AC C AC AG 
CCAGACCTATGGCAATGGGTCCAAGTGCGACCTTAATGGGAGGCCCCGGGAGGCCGAGGT 
TCGGTTCCTCTGTGACGAGGGTGCAGGTATCTCTGGGGACTACATCGATCGCGTGGACGA 
GCCCTTGTCCTGCTCTTATGTGCTGACCATTCGCACTCCTCGGCTCTGCCCCCACCCTCT 
CCTCCGGCCCCCACCCAGTGCTGCACCGCAGGCCATCCTCTGTCACCCTTCCCTACAGCC 
TGAGGAGTACATGGCCTACGTTCAGAGGCAAGCCGTAGACTCAAAGCAGTATGGAGATAA 
AAT C AT AGAGGAGC TGCAAGAT C T AGGC C C C CAAGTGTGG AGTGAGAC CAAGTCTGGGGT 
GGC ACCC CAAAAGATGG CAGGTGCGAGC C CGAC CAAGGATGACAGT AAGG ACTCAGATTT 
CTGGAAGATG CTT AATG AG C C AGAGGAC C AGG C C CCAGG AGGGG AGGAGGTG CCGG CTGA 
GGAGCAGGAC C CAAG C C CTGAGGCAGCAGAT T C AGCTT CTGGTG CT C C C AATGAT TTT CA 
GAACAACGTGCAGGTCAAAGTCATTCGAAGCCCTGCGGATTTGATTCGATTCATAGAGGA 
GCTGAAAGGTGGAACAAAAAAGGGGAAGCCAAATATAGGCCAAGAGCAGCCTGTGGATGA 
TG C TG CAGAAGT CC CT CAGAGGG AAC CAG AGAAGGAAAGGGGTGAT C C AG AACGGC AG AG 
AGAGATGGAAGAAGAGGAGGATGAGGATGAGGATGAGGATGAAGATGAGGATGAACGGCA 
GTTACTGGGAGAATTTGAGAAGGAACTGGAAGGGATCCTGCTTCCGTCAGACCGAGACCG 
GCTCCGTTCGGAGGTGAAGGCTGGCATGGAGCGGGAACTGGAGAACATCATCCAGGAGAC 
AG AG AAAGAGCTGGAC C CAGATGGGC TG AAGAAGGAGT CAGAGCGGG AT CGGGCAATG C T 
GG CTCTC AC AT CCACT CTCAACAAAC T CATC AAAAGAC TGGAGGAAAAACAGAGT CC AGA 
GCTGGTGAAGAAGCACAAGAAAAAGAGGGTTGTCCCCAAAAAGCCTCCCCCATCACCCCA 
ACCTACAGAGGAGGATCCTGAGCACAGAGTCCGGGTCCGGGTCACCAAGCTCCGTCTCGG 
AGGCC CT AAT CAGGAT CTGACTGT C C T CGAGATGAAACGGGAAAAC C C AC AGCTGAAAC A 
AATCGAGGGGCTGGTGAAGGAGCTGCTGGAGAGGGAGGGACTCACAGCTGCAGGGAAAAT 
TGAGATCAAAATTGTCCGCCCATGGGCTGAAGGGACTGAAGAGGGTGCACGTTGGCTGAC 
TGATGAGGACACGAGAAACCTCAAGGAGATCTTCTTCAATATCTTGGTGCCGGGAGCTGA 
AGAGGCCCAGAAGGAACGCCAGCGGCAGAAAGAGCTGGAGAGCAATTACCGCCGGGTGTG 
GGGCTCTCCAGGTGGGGAGGGCACAGGGGACCTGGACGAATTTGACTTCTGAGACCAACA 
CTACACTTGACCCTTCACGGAATCCAGACTCTTCCTGGACTGGCTTGCCTCCTCCCCACC 
TCCCCACCCTGGAACCCCTGAGGGCCAAACAGCAGAGTGGAGCTGAGCTGTGGACCTCTC 
GGGCAACTCTGTGGGTGTGGGGGCCCTGGGTGAATGCTGCTGCCCCTGCTGGCAGCCACC 
TTGAGACCTCACCGGGCCTGTGATATTTGCTCTCCTGAACTCTCACTCAATCCTCTTCCT 
CTCCTCTGTGGCTTTTCCTGTTATTGTCCCCTAATGATAGGATATTCCCTGCTGCCTACC 
TGG AGATTC AGT AGGAT C T TTTGAGTGGAGGTGGGT AGAG AGAG C AAGGAGGGCAGGACA 
CTTAGCAGGCACTGAGCAAGCAGGCCCCCACCTGCCCTTAGTGATGTTTGGAGTCGTTTT 
ACCCTCTTCTATTGAATTGCCTTGGGATTTCCTTCTCCCTTTCCCTGCCCACCCTGTCCC 
CTACAATTTGTGCTTCTGAGTTGAGGAGCCTTCACCTCTGTTGCTGAGGAAATGGTAGAA 
TGCTGCCTATCACCTCCAGCACAATCCCAGCGAAAAAGGTGTGAAGCACCCACCATGTTC 
TTGAAC AAT CAGGTTT CTAAATAAAC AACTGGACCAT CA 



A NOV70a polypeptide (SEQ ID NO:242) encoded by SEQ ID NO:241 is 668 amino 
acids in length and is presented using the one-letter amino acid code in Table 70B. The Psort 
profile for NOV70a predicts that this sequence has a signal peptide and is likely to be 
localized outside the cell with a certainty of 0.8200. In alternative embodiments, a NOV70a 
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polypeptide is located in the nucleus with a certainty of 0.1260. The Signal P predicts a likely 
cleavage site for a NOV70a peptide is between positions 25 and 26, z.e., at the dash in the 
sequence SLT-GG. 



Table 70B, NOV70a Polypeptide Sequence (SEQ ID 
NO:242) 



MAAETLLSSLLGLLLLGLLLPASLTGGVGSLNLEELSEMRYGIEILPLPVMGGQSQSSDV 
VIVSSKYKQRYECRLPAGAIHFQREREEETPAYQGPGI PELLS PMRDAPCLLKTKDWWTY 
EFCYGRHIQQYHMEDSEIKGEVLYLGYYQSAFDWDDETAKASKQHRLKRYHSQTYGNGSK 
CDLNGRPREAEVRFLCDEGAGISGDYIDRVDEPLSCSYVLTIRTPRLCPHPLLRPPPSAA 
PQAILCHPSLQPEEYMAYVQRQAVDSKQYGDKIIEELQDLGPQVWSETKSGVAPQKMAGA 
S P T KDD S KDS D FWKMLNE PEDQ AP GGE E VP AE E QD P S PE AAD S AS G APND FQNNVQ VK V I 
RSPADLIRFIEELKGGTKKGKPNIGQEQPVDDAAEVPQREPEKERGDPERQREMEEEEDE 
DEDEDEDEDERQLLGEFEKELEGILLPSDRDRLRSEVKAGMERELENI IQETEKELDPDG 
LKKESERDRAMLALTSTLNKLIKRLEEKQSPELVKKHKKKRWPKKPPPSPQPTEEDPEH 
RVRVRVTKLRLGGPNQDLTVLEMKRENPQLKQIEGLVKELLEREGLTAAGKIEIKIVRPW 
AEGTEEGARWLTDEDTRNLKEIFFNILVPGAEEAQKERQRQKELESNYRRVWGSPGGEGT 
GDLDEFDF 

5 

NOVlOb 

The disclosed NOV70b (alternatively referred to herein as CG56868-04) includes the 
2702 nucleotide sequence (SEQ ID NO: ) shown in Table 70C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 86-88 and ends with a TGA codon at 
10 nucleotides 2036-2038. The disclosed NOV70b maps to human chromosome 12ql3. 



Table 70C. NOV70b Nucleotide Sequence (SEQ ID 
NO:243) 

TTGCACTCTCCCACACCCTTTTCTTTTCGTCCGCTCTTCGCTTATTTCTCCCGCCGTCTC 
CTCTGCATAAGAAGGGGAACGAAAGATGGCGGCGGAAACGCTGCTGTCCAGTTTGTTAGG 
ACTGCTGCTTCTGGGACTCCTGTTACCCGCAAGTCTGACCGGCGGTGTCGGGAGCCTGAA 
CCTGGAGGAGCTGAGTGAGATGCGTTATGGGATCGAGATCCTGCCGTTGCCTGTCATGGG 
AGGGCAGAGCCAATCTTCGGACGTGGTGATTGTCTCCTCTAAGTACAAACAGCGCTATGA 
GTGTCGCCTGCCAGCTGGAGCTATTCACTTCCAGCGTGAAAGGGAGGAGGAAACACCTGC 
TTACCAAGGGCCTGGGATCCCTGAGTTGTTGAGCCCAATGAGAGATGCTCCCTGCTTGCT 
GAAGACAAAGGACTGGTGGACATATGAATTCTGTTATGGACGCCACATCCAGCAATACCA 
CATGGAAGATTCAGAGATCAAAGGTGAAGTCCTCTATCTCGGCTACTACCAATCAGCCTT 
CGACTGGGATGATGAAACAGCCAAGGCCTCCAAGCAGCATCGTCTTAAACGCTACCACAG 
CCAGACCTATGGCAATGGGTCCAAGTGCGACCTTAATGGGAGGCCCCGGGAGGCCGAGGT 
TCGGTTCCTCTGTGACGAGGGTGCAGGTATCTCTGGGGACTACATCGATCGCGTGGACGA 
GCCCTTGTCCTGCTCTTATGTGCTGACCATTCGCACTCCTCGGCTCTGCCCCCACCCTCT 
CCTCCGGCCCCCACCCAGTGCTGCACCACAGGCCATCCTCTGTCACCCTTCCCTACAGCC 
TGAGGAGTACATGGCCTACGTTCAGAGGCAAGCCGACTCAAAGCAGTATGGAGATAAAAT 
CATAGAGGAGCTGCAAGATCTAGGCCCCCAAGTGTGGAGTGAGACCAAGTCTGGGGTGGC 
ACCCCAAAAGATGGCAGGTGCGAGCCCGACCAAGGATGACAGTAAGGACTCAGATTTCTG 
GAAGATGCTTAATGAGCCAGAGGACCAGGCCCCAGGAGGGGAGGAGGTGCCGGCTGAGGA 
GCAGGACCCAAGCCCTGAGGCAGCAGATTCAGCTTCTGGTGCTCCCAATGATTTTCAGAA 
CAACGTGCAGGTCAAAGTCATTCGAAGCCCTGCGGATTTGATTCGATTCATAGAGGAGCT 
GAAAGGTGGAACAAAAAAGGGGAAGCCAAATATAGGCCAAGAGCAGCCTGTGGATGATGC 
TGC AG AAGT C C CT C AG AGGGAAC C AGAG AAGG AAAGGGGTG AT C C AG AACGGC AGAGAG A 
GATGGAAGAAGAGGAGGATGAGGATGAGGATAAGATGAGGATGAACGGCAGTTACTGGGG 
AGAATTTGAGAAGGAACTGGAAGGGATCCTGCTTCCGTCAGACCGAGACCGGCTCCGTTC 
GGAGACAGAGAAAGAGCTGGACCCAGATGGGCTGAAGAAGGAGTCAGAGCGGGATCGGGC 
AATGCTGGCTCTCACATCCACTCTCAACAAACTCATCAAAAGACTGGAGGAAAAACAGAG 
TCCAGAGCTGGTGAAGAAGCACAAGAAAAAGAGGGTTGTCCCCAAAAAGCCTCCCCCATC 
ACCCCAACCTACAGAGGAGGATCCTGAGCACAGAGTCCGGGTCCGGGTCACCAAGCTCCG 
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TCTCGGAGGCCCTAATCAGGATCTGACTGTCCTCGAGATGAAACGGGAAAACCCACAGCT 
GAAACAAATCGAGGGGCTGGTGAAGGAGCTGCTGGAGAGGGAGGGACTCACAGCTGCAGG 
GAAAATTGAGATCAAAATTGTCCGCCCATGGGCTGAAGGGACTGAAGAGGGTGCACGTTG 
GCTGACTGATGAGGACACGAGAAACCTCAAGGAGATCTTCTTCAATATCTTGGTGCCGGG 
AGCTGAAGAGGCCCAGAAGGAACGCCAGCGGCAGAAAGAGCTGGAGAGCAATTACCGCCG 
GGT GTGGGGCT CT CC AGGTGGGG AGGG C AC AGGGG AC C TGG AC GAATT TG ACT T CTGAG A 
CCAACACTACACTTGACCCTTCACGGAATCCAGACTCTTCCTGGACTGGCTTGCCTCCTC 
CCCACCTCCCCACCCTGGAACCCCTGAGGGCCAAACAGCAGAGTGGAGCTGAGCTGTGGA 
CCTCTCGGGCAACTCTGTGGGTGTGGGGGCCCTGGGTGAATGCTGCTGCCCCTGCTGGCA 
GCCACCTTGAGACCTCACCGGGCCTGTGATATTTGCTCTCCTGAACTCTCACTCAATCCT 
CTTCCTCTCCTCTGTGGCTTTTCCTGTTATTGTCCCCTAATGATAGGATATTCCCTGCTG 
CCTACCTGGAGATTCAGTAGGATCTTTTGAGTGGAGGTGGGTAGAGAGAGCAAGGAGGGC 
AGGACACTTAGCAGGCACTGAGCAAGCAGGCCCCCACCTGCCCTTAGTGATGTTTGGAGT 
CGTTTTACCCTCTTCTATGGAATTGCCTGTGGATTCCTTCTCCCTTCCCTGCCCACCGTG 
TCCTACAATTGTGCTCTGAGTGAGAGCCTTCCTCTCTGCTAGGAAGGTTATGTGCCTTAC 
TCCGCAATCGGAAAGTTAGCCACGTTCTAATCGTTATACAAGGCTAAAAAAAATAAATAT 
TTATACCCGTTTTTCCCTGATTTATTTTTAAATATTATATTATTTTTAATATAATTTGTG 
GG 



A NOV70b polypeptide (SEQ ID NO:244) encoded by SEQ ID NO:243 is 650 amino 
acids in length and is presented using the one-letter amino acid code in Table 70D. The Psort 
profile for NOV70b predicts that this sequence has a signal peptide and is likely to be 
5 localized outside the cell with a certainty of 0.8200. In alternative embodiments, a NOV70b 
polypeptide is located to lysosomes with a certainty of 0.1900, or to the nucleus with a 
certainty of 0.1260. The Signal P predicts a likely cleavage site for a NOV70b peptide is 
between positions 25 and 26, i.e., at the dash in the sequence SLT-GG. 



Table 70D. NOV70b Polypeptide Sequence (SEQ ID 
NO:244) 

MAAETLLSSLLGLLL.LGLLLPASLTGGVGSL.NLEELSEMRYGIEILPLPVMGGQSQSSDV 
VIVSSKYKQRYECRLPAGAIHFQREREEETPAYQGPGIPELLSPMRDAPCLLKTKDWWTY 
EFCYGRHIQQYHMEDSEIKGEVLiYLGYYQSAFDWDDETAKASKQHRLKRYHSQTYGNGSK 
CDLNGRPREAEVRFLCDEGAGISGDYIDRVDEPLSCSYVLTIRTPRLCPHPLLRPPPSAA 
PQAILCHPSLQPEEYMAYVQRQADSKQYGDKIIEELQDLGPQVWSETKSGVAPQKMAGAS 
PTKDDSKDSDFWKMLNEPEDQAPGGEEVPAEEQDPSPEAADSASGAPNDFQNNVQVKVIR 
SPADLIRFIEELKGGTKKGKPNIGQEQPVDDAAEVPQREPEKERGDPERQREMEEEEDED 
EDKMRMNGSYWGEFEKELEGILLPSDRDRLRSETEKELDPDGLKKESERDRAMLALTSTL 
NKLIKRLEEKQSPELVKKHKKKRWPKKPPPSPQPTEEDPEHRVRVRVTKLRLGGPNQDL 
TVLEMKRENPQLKQIEGLVKELLEREGLTAAGKIEIKIVRPWAEGTEEGARWLTDEDTRN 
LKEIFFNILVPGAEEAQKERQRQKELESNYRRVWGSPGGEGTGDLDEFDF 

10 



A BLAST analysis of NOV70 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV70 had high 
homology to other proteins as shown in Table 70E. 

15 

Table 70E. BLASTX results from PatP database for NOV70 

Smallest 
Sum 

High Probability 
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Sequences producing High-s 


scoring Segment Pairs: 


Score 


P (N) 




patp 


AAG76 089 Human colon 


cancer antigen protein 


423 


3 . 4e- 


52 


patp 


AAG41826 Arabidopsis 


thaliana protein fragment 


298 


2 . 6e- 


25 


patp 


AAG41827 Arabidopsis 


thaliana protein fragment 


279 


2 . 9e- 


23 


patp 


AAG41828 Arabidopsis 


thaliana protein fragment 


279 


2 . 9e- 


23 


patp 


AAU3 2 2 55 Novel human 


secreted protein #2746 


232 


3 .4e- 


18 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 2735 of 2739 bases (99%) identical to a gbiGENBANK- 
ID:HSU41635|acc:U41635.1 mRNA from Homo sapiens (Human OS-9 precurosor mRNA, 
5 complete cds). The full amino acid sequence of the protein of the invention was found to have 
667 of 668 amino acid residues (99%) identical to, and 667 of 668 amino acid residues (99%) 

v. 

similar to, the 667 amino acid residue ptnr:SWISSPROT-ACC:Q 13438 protein from Homo 
sapiens (Human) (PROTEIN OS-9 PRECURSOR). The sequence of this invention has 1 
amino acid inserion, compared to ptnr:SWISSPROT-ACC:Q13438 protein from Homo 
1 0 sapiens (Human) (PROTEIN OS-9 PRECURSOR). NOV70 also has homology to the other 
proteins shown in the BLASTP data in Table 70F. 



Table 70F. NOV70 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%> 


Positive 

(%) 


Expect 


gi | 5803109 | re 
f|NP 006803.1 
| (NM_006812) 


amplified in 
osteosarcoma [Homo 
sapiens] 


667 


667/668 
(99) 


667/668 
(99) 


0 . 0 


gi | 12653521 | g 
b | AAH00532 . 1 | 
AAH00532 
(BC000532) 


Similar to amplified in 1 
osteosarcoma 
[Homo sapiens] 


612 


512/513 
(99) 


512/513 
(99) 


0 . 0 


gi | 17986213 |g 
b|AAC39523 .2 | 
(U81031) 


OS 9 

[Homo sapiens] 


474 


474/475 
(99) 


474/475 
(99) 


0 . 0 


gi | 13905114 | g 
b| AAH06844 . 1 | 
AAH06844 
(BC006844) 


Unknown (protein for 
IMAGE : 3 5984 53 ) 
[Mus musculus] 


571 


435/630 
(69) 


495/630 
(78) 


0 . 0 


gi | 13676348 |g 
b | AAH06506 . 1 | 
AAH06506 
(BC006506) 


Similar to amplified in 
osteosarcoma 
[Homo sapiens] 


376 


373/374 
(99) 


374/374 
(99) 


e-161 



This BLASTP data is displayed graphically in the ClustalW in Table 70G. A multiple 
15 sequence alignment is given, with the NOV70a and b proteins being shown on lines 1 and 2 in 
a ClustalW analysis comparing the protein of the invention with the related protein sequences 
shown in Table 70F. 



Table 70G. ClustalW Alignment of NOV70 
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NOV70a 


(SEQ 


ID 


NO 


242) 


NOV70b 


(SEQ 


ID 


NO 


244) 


gi| 5803109] 


(SEQ 


ID 


NO 


649) 


gi | 12653521 | 


(SEQ 


ID 


NO 


650) 


gi | 17986213 | 


(SEQ 


ID 


NO 


651) 


gi | 13905114 | 


(SEQ 


ID 


NO 


652) 


gi j 13676348 | 


(SEQ 


ID 


NO 


653) 










10 



20 



30 



40 



50 



NOV70a 

NOV7 Ob 

gi | 5803109 | 

gij 12653521 | 

gi j 17986213 | 

gij 13905114 | 

gi|l3676348| 



MAAETLLSSLLGLLLLGLLLPASLTGGVGSLNLEELSEMRYGIEIL 
MAAETLLSSLLGLLLLGLLLPASLTGGVGSLNLEELSEMRYGIEIL 
MAAETLLSSLLGLLLLGLLLPASLTGGVGSLNLEELSEMRYGIEIL 
MAAETLLSS LLGLLLLGLLLPASLTGGVGS LNLEELSEMRYGI E I L 



NOV70a 


61 


NOV70b 


61 


gi 


5803109| 


61 


gi 


12653521 


61 


gi 


17986213 


1 


gi 


13905114 


15 


gi 


13676348 


1 


NOV70a 


121 


NOV70b 


121 


gi 


5803109 | 


121 


gi | 12653521 


121 


gi|l7986213 


1 


gij 13905114 


75 


gi|l3676348 


1 



NOV70a 


181 


NOV70b 


181 


gx| 5803109 | 


181 


gi | 12653521 | 


181 


gij 17986213 | 


1 


gij 13905114] 


135 


gi|l3 676348| 


1 


NOV7 0a 


241 


NOV7 Ob 


241 


gi | 5803109 | 


241 


gi j 12653521 | 


241 


gi j 17986213 j 


48 


gi j 13905114 j 


195 


gijl3676348| 


1 


NOV70a 


301 


NOV70b 


300 


gi|5803109| 


300 


gijl265352l| 


300 


gij 17986213 | 


107 


gij 13905114 | 


254 



I 



PLPVMGGQSQ 
PLPVMGGQSQ 
PLPVMGGQSQ 
PLPVMGGQSQ 



PLPVMGGQSQ^ 



70 



80 
• I . . 



90 



100 



110 




81 

si 



SSKYKQRYECRLPAGAIHFQREREEETPAYQGPGI PELLS PMRDAPCLLKTKDWWTY 
SSKYKQRYECRLPAGAIHFQREREEETPAYQGPGI PELLS PMRDAPCLLKTKDWWTY 
*S SKYKQRYECRLPAGAIHFQRE RE EETPAYQGPGI PELLS PMRDAPCLLKTKDWWTY 
SSKYKQRYECRLPAGAIHFQREREEETPAYQGPG I PELLS PMRDAPCLLKTKDWWTY 



\7gVSSKYKQRYECRLPAGAIHFQREREEETPAYQGPGI PELLS PMRDAPCLLKTKDWWTY 



120 
120 
120 
120 
1 

74 

1 



130 
. . I . . 



140 



150 



160 
. . I 



180 



EFCYGRHIQQYHMEDSEIKGj|VLYLGgYQSgFgWDDETAKASKQHRLKRYHSQTYGNGSK 
EFCYGRHIQQYHMEDSEIKGgVLYLGyYQSgFigWDDETAKASKQHRLKRYHSQTYGNGSK 
E FC YGRHI QQYHMEDS E I KG|VL YLgS YQS |f|wDDETAKAS KQHRLKRYHS QT YGNGS K 
E FC YGRHI QQYHMEDS E I KGjfflVL YLGSYQS SfIwDDETAKAS KQHRLKRYHS QT YGNGS " 



EFCYGRHIQQYHMEDSEIKGffiVLYLGgjYQSaF^WDDETAKASKQHRLKRYHSQTYGNGSK 



180 
180 
180 
180 
1 

134 
1 



190 



200 



210 



240 




250 



260 



270 



280 



290 



300 




I 



310 
I ♦ 



320 



330 



340 



350 



360 



SPTKDDSKDSDF 
SPTKDDSKDSDF 
SPTKDDSKDSDF 
SPTKDDSKDSDF 
SPTKDDSKDSDF 
SP3K|I|flKl|s[ifl 
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ji }j is w if -si i: h is .1! -ir 



gi|l3676348| 9 



7KDDSKDSDFWKMLNEPEDQAPGGEEVPAEEQDPSI 



AAD S AS GAPND FQNNVQ VK\ 



67 



NOV70a 

NOV70b 

gi | 5803109 | 

gi | 12653521 | 

gijl7986213 j 

gijl3905114 j 

gijl3676348 j 



NOV7 0a 


417 


NOV7UD 


416 


gi|5803109| 


416 


gijl265352l| 


416 


gij 17986213 j 


223 


gi j 13905114 j 


371 


gi j 13676348 j 


125 


NOV70a 


473 


NOV / OJD 


455 


gi|5803109| 


472 


gi|l265352l| 


472 


gij 17986213 j 


279 


gij 13905114 | 


431 


gi|l3676348| 


181 


NOV7 0a 


533 


NOV7 0b 


515 


gi | 5803109 | 


532 


gijl265352l| 


532 


gi j 17986213 j 


339 


gij 13905114 j 


491 


gi|l3676348| 


241 


NOV7 0a 


593 


NOV70b 


575 


gi | 5803109 | 


592 


gi|l265352l| 


537 


gi j 17986213 | 


399 


gij 13905114 j 


496 


gij 13676348 j 


301 


NOV70a 


653 


NOV70b 


635 


gi| 5803109 | 


652 


gi jl265352l| 


597 


gi j 17986213 | 


459 


gi j 13905114 j 


556 


gi j 13676348 j 


361 



I 



370 
. . I . . 



I 



380 
. . I . . 



I 



390 
. . I . . 



400 



420 



I 



360 
359 
359 
359 
166 
312 
68 



IRSPADLIRFIEELKGGTKKGKPNIGQEQPVDDAAEVPQREPE 
IRSPADLIRFIEELKGGTKKGKPNIGQEQPVDDAAEVPQREPE 
IRSPADLIRFIEELKGGTKKGKPNIGQEQPVDDAAEVPQREPE 
IRSPADLIRFIEELKGGTKKGKPNIGQEQPVDDAAEVPQREPE 
IRSPADLIRFIEELiK GGTK KGKP NIGQ EQPVDDAAEVPQREPE 
IRSPADLIRpiEELK^^KGKP fa^ia.a EOPBDDTOE^PORE^E 
IRSPADLIRFIEELKGGTKKGKPNIGQEQPVDDAAEVPQREPE 



I. 



430 
. . I . . 



440 



450 
. . I . . 



460 
. . I . . 




I 



ERQREMEE 
ERQREMEE 
ERQREMEE 
ERQREMEE 
E RQRE MEE 

E SES MEE 

ERQREMEE 



416 
415 
415 
415 
222 
370 
124 



480 



DGDD 



EDEDED EDEDEDERQLL GEFEKELEGILLPSDRDRLR SEVKAGMERELENIIQ ET 

E DE DED[|JJ2J2^^^Q2GE FE KE LEG I L L P S DRDRLR^^BBBBB^I^I^E E T 

EDEDEDEDEDEDERQLLGEFEKELEGILLPSDRDRLRSEVKAGMERELENIIQET 
EDEDEDEDEDEDERQLLGEFEKELEGILLPSDRDRLRSEVKAGMERELENIIQET 
EDEDEDEDEDEDERQLLGEFEKELEGILLPSDRDRLRSEVKAGMERELENIIQET 
E|E|E|E|EDE^EgQLLGEFEKELEGS(LLPSDR||RLRSEVKAGMERELENIIQET 
EDEDEDEDEDEDERQLLGEFEKELEGILLPSDRDRLRSEVKAGMERELENIIQET 



472 
454 
471 
471 
278 
430 
180 



I 



490 



500 



510 



I 



520 
■ ■I.. 



530 
. . [ . . 



540 



I 



EKELDPDGLKKESERDRAMLALTSTLNKLI 
EKELDPDGLKKESERDRAMLALTSTLNKLI 
EKELDPDGLKKESERDRAMLALTSTLNKLI 
EKELDPDGLKKESERDRAMLALTSTLNKLI 
EKELDPDGLKKESERDRAMLALTSTLNKLI 
EKELDPiGLjKESERl^ABLALTSTLiSKLI 
EKELDPDGLKKESERDRAMLALTSTLNKLI 



KRLEEKQSPELVKKHKKKRWPKKPPPSPQ 

KRLEEKQSPELVKKHKKKRWPKKPPPSPQ 

KRLEEKQSPELVKKHKKKRWPKKPPPSPQ 

KRLEEKQSPELVKKHKKKRWPKKPPPSPQ 

KRLEEKQSPELVKKHKKKRWPKKPPPSPQ 

KRLgEjJjQSPELV§KgKK^RWPgKPPPSPl 

KRLEEKQSPELVKKHKKKRWPKKPPPSPQ 



532 
514 
531 
531 
338 
490 
240 



I 



550 
. . I . . 



560 
. . I . . 



I 



570 
. . I . . 



580 



590 



,600 



I 



I 



I 



PTEEDPEHRVRVRVTKLRLGGPNQDLTVLEMKRENPQLKQIEGLVKELLEREGLTAAGKI 
PTEEDPEHRVRVRVTKLRLGGPNQDLTVLEMKRENPQLKQIEGLVKELLEREGLTAAGKI 
PT EEDPEHRVRVRVTKLRLGGPNQDLTVLEMKRENPQLKQIEGLVKELLEREGLTAA GK I 

PT EEDPEHRVRVRVTKLRLGGPNQDLTVLEMKRENPQLKQIEGLVKELLEREGLTAA GK I 

PTj^^^BBm^^^^^m ^^^B i, >'- 'H^M^ i/ 1 

PTEEDPEHRVRVRVTKLRLGGPNQDLTVLEMKRENPQLKQIEGLVKELLEREGLTAAGKI 



I 



610 
■ ■ I ■ ■ 



I 



620 
■ ■ I 



630 



I 



I 



640 
■ ■ I • • 



650 



660 



EIKIVRPWAEGTEEGARWLTDEDTRNL 
EIKIVRPWAEGTEEGARWLTDEDTRNL 
E I K I VRPWAEGTE E GAR WLTDE DTRNL 
EIKIVRPWAEGTEEGARWLTDEDTRNL 
E I K I VRPWAEGTE E GAR WLTDE DTRNL 
EIKIVRPgAEGgEEggjRWLTDEDTRNL 
EIKIVRPWAEGTEEGARWLTDEDTRNL 



KEIFFNILVPGAEEAQKERQRQKELESNYRRVW 
KEIFFNILVPGAEEAQKERQRQKELESNYRRVW 
KEIFFNILVPGAEEAQKERQRQKELESNYRRVW 
KEIFFNILVPGAEEAQKERQRQKELESNYRRVW 
KEIFFNILVPGAEEAQKERQRQKELESNYRRVW 
KEIFFNILVgGAEEAfJjKERQRQgELESNYRRVW 
KEI FFNILVPGAEEAQKERQRQKELESNYRRVW 



652 
634 
651 
596 
458 
555 
360 



670 



I 



I 



GS PGGEGTGDLDE FDF 
GSPGGEGTGDLDEFDF 
GS PGGEGTGDLDE FDF 
GSPGGEGTGDLDEFDF 
GSPGGEGTGDLDEFDF 
GS PGGEGTGDLDE FDF 
GSPGGEGTGDLDEFDF 



668 
650 
667 
612 
474 
571 
376 



Amplification and overexpression of genes involved in cellular growth control occur 
frequently in human tumors. Using a chromosome microdissection-based hybrid-selection 
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strategy, the OS-9 gene has been identified within 12ql3-15, a region frequently amplified in 
human cancers. The full-length OS-9 cDNA sequence consists of 2785 bp from which an open 
reading frame (ORF) with 667 amino-acid residues has been deduced. The predicted 
polypeptide is water soluble and acidic. The OS-9 gene encodes a 2.8-kb mRNA transcribed in 
5 all 16 human tissues examined, suggesting that OS-9 is ubiquitously expressed in human 

tissues. OS-9 is co-amplified with CDK4 in three of five sarcoma tissues. Homology analysis 
of the amino-acid sequence has revealed significant similarities between OS-9 and two ORFs 
deduced from genomic sequences in Caenorhabditis elegans and Saccharomyces cerevisiae. 
The region of similarity extends over 200 residues (approximately one-third of each ORF), 
10 and eight cysteines were conserved in all three ORFs. These observations suggest that this 

region comprises a functional domain present in a novel evolutionarily conserved gene family 
defined by OS-9. 

The OS-9 genomic DNA has been isolated and characterized from a human BAC 
library. Sequencing of the genomic DNA has shown that the gene spanned approximately 30.4 

15 kbp and had 15 exons. The 1,010 bp sequence of the 5' upstream region has also been 
determined. The potential binding-sequence motifs TATA and CCAAT for general 
transcription factors have been found in the 5' upstream region. Primer extension analysis has 
revealed two putative transcription start sites. 

Three isoforms of OS-9 cDNA have been isolated from a myeloid leukemia HL-60 

20 cDNA library and characterized. Isoform 1 consisted of 2,700 bp, from which a 667 amino 
acid sequence was deduced and found to be identical with that of OS-9 cDNA from 
osteosarcoma cells. Isoform 2 cDNA lacked a 165 nucleotide sequence in the coding region. 
Isoform 3 cDNA had an additional 45 bp deletion in the coding region. Isoforms 2 and 3 
encode 612 and 597 amino acid polypeptides, respectively. Comparison of their cDNA 

25 sequences with the genomic structure has indicated that three isoforms are splice variants. 

Reverse transcription-polymerase chain reaction analysis has shown predominant expression 
of isoform 2 mRNA in myeloid leukemia HL-60 cells, osteosarcoma OsA-CL cells and 
rhabdomyosarcoma Rh30 cells. Northern blotting has revealed similar levels of expression of 
OS-9 gene in various tumor cell lines of sarcoma cells, carcinoma cells and myeloid leukemia 

30 cells, but 3-4 times higher expression in OsA-CL cells and Rh30 cells containing a 

homogeneously staining region of 12ql3-15. OS-9 expression decreased in differentiation- 
induced HL-60 cells. The above data suggests a possible involvement of OS-9 in cell growth 
and tumour development. 



591 




The NOV70 disclosed in this invention is predicted to be expressed in at least the 
following tissues: adrenal gland, bone marrow, brain - amygdala, brain - cerebellum, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, 
fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, pancreas, pituitary 
5 gland, placenta, prostate, salivary gland, skeletal muscle, small intestine, spinal cord, spleen, 
stomach, testis, thyroid, trachea, uterus, amnion, aorta, ascending colon, bone, bronchus, 
cartilage, cervix, colon, cornea, coronary artery, dermis, duodenum, epidermis, epididymis, 
hair follicles, hypothalamus, islets of langerhans, kidney cortex, liver, lung, lymph node, 
lymphoid tissue, esophagus, ovary, parathyroid gland, peripheral blood, pineal gland, 

10 respiratory bronchiole, retina, skin, thymus, tonsils, umbilical vein, urinary bladder, vulva, 
whole organism. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV70 is provided in Example 2. 

15 The NOV70 nucleic acids and proteins of are useful in potential therapeutic 

applications implicated in various pathological disorders described further herein, for example, 
the compositions of the present invention may have efficacy for the treatment of patients 
suffering from leukemias, sarcomas and other types of cancer, as well as other diseases, 
disorders and conditions. OS-9 was co-amplified with CDK4 in three of five sarcoma tissues 

20 (Mol Carcinog 1 996 Apr; 1 5(4):270-5). Three isoforms of OS-9 cDNA were found in a 

myeloid leukemia HL-60 cDNA library and reverse transcription-polymerase chain reaction 
analysis has shown predominant expression of isoform 2 mRNA in myeloid leukemia HL-60 
cells, osteosarcoma OsA-CL cells and rhabdomyosarcoma Rh30 cells. Northern blotting has 
revealed similar levels of expression of OS-9 gene in various tumor cell lines of sarcoma cells, 

25 carcinoma cells and myeloid leukemia cells (J Biochem (Tokyo) 1998 May;123(5):876-82). 

The NOV70 nucleic acid encoding the OS-9-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. The novel nucleic acid of the invention 
encoding a OS-9-like protein includes the nucleic acid whose sequence is provided in 70A or 

30 70C, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in 70A or 70C while still 
encoding a protein that maintains its OS-9-like activities and physiological functions, or a 
fragment of such a nucleic acid. 
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The invention further includes nucleic acids whose sequences are complementary to 
the sequence of 70A or 70C, including nucleic acid fragments that are complementary to any 
of the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
acid fragments, or complements thereto, whose structures include chemical modifications. 
5 Such modifications include, by way of non-limiting example, modified bases, and nucleic 

acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 
applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
10 about 1% of the bases may be so changed. 

The novel protein of the invention includes the OS-9-like protein whose sequence is 
provided in Table 70B or 70D. The invention also includes a mutant or variant protein any of 
whose residues may be changed from the corresponding residue shown in Table 70B or 70D 
while still encoding a protein that maintains its OS-9-like activities and physiological 
15 functions, or a functional fragment thereof. In the mutant or variant protein, up to about 1% of 
the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
20 prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV71 

The disclosed NOV71 (alternatively referred to herein as CG56906-01) includes the 
25 2081 nucleotide sequence (SEQ ID NO:245) shown in Table 71 A. A NOV71 ORF begins 

with a Kozak consensus ATG initiation codon at nucleotides 55-57 and ends with a stop codon 
at nucleotides 1978-1980. The disclosed NOV71 maps to human chromosome Xpl 1 . 



GCGGCCGCGGCCTCGGCCTCCTCCTCTGGGGCGGCGGCCGAGGACAGCAGCGCCATGGAG 
GAGCTCGCTACTGAGAAGGAGGCGGAGGAGAGCCACCGGCAAGACAGCGTGAGCCTGCTC 
ACCTTCATCCTGCTGCTCACGCTCACCATCCTCACCATCTGGCTCTTCAAGCACCGCCGG 
GTGCGCTTTCTGCACGAGACCGGGCTGGCCATGATCTATGGGCTCATCGTTGGGGTGATC 
CTGAGGTATGGTACCCCTGCTACCAGTGGCCGTGACAAATCACTCAGCTGCACTCAGGAA 
GACAGGGCCTTCAGTACCTTATTAGTGAATGTCAGCGGAAAGTTCTTCGAATACACTCTG 
AAAGGAGAAAT C AGT CC TGG C AAGAT CAACAGCGTAGAGCAGAATGAT ATGCT ACGGAAG 
GTAACATTCGATCCAGAAGTATTTTTCAACATTCTTCTGCCTCCAATTATTTTTCATGCT 
GGATACAGCTTAAAGAAGAGACACTTTTTCAGAAATCTTGGATCTATACTGGCCTATGCC 
TTCTTGGGGACTGCTGTTTCATGCTTCATTATTGGAAATCTCATGTATGGTGTGGTGAAG 



Table 71A. NOV71 Nucleotide Sequence (SEQ ID 
NO:245) 
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CTCATGAAGATTATGGGACAGCTCTCAGATAAATTTTACTACACAGATTGTCTCTTTTTT 
GG AG C AATC AT CT CTG C C ACTG AC CC AGTG ACTGTG C TGG CG AT ATTT AATGAATTGC AT 
G C AG ACGTGGAT C TTT ACG CACTT CTT TTTGG AG AG AGCGT CC T AAATGATGCTGTTGCC 
ATTGTACTGTCCTCGTCTATTGTTGCCTACCAGCCAGCGGGACTGAACACTCACGCCTTT 
GATGCTGCTGCCTTTTTTAAGTCAGTTGGCATTTTTCTAGGTATATTTAGTGGCTCTTTT 
ACCATGGGAGCTGTGACTGGTGTTGTGACTGCTCTAATATCCTTTTTACAGAATGCCAAC 
GTGACTAAGTTTACCAAACTGCACTGCTTCCCCCTGCTGGAGACGGCGCTGTTCTTCCTC 
ATGTCCTGGAGCACGTTTCTCTTGGCAGAAGCCTGCGGATTTACAGGTGTTGTAGCTGTC 
CTTTTCTGTGGAATCACACAAGCTCATTACACCTACAACAATCTGTCGGTGGAATCAAGA 
AGTCGAACCAAGCAGCTCTTTGAGGTGTTACATTTCCTGGCAGAGAACTTCATCTTCTCC 
TACATGGGCCTGGCACTGTTTACCTTCCAGAAGCACGTTTTCAGCCCCATTTTCATCATC 
GGAGCTTTTGTTGCCATCTTCCTGGGCAGAGCCGCGCACATCTACCCGCTCTCCTTCTTC 
CTCAACTTGGGCAGAAGGCATAAGATTGGCTGGAATTTTCAACACATGATGATGTTTTCA 
GGCCT C AGGGG AG C AATGGC ATTTGCGT TGGCC AT C CGTG AC ACGG C ATC C TATGCTCG C 
CAGATGATGTTCACGACCACCCTTCTCATTGTGTTCTTCACTGTCTGGATCATTGGAGGA 
GGCACGACACCCATGTTGTCATGGCTTAACATCAGGTTGGACGGCCCAGATTCTGCCAGA 
GGAAACCGGACAAAACAGGAGAGCGCATGGATATTCAGGCTGTGGTACAGCTTTGATCAC 
AATTACCTGAAGCCCATCCTCACACACAGTGGTCCCCCACTAACCACCACGCTCCCCGCC 
TGGTGTGGCTTACTAGCTCGATGTCTGACCAGTCCCCAGGTGTACGATAACCAAGAGCCA 
CTGAGAGAGGAAGACTCTGATTTCATCCTGACCGAAGGCGACCTGACATTGACCTACGGG 
GACAGCACAGTGACTGCAAATGGCTCCTCAAGTTCGCACACCGCCTCCACGAGTCTGGAG 
GGCAGCCGGAGAACGAAGAGCAGCTCGGAGGAAGTGCTGGAGCGAGACCTGGGAATGGGA 
GACCAGAAGGTTTCGAGCCGGGGCACCCGCCTAGTGTTTCCCCTGGAAGATAATGCTTGA 
CTTTCCCCCCAAGCCCTGGCGCGATGGGGTAGGCTCCCGATGGGGTGAGGACAGCTGCAA 
GCCCTAGTGTTGTTGGAGGTGGGGCAGTGACTAGATTGAAC 

A NOV71 polypeptide (SEQ ID NO:246) encoded by SEQ ID NO:245 is 641 amino 
acids in length and is presented using the one-letter amino acid code in Table 71B. The Psort 
profile for NOV71 predicts that this sequence is likely to be a Type III 6 membrane protein, 
5 has a signal peptide, and is likely to be localized to the plasma membrane with a certainty of 
0.8200. In alternative embodiments, a NOV71 polypeptide is located to Golgibodies with a 
certainty of O.4600, or to the endoplasmic reticulm (membrane) with a certainty of 0.6850. 
The Signal P predicts a likely cleavage site for a NOV71 peptide is between positions 37 and 
38, i.e., at the dash in the sequence IWL-FK. 

10 

Table 71B. NOV71 Polypeptide Sequence (SEQ ID 
NO:246) 

MEELATEKEAEESHRQDSVSLLTFILLLTLTILTIWLFKHRRVRFLHETGLAMIYGLIVG 
VILRYGTPATSGRDKSLSCTQEDRAFSTLLVNVSGKFFEYTLKGEISPGKINSVEQNDML 
RKVTFDPEVFFNILLPPI IFHAGYSLKKRHFFRNLGSILAYAFLGTAVSCFI IGNLMYGV 
VKLMKIMGQLSDKFYYTDCLFFGAI ISATDPVTVLAI FNELHADVDLYALLFGESVLNDA 
VAI VLSS S I VAYQPAGLNTHAFDAAAFFKS VGI FLGI FSGS FTMGAVTGWTALI S FLQN 
ANVTKFTKLHCFPLLETALFFLMSWSTFLLAEACGFTGVVAVLFCGITQAHYTYNNLSVE 
SRSRTKQLFEVLiHFLAENFIFSYMGLALFTFQKHVFSPIFIIGAFVAIFLGRAAHIYPLS 
F FLNLGRRHK I G WNFQHMMM F S GLRG AMAF AL AI RDTAS YARQMMFTTTLLI VFFTWI I 
GGGTTPMLSWLNIRLDGPDSARGNRTKQESAWIFRLWYSFDHNYLKPILTHSGPPLTTTL 
PAWCGLLARCLTSPQVYDNQEPLREEDSDFILTEGDLTLTYGDSTVTANGSSSSHTASTS 
LEGSRRTKSSSEEVLERDLGMGDQKVSSRGTRLVFPLEDNA 



A BLAST analysis of NOV71 was run against the proprietary PatP GENE SEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV71 had high 
homology to other proteins as shown in Table 71C. 

15 



594 




Table 71 C. BLASTX results from PatP database for NOV71 



Smallest 
Sum 











High 


Probability 


Sequences producing 


High- scoring Segment 


Pairs : 


Score 


P (N) 


patp 


AAB90555 Human 


secreted protein, 




2410 


5 . le-250 


patp 


AAB90637 Human 


secreted protein, 




2410 


5 . le-250 


patp 


AAU02 8 83 Human 


HsNHE-6 polypeptide - 


Homo sapiens, 


1693 


5 .2e-201 


patp 


AAB90 5 90 Human 


secreted protein, 




902 


2 . le-169 


patp 


AAB90591 Human 


secreted protein, 




902 


2.1e-169 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 747 of 1 121 bases (66%) identical to a gb:GENBANK- 
ID:AF030409|acc:AF030409.1 mRNA from Homo sapiens (sodium-hydrogen exchanger 6 
5 (NHE-6) mRNA, nuclear gene encoding mitochondrial protein). The full amino acid sequence 
of the protein of the invention was found to have 391 of 518 amino acid residues (75%^ 
identical to, and 443 of 518 amino acid residues (85%) similar to, the 669 amino acid residue 
ptnr:SWISSNEW-ACC:Q92581 protein from Homo sapiens (Human) 
(SODIUM/HYDROGEN EXCHANGER 6 (NA(+)/H(+) EXCHANGER 6) (NHE-6)). 
10 NOV71 also has homology to the other proteins shown in the BLASTP data in Table 7 ID. 



Table 71D. NOV71 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%> 


Positive 
(%) 


Expect 


gi | 14211919 | r 
ef |NP_115980 . 
1 | <NM_032591) 


solute carrier family 9 
( sodium/hydrogen 
exchanger) , isoform 7; 
nonselective sodium 
potassium/proton 
exchanger [Homo sapiens] 


725 


631/681 
(92) 


632/681 
(92) 


0 . 0 


gi | 3319946 | em 
b |CAA18155 . 1 | 
(AL022165) 


dJ71L16.5 (KIAA0267 LIKE 
putative Na(+)/H(+) 
exchanger) [Homo 
sapiens] 


616 


575/625 
(92) 


576/625 
(92) 


0.0 


gi | 1665827 | db 
j |BAA13449 . 1 | 
(D87743) 


Similar to Human Na+/H+ 
exchanger 2 (A57 644) 
[Homo sapiens] 


666 


451/649 
(69) 


513/649 
(78) 


0 . 0 


gi | 5454070 | re 
f |NP_006350 . 1 
| (NM_006359) 


solute carrier family 9 
{ sodium/ hydrogen 
exchanger) , isoform 6 
[Homo sapiens] 


669 


451/649 
(69) 


513/649 
(78) 


0.0 


gi | 17474970 | r 
ef |XP_062645 . 
1 | (XM_062645) 


similar to solute 
carrier family 9 

(sodium/hydrogen 
exchanger), isoform 7; 
nonselective sodium 
potassium/proton 
exchanger (H. sapiens) 

[ Homo sap iens] 


485 


292/412 
(70) 


305/412 
(73) 


e-148 



595 



Is i! f! W 



This BLASTP data is displayed graphically in the ClustalW in Table 7 IE. A multiple 
sequence alignment is given, with the NOV71 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 7 ID. 



Table 71E. ClustalW Alignment of NOV71 



NOV71 

gi | 14211919 | 
gi|3319946| 
gi | 1665827 | 
gij 5454070 j 
gi|17474970| 



NOV71 

gi|l4211919| 
gi | 3319946 | 
gi|l665827| 
gi | 5454070 j 
gi 1 17474970 | 



NOV71 

gi|l4211919| 
gij 3319946 | 
gi | 1665827 j 
gi|5454070| 
gi 1 17474970 | 



(SEQ ID NO:246) 

(SEQ ID NO: 654) 

(SEQ ID NO: 655) 

(SEQ ID NO: 656) 

(SEQ ID NO: 657) 

(SEQ ID NO: 658) 



8 

61 
1 

58 
61 
8 



NOV71 


68 


gi| 14211919 | 


121 


gi|3319946| 


12 


gi | 1665827 j 


118 


gi|5454070 j 


121 


gij 17474970 | 


68 


NOV71 


128 


gi| 14211919 | 


181 


gi|3319946| 


72 


gi | 1665827 j 


171 


gi|5454070 j 


174 


gij 17474970 | 


121 


NOV71 


188 


gi| 14211919 | 


241 


gi | 3319946 | 


132 


gi | 1665827 


206 


gi j 5454070 j 


209 


gij 17474970 | 


121 


NOV71 


248 



MEPGDAARPGSGRATGAPPPRLLLLPLLLGWGLRVAAAASASSSGAAAEDSS. 



20 



30 
.|.. 



40 
.|.. 



1 

1 

! 

1 RGWRRAPLRRGVGSSPRARRLMRPLWLLLAVGVFDWAGASDGGGGEARAME 

1 MARRGWRRAPLRRGVGSS PRARRLMRPLWLLLAVGVFDWAGASDGGGGEARAME 

! M 



KgAEESHRQDS 
K1AEESHRQDS 




80 



90 



100 



FILLLTLTILTI WLFKHRR 
FILLLTLTILTI WLFKHRRi 





180 



E I S PGK INS VraQNDML 



IKVTFDP 12 7 

SKVTFDP 180 

KVTFDP 71 

171 

174 

121 



190 



200 



210 



220 



evffglllppiifhagyslkkrhffrn 
evffSillppiifhagyslkkrhffrn 
evffSillppiifhagyslkkrhff: 




230 

I 



240 



SCF 




IG 


NL 


MYG 




V 


K 


LMK 


SCF 




IG 


jp 


MYG 




V 


K 


LMK 


SCF 




IG 


NL 


MYG 




V 


K 


LMK 


SCF 




IG 


SI 


MYG 






T 


LMK 


SCF 




IG 


SI 


MYG 


c 


1 




LMK 



|M 187 

|M 240 

IM 131 

uh" 205 

-- 121 



250 




FY 


§ 


TDCL 


FY 


I 


TDCL 


FY 


s 


TDCL 


FY 


1 


TDCL 


FY 




TDCL 




290 



300 



LYALLFGESVLNDAVAI VLSS 
LYALLFGESVLNDAVAIVLSS 
LYALLFGESVLNDAVAI VLSS 
s(L YALL FGE S VLND AVA I VL S S 
LYALLFGESVLNDAVAIVLSS 



247 
300 
191 
265 
268 
121 



310 

248 



320 

L 



330 



340 
. . I . . 



350 



360 
• - I 



FKSVGI FLGIFSGSFTMGAVTGWTAL 



ISFLQNANS 



307 



596 



ii n 



;J| si ii ;i " 



gi] 14211919 | 
gi | 3319946 | 
gijl665827 j 
gi I 5454070 | 
gi j 17474970 | 



NOV71 

gi | 14211919 | 
gi j 3319946 | 
gi 1 1665827 j 
gij 5454070 | 
gi j 17474970 | 



NOV71 

gi | 14211919 | 
gi|3319946 | 
gij 1665827 j 
gij 5454070 j 
gij 17474970 | 



NOV71 

gi | 14211919 | 
gi|3319946| 
gi|l665827 j 
gi j 5454070 j 
gij 17474970 | 



301 
192 
266 
269 
121 



NOV71 


308 


gi | 14211919 | 


352 


gi|3319946| 


243 


gi j 1665827 j 


318 


gi j 5454070 | 


321 


gij 17474970 | 


153 



428 
472 
363 
438 
441 
268 



NOV71 


488 


gi 


14211919 | 


532 


gi 


3319946 | 


423 


gi 


1665827 j 


498 


gi 


5454070 j 


501 


gi 


17474970 | 


328 



SIVAYQPAG 
SIVAYQPAG 
SIVAYQPAG 
SIVAYQPAG 




JFKSVGI FLGIFSGS FTMGAVTGBvgAjJJgfflggMVTKFT 
|FKSVGI FLGI FSGS FTMGAVTGMv"A i7^Wli|W vTKFT 

|fksRgi flgi fsgs fSmgaStgwtal^^HHHvtkft 

'fKsBgI FLGI FSGS fBmGABTGWTAL ^^^^B vTKFT 

Iks vg i flg i f s gHftmgavtgwtalBSSBHvt k ft 



351 
242 
317 
320 
152 



370 



380 



390 



400 
. . I . . 



410 
I I I 



420 



klgcfplletalfflmswstfllaeacgftgwavlfcgitqahytynnlsvesrsrtkq 
klBcfplletalfflmswstfllaeacgftgwavlfcgitqahytynnlsvesrsrtkq 
klBcfplletalfflmswstfllaeacgftgwavlfcgitqahytynnlsvesrsrtkq 

KL^FfflLLETffiLFFLMSWSTFLLAEAffiGFTGWAVLFCGITQAHYTYNNLS0ESffljRTKQ 



KL^ FgLLETgL FFLMS W S T FLL AE AMGFTG WA VL FCG I TQ AH YT YNNL SUE SS JRTKQ 
KL^FgLLETgLFFLMSWSTFLLAEA^GFTGVVAVLFCGITQAHYTYNNLsHESggRTKQ 



KLjgCFPLLETALFFLMSWSTFLLAEACGFTGWAVLFCGITQAHYT 



mi 



367 
411 
302 
377 
380 
212 



NOV71 


368 


LFE 


gi| 14211919 | 


412 


LFE 


gi j3319946 | 


303 


LFE 


gij 1665827 j 


378 


LFE 


gi j 5454070 j 


381 


LFE 


gij 17474970] 


213 


LFE 




490 



500 



510 



520 



530 



I 



540 
• - I 



rhkigwnfqhmmmfsglrgamafalairdtasyarqmmfyttllivfftvwi igggttpm 
rhkigwnfqhmmmfsglrgamafalairdtasyarqmmfmttllivfftvwiigggttpm 
rhkigwnfqhmmmfsglrgamafalairdtasyarqmmfgjttllivfftvwi igggttpm 
rSkigSnfqhmmmf^glrgamafalairdta^yarqmmfSttllivfftvw^Sgggtt^m 
r§kig^fqhmmmf|glrgamafalairdtagyarqmmf|ttl^ivfftvw^gggttgm 
rhkigwnfqhT" 



^MFSGLRGAMAFALAImDI 



DYARQMiiFlaTTI3aIVFFTiiWT IGGGTTPM 



487 
531 
422 
497 
500 
327 




560 
..|.. 



570 
..|.. 



580 
..|.. 



590 



I 



600 



|SIKEPSKEDHNEHHRQYFRV< 




Bpppnndsfqvlqg I 

^PPPNNDSFQVLQGg 

HLG' 

& HLG 

[|pppnndsfqvlqg[3s 



GN 


R 


TK 


507 
591 


GN 




TK 




TK 


482 


RT 




TK 


525 


RT 




TK 


528 


GN 


W 


TK 


387 



610 



NOV71 


508 




gi| 14211919 | 


592 


■ 


gi j 3319946 | 


483 


| 


gi j 1665827 j 


526 




gij 5454070 j 


529 


A 


gij 17474970 | 


388 






670 



680 



690 



710 



720 



568 
652 
543 
586 
589 
412 



SDFILTEGDLTLTYGDSTVTANG 
SDF I LTEGDLTLT YGDSTVTANG 
SDFILTEGDLTLTYGDSTV TANG 
SDffll L^GDlIjjjLTYGDST V 

SDgi l^gd §| ltygdstv 

SDFI LTEGDLTLTYGDSTVTANG 




730 



740 




641 

725 

616 

EPPLNLLDNTRHGPA 6 66 
EPPLNLLDNTRHGPA 6 69 
485 
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Table 71F lists the domain description from DOMAIN analysis results against NOV71. 
This indicates that the NOV71 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 71F. Domain Analysis of NOV71 



gnl | Pf am| pf am009 99 , NaHExchanger , Sodium/ hydrogen exchanger family. Na/H 

antiporters are key transporters in maintaining the pH of actively 

metobolizing cells. The molecular mechanisms of antiport are unclear. These 

antiporters contain 10-12 transmembrane regions (M) at the amino- terminus and 

a large cytoplasmic region at the carboxyl terminus . The transmembrane 

regions M3-M12 share identity with other members of the family. The M6 and M7 

regions are highly conserved. Thus, this is thought to be the region that is 

involved in the transport of sodium and hydrogen ions . The cytoplasmic region 

has little similarity throughout the family. SEQ ID NO: 8 65 

CD-Length = 400 residues, 88.5% aligned 
Score - 276 bits (706), Expect = 3e-75 



NOV71 : 
Sbjct : 
NOV71 : 
Sbjct : 
NOV71 : 
Sbjct: 
NOV7 1 : 
Sbjct : 
NOV71 : 
Sbjct: 
NOV71 : 
Sbjct : 
NOV71 : 
Sbjct : 



123 

46 

183 

106 

242 

158 

302 

208 

362 

265 

422 

325 

480 

385 



VTFDPEVFFNILLPPI IFHAGYSLKKRHFFRNLGS ILAYAFLGTAVSCFIIGNLMYGVVK 18 2 
V D EVFF ILLPPI+F AG L R FRNLGSIL A LG + IG LMY +V 

VDLDSEVFFEILLPPILFEAGLELDLRELFRNLGSILLLALLGVLIPALGIGGLMYALVP 10 5 

LMKIMGQLSDKFYYTDCLFFGAI ISATDPVTVLAI FNEL-HADVDLYALLFGESVLNDAV 241 
+ + + + L FGAI+SATDPV VLA+ EL L L+FGES+LND V 

ILGL DFLAALLFGAILSATDPVAVLAVLKELGRVPKRLGTLIFGESLLNDGV 15 7 

AI VLSS S I VAYQPAGLNTHAFDAAAFFKS VGI FLG I FSGS FTMGAVTGWTALI SFLQNA 3 01 

A+VL + ++++ G A +A F + FL +F G +G V G + +LI 

AWLLAVL I S FALGG AVE AFDI FLG I LS FLWFLGG I L I GLVLGYLL SLI 207 

NVTKFTKLHCFPLLETALFFLMSWSTFLLAEACGFTGVVAVLFCGITQAHYTYNNLSVES 3 61 

T+FT L+E L L+ + + +LLAE G +G++AV G+ ++Y N+S +S 

- -TRFTFRE-DRLIEPLLVLLLAYLAYLLAEILGLSGILAVFAAGLALSNYVEANISEKS 2 64 

RSRTKQLFEVLHFLAENFIFSYMGLALFTFQKHVFSPIFIIGAFVAIFLGRAAHIYPLSF 421 
R+ K + +VL FL E IF +GL+L H ++ 1+ A V I L RA + + L+ 

RTTEKYFWKVLSFLFEPLIFVLLGLSLDLSVLHNWNIALILLAIVLILLARAIGVFLLTL 324 

FLNLGRRHKIGWNFQHMMMFSGLRGAMAFALAIRDTASY- -ARQMMFTTTLLIVFFTVWX 4 7 9 

LN RR KI + Q ++ + GLRGA+A ALA+ + AR ++ TT + + +V TV + 

LLN F FRRE KIP FGDQL V I G WGGLRGAVALALAL SG PLT S G P ARDL I LT TAT I WLVTVLV 384 

IGGGTTPMLSWLNIR 4 94 

G P++ L ++ 

QGITLKPLVKKLRVK 3 9 9 



10 



Na+/H+ exchangers are integral membrane ion transporter proteins that exchange 
extracellular Na+ for intracellular H+ with a stoichiometry of one for one. They have multiple 
cellular functions, including maintenance of intracellular pH, cell volume control, and 
reabsorption of sodium across renal, intestinal, and other epithelia. Multiple Na+/H+ 
exchanger isoforms (NHE1-NHE6) exist, exhibiting considerable differences in their 
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membrane localization, biochemical and pharmacologic properties, and responsiveness to 
various stimuli. For example, NHE1, the most predominant isoform expressed in heart, 
contributes significantly to myocardial intracellular pH. Hyperactivation of NHE1 during 
episodes of cardiac ischemia and reperfusion has been shown to disrupt the intracellular ion 
5 balance, leading to cardiac dysfunction and damage in several animal models, which can be 
prevented by pharmacological antagonists of NHEH. Increased activity of sodium/hydrogen 
exchange also provides a potentially important mechanism for the development of 
hypertension. 

In 1998, Numata et al. identified the gene encoding a novel isoform of 

10 sodium/hydrogen exchanger that they called NHE6 or SLC9A6 (3). The NHE6 protein has 
similar topology to the other NHEs in that it has 12 putative membrane-spanning segments 
within the N-terminal region and a hydrophilic C terminus. However, NHE6 also has a 
putative mitochondrial inner membrane-targeting signal at its N terminus. The NOV71 protein 
is homologous to the NHE6 protein, except that it is predicted to localize to the plasma 

15 membrane instead of the mitochondrial inner membrane. The NHE6-like gene maps to human 
chromosome Xpl 1. Based on its expression pattern, NOV71 may play a role in renal or 
metabolic diseases and immune function through its sodium/hydrogen exchange activity at the 
plasma membrane. ) 

NOV71 is predicted to be expressed in at least the following tissues: kidney, tonsils, 

20 germinal B cells, uterus, pituitary gland, brain, skeletal muscle, heart, lung, liver, pancreas, 
small intestine, colon, kidney, spleen, thymus, peripheral blood leukocytes, testis, ovary, 
placenta and prostate. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 

25 NOV71 is provided in Example 2. 

The NOV71 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, cancer, 
trauma, bacterial and viral infections, in vitro and in vivo regeneration, endometriosis, fertility, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, immunodeficiencies, 

30 graft versus host disease, diabetes, autoimmune disease, renal artery stenosis, interstitial 

nephritis, glomerulonephritis, polycystic kidney disease, systemic lupus erythematosus, renal 
tubular acidosis, IgA nephropathy, hypercalceimia, Lesch-Nyhan syndrome, endocrine 
dysfunctions, diabetes, obesity, growth and reproductive disorders, tonsillitis as well as other 
diseases, disorders and conditions. 
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The NOV71 nucleic acids encoding the sodium/hydrogen exchanger NHE6-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. The 
novel nucleic acid of the invention encoding a Sodium/Hydrogen Exchanger 6-like protein 
5 includes the nucleic acid whose sequence is provided in Table 71 A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 71 A while still encoding a protein that maintains 
its Sodium/Hydrogen Exchanger 6-like activities and physiological functions, or a fragment of 
such a nucleic acid. The invention further includes nucleic acids whose sequences are 

10 complementary to the sequence of Table 71 A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

15 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 34% of the bases 
may be so changed. 

20 The novel protein of the invention includes the Sodium/Hydrogen Exchanger 6-like 

protein whose sequence is provided in Table 71 B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 71B while still encoding a protein that maintains its Sodium/Hydrogen Exchanger 6- 
like activities and physiological functions, or a functional fragment thereof. In the mutant or 

25 variant protein, up to about 25% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

30 below. 

NOV72 

The disclosed NOV72 (alternatively referred to herein as CG56910-01) includes the 
1094 nucleotide sequence (SEQ ID NO:247) shown in Table 72 A. A NOV72 ORE begins 
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Kozak consensus ATG initiation codon at nucleotides 14-16 and ends with a stop codon at 
nucleotides 1082-1084. The disclosed NOV72 maps to human chromosome 6. 



Table 72A. NOV72 Nucleotide Sequence (SEQ ID 
NO:247) 

ACTTCTATAAGACATGGATAGATGCAAACATGTAGGGCGGTTAGCCCCAGGCGTCACGGG 
CCTGCGCAACCTGGGCAACACCTGCTACATGAACTCCATCCTCCAGGTGCTCAGCCACCT 
CCAGAAGTTCCGAGAATGTTTCCTCAACCTTGACCCTTCCAAAACGGAACATCTGAGTTC 
AAAGCACATTTCCCTCTGCCGTGAACTGCACACCCTCTTCCGAGTCATGTGGTCCGGGAA 
GTGGGCCCTAGTGTCGCCCTTCGCCATGCTCCACTCAGTGTGGAGCCTGATCCCTGCCTT 
CCGCGGCTACGACCAACAGGACGCGCAGGAATTTCTCTGCGAGCTGCTGCACAAGGTGCA 
GC AGG AACT CG AGTCT GAGGG C AC C AC ACGC C GG AT C CT C ATC CC CTT CT C C C AG AGG AA 
GCTCACCAAACAGGTCTTAAAGGTGGTGAATACCATATTTCATGGGCAGCTGCTCAGTCA 
GGTATGTGTGGTCACATGTATATCATGCAATTACAAATCCAATACCATTGAGCCCTTTTG 
GGACCTATCCCTGGAATTCCCTGAACGCTATCACTGCATAGAAAAGGGGTTTGTCCCTTT 
GAATCAAACAGAGTGCTTGCTCACTGAGATGCTGGCCAAATTCACAGAGACAGAGGCCCT 
GGAAGGGAGAATCTACGCTTGTGACCAGTGTAACAGTGAGTGCTGTGTGAAGCAGTTAAT 
GAT CT AC AGAC T AC CT C AGGTTC T CCGGCTG C AC C TT AAAAGATT CCAT CG AGAGAAGAT 
TGGGGTCCATGTCGTCTTTGACCAGGTATTAACCATGGAACCTTACTGCTGCAGGGACAT 
GCTCTCCTCTCTTGACAAAGAGACCTTTGCCTATGATCTCTCCGCAGTGGTCATGCATCA 
CGGGAAAGGGTTTGGCTCAGGACACTACACAGCCTATTGCTACAACACAGAGGGAGGTTT 
TTGGGTCCACTGCAATGACTCAAAGCTGAATGTATGCAGTGTCGAGGAAGTGTGCAAAAC 
CCAGGCCTACATCCTTTTTTACGCGCTGACTGAGATGGCGCTGAGTGAATGTGGAAGGTG 
CTAAGAC C CAGT C T 



A NOV72 polypeptide (SEQ ID NO:248) encoded by SEQ ID NO:247 is 356 amino 
acids in length and is presented using the one-letter amino acid code in Table 72B. The Psort 
profile for NOV72 predicts that this sequence has no signal peptide and is likely to be 
localized in the cytoplasm with a certainty of 0.5050. In alternative embodiments, a NOV72 
polypeptide is located to lysosomes with a certainty of 0.1000, to the mitochondrial matrix 
space with a certainty of 0.1000, or to paroxisomal microbodies with a certainty of 0.3547. 



Table 72B. NOV72 Polypeptide Sequence (SEQ ID 
NO:248) 

MDRCKHVGRLAPGVTGLRNLGNTCYMNSILQVLSHLQKFRECFLNLDPSKTEHLSSKHIS 
LCRELHTLFRVMWSGKWALVSPFAMLHSVWSLIPAFRGYDQQDAQEFLCELLHKVQQELE 
SEGTTRR I L I PFS QRKLTKQVLKWNT I FHGQLL S QVCWTC I S CNYKSNT I EPFWDLSL 
EFPERYHCIEKGFVPLNQTECLLTEMLAKFTETEALEGRIYACDQCNSECCVKQLMIYRL 
PQVLRLHLKRFHREKIGVHWFDQVLTMEPYCCRDMLSSLDKETFAYDLSAWMHHGKGF 
GSGHYTAYCYNTEGGFWVHCNDSKLNVCSVEEVCKTQAYILFYALTEMALSECGRC 



A BLAST analysis of NOV72 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV72 had high 
homology to other proteins as shown in Table 72C. 



Table 72C. BLASTX results from PatP database for NOV72 



Smallest 
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l! SI" 



• 











Sum 








High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P (N) 


patp 


AAB92670 


Human protein sequence 


1133 


2 . 0e-12l 


patp 


AAB4 22 5 9 


Human ORFX ORF2 023 polypeptide sequence 


497 


2 . 7e-47 


patp 


-AAY13115 


Human secreted protein encoded by 5 • EST 


492 


9 . le-47 


patp 


AAM79194 


Human protein 


487 


3 . le-46 


patp 


AAB74672 


Human protease and protease inhibitor 


464 


8 .4e-44 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 215 of 342 bases (62%) identical to a gb:GENBANK- 
ID:AF073344|acc:AF073344.1 mRNA from Homo sapiens (ubiquitin-specific protease 3 
5 (USP3) mRNA). The full amino acid sequence of the protein of the invention was found to 
have 124 of 345 amino acid residues (35%) identical to, and 183 of 345 amino acid residues 
(53%) similar to, the 353 amino acid residue ptnr:SWISSNEW-ACC:088623 protein from 
Mus musculus (Mouse) (UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 2 (EC 
3.1.2.15) (UBIQUITIN THIOLESTERASE 2) (UBIQUITIN-SPECIFIC PROCESSING 
10 PROTEASE 2) (DEUBIQUITINATING ENZYME 2) (41 KDA UBIQUITIN-SPECIFIC 

PROTEASE)). NOV72 also has homology to the other proteins shown in the BLASTP data in 
Table 72D. 



Table 72D. NOV72 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi | 15208127 | d 
bj | BAB63088 . 1 
| (AB070143) 


hypothetical protein 
[Macaca f ascicularis] 


550 


282/317 
(88) 


287/317 
(89) 


e-157 


gi | 16174134 | r 
ef |XP_057397 . 
1 | (XM_057397) 


similar to Unknown 
(protein for MGC:2074l) 
(H. sapiens) 


640 


255/289 
(88) 


260/289 
(89) 


e-140 


gi | 14149817 | r 
ef |NP_115523 . 
1 | (NM 032147) 


hypothetical protein 
DKFZp434D0127 
[ Homo sapi en s ] 


712 


221/307 
(71) 


247/307 
(79) 


e-119 


gi | 15451368 |d 
bj |BAB64488.1 
| (AB071094) 


hypothetical protein 
[Macaca f ascicularis] 


585 


205/239 
(85) 


210/239 
(87) 


e-108 


gi | 16041104 |d 
bj |BAB69719. 1 
| (AB072750) 


hypothetical protein 
[Macaca f ascicularis] 


497 


111/213 
(52) 


123/213 
(57) 


2e-45 



15 This BLASTP data is displayed graphically in the ClustalW in Table 72E. A multiple 

sequence alignment is given, with the NOV72 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 72D. 
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Table 72E. ClustalW Alignment of NOV72 



NOV72 (SEQ ID NO: 24 8) 

gi | 15208127 | (SEQ ID NO:659> 

gi | 16174134 I (SEQ ID NO:660) 

gijl4149817| (SEQ ID NO:661) 

gi | 15451368 | {SEQ ID NO:662) 

gi j 16041104 j (SEQ ID NO:663) 



NOV72 

gi | 15208127 | 
gi | 16174134 | 
gi | 14149817 | 
gi | 15451368 j 
gi j 16041104 j 



NOV7 2 

gi | 15208127 | 
gi | 16174134 | 
gi j 14149817 j 
gi j 15451368 | 
gi j 16041104 j 



NOV7 2 

gi | 15208127 | 
gi | 16174134 j 
gi | 14149817 j 
gi | 15451368 j 
gi j 16041104 | 



NOV72 

gi | 15208127 | 
gi j 16174134 j 
gi j 14149817 j 
gi j 15451368 j 
gi | 16041104 j 



NOV7 2 

gi | 15208127 | 
gi j 16174134 j 
gi j 14149817 j 
gi | 15451368 | 
gi j 16041104 j 



NOV72 

gi | 15208127 | 
gi j 16174134 j 
gi j 14149817 j 
gi | 15451368 | 
gi j 16041104 j 



NOV7 2 



10 



20 
•I 



30 



50 
•I 



ML£ 
ML! 



MDgCKHVGgLgLAQDHSnLNPQKW 

mdSckhvgIlIlaqdhsHlnpqkw 
^idBckhvgBlBlaqdhsHlnpqkw 



MD0CKHVGS 



60 



|LAQDHS@LNPQKW 



70 




TTES 


1 


WACL 


TTES 


u 


WACL 


TTES 




WACL 


TTES 


e 


WACL 



80 



90 





GDLKLLR 
GDLKLLR 
GDLKLLR 
GDLKLLR 



110 



120 



130 



140 



150 
■ * I 




210 



220 



230 
..|.. 



240 



250 



RpQtlLE 


EL 








PPRKS 


rkiIle 


EL 




S 




PPRKS 


y&Hka 


EL 


E 


S 




PPRKS 


RRIjLE 


EL 




s 




PPRKS 




EL 




s 




PPRKS 



jQHT 

JqGLAQST IIEIVSgQ 

l L |ht 

JQGLAQST HE I VP^Q 1 




260 



270 



280 



I 



290 



300 




jAPGVTGLRNLGNTCYMNSILQVLSHLQKFR 
VAPGVTGLRNLGNTCYMNS ILQVLSHLQKFR 
JjAPGVTGLRNLGNTCYMNSILQVLSHLQKFR 

vQpgvtglrnlgntcymnsJ5lqvlshl^fr 
vapgvtglrnlgntcymns ilqvlshlqkfr 
vSpgvtglrnlgntcymnsSlqvlshlSWfr 



320 

.1.. 



330 

..I.. 



340 



350 
-•I 



ECFLNLDPSKTEHL 
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gi | 15208127 | 
gi | 16174134 j 
gi | 14149817 | 
gi | 15451368 j 
gi | 16041104 j 




FPKATNGKTQLSGK 
FPKATNGKTQLSGK 



LirqWLA^ TASEKTRS -CKHPF 
FPKATNGKTQLSGK: 
ASEKKRSFSKHPP 




^ ilLRSp^AEACEgE 
kl^LRND^AEACE@E 
I^MrfECQfeSDTGFVCS 

;cqS|dtge 



360 



370 



380 



390 



400 



NOV72 

gi | 15208127 
gi | 16174134 
gi | 14149817 
gi | 15451368 
gi | 16041104 




GFC wjsi 

GFC Wp 

RQSSLSSGI/S 

GFC Wg 

RQSSLSSGLS 



NOV72 

gi | 15208127 | 
gi | 16174134 j 
gi | 14149817 j 
''gi | 15451368 | 
gi | 16041104 | 



410 



420 



430 



440 



450 




NOV72 

gi | 15208127 | 
gi j 16174134 j 
gi | 14149817 | 
gi | 15451368 | 
gi | 16041104 j 



460 470 480 490 



ILIPFSQRKLTKQVLKVVNTIFHGQLLSQ 
I LI P FSQRKLTKQVLKVVNT I FHGQLLS Q 
I L I P FSQRKLTKQVLKVVNT I FHGQLLS Q 

L I P2JsQRKL0KQVLj£jvVNj£jl FHGQLLS Q 
ILIPFSQRKLTKQVLKVVNTIFHGQLLSQ 

L I pHs QRKL0KQVLjS!vVNjSjI FHGQLLS Qg 




SVEG- 



NOV72 

gi | 15208127 | 
gi | 16174134 j 
gi | 14149817 j 
gi | 15451368 j 
gi | 16041104 j 



510 



520 



530 



540 
I .... I ■ . 



LSLEFPERYHCIEKGFVPLNQTECLLTEMLAKFTETEALEG 
LSLEFPERYHCIEKGFVPLNQTECLLTEMLAKFTETEALEG 
LSLEFPERYHCIEK GFVPLNQTE CLLTEMLAKFTETEALEG 
LSLEFPERY@C^K^Hg|^^gCLBjjTEMLAKFTETEALEG 
LSLEFPERYHCIEKGFVPLNQTECLLTEMLAKFTETEALEG 



KIYVCDQCN 



NOV72 

gi | 15208127 | 
gi | 16174134 j 
gi | 14149817 | 
gi j 15451368 j 
gi | 16041104 j 



NOV72 

gi | 15208127 | 
gi | 16174134 j 
gi | 14149817 j 
gi|l5451368| 
gi | 16041104 j 



560 570 580 

I — I • • - - L 

-cvE 

iRKSNPKPl' 



590 



600 




650 



YDLSAWMHHGKGFGSG i 
YDLSAWMHHGKGFGSG; 
YDLSAWMHHGKGFGSGi 
YDLSAWMHHGKGFGSG! 



NOV72 

gi | 15208127 | 
gi j 16174134 | 
gi j 14149817 | 
gi j 15451368 | 
gi j 16041104 | 




700 
♦I 



670 680 690 

I I I I I I I 

FWVHCNDSKLNVCSVEEVCKTQAYILFY 

|FWVHCNDS KLNVCSVEEVCKTQAYI LFYTQRTVQGNAR 

CALLCGVGDTERG 

FWVHCNDS KLSMCTMDEVCKAQAYI LFYTQRVTENGHS 
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710 720 
....|....|....|....|....|.. 

NOV7 2 ALTEMALSECGRC 

gi|l5208127| I SETHLQAQVQS SNNDEGRPQT FS 

gi|l6174134| 

gi j 14149817 | KLLPPELLLGSQHPNEDADTSSNEILS 

gi|l5451368| 

gi|l6041104| 



Table 72F lists the domain description from DOMAIN analysis results against NOV72. 
This indicates that the NOV72 sequence has properties similar to those of other proteins 
known to contain this domain. 

5 



Table 72F. Domain Analysis of NOV72 




gnl | Pfam|pf am00443 , UCH-2, Ubiquitin carboxyl 


-terminal 


hydrolase family 2. SEQ ID NO: 86 6 






CD-Length = 68 residues, 91-2% aligned 
Score = 59.7 bits (143), Expect - 3e-10 




NOV72 : 
Sbjct : 


287 YDLSAVVMHHGKGFGSGHYTAYCYNTEGGFWVHCNDSKLNVCSVEEVCK TQAY 33 9 

Y+L AW+H G GHY AY G W +D K++ + EEV + + AY 
5 YELYAWVHSG-SLSGGHYIAYVKKENDG-WYKFDDDKVSRVTEEEVLEFSGGGETSSAY 62 


NOV72: 
Sbjct: 


340 ILFY 343 

ILFY 
63 ILFY 66 





Ubiquitin is a highly conserved 76-amino acid protein involved in the regulation of 
intracellular protein breakdown, cell cycle regulation, and stress response. Ubiquitin is 
released from degraded proteins by disassembly of the polyubiquitin chains, which is mediated 

10 by ubiquitin-specific proteases (USPs). The ubiquitin-specific proteases are a family of 

largely dissimilar enzymes with two major conserved sequence regions, containing either a 
conserved cysteine residue or two conserved histidine residues, respectively. The murine Unp 
oncoprotein and its human homologue, Unph, both contain regions similar to the conserved 
Cys and His boxes common to all the Ubps. 

1 5 Unp and Unph have been shown to be active deubiquitinating enzymes, able to cleave 

ubiquitin from both natural and engineered linear ubiquitin-protein fusions, including the 
polyubiquitin precursor. Mutation of the conserved Unp Cys and His residues abolishes this 
activity, and identifies the likely His residue in the catalytic triad. Unp is tumorigenic when 
overexpressed in mice, leading to the suggestion that Unp may play a role in the regulation of 

20 ubiquitin-dependent protein degradation. It was demonstrated that the high-level expression of 
Unp in yeast does not disrupt the degradation of the N-end rule substrate Tyr-beta- 
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galactosidase (betagal), the non-N-end rule substrate ubiquitin-Pro-betagal, or the degradation 
of abnormal, canavanine-containing proteins. 

Data suggests that Unp is not a general modulator of ubiquitin-dependent proteolysis. 
However, Unp may have a role in the regulation of the degradation of a specific, as yet 
5 undescribed, substrate(s). The novel human Ubiquitin-Specific Protease-like Proteins of the 
invention contains two Ubiquitin carboxyl-terminal hydrolase domains. Therefore, it is 
anticipated that NOV72 has a role in regulation of specific ubiquitins and could be a 
potentially important target for drugs. Such drugs may have important therapeutic 
applications, such as treating numerous tumors. 
10 NOV72 is predicted to be expressed in at least the following tissues: bladder and 

cervix. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, public EST 
sources, literature sources, and/or RACE sources. Further expression data for NOV72 is 
provided in Example 2. 

15 The NOV72 nucleic acids and proteins are useful in potential therapeutic applications 

implicated in various pathological disorders described further herein, for example, the 
compositions of the present invention will have efficacy for the treatment of patients suffering 
from: cystitis, incontinence, fertility, systemic lupus erythematosus, autoimmune disease, 
asthma, emphysema, scleroderma, allergy, ARDS, Von Hippel-Lindau (VHL) syndrome, 

20 Alzheimer's disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, 

Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, 
ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, pain, 
neurodegeneration. as well as other diseases, disorders and conditions. The NOV72 nucleic 
acid encoding the ubiquitin specific protease-like protein of the invention, or fragments 

25 thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a ubiquitin-specific protease-like 
protein includes the nucleic acid whose sequence is provided in Table 72A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 

30 be changed from the corresponding base shown in Table 72A while still encoding a protein 
that maintains its ubiquitin-specific protease-like activities and physiological functions, or a 
fragment of such a nucleic acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence of Table 72 A, including nucleic acid fragments that are complementary to any of 




the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
5 carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 
applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 38% of the bases may be so changed. 

The novel protein of the invention includes the ubiquitin-specific protease-like protein 

10 whose sequence is provided in Table 72B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
72B while still encoding a protein that maintains its ubiquitin-specific protease-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 65% of the amino acid residues may be so changed. 

15 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

20 

NOV73 

The disclosed NOV73 (alternatively referred to herein as CG56822-01) includes the 
967 nucleotide sequence (SEQ ID NO:249) shown in Table 73A. A NOV73 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 26-28 and ends with a stop codon at 
25 nucleotides 935-937. The disclosed NOV73 maps to human chromosome 2. 



Table 73A. NOV73 Nucleotide Sequence (SEQ ID 
NO:249) 

GCAGGTTCTTTTACAGGGAGCCACCATGGCTGATAAATCCAAATTTATTGAATACATTGA 
CGAAGCTTT AGAAAAATC AAAAGAAACTGCACTCT CTCATTTATTTTT CACCTATC AGGG 
GATTCCTTACCCCATCACCATGTGCACCTCAGAAACTTTCCAAGCGCTGGACACCTTCGA 
AGCCAGACATGATGACATCGTGCTAGCATCTTATCCAAAGTGCGGTTCAAACTGGATTCT 
CCACATTGTCAGTGAATTAATATATGCTGTTTCTAAAAAAAAGTATAAATATCCAGAATT 
CCCAGTTCTTGAATGTGGGGATTCAGAAAAATATCAGAGAATGAAAGGCTTTCCATCACC 
AAGGATTTTGGCAACTCACCTCCACTATGACAAATTACCTGGGTCTATCTTCGAGAATAA 
AGCCAAGATATTGGTGATATTTCGAAACCCTAAAGATACAGCAGTATCTTTTTTGCATTT 
CCACAACGATGTCCCCGATATTCCAAGCTATGGCTCTTGGGATGAATTCTTCAGACAGTT 
CATGGTGTTTTTAGTTTCTTGGGGAAGGTATTTTGATTTTGCAATCAATTGGAACAAACA 
TCTTGATGGCGACAATGTTAAGTTCATATTATATGAAGACCTGAAAGAGGTGAGATTATT 
AGGAATAAAACAGATTGCTGAGTTCTTGGGATTCTTTCTAACTGGGGAGCAAATTCAAAC 
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TATCTCAGTCCAGAGCACCTTCCAAGCCATGCGTGCGAAGTCTCAGGACACACACGGTGC 
TGTCGGCCCATTCCTTTTCCGCAAAGGTAAAGTCGCAGATTGGAAAAATTTGTTCAGTGA 
AATTCAGAACCAGGAAATGGATGAAAAATTCAAAGAGTGCTTAGCAGGCACCTCCCTCGG 
AGCAAAGTTGAAGTATGAATCATATTGCCAGGGTTGATTCCAGTCAATTCAGCAGGCCTA 
GATTTAT _ 

A NOV73 polypeptide (SEQ ID NO:250) encoded by SEQ ID NO:249 is 303 amino 
acids in length and is presented using the one-letter amino acid code in Table 73B. The Psort 
profile for NOV73 predicts that this sequence has no signal peptide and is likely to be 
5 localized to peroxisomal microbodies with a certainty of 0.7480. In alternative embodiments, 
a NOV73 polypeptide is located to lysosomes with a certainty of 0.1000 9 or to the 
mitochondrial matrix space with a certainty of 0.1000. 



Table 73B. NOV73 Polypeptide Sequence (SEQ ID 
NO:250) 

MADKSKFIEYIDEALEKSKETALSHLFFTYQGIPYPITMCTSETFQALDTFEARHDDIVL 
ASYPKCGSNWILHIVSELIYAVSKKKYKYPEFPVLECGDSEKYQRMKGFPSPRILATHLH 
YDKLPGS I FENKAKILVI FRNPKDTAVSFLHFHNDVPDI PSYGSWDEFFRQFMVFLVSWG 
RYFDFAINWNKHLDGDNVKFILYEDLKEVRLLGIKQIAEFLGFFLTGEQIQTISVQSTFQ 
AMRAKSQDTHGAVGPFLFRKGKVADWKNLFSEIQNQEMDEKFKECLAGTSLGAKLKYESY 
CQG 



10 A BLAST analysis of NOV73 was run against the proprietary PatP GENESEQ Protein 

Patent database. It was found, for example, that the amino acid sequence of NOV73 had high 
homology to other proteins as shown in Table 73C. 



Table 73C. BLASTX results from PatP database for NOV73 










Smallest 










Sum 








High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P (N) 


patp 


AAU07 75 8 


Human novel transferase protein, 


1562 


3 .7e-160 


patp 


AAU07760 


Human novel transferase protein, 


1366 


2 . 2e-139 


patp 


-AAU07765 


Human novel transferase protein, 


1013 


5 . 6e-102 


patp 


:AAU07763 


Human novel transferase protein, 


930 


3 .5e-93 


patp 


: AAU07761 


Human novel transferase protein, 


734 


2.1e-72 



15 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence of this invention has 427 of 643 bases (66%) identical to a gb:GENBANK- 
ID:AF033189|acc:AF033 189.1 mRNA from Gallus gallus (Gallus gallus sulfotransferase 
mRNA). The full amino acid sequence of the protein of the invention was found to have 151 
of 307 amino acid residues (49%) identical to, and 212 of 307 amino acid residues (69%) 

20 similar to, the 3 12 amino acid residue ptnr:SPTREJVlBL-ACC:057338 protein from Gallus 
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gallus (Chicken) (SULFOTRANSFERASE). NOV73 also has homology to the other proteins 
shown in the BLASTP data in Table 73D. 



Table 73D. NOV73 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi|l7447308|r 
ef |XP_065865 . 
1 | (XM_065865) 


similar to 
sulfotransf erase (H. 
sapiens) 
[Homo sapiens] 


303 


294/303 
(97) 


295/303 
(97) 


e-168 


gi |2687360 | gb 
| AAB88818 . X | 
(AF033189) 


sulfotransf erase 
[Gallus gallus] 


312 


151/307 
(49) 


212/307 
(68) 


4e-84 


gi | 12229955 | s 
p | Q9WUW8 | STK1 
RAT 


SULFOTRANSFERASE Kl 
(RSULT1C2) 


296 


94/294 
(31) 


146/294 
(48) 


3e-35 


gi | 18079235 | r 
ef |NP_081211 . 
1 | (NM_026935) 


sulfotransf erase family, 
cytosolic, 1C, member 1 
[Mus musculus] 


296 


92/292 
(31) 


147/292 
(49) 


5e-35 


gi | 11262122 |p 
irj | JC7283 


hy d r oxy a r y 1 am i ne 
sulfotransf erase (EC 
2.8.2) 2A - rat 


296 


93/294 
(31) 


149/294 
(50) 


2e-34 



5 This BLASTP data is displayed graphically in the ClustalW in Table 73E. A multiple 

sequence alignment is given, with the NOV73 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 73D. 



Table 73E. ClustalW Alignment of NOV73 



NOV73 

gi | 17447308 | 
gi |2687360| 
gi | 12229955 | 
gi j 18079235 j 
gi j 11262122 j 



NOV7 3 

gi | 17447308 | 
gi | 2687360 | 
gi j 12229955 
gi j 18079235 
gi | 11262122 



NOV7 3 

gi|l7447308| 
gi j 2687360 | 
gi j 12229955 | 
gi | 18079235 j 
gi j 11262122 j 



(SEQ ID NO: 250) 

(SEQ ID NO: 664) 

(SEQ ID NO:665) 

(SEQ ID NO:666) 

(SEQ ID NO: 667) 

(SEQ ID NO: 66 8) 

10 20 30 40 

— I — t .I 

MAD-KSKFIEYjXpEgLgK^EgAfflSHjdFFTYQGIPYR 

mad- kskfiey|PdeBl§k@k^a3si$|Fftyqgipy| 

MEKSRKKFI DVpjDKgl V^GNAMDRDMl FS YKGVL^! 

* M^Sp^^R^jKgKgVAG 1 

"*\g 1 

^-G 10 




110 



120 



130 



140 



150 
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NOW 3 

gi | 17447308 | 
gi j 2687360 | 
gi | 12229955 | 
gi | 18079235 j 
gi | 11262122 j 



NOV 7 3 


Ekd 


gi | 17447308 | 


Bkd 


gi (2687360 | 


Ekd 


gi j 12229955 | 


Rkd 


gi j 18079235 | 


Bkd 


gi | 11262122 j 


E kd 



NOV73 

gi | 17447308 | 
gi | 2687360 | 
gi | 12229955 | 
gi | 18079235 | 
gi | 11262122 j 



NOV7 3 

gi | 17447308 | 
gi | 2687360 | 
gi j 12229955 | 
gi | 18079235 j 
gi j 11262122 j 



fpvlecgdsekyqrmkgf 
fpvlecgdsekyqSmkg^ 
fp^leigdtekySImkkI 

IEWARPPQPSGv||| 
I E||ARP PQP SGVMANEi 

ie|arppqpsgvB^ 



160 

r Agg^ (H^NDVPDat| 

|ndvpd : 

SNRWSA 

Irmsq^ 

^y8Hy;RMSQ/ 
210 






220 
.|.. 



230 



VRLLGOKCpi 5 

Hqtlg^^asB 

PKHEMO^VjyiC 
PKREfc 



240 
I 



250 



rGgQSQTjjsppsf 

spRsyf iR~S| 

DEpv^Kjj^g 

de[3vSdkB^|^; 




260 270 

p|AKSQDTHGAgG P FLF 

\KSQDTHGAgG P FLF 

"Bs||KTHGaBg S MLF 

Bp||tnrst3pksvldqsispfm 

BpBtNRSTAPKS ILDQS I S PFM 
JpSBtNF STpSPKT I MDQS ISPFK 



280 290 




310 320 

NOV7 3 pCpAg^^GAKy|YESYCQG 

gi | 174473 08 | ^ c ^aH^HgAK^8yE S Y C QG 

gi | 2687360 | fel|A^JrffcTK^YEVYCKA 

gi | 12229955 | bS|G^^S^FCRgL 

gi j 18079235 | fe^G^B^FS^L 

gi | 11262122 j bi^oSBlNFCiSL 



Table 73F lists the domain description from DOMAIN analysis results against NOV73. 
This indicates that the NOV73 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 73F. Domain Analysis of NOV73 


gnl|Pfam|pfam00685, Sulfotransfer, Sulfotransf erase protein SEQ ID 


NO:867 






CD-Length - 269 residues, 93.3% aligned 






Score = 


176 bits (446), Expect = 2e-45 




NOV73 : 


49 


DTFEARHDDIVLASYPKCGSNWILHIVSELIYAVSKKKYKYPE FPVLECGDSEK- 


102 






+ F+AR DD+++A YPK G+ W+ I+S L V + + P LE E 




Sbjct 


18 


NCFQARPDDVLIAGYPKSGTTWLQEILS-LHPNVGDFEPSPSDPLLFRNPWLEYPKGEDW 


76 


NOV73 


103 


YQRMKGFPS-PRILATHLHYDKLPGSIFENKAKILVIFRNPKDTAVSFLHFHNDVPDIPS 


161 






Y+ +K PS PR++ THL + LP S +KAKI+ + RNPKD AVS+ HF D+P+ 




Sbjct 


77 


YETLKPMPSSPRLIKTHLPLELLPKSFLSSKAKIIYVLRNPKDVAVSYYHFSRSHKDLPA 


136 


NOV73 


; 162 


Y-GSWDEFFRQFMVFLVSWGRYFDFAINWNKHLDGDNVKFILYEDLKEVRLLGIKQIAEF 


220 
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F+ V +G YFD + W + V F+ YEDLKE IK+IAEF 



Sbjct : 


137 


DPGTFEEFLEAFLNGKVLYGSYFDHVLGWWELRPEPQVLFLDYEDLKEDPAGEIKKIAEF 


196 


NOV73 : 


221 


LGFFLTGEQIQTISVQSTFQAMRA KSQDTHGAVGPFLFRKGKVADWKNLFS 


271 




LG L+ E++ + S+F M+ + G PF RKG V DWKN F+ 




Sbj Ct : 


197 


LGLPLSEEELDKLLDHSSFFLMKLNPLSNYETLCLGKSKGRKSPF-MRKGLVGDWKNYFT 


255 


NOV73 : 


272 


EIQNQEMDEKFKE 2 84 








QN+ + D+ KE 




Sbj Ct : 


256 


PEQNEKFDKVIKE 2 68 





This family includes a range of sulfotransferase proteins including flavonyl 3- 
sulfotransferase, aryl sulfotransferase, alcohol sulfotransferase, estrogen sulfotransferase and 
phenol-sulfating phenol sulfotransferase. These enzymes are responsible for the transfer of 

5 sulphate groups to specific compounds. 

NOV73 is predicted to be expressed in at least the following tissues: epithelial, 
endothelial, muscle, and neuronal tissues. This information was derived by determining the 
tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 

10 expression data for NOV73 is provided in Example 2. 

The novel nucleic acid of the invention encoding a sulfotransferase-like protein 
includes the nucleic acid whose sequence is provided in Table 73 A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 73A while still encoding a protein that maintains 

1 5 its sulfotransferase-like activities and physiological functions, or a fragment of such a nucleic 
acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence of Table 73 A, including nucleic acid fragments that are complementary to any of 
the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 

20 acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 

25 applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 34% of the bases may be so changed. 

The novel protein of the invention includes the Sulfotransferase-like protein whose 
sequence is provided in Table 73B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 73B while 





still encoding a protein that maintains its Sulfotransferase-like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 51% 
of the amino acid residues may be so changed. 

The NOV73 nucleic acids and proteins are useful in potential therapeutic applications 
5 implicated in various pathological disorders described further herein, for example, the 

compositions of the present invention will have efficacy for the treatment of patients suffering 
from: cystitis, incontinence, fertility, systemic lupus erythematosus, autoimmune disease, 
asthma, emphysema, scleroderma, allergy, ARDS, Von Hippel-Lindau (VHL) syndrome, 
Alzheimer's disease, stroke, tuberous sclerosis, hypercalcemia, Parkinson's disease, 

10 Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, 
ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, pain, 
neurodegeneration. as well as other diseases, disorders and conditions. The NOV73 nucleic 
acid encoding the sulfotransferase-like protein of the invention, or fragments thereof, may 
further be useful in diagnostic applications, wherein the presence or amount of the nucleic acid 

15 or the protein are to be assessed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

20 below. . 



The disclosed NOV74 (alternatively referred to herein as CG56775-01) includes the 
732 nucleotide sequence (SEQ ID NO:251) shown in Table 74A. A NOV74 ORF begins with 
25 a Kozak consensus ATG initiation codon at nucleotides 52-54 and ends with a stop codon at 
nucleotides 673-675. The disclosed NOV74 maps to human chromosome 15. 



CAGACTCCCCCTTGCTGGCTCCTGCACAGAATGCCGGGCCCCACTGCTGCCATGACAGGCCCTTTCAAAC 
GCTCCATGGAAGATCTCAGTGACCTGCTCTCAGACAGCAGTGGCTGCTACAGCCTCCCAAGCCAGCCCTG 
CAATGAGGTCACCCTGAAGATTTACATGGGCAACACATCTGTGGATCAGGATATCCCCAAGCTTCAGAAA 
CTAGGCAGTATTCATGCCCTGAATACCACTGAGGGCAGGTCTTTCATGCACATAAACAATGCCAACTTCT 
CCAAGGACTCTGGCATCACCTACCTGGGCATCAAGGCCAATGAAGTGCAGGAGTTCAACCTCAGCACCTA 
CTTTGAAAGGGCTACAGACTTCACTGACCAGGCCTTGGCTCAAAATGGCCAGGTGCTCGTCCAGTGCTGG 
GAAGGTTACAGCCATCTCCAGCTCGTTATCATGTACCTTATGATTGTCAGAAGTTGGACATCAAGTCATC 
TGAGTATCATGAGGCAGAACTGTGAGATCAGCCCCAATGATGGGTTCCTGGCTCAGCTTTGCCATCTCAA 
TGACAAACTAGCCAAGGAGGGGAAGGTGAAACCCTGGGGTGCCCCTACCACCTTTGCTCGAGAGGTTCAG 
TGGGAGAGGCCCTGGTTGAAGGTATCCTGTGACACTGTACCCTGATCCCAGCATCACGAGCCACTTGCCC 
TCAAGTCTGTCTCAACAAGTCCTGGGCCACTT 



NOV74 



Table 74A. NOV74 Nucleotide Sequence (SEQ ID NO:251) 
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A NOV74 polypeptide (SEQ IDNO:252) encoded by SEQ ID NO:251 is 214 amino 
ids in length and is presented using the one-letter amino acid code in Table 74B. The Psort 
profile for NOV74 predicts that this sequence has no signal peptide, and is likely to be 
localized at the cytoplasm with a certainty of 0.4500. In alternative embodiments, a NOV74 
polypeptide is located to peroxisomal microbodies with a certainty of 03625, or, to the 
mitochonrial matrix space with a certainty of 0.1000. 



Table 74B. NOV74 Polypeptide Sequence (SEQ ID 
NO:252) 



MPGPTAAMTGPFKRSMEDLSDLLSDSSGCYSLPSQPCNEVTLKIYMGNTSVDQDIPKLQK 
LGSIHALNTTEGRSFMHINNANFSKDSGITYLGIKANEVQEFNIiSTYFERATDFTDQALA 
QNGQVLVQCWEGYSHLQLVIMYLMI VRSWTS SHLS IMRQNCE I S PNDGFLAQLCHLNDKL 
AKEGKVKPWGAPTTFAREVQWERPWLKVSCDTVP 



A BLAST analysis of NOV74 was run against the proprietary PatP GENE SEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV74 had high 
homology to other proteins as shown in Table 74C. 



Table 74C. BLASTX results from PatP database for NOV74 



Sequences producing High- scoring Segment Pairs: 

patp:AAR56 968 Human phosphatase VHR - Homo sapiens, 185 aa . 
patp : AAW35330 Human cdc25B vaccinia HI related phosphatase 
patp:AAB42 8 73 Human ORFX ORF26 3 7 polypeptide sequence 
patp : AAG6744 9 Amino acid sequence of a human polypeptide 
patp : AAG67628 Amino acid sequence of a human protein 



Smallest 
Sum 

High Probability 
Score P (N) 

664 5.4e-65 

664 5.4e-65 

664 5.4e-65 

664 5.4e-65 

664 5.4e-65 



In a search of public sequence databases, it was found, for example, that the nucleic 
1 5 acid sequence of this invention has 347 of 420 bases (82%) identical to a gb:GENBANK- 
ID:HUMDSPHS|acc:L05 147.1 mRNA from Homo sapiens (Human dual specificity 
phosphatase tyrosine/serine mRNA). The full amino acid sequence of the protein of the 
invention was found to have 135 of 185 amino acid residues (72%) identical to, and 150 of 
185 amino acid residues (81%) similar to, the 185 amino acid residue ptnnSWISSNEW- 
20 ACC:P5 1452 protein from Homo sapiens (Human) (DUAL SPECIFICITY PROTEIN 
PHOSPHATASE 3 (EC 3.1.3.48) (EC 3.1 .3.16) (DUAL SPECIFICITY PROTEIN 
PHOSPHATASE VHR)). NOV74 also has homology to the other proteins shown in the 
BLASTP data in Table 74D. 
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Table 74D. NOV74 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

c%) 


Expect 


gi | 17458321 | r 
ef|XP 063782. 
1 | (XM_063782) 


similar to dual 
specificity phosphatase 
3 - vaccinia virus 
phosphatase VH1- related ,- 
protein tyrosine 
phosphatase ; 
serine/ threonine 
specific protein 
phosphatase (H. sapiens) 
[Homo sapiens] 


597 


180/207 
(86) 


180/207 
(86) 


4e-92 


gi |4758208 | re 
f |NP_004081 . 1 
| (NM_004090) 


dual specificity 
phosphatase 3; vaccinia 
virus phosphatase VHl- 
related; protein 
tyrosine phosphatases- 
serine/ threonine 
specific protein 
phosphatase 
[Homo sapiens] 


185 


135/185 
(72) 


150/185 
(80) 


5e-71 


gi | 1633321 | pd 
b | 1VHR | A 


Chain A ; Human Vhl- 
Related Dual -Specif icity 
Phosphatase 


184 


137/184 
(72) 


149/184 
(80) 


2e-70 


gi | 18158941 |p 
db | 1 J4X | A 


Chain A, Human Vhl- 
Related Dual - Specif icity 
Phosphatase C124s 
Mutant- Peptide Complex 


184 


133/184 
(72) 


148/184 
(80) 


3e-69 


gi | 12843112 | d 
bj |BAB25864 . 1 
| (AK008734) 


DUAL SPECIFICITY PROTEIN 
PHOSPHATASE 3 (EC 
3.1.3.48) {EC 3.1.3.16) 

(DUAL SPECIFICITY 
PROTEIN PHOSPHATASE 
VHR) -putative 

[Mus musculus] 


185 


126/184 
(68) 


146/184 
(78) 


3e-65 



This BLASTP data is displayed graphically in the ClustalW in Table 74E. A multiple 
sequence alignment is given, with the NOV74 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 74D. 



Table 74E. ClustalW Alignment of NOV74 



NOV74 (SEQ ID NO: 252) 

gi|l745832l| (SEQ ID NO:669) 

gi|4758208| (SEQ ID NO:670) 

gi|l63332l| (SEQ ID NO:67l) 

gi|l815894l| (SEQ ID NO:672) 

gijl2843112| (SEQ ID NO:673) 

10 20 30 40 50 

....|....|....|....|....|....|....|....|....|....| 

NOV74 

gi | 1745 8321 | MGGQEETVEEIQRMNSRSWWRIVICCDFTITLRTEIWKPEQRGEALTLQ 

gi [4758208 | 

gi | 1633321 j 

gi|l815894l| 

gi|l2843112| " 
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NOV74 

gi | 17458321 1 
gi | 4758208 | 
gi | 1633321 | 
gi j 18158941 1 
gi j 12843112 | 



NOV74 

gi|l745832l| 
gi (4758208 | 
gi j 1633321 j 
gi|l815894l| 
gi | 12843112 | 



NOV74 

gi | 17458321 1 
gi j 4758208 | 
gi | 1633321 | 
gi j 18158941 1 
gi|l2843112| 



NOV74 

gi | 17458321 1 
gi | 4758208 | 
gi | 1633321 j 
gi|l815894l| 
gi | 12843112 j 



NOV74 

gi | 17458321 | 
gi |4758208 | 
gi j 1633321 j 
gi|l815894l| 
gi j 12843112 | 



NOV74 

gi | 17458321 | 
gi |4758208 ) 
gi j 1633321 j 
gi|l815894l| 
gi j 12843112 | 



NOV74 

gi | 17458321 | 
gi | 4758208 | 
gi | 1633321 j 
gi j 18158941 | 
gi | 12843112 j 



NOV74 

gi | 17458321 | 
gi [4758208 | 
gi | 1633321 | 



70 

l----l----t--"l----l---'l---;l;;;;!;:;:!:____ 

EFSSLSLCPPMPGGPADNKEYPRGQGRKAVPQPPLGTVWEQVCGGWHSAR 



80 



90 



100 
•I 



110 



120 

GTSCTKSQCTLDQERGERSGTHRQNLSETSPNLPTAILGLLHPDLDVCKK 



130 
..|.. 



140 



150 
-I 



160 



170 



180 



190 



200 



KRCNRIYSGMEKTHPQR7ALVRTQNGINEKNQKWDKRRCLGPGCHSAGLMP 



210 



TAAYQLLAVLPGSPAPPDHSLRGSEEVXjAHTESTGDENMRHPQTPGLSKA 



220 



230 

..I.. 



240 



250 
♦I 



270 
..|.. 



280 
..|.. 



290 



300 
-I 



I I I 1 I I I 

LTAMQGAAREVGGHWELSPRLPRTSPGTSSELSPHPLVPHPVHPPPINNQ 



310 

LRERQDSRPAGKKAPVATWPVSNLREKGPGLRRRGGSVPSIPDAAIMAIT 



320 

..I.. 



330 



340 



350 



360 

I--.. I 

-MPGPT. 
GALSASHGHLLNMPGPT. 



370 



380 



390 



400 
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EVTPRI YVGNAS VAQDI PKLQKLGI THVLNAAEGRS FMHVNTNANFYKDS 
EVaPRgjYVGNASVAQDli^LQKLGITHVLNAAEGRSFMHVNTiAgFYlDS 



gi | 18158941 | 
gi j 12843112 j 



NOV74 

gi | 17458321 | 
gi|4758208| 
gi j 1633321 j 
gi | 18158941 | 
gi | 12843112 | 



NOV74 

gi | 17458321 | 
gi | 4758208 | 
gi j 1633321 j 
gi | 18158941 | 
gi j 12843112 j 



NOV74 

gi | 17458321 | 
gi j 4758208 | 
gi | 1633321 | 
gi|l815894l| 
gi | 12843112 j 



NOV74 

gi | 17458321 | 
gi j 4758208 | 
gi | 1633321 j 
gi | 18158941 | 
gi j 12843112 j 



460 



480 



490 



500 




550 



PTLVIAYLMMRQKMDVKS 
PTLVIAYLMMRQKMDVKS 
PTLVIAYLMMRQKMDVKS 
PTLVIAYLMMRQKMDVKS 



560 



570 



590 



600 




580 

|....|....|....|....|....|....|...-| 

GAPTTFAREVQWERPWLK- -VS CDTVP 

GAPTTFAREVQWERPWLKPLLSSEPEHHGCETLVLSSGPQHPQP 
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Table 74F lists the domain description from DOMAIN analysis results against NOV74. 
This indicates that the NOV74 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 74F. Domain Analysis of NOV74 



qnl |Pfam|pfam00782 / DSPc, Dual specificity phosphatase, catalytic domain. Ser/Thr and 
Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of 
tyrosine-specif ic phosphatases, except for a "recognition" region. SEQ ID NO: 868 

CD-Length = 139 residues, 100.0% aligned 

Score =90.5 bits (223), Expect = 9e-20 



NOV74 : 


36 


Sbjct : 


1 


NOV74 : 


96 


Sbj Ct : 


53 


N0V74 : 


154 


Sbjct : 


113 



PCNEVTLKIYMGNTSVDQDIPKLQKLGSIH7ALNTTEGRSFMHINNANFSKDSGITYLGIK 
+ E+ +Y+G+ ++ L KLG H +N T SK+SG YL I 



95 



- EEVPNSKNSGFLYLHI P 5 2 



GPSEILPHLYLGSYPTASNLAFLSKLGITHVINVT- 

ANEVQEFNLSTYFERATDFTDQALAQNGQVLVQCWEGYSHLQLVIM-YLMIVRSWTSSH- 
++ E ++S Y + A +F + A + G+VLV C G S +1+ YLM R+ + + 



153 



53 VDDNHE TD I S P YLDE AVE FIE D ARQ KGGKVL VHC Q AG I S R S ATL 1 1 AY LM KT RNL S LNE A 112 

LSIMRQNCE- ISPNDGFLAQLCHLNDK 179 

S +++ ISPN GF QL K 
YSFVKERRPIISPNFGFKRQIiIEYERK 13 9 
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Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 
processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
5 suggested that MAP kinase phosphatases may be tumor suppressors. DUSP6 (alias PYST1), 
one of the dual-specificity tyrosine phosphatases, is localized on 12q21, one of the regions of 
frequent allelic loss in pancreatic cancer. This gene is composed of three exons, and two forms 
of alternatively spliced transcripts are ubiquitously expressed. Although no mutations were 
observed in 26 pancreatic cancer cell lines, reduced expressions of the full-length transcripts 
10 were observed in some cell lines, which may suggest some role for DUSP6 in pancreatic 

carcinogenesis. The mitogen-induced gene, DUSP2, encodes a nuclear protein, PAC1, that 
acts as a dual-specific protein phosphatase with stringent substrate specificity for MAP kinase. 
MAP kinase phosphorylation and consequent enzymatic activation is a central and often 
obligatory component in signal transduction initiated by growth factor stimulation or resulting 
1 5 from various types of oncogenic transformation. DUSP2 downregulates intracellular signal 
transduction through the dephosphorylation/inactivation of MAP kinases. 

NOV74 is predicted to be expressed in at least the following tissues: retina. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 
20 sources, and/or RACE sources. Further expression data for NOV74 is provided in Example 2. 
The NOV74 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 
25 disorders including Crohn's disease, 1BD, allergies, rheumatoid and osteoarthritis, 

inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
30 disorders including prostate cancer and other diseases, disorders and conditions of the like. 

The NOV74 nucleic acid encoding the phosphatase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 
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The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 74A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 74A while still encoding a protein 
5 that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence of Table 74 A, including nucleic acid fragments that are complementary to any of 
the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 

10 acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
that they may be used, for example, as antisense binding nucleic acids in therapeutic 

15 applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 18% of the bases may be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 74B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

20 in Table 74B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 28% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 

25 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV75 

The disclosed NOV75 (alternatively referred to herein as CG56783-01) includes the 
30 840 nucleotide sequence (SEQ ID NO:253) shown in Table 75A. A NOV75 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 31-33 and ends with a stop codon at 
nucleotides 769-771 . The disclosed NOV75 maps to human chromosome 1 . 
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Table 75A. NOV75 Nucleotide Sequence (SEQ ID NO:253) 

AGAACCCAAGGCTCCCTGGATTTGCAGTCCATGAGCAACAAGCCCTGCTTACAGACCCCTGGGAGGAGCA 
GATTCCACGAGGGGCTGGACCAAGTCTACCTTCCAAATGTGGCTGGACTCTCTGCAGCCCCCACCCAGAG 
ACTGCCCATCAGGGAAGAGATGGTGCCATCAAGAGGCTATGGGGAAGAGGTGGATGAGGTCTGGCCCAAT 
GTCTTCATAGCTGAGAAGAGTGTGGCTGTGAACAAGGGGAGGCTGAAGAGGCTGGGAATCACCCACATTC 
TGAATGCTGCGCATGGCACCGGCGTTTACACTGGCCCCGAATTCTACACTGGCCTGGAGATCCAGTACCT 
GGGTGTAGAGGTGGATGACTTTCCTGAGGTGGACATTTCCCAGCATTTCCGGAAGGCGTCTGAGTTCCTG 
GATGAGGCGCTGCTGACTTACAGAGGACGTTTGACCAACGTGGGATTGAATGGGTCTGTCGGTCGCCTGC 
GGCGTAAGGAGTGTGTCCCACCTCGCTCGCAGGTCCTGGAGCGCACCGGCAGACCTCGAGGCGGAGCGGG 
GAAAGTCCTGGTCAGCAGCGAAATGGGCATCAGCCGGTCAGCAGTGCTGGTGGTCGCCTACCTGATGATC 
TTCCACAACATGGCCATCCTGGAGGCTTTGATGACCGTGCGTAAGAAGCGGGCCATCTACCCCAATGAGG 
GCTTCCTGAAGCAGCTGCGGGAGCTCAATGAGAAGTTGATGGAGGAGAGAGAAGAGGACTATGGCCGGTA 
GGGGGGATCAGCTGAGGCTGAGGAGGGCGAGGGCACTGGGAGCATGCTCGGGGCCAGAGTGCACGCCCTG 

A NOV75 polypeptide (SEQ ID NO:254) encoded by SEQ ID NO:253 is 246 amino 
acids in length and is presented using the one-letter amino acid code in Table 75B. The Psort 
profile for NOV75 predicts that this sequence has no signal peptide and is likely to be 
5 localized at the plasma membrane with a certainty of 0.7000. In alternative embodiments, a 
NOV75 polypeptide is located, to the endoplasmic reticulum (membrane) with a certainty of 
0.1000, or to the nucleus with a certainty of 0.2000. 



Table 75B. NOV75 Polypeptide Sequence (SEQ ID 
NO:254) 



MSNKPCLQTPGRSRFHEGLDQVYLPNVAGLSAAPTQRLPIREEMVPSRGYGEEVDEVWPN 
VFIAEKSVAVNKGRLKRLGITHILNAAHGTGVYTGPEFYTGLEIQYLGVEVDDFPEVDIS 
QHFRKASEFLDEALLTYRGRLTNVGLNGSVGRLRRKECVPPRSQVLERTGRPRGGAGKVL 
VSSEMGISRSAVLWAYLMIFHNMAILEALMTVRKKRAIYPNEGFLKQLRELNEKLMEER 
EEDYGR 



10 A BLAST analysis of NOV75 was run against the proprietary PatP GENE SEQ Protein 

Patent database. It was found, for example, that the amino acid sequence of NOV75 had high 
homology to other proteins as shown in Table 75 C. 



Table 75C. BLASTX results from PatP database for NOV75 










Smallest 










Sum 








High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P(N) 


patp 


AAE04836 


Human SGP018 phosphatase polypeptide 


389 


1 . 6e-67 


patp 


AAB40919 


Human ORFX ORF6 83 polypeptide sequence 


457 


1 . Oe-55 


patp 


AAE04837 


Human SGP0 03 phosphatase polypeptide 


218 


2 . 2e-34 


patp 


AAY85620 


Human dual specificity phosphatase- 9 


210 


1.0e-32 


patp 


•AAE04839 


Human SGP060 phosphatase polypeptide 


210 


1 . Oe-32 



15 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence of this invention has 167 of 257 bases (64%) identical to a gbiGENBANK- 

ID:AB027004|acc:AB027004.1 mRNA from Homo sapiens (mRNA for protein phosphatase). 

619 




The full amino acid sequence of the protein of the invention was found to have 39 of 89 amino 
acid residues (43%) identical to, and 58 of 89 amino acid residues (65%) similar to, the 198 
amino acid residue ptnr:SPTREMBL-ACC:Q9QYJ7 protein from Mus musculus (Mouse) 
(PROTEIN PHOSPHATASE). NOV75 also has homology to the other proteins shown in the 
5 BLASTP data in Table 75D. 



Table 75D. NOV75 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi | 17454087 | r 
ef |XP_061191 . 
1 | (XM_061191) 


similar to protein 
phosphatase (H. sapiens) 
[ Homo sa.pl en s ] 


370 


73/185 
(39) 


104/185 
(55) 


4e-30 


gi | 13128968 | r 
ef |NP_076930. 
1 | (NM_024025) 


hypothetical protein 
MGC1136; hypothetical 
protein MGC2627 
[ Homo s api en s ] 


211 


72/187 
(38) 


105/187 
(55) 


2e-29 


gi | 15072533 | g 
b | AAK77966 . 1 | 
(AY040091) 


branching- enzyme 
interacting dual- 
specificity protein 
phosphatase BEDP [Homo 
sapiens] 


188 


75/201 
(37) 


108/201 
(53) 


69e-29 


gi | 7305011 | re 
f |NP_038877 . 1 
| (NM013849) 


dual specificity 
phosphatase 13 
[Mus musculus} 


198 


73/187 
(39) 


106/187 
(56) 


9e-29 


gi | 17390456 |g 
b|AAH18204.l| 
AAH18204 
(BC018204) 


Similar to hypothetical 
protein MGC1136 
[Mus musculus] 


211 


72/187 
(38) 


104/187 
(55) 


le-28 



This BLASTP data is displayed graphically in the ClustalW in Table 75E. A multiple 
sequence alignment is given, with the NOV75 protein being shown on line 1 in a ClustalW 
10 analysis comparing the protein of the invention with the related protein sequences shown in 
Table 75D. 



Table 75E. ClustalW Alignment of NOV75 



NOV7 5 

gi | 17454087 | 
gi | 13128968 | 
gi j 15072533 j 
gi | 7305011 | 
gi j 17390456 | 



NOV7 5 

gi | 17454087 | 
gi | 13128968 j 
gi 1 15072533 j 
gi|730501l| 
gi j 17390456 | 
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(SEQ ID NO: 2 54) 

(SEQ ID NO:674) 

(SEQ ID NO: 6 75) 

(SEQ ID NO : 676) 

(SEQ ID NO: 677) 

(SEQ ID NO:678) 



10 



20 



30 



40 
.|.. 



50 
-I 



-MSNKPCI 
MLEVDPVTVPVI MPNVKRDEX 
MCPGNWLWASMTI 

MDS 

MCPGNWLWASMTI 



QTPG 

jEEVKEFAQVQ 
.FSR--SSS 
ITS 

:FSR--GSS| 



-SR- 



GLGDVLFPLADGSLQTi 

Sp|| 




60 



I 



70 



80 



100 



NOV75 

gi | 17454087 | 
gi | 13128968 | 
gi j 15072533 | 
gi | 7305011 | 

gil 



-I 

- FHE 



90 

|....|....| 
iDQVYLPN 

pqawnvrnllglpilgpkwpkpvgfsqikekvgtkBvksKescrkerk 

TkgTLE^lP 

ELgGEDl 

R P S-IHG^ 



17390456 | § T^SLElMP- 



NOV7 5 

gi | 17454087 | 
gi | 13128968 | 
gi j 15072533 | 
gi j 7305011 | 
gi j 17390456 | 



NOV7 5 

gi | 17454087 | 
gi j 13128968 | 
gi | 15072533 j 
gi | 7305011 | 
gi j 17390456 | 



NOV75 

gi | 17454087 | 
gi j 13128968 | 
gi | 15072533 | 
gi | 7305011 | 
gi | 17390456 | 



NOV7 5 

gi | 17454087 | 
gi | 13128968 | 
gi j 15072533 | 
gi [7305011 | 
gi j 17390456 | 



NOV7 5 

gi | 17454087 | 
gi j 13128968 | 
gi j 15072533 j 
gi |7305011| 
gi | 17390456 | 



NOV7 5 


YTg 


gi | 17454087 | 


DTg 


gi j 13128968 | 


RGT 


gi j 15072533 j 


QGg 


gi | 7305011 1 


DT@ 


gi j 17390456 | 


RGT 




360 370 380 390 400 

DYGR 

SQRQDGENGSRHEAGSDSFQGQESQTRQKPKRVAAVGDAGRQGPGMEMAW 



NOV7 5 

gi | 17454087 | RNQNVI KAF 

gi|l3128968| 

gi|!5072533| 
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v 




gi | 7305011 | 
gi | 17390456 | 



Table 75F lists the domain description from DOMAIN analysis results against NOV75. 
This indicates that the NOV75 sequence has properties similar to those of other proteins 
known to contain this domain. 

5 



Table 75F. Domain Analysis of NO V75 


gnl | Pf a 


m | pf am00782 , DSPc, Dual specificity phosphatase, catalytic domain. 


Ser/Thr 


and 


Tyr protein phosphatases. The enzyme's tertiary fold is 


highly 


similar 


to 


that of tyrosine-specif ic phosphatases, except for a "recognition" 


region . 


SEQ 


ID NO: 869 




Score = 


CD-Length = 139 residues, 92.8% aligned 
98.2 bits (243), Expect = 5e-22 




NOV75 : 
Sbjct : 


56 
4 


E VWPNVF I AEKS VAVNKGRLKRLG I TH I LNAAHGTGVYTGPE F YTGLE I Q YLGVE VDDFP 
E+ P+ + + + AN L +LGITH++N F YL + VDD 
EILPHLYLGSYPTASNLAFLSKLGITHVINVTEEVPNSKNSGF LYLHI PVDDNH 


115 
57 


NOV75 : 
Sbjct : 


116 
58 


EVDISQHFRKASEFLDEALLTYRGRLTNVGLNGSVGRLRRKECVPPRSQVLERTGRPRGG 
E DIS + +A EF+++A R 
ETDISPYLDEAVEFIEDA RQK 


175 
78 


NOV75 : 
Sbjct : 


176 
79 


AGK VL VS S EMG I S R S AVLWAYLM I FHNMA I L E ALMT VRKKR - A I Y PNEG FL KQ 22 8 
GKVLV + GISRSA L++AYLM N+++ EA V+ ++R I PN GF +Q 

GGKVLVHCQAG ISRSATL 1 1 AYLMKTRNLSLNEAYS FVKE RRP IIS PNFGFKRQ 132 





Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 

10 processes such as proliferation, differentiation, and apoptosis. For this reason it has been 

suggested that MAP kinase phosphatases may be tumor suppressors. DUSP6 (alias PYST1), 
one of the dual-specificity tyrosine phosphatases, is localized on 12q21, one of the regions of 
frequent allelic loss in pancreatic cancer. This gene is composed of three exons, and two forms 
of alternatively spliced transcripts are ubiquitously expressed. Although no mutations were 

15 observed in 26 pancreatic cancer cell lines, reduced expressions of the full-length transcripts 
were observed in some cell lines, which may suggest some role for DUSP6 in pancreatic 
carcinogenesis. The mitogen-induced gene, DUSP2, encodes a nuclear protein, PAC1, that 
acts as a dual-specific protein phosphatase with stringent substrate specificity for MAP kinase. 
MAP kinase phosphorylation and consequent enzymatic activation is a central and often 

20 obligatory component in signal transduction initiated by growth factor stimulation or resulting 
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from various types of oncogenic transformation. DUSP2 downregulates intracellular signal 
transduction through the dephosphorylation/inactivation of MAP kinases. 

NOV75 is predicted to be expressed in at least the following tissues: heart, breast and 
ovarian tissue, pancreas, brain, liver, kidney, spleen, testis, ovary, and peripheral blood 
leukocytes. This information was derived by determining the tissue sources of the sequences 
that were included in the invention including but not limited to SeqCalling sources, public 
EST sources, literature sources, and/or RACE sources. Further expression data for NOV75 is 
provided in Example 2. 

The NOV75 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
disorders including prostate cancer and other diseases, disorders and conditions of the like. 
The NOV75 nucleic acid encoding the phosphatase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 75 A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 75A while still encoding a protein 
that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. 

The invention further includes nucleic acids whose sequences are complementary to 
the sequence of Table 75 A, including nucleic acid fragments that are complementary to any of 
the nucleic acids just described. The invention additionally includes nucleic acids or nucleic 
acid fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of non-limiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications are 
carried out at least in part to enhance the chemical stability of the modified nucleic acid, such 
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that they may be used, for example, as antisense binding nucleic acids in therapeutic 
applications in a subject. In the mutant or variant nucleic acids, and their complements, up to 
about 36% of the bases may be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
5 protein whose sequence is provided in Table 75B. The invention also includes a mutant or 

variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 75B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 57% of the amino acid residues may be so changed. 
10 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

15 NOV76 

NOV76 ncludes two phospatase-like proteins, designated herein as NOV76a and 
NOV76b. 



NOV76o 

20 The disclosed NOY76a (alternatively referred to herein as CG56789-01) includes the 

2200 nucleotide sequence (SEQ ID NO:255) shown in Table 76 A. A NOV76a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 61-63 and ends with a stop codon 
at nucleotides 2101-2103. The disclosed NOV76 maps to human chromosome 12. 



Table 76A. NOV76a Nucleotide Sequence (SEQ ID NO:255) 

ACCATTACATCATCGTGGCAAATTAAAGAAGGAGGTGGGAAAAGAGGACTTATTGTTGTCATGGCCCATG 
AGATGATTGGAACTCAAATTGTTACTGAGAGGTTGGTGGCTCTGCTGGAAAGTGGAACGGAAAAAGTGCT 
GCTAATTGATAGCCGGCCATTTGTGGAATACAATACATCCCACATTTTGGAAGCCATTAATATCAACTGC 
TCCAAGCTTATGAAGCGAAGGTTGCAACAGGACAAAGTGTTAATTACAGAGCTCATCCAGCATTCAGCGA 
AACATAAGGTAAACGCTCAGGTTGACATTGATTGCAGTCAGAAGGTTGTAGTTTACGATCAAAGCTCCCA 
AGATGTTGCCTCTCTCTCTTCAGACTGTTTTCTCACTGTACTTCTGGGTAAACTGGAGAAGAGCTTCAAC 
TCTGTTCACCTGCTTGCAGGTTTATTCTTAGGTGGGTTTGCTGAGTTCTCTCGTTGTTTCCCTGGCCTCT 
GTGAAGGAAAATCCACTCTAGTCCCTACCTGCATTTCTCAGCCTTGCTTACCTGTTGCCAACATTGGGCC 
AACCCGAATTCTTCCCAATCTTTATCTTGGCTGCCAGCGAGATGTCCTCAACAAGGAGCTGATGCAGCAG 
AATGGGATTGGTTATGTGTTAAATGCCAGCAATACCTGTCCAAAGCCTGACTTTATCCCCGAGTCTCATT 
TCCTGCGTGTGCCTGTGAATGACAGCTTTTGTGAGAAAATTTTGCCGTGGTTGGACAAATCAGTAGATTT 
CATTGGTAAGTTGACTTATACAGAGAAAGCAAAAGCCTCCAATGGATGTGTTCTAGTGCACTGTTTAGCT 
GGGATCTCCCGCTCCGCCACCATCGCTATCGCCTACATCATGAAGAGGATGGACATGTCTTTAGATGAAG 
CTTACAGGAGATTTGTGAAAGAAAAAAGACCTACTATATCTCCAAACTTCAATTTTCTGGGCCAACTCCT 
GGACTATGAGAAGAAGATTAAGAACCAGACTGGAGCATCAGGGCCAAAGAGCAAACTCAAGCTGCTGCAC 
CTGGAGAAGCCAAATGAACCTGTCCCTGCTGTCTCAGAGGGTGGACAGAAAAGCGAGACGCCCCTCAGTC 
CACCCTGTGCCGACTCTGCTACCTCAGAGGCAGCAGGACAAAGGCCCGTGCATCCCGCCAGCGTGCCCAG 
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CGTGCCCAGCGTGCAGCCGTCGCTGTTAGAGGACAGCCCGCTGGTACAGGCGCTCAGTGGGCTGCACCTG 
TCCGCAGACAGGCTGGAAGACAGCAATAAGCTCAAGCGTTCCTTCTCTCTGGATATCAAATCAGTTTCAT 
ATTCAGCCAGCATGGCAGCATCCTTACATGGCTTCTCCTCATCAGAAGATGCTTTGGAATACTACAAA.ee 
T T C C AC T ACT C TGG ATGGG ACC AAC AAGCT ATGC C AGTT CT C C C CTGTT C AGGAACT AT CGG AGC AG AC T 
CCCGAAACCAGTCCTGATAAGGAGGAAGCCAGCATCCCCAAGAAGCTGCAGACCGCCAGGCCTTCAGACA 
GCCAGAGCAAGCGATTGCATTCGGTCAGAACCAGCAGCAGTGGCACCGCCCAGAGGTCCCTTTTATCTCC 
ACTGCATCGAAGTGGGAGCGTGGAGGACAATTACCACACCAGCTTCCTTTTCGGCCTTTCCACCAGCCAG 
CAGCACCTCACGAAGTCTGCTGGCCTGGGCCTTAAGGGCTGGCACTCGGATATCTTGGCCCCCCAGACCT 
CTACCCCTTCCCTGACCAGCAGCTGGTATTTTGCCACAGAGTCCTCACACTTCTACTCTGCCTCAGCCAT 
CTACGGAGGCAGTGCCAGTTACTCTGCCTACAGCTGCAGCCAGCTGCCCACTTGCGGAGACCAAGTCTAT 
TCTGTGCGCAGGCGGCAGAAGCCAAGTGACAGAGCTGACTCGCGGCGGAGCTGGCATGAAGAGAGCCCCT 
TTGAAAAGCAGTTTAAACGCAGAAGCTGCCAAATGGAATTTGGAGAGAGCATCATGTCAGAGAACAGGTC 
ACGGGAAGAGCTGGGGAAAGTGGGCAGTCAGTCTAGCTTTTCGGGCAGCATGGAAATCATTGAGGTCTCC 
TGAGAAGAAAGACACTTGTGACTTCTATAGACAATTTTTTTTTCTTGTTCACAAAAAAATTCCCTGTAAA 
T CTGAAATAT AT AT ATGTAC ATACATATAT 



A NOV76a polypeptide (SEQ ID NO:256) encoded by SEQ ID NO:255 is 680 amino 
acids in length and is presented using the one-letter amino acid code in Table 76B. The Psort 
profile for NOV76a predicts that this sequence has no signal peptide and is likely to be 
5 localized at the nucleus with a certainty of 0.8800. In alternative embodiments, a NOV76a 
polypeptide is located to peroxisomal microbodies with a certainty of 0.3000, to the 
mitochondrial matrix space with a certainty of 0.1000, or to lysosomes with a certainty of 
0.1000. 



Table 76B. NOV76a Polypeptide Sequence (SEQ ID 
NO:256) 

MAHEMIGTQIVTERLVALLESGTEKVLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQ 
DKVLITELIQHSAKHKVNAQVDIDCSQKWVYDQSSQDVASLSSDCFLTVLLGKLEKSFN 
SVHLLAGLFLGGFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLYLGCQR 
DVLNKELMQQNG I G Y VLNASNTC P KPDF I PE SHFLRVPVNDS FCE K I L P WLDKS VDF I GK 
LTYTEKAKASNGCVLVHCLAGISRSATIAIAYIMKRMDMSLDEAYRRFVKEKRPTISPNF 
NFLGQLLDYEKKI KNQTGASGPKSKLKL.LHLEKPNEPVPAVSEGGQKSETPLS PPCADS A 
TSEAAGQRPVHPASVPSVPSVQPSLLEDSPLVQALSGLHLSADRLEDSNKL.KRSFSLDIK 
SVSYSASMAASLHGFSSSEDALEYYKPSTTLDGTNKIiCQFSPVQELSEQTPETSPDKEEA 
SIPKKLQTARPSDSQSKRLHSVRTSSSGTAQRSLLSPLHRSGSVEDNYHTSFLFGLSTSQ 
QHLTKSAGLGLKGWHSDILAPQTSTPSLTSSWYFATESSHFYSASAIYGGSASYSAYSCS 
QLPTCGDQVYSVRRRQKPSDRADSRRSWHEESPFEKQFKRRSCQMEFGESIMSENRSREE 
LGKVGSQSSFSGSMEIIEVS 

10 

NOV76b 

The disclosed NOV76b (alternatively referred to herein as CG56789-02) includes the 
2071 nucleotide sequence (SEQ ID NO:257 ) shown in Table 76C. A SEC2 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 61-63 and ends with a stop codon at 
15 nucleotides 2047-2049. The disclosed NOV76b maps to human chromosome 12. 



Table 76C. NOV76b Nucleotide Sequence (SEQ ID 
NO:257) 

ACCATTACATCATCGTGGCAAATTAAAGAAGGAGGTGGGAAAAGAGGACTTATTGTTGTC 
ATGGCCCATGAGATGATTGGAACTCAAATTGTTACTGAGAGGTTGGTGGCTCTGCTGGAA 
AGTGGAACGGAAAAAGTGCTGCTAATTGATAGCCGGCCATTTGTGGAATACAATACATCC 
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CACATTTTGGAAGCCATTAATATCAACTGCTCCAAGCTTATGAAGCGAAGGTTGCAACAG 

GACAAAGTGTTAATTACAGAGCTCATCCAGCATTCAGCGAAACATAAGGTTGACATTGAT 

TGCAGTCAGAAGGTTGTAGTTTACGATCAAAGCTCCCAAGATGTTGCCTCTCTCTCTTCA 

GACTGTTTTCTCACTGTACTTCTGGGTAAACTGGAGAAGAGCTTCAACTCTGTTCACCTG 

CTTGCAGGTGGGTTTGCTGAGTTCTCTCGTTGTTTCCCTGGCCTCTGTGAAGGAAAATCC 

ACTCTAGTCCCTACCTGCATTTCTCAGCCTTGCTTACCTGTTGCCAACATTGGGCCAACC 

CGAATTCTTCCCAATCTTTATCTTGGCTGCCAGCGAGATGTCCTCAACAAGGAGCTGATG 

CAGCAGAATGGGATTGGTTATGTGTTAAATGCCAGCAATACCTGTCCAAAGCCTGACTTT 

ATCCCCGAGTCTCATTTCCTGCGTGTGCCTGTGAATGACAGCTTTTGTGAGAAAATTTTG 

CCGTGGTTGGACAAATCAGTAGATTTCATTGAGAAAGCAAAAGCCTCCAATGGATGTGTT 

CTAGTGCACTGTTTAGCTGGGATCTCCCGCTCCGCCACCATCGCTATCGCCTACATCATG 

AAGAGGATGGACATGTCTTTAGATGAAGCTTACAGATTTGTGAAAGAAAAAAGACCTACT 

ATATCTCCAAACTTCAATTTTCTGGGCCAACTCCTGGACTATGAGAAGAAGATTAAGAAC 

CAGACTGGAGCATCAGGGCCAAAGAGCAAACTCAAGCTGCTGCACCTGGAGAAGCCAAAT 

GAACCTGTCCCTGCTGTCTCAGAGGGTGGACAGAAAAGCGAGACGCCCCTCAGTCCACCC 

TGTGCCGACTCTGCTACCTCAGAGGCAGCAGGACAAAGGCCCGTGCATCCCGCCAGCGTA 

CCCAGCGTGCAGCCGTCGCTGTTAGAGGACAGCCCGCTGGTACAGGCGCTCAGTGGGCTG 

CACCTGTCCGCAGACAGGCTGGAAGACAGCAATAAGCTCAAGCGTTCCTTCTCTCTGGAT 

ATCAAATCAGTTTCATATTCAGCCAGCATGGCAGCATCCTTACATGGCTTCTCCTCATCA 

GAAGATGCTTTGGAATACTACAAACCTTCCACTACTCTGGATGGGACCAACAAGCTATGC 

CAGTTCTCCCCTGTTCAGGAACTATCGGAGCAGACTCCCGAAACCAGTCCTGATAAGGAG 

GAAGCCAGCATCCCCAAGAAGCTGCAGACCGCCAGGCCTTCAGACAGCCAGAGCAAGCGA 

TTGCATTCGGTCAGAACCAGCAGCAGTGGCACCGCCCAGAGGTCCCTTTTATCTCCACTG 

CATCGAAGTGGGAGCGTGGAGGACAATTACCACACCAGCTTCCTTTTCGGCCTTTCCACC 

AGCCAGCAGCACCTCACGAAGTCTGCTGGCCTGGGCCTTAAGGGCTGGCACTCGGATATC 

TTGGCCCCCCAGACCTCTACCCCTTCCCTGACCAGCAGCTGGTATTTTGCCACAGAGTCC 

TCACACTTCTACTCTGCCTCAGCCATCTACGGAGGCAGTGCCAGTTACTCTGCCTACAGC 

TGCAGCCAGCTGCCCACTTGCGGAGACCAAGTCTATTCTGTGCGCAGGCGGCAGAAGCCA 

AGTGACAGAGCTGACTCGCGGCGGAGCTGGCATGAAGAGAGCCCCTTTGAAAAGCAGTTT 

AAACGCAGAAGCTGCCAAATGGAATTTGGAGAGAGCATCATGTCAGAGAACAGGTCACGG 

GAAGAGCTGGGGAAAGTGGGCAGTCAGTCTAGCTTTTCGGGCAGCATGGAAATCATTGAG 

GTCTCCTGAGAAGAAAGACACTTGTGACTTC __ 



A NOV76b polypeptide (SEQ ID NO:258) encoded by SEQ ID NO:257 is 662 amino 
acids in length and is presented using the one-letter amino acid code in Table 76D. The Psort 
profile for NOV76b predicts that this sequence has no signal peptide and is likely to be 
localized to the nucleus with a certainty of 0.8800. In alternative embodiments, a NOV76b 
polypeptide is located to peroxisomal microbodies with a certainty of 0.3000. 



Table 76D. NOV76b Polypeptide Sequence (SEQ ID 
NO:258) 

MAHEMIGTQIVTERLVALLESGTEKVLLIDSRPFVEYNTSHILEAININCSKLMKRRLQQ 
DKVLITELIQHSAKHKVDIDCSQKVWYDQSSQDVASLSSDCFLTVLLGKLEKSFNSVHL 
LAGGFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLYLGCQRDVLNKELM 
QQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCEKILPWLDKSVDFIEKAKASNGCV 
LVHCLAGISRSATIAIAYIMKRMDMSLDEAYRFVKEKRPTISPNFNFLGQLLDYEKKIKN 
QTGASGPKSKLKLLHLEKPNEPVPAVSEGGQKSETPLSPPCADSATSEAAGQRPVHPASV 
PSVQPSLLEDSPLVQALSGLHLSADRLEDSNKLKRSFSLDIKSVSYSASMAASLHGFSSS 
EDALEYYKPSTTLDGTNKLCQFSPVQELSEQTPETSPDKEEASIPKKLQTARPSDSQSKR 
LHSVRTSSSGTAQRSLLSPLHRSGSVEDNYHTSFLFGLSTSQQHLTKSAGLGLKGWHSDI 
LAPQTSTPSLTSSWYFATESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRRQKP 
SDRADSRRSWHEESPFEKQFKRR'SCQMEFGESIMSENRSREELGKVGSQSSFSGSMEIIE 

VS - _ 



A BLAST analysis of NOV76 was run against the proprietary PatP GENE SEQ Protein 

Patent database. It was found, for example, that the amino acid sequence of NOV76 had high 

homology to other proteins as shown in Table 76E. 
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Table 76E. BLASTX results from PatP database for NOV76 



Sequences producing High-scoring Segment Pairs: 

patp: AAE04834 Human SGP002 phosphatase polypeptide 
patp : AAU09016 Human dual specificity phosphatase 21117 
patp: AAB20325 Human protein phosphatase and kinase protein 
patp : AAM25744 Human protein sequence 

patp:AAW29150 Dual - specific murine thr-tyr phosphatase 



Smallest 
Sum 

High Probability 
Score P (N) 

3360 0.0 

3360 0.0 

2963 1.3e-308 

2860 l.le-297 

1088 1.5e-125 
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In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1 149 of 1 150 bases (99%) identical to a gb:GENBANK- 
lD:AB052156|acc:AB052 156.1 mRNA from Homo sapiens (MKP-7 mRNA for MAPK 
phosphatase-7). The full amino acid sequence of the protein of the invention was found to 
have 662 of 665 amino acid residues (99%) identical to, and 662 of 665 amino acid residues 
(99%) similar to, the 665 amino acid residue ptnr:SPTREMBL-ACC:Q9BY84 protein from 
Homo sapiens (Human) (MAPK PHOSPHATASE-7). NOV76 also has homology to the 
other proteins shown in the BLASTP data in Table 76F. 



Table 76F. NOV76 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi | 12697945 |d 
bj | BAB21791 . 1 ; 
| (AB051487) 


KIAA1700 protein 
[Homo sapiens] 


690 


665/680 
(97) 


665/680 
(97) 


0 . 0 


gi | 14756395 | r 
ef |XP_039106. 
1 | {XM 039106) 


MAPK phosphatase-7 [Homo 
sapiens] 


665 


665/680 
(97) 


665/680 
(97) 


0.0 


gi | 16550836 |d 
bj | BAB71060 . 1 
| (AK055973) 


unnamed protein product 
[Homo sapiens] 


665 


664/680 
(97) 


664/680 
(97) 


0 . 0 


gi | 13990989 |d 
bj |BAB47240.1 
| (AB052157) 


MAP kinase phosphatase-7 
[Mus musculus] 


660 


601/680 
(88) 


628/680 
(91) 


0.0 


gi | 13625393 |g 
b|7AAK35052 . 1 | 
AF345951_1 
(AF345951) 


map kinase phosphatase-M 
Al isoform 
[Mus musculus] 


677 


533/644 
(82) 


568/644 
(87) 


0.0 
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This BLASTP data is displayed graphically in the ClustaiW in Table 76G. A multiple 
sequence alignment is given, with the NOV76 protein being shown on line 1 in a ClustaiW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 76F. 
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Table 76G. ClustalW Alignment of NOV76 



NOV7 6a (SEQ ID NO: 25 6) 

NOV76b {SEQ ID NO: 258) 

gi|l2697945| (SEQ ID NO:679) 

gi|l4756395| (SEQ ID NO:680) 

gi|l6550836| (SEQ ID NO:681) 

gi|l3990989| (SEQ ID NO:682) 

gijl3625393| (SEQ ID NO:683) 

10 20 30 40 50 

— I — I — I - - - - i — i .... i .... | --- - | - - - I I 

NOV7 6a 

NOV7 6b 

gi | 126 97945 | AFEHTTITSSWQIKEGGGKRGLIWjj 

gi|l4756395| 

gi|l6550836| 

gi|l3990989| 

gi | 13625393 I 

60 70 80 90 100 
I .... 1 .... I I I I • I | ■ - . - | , , - , | 



mahemigtqivterlvallesgtek 
•mahemigtqivterlvallesgtek 
mahemigtqivterlvallesgtek 
mahemigtqivterlvallesgtek 
mahemigtqivterlvallesgtek 
mahemigtqivteSlvallesgtek 
1mahemigtqivte§lvallesgtek 



NOV7 6a 
NOV7 6b 
gi|l2697945| 
gi | 14756395 j 
gi j 16550836 j 
gi j 13990989 j 
gi | 13625393 j 



NOV7 6a 

NOV 7 6b 

gi | 12697945 | 

gi | 14756395 j 

gi j 16550836 j 

gi | 13990989 | 

gi j 13625393 | 



NOV7 6a 

NOV7 6b 

gi | 12697945 | 

gi | 14756395 | 

gi j 16550836 j 

gi|l3990989| 

gi | 13625393 | 



NOV76a 

NOV7 6b 

gi | 12697945| 

gi|l4756395| 

gi | 16550836 | 

gi j 13990989 j 

gi | 13625393 j 



vllidsrpfveyntshileainincsklmkrrlqqdkvliteliqhsakh 
vllidsrpfveyntshileainincsklmkrrlqqdkvliteliqhsakh 
vllidsrpfveyntshileainincsklmkrrlqqdkvliteliqhsakh 
vllidsrpfveyntshileainincsklmkrrlqqdkvliteliqhsakh 
vllidsrpfveyntshileainincsklmkrrlqqdkvliteliqhsakh 
vllidsrpfveyntshileainincsklmkrrlqqdkvliteliqhsakh 
vllidsrpfveyntshileainincsklmkrrlqqdSvliteliqhsakh 



110 



120 



130 



140 



I 



150 

^1 



staqv 



didcsqkvwydqssqdvaslssdcfltvllgkleksfnsvhll 
didcsqkvwydqssqdvaslssdcfltvllgkleksfnsvhll 
didcsqkvwydqssqdvaslssdcfltvllgkleksfnsvhll 
didcsqkvwydqssqdvaslssdcfltvllgkleksfnsvhll 
didcsqkvwydqssqdvaslssdcfltvllgkleksfnsvhll 
didc^qkvwydqssqdvSslssdcfltvllgkle^sfnsvhll 

nTr>r|QKVWYDOSSODV§SLSSDCFLTVLLGKLE|g sFNSVHLL 



160 



170 



180 



I 



t 



190 

■ I . • 



200 



SLFLG 



G 



1 



GFAEFSRCFPGLCEGKSTLVPTCTSQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTjjlLPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPVANIGPTRILPNLY 
GFAEFSRCFPGLCEGKSTLVPTCISQPCLPV ANIGPTRILPNLY 



210 
. . I . . 



220 
I I . . 



230 
. . I . . 



240 



I 



I 



250 

- - L 



LGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 
LGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 
LGCQRDVLNKELMQ QNG I G Y VLNASNTC PKPDFIPE SHFLRVP VNDS FC E 
LGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 
LGCQRDVLNKELMQQNG I GY VLNASNTCPKPDF I PE SHFLRVPVNDS FC E 
LGCQRDVLNKj|LMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 



5CQRDVLNKB-LMQC 



JYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 



260 270 280 290 300 

NOV7 6 a P^iM!4IiJlfiM^HGKLT YTE 

NOV7 6b 
gi | 12697945 | 
gi j 14756395 j 
gi | 16550836 | 
gi | 13990989 | 



KILPWLDKSVDFlH^^jEKAKASNGCVLVHCLAGISRSATIAIAYIMK 

kilpwldksvdfiB^HekakasngcvlvhclagisrsatiaiayimkI 
kilpwldksvdfiHSHHekakasngcvlvhclagisrsatiaiayimk 
kilpwldksvdfiB^^Hekakasngcvlvhclagisrsatiaiayimk 
kilpwldksvdfi^S^Bekakasngcvlvhclagisrsatiaiayimk 
kilpwldksvdfi^^^HekakasngcvlIhclagi^rsatiaiayimk 
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gi | 13625393 | 



KILPWLDKSVDFlj 



iATIAIAYIMK 



310 



320 



330 



340 



350 



NOV7 6a 

NOV76b 

gi | 12697945| 

gi|l4756395| 

gi | 16550836 | 

gi | 13990989 | 

gi | 13625393 j 



NOV76a 
NOV7 6b 
gi | 12697945 | 
gi | 14756395 | 
gi | 16550836 | 
gi | 13990989 j 
gi j 13625393 | 



NOV76a 
NOV7 6b 
gi | 12697945 | 
gi j 14756395 j 
gi | 16550836 j 
gi | 13990989 | 
gi | 13625393 j 



NOV7 6a 
NOV7 6b 
gi | 12697945 | 
gi | 14756395 j 
gi | 16550836 | 
gi j 13990989 j 
gi|l3625393 j 



NOV7 6a 

NOV7 6b 

gi | 12697945| 

gi | 14756395 j 

gi|l6550836| 

gi | 13990989 j 

gi | 13625393 | 



NOV7 6a 
NOV7 6b 
gi |12697945| 
gi j 14756395] 
gi j 16550836 j 
gi 1 13990989 j 
gi j 13625393 j 



NOV7 6a 

NOV76b 

gi | 12697945 | 

gi j 14756395 | 

gi | 16550836 | 




610 



I 



620 
. . I . . 



630 



I 



640 
. . I . . 



650 
■^1 



TESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRRQKPSDRADSR 
TESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRRQKPSDRADSR 
TESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRRQKPSDRADSR 
TESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRRQKPSDRADSR 
TESSHFYSASAIYGGSASYSAYSCSQLPTCGDQVYSVRRRQKPSDRADSR 
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gi | 13990989 | 
gi j 13625393 j 



NOV7 6a 
NOV7 6b 
gi | 12697945 
gi j 14756395 
gi | 16550836 
gi |13990989 
gi | 13625393 



NOV7 6a 
NOV7 6b 
gi | 12697945 | 
gi | 14756395 j 
gi 1 16550836 | 
gi | 13990989 | 
gi j 13625393 | 



SYSAYSCgQLPTCgDQgYSVRRRQKPgDRADSR 

sysayscBqlptcsdqBysvrrrqkpisdradsr 



660 670 
. - I I I • • 



680 



690 



700 



heespfekqfkrrscqmefgesimsenrsreelg 
heespfekqfkrrscqmefgesimsenrsreelg 
heespfekqfkrrscqmefgesimsenrsreelg 
heespfekqfkrrscqmefgesimsenrsreelg 
heespfekqfkrrscqmefgesimsenrsreelg 

HEESPFEKQFKRRSCQMEFGESIMSENRSREELG 



^GMi^RA§L^SSLNAEA^AKWNLER^liCRRTGPGRSWARW7AASP. 





SLLLTVLFPVHKK 



Table 76H lists the domain description from DOMAIN analysis results against 
NOV76. This indicates that the NOV76 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 76H. Domain Analysis of NOV76 



gnl | Pf am| pf am007 82 , DSPc, Dual specificity phosphatase, catalytic domain. 
Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly 
similar to that of tyrosine-specif ic phosphatases, except for a "recognition" 
region. SEQ ID NO: 87 0 

CD-Length = 13 9 residues, 10 0.0% aligned 
Score = 172 bits (436), Expect = 6e-44 



NOV7 6 : 
Sbjct : 

NOV76 : 
Sbjct: 

N0V7 6 : 
Sbj Ct : 



166 GPTRILPNLYLGCQRDVLNKELMQQNGIGYVLNASNTCPKPDFIPESHFLRVPVNDSFCE 22 5 

GP+ ILP+LYLG N + + GI +V+N + P +L +PV+D+ 

1 GPSEILPHLYLGSYPTASNLAFLSKLGITHVINVTEEVPN-SKNSGFLYLHIPVDDNHET 59 

o o o o o o 

226 KILPWLDKSVDFIGKLTYTEKAKASNGCVLVHCLAGISRSATIAIAYIMKRMDMSLDEAY 28 5 

I P+LD++V+FI E A+ G VLVHC AGISRSAT+ IAY+MK ++SL+EAY 

60 DISPYLDEAVEFI EDARQKGGKVLVHCQAGISRSATLIIAYLMKTRNLSLNEAY 113 



286 RRFVKEKRPTISPNFNFLGQLLDYEKK 312 

FVKE+RP ISPNF F QL++YE+K 
114 - SFVKERRP I I SPNFGFKRQLI EYERK 13 9 



10 



Mitogen-activated protein kinases (MAPKs) are inactivated via dephosphorylation of 

either the threonine or tyrosine residue or both in the P-loop catalyzed by protein phosphatases 

which include serine/threonine phosphatases, tyrosine phosphatases, and dual specificity 

phosphatases. Nine members of the dual specificity phosphatases specific for MAPKs, termed 

MKPs, have been reported. Each member has its own substrate specificity, tissue distribution, 
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and subcellular localization. MKP-7 is most similar to hVH5, a member of previously known 
MKPs, in the primary structure. MKP-7 is predominantly localized in the cytoplasm when 
expressed in cultured cells, whereas hVH5 is both in the nucleus and the cytoplasm. MKP-7 
binds to and inactivates p38 MAPK and JNK/SAPK, but not ERK. Furthermore, MKPs have 
5 the substrate specificity toward the isoforms of the p38 family (alpha, beta, gamma, and delta). 
MKP-7 binds to and inactivates p38 alpha and -beta, but not gamma or delta. MKP-5 and 
CL1 00/MKP-1 also bind to p38 alpha and -beta, but not gamma or delta. 

NOV76 is predicted to be expressed in at least the following tissues: blood, brain, 
CNS, colon, heart, kidney, lung, and stomach. This information was derived by determining 

10 the tissue sources of the sequences that were included in the invention including but not 

limited to SeqCalling sources, public EST sources, literature sources, and/or RACE sources. 
Further expression data for NOV76 is provided in Example 2. 

The NOV76 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 

15 cardiomyopathy, atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial 
septal defect (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary 
stenosis, subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation, hypercalceimia, ulcers, Hirschsprung's disease, Crohn's 
Disease, anemia, ataxia-telangiectasia, autoimmune disease, immunodeficiencies, Von Hippel- 

20 Lindau (VHL) syndrome, Alzheimer's disease, stroke, Parkinson's disease, Huntington's 

disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, leukodystrophies, 
behavioral disorders, addiction, anxiety, pain, neurodegeneration, neuroprotection, systemic 
lupus erythematosus, asthma, emphysema, allergy, ARDS, diabetes, renal artery stenosis, 
interstitial nephritis, glomerulonephritis, polycystic kidney disease, renal tubular acidosis, IgA 

25 nephropathy, as well as other diseases, disorders and conditions. NOV76 nucleic acids 

encoding the MAP kinase-like protein of the invention, or fragments thereof, may further be 
useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the 
protein are to be assessed. 

The novel nucleic acid of the invention encoding a phosphatase-like protein includes 

30 the nucleic acid whose sequence is provided in Table 76A or 76C, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 76A or 76C while still encoding a protein that 
maintains its phosphatase-like activities and physiological functions, or a fragment of such a 
nucleic acid. The invention further includes nucleic acids whose sequences are complementary 
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to the sequence of Table 76A or 76C, including nucleic acid fragments that are complementary 
to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
5 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 

10 be so changed. 

The novel protein of the invention includes the phosphatase-like protein whose 
sequence is provided in Table 76B or 76D. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
76B or 76D while still encoding a protein that maintains its phosphatase-like activities and 

15 physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 1% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 

20 prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV77 

The disclosed NOV77 (alternatively referred to herein as CG56804-01) includes the 
25 881 nucleotide sequence (SEQ ID NO:259) shown in Table 77A. A NOV77 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 61-63 and ends with a stop codon at 
nucleotides 769-771. The disclosed NOV77 maps to human chromosome 14. 



Table 77 A. NOV77 Nucleotide Sequence (SEQ ID NO:259) 

GAGGGTCGGCCGGCTGTGTAACACTCTCCCACCCCACCCACCAGCCCGCGGGCCAGCACCATGGAGGACG 
TGAAGCTGGAGTTCCCTTCCCTTCCACAGTGCAAGGAAGACGCCGAGGAGTGGACCTACCCTGAGTGGAC 
CTACCCTATGAGACGAGAGATGCAGGAAATTTTACCTGGATTGTTCTTAGGCCCATATTCATCTGCTATG 
AAAAGCAAGGTACTACCTGTACTACAGAAACATGGAATAACCCATATAATATGCATACGACAAAATATTG 
AAGCAAACTTTATTAAACCAAACTTTCAGCAGTTATTTAGGTATTTAGTCCTGGATATTGCAGATAATCC 
AGTTGAAAATATAATACGTTTTTTCCCTATGTTTTGCCTCCAGACTAAGGAATTTATTGATGGGAGCTTA 
CAAATGGGAGGTAAAGTTCTTGTGCATGGAAATGCAGGGATCTCCAGAAGTGCAGCCTTTGTTATTGCAT 
ACATTATGGAAACATTTGGAATGAAGTACAGGTTCAGAGATGCTTTTGCTTATGTTCAAGAAAGAAGATT 
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TTGTATTAATCCTAATGCTGGATTTGTCCATCAACTTCAGGAATATGAAGCCATCTACCTAGCAAAATTA 
ACAATACAGATGATGTCACCACTCCAGATAGAAAGGTCATTATCTGTTCATTCTGGTACCACAGGTGGCA 
GTTTGAAGAGAACACATGAAGAAGAGGATGATTTTGGAACCATGCAAGTGGCGACTGCACAGAATGGCTG 
ACTTGAAGAGCAACATCATAGAGTGTGAATTTCTATTTGGGAAGGAGAAAATACAAGAGAAAATTATAAT 
GTAAAATGGTAAAAACATAAGTAGTTTTTTTTTCAATTACA , 

A NOV77 polypeptide (SEQ ID NO:260) encoded by SEQ ID NO:259 is 236 amino 
acids in length and is presented using the one-letter amino acid code in Table 77B. The Psort 
profile for NOV77 predicts that this sequence has no signal peptide and is likely to be 
5 localized at the cytoplasm with a certainty of 0.6036. In alternative embodiments, a NOV77 
polypeptide is located to lysosomes with a certainty of 0.2040. 



Table 77B. NOV77 Polypeptide Sequence (SEQ ID 
NO:260) 

MEDVKLEFPSLPQCKEDAEEWTYPEWTYPMRREMQEILPGLFLGPYSSAMKSKVLPVLQK 
HGITHIICIRQNIEANFIKPNFQQLFRYIiVLDIADNPVENIIRFFPMFCLQTKEFIDGSL 
QMGGKVL VHGNAG I S RS AAFV I AY I METFGMKYRFRDAFAYVQERRFC INPNAGFVHQLQ 
EYEAIYLAKLTIQMMSPLQIERSLSVHSGTTGGSLKRTHEEEDDFGTMQVATAQNG 

A BLAST analysis of NOV77 was run against the proprietary PatP GENE SEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV77 had high 
homology to other proteins as shown in Table 77C. 



Table 77C. BLASTX results from PatP database for NOV77 










Smallest 










Sum 








High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp 


AAM39734 


Human polypeptide 


1099 


4 . 3e-lil 


patp 


AAM41520 


Human polypeptide 


1099 


4 . 3e-lll 


patp 


.AAE08552 


Human phosphatase protein - Homo sapiens 


1099 


4 .3e-lll 


patp 


:AAU09017 


Human dual specificity phosphatase 3 8692 


1099 


4 .3e-lH 


patp 


:AAY68795 


Amino acid sequence of a human protein 


210 


6 - 9e-17 



In a search of sequence databases, it was found, for example, that the nucleic acid 
15 sequence of this invention has 228 of 249 bases (91%) identical to a gb:GENBANK- 

ID:MMU34973|acc:U34973.1 mRNA from Mus musculus (protein tyrosine phosphatase-like 
mRNA, unspliced c-terminal product and spliced c-terminal end STYX). The full amino acid 
sequence of the protein of the invention was found to have 214 of 236 amino acid residues 
(90%) identical to, and 221 of 236 amino acid residues (93%) similar to, the 223 amino acid 
20 residue ptnr:SPTREMBL-ACC:Q60970 protein from Mus musculus (Mouse) (PROTEIN 

TYROSINE PHOSPHATASE-LIKE). NOV77 also has homology to the other proteins shown 
in the BLASTP data in Table 77D. 
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Table 77D. NOV77 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 174 767 93 | r 
ef |XP_058659 . 
1 | (XM 058659) 


similar to putative (H. 
sapiens) [Homo sapiens] 


223 


223/236 
(94) 


223/236 
(94) 


e-118 


gi | 12833088 | d 
bj |BAB22384 . 1 
| (AK002822) 


phosphoserine/ threonine/ 
tyrosine interaction 
protein-putative 
[Mus musculus] 


223 


215/236 
(91) 


221/236 
(93) 


e-116 


gi | 2137698 |pi 
r | |I49365 


protein tyrosine 
phosphatase - mouse 


233 


214/236 
(90) 


221/236 
(92) 


e-116 


gi| 978998l|re 
f|NP 062611.1 
| (NM_019637) 


phosphoserine/ threonine/ 
tyrosine interaction 
protein; STNS 
(alternatively spliced 
intron of Styx) ; protein 
tyrosine phosphatase - 
like unspliced c- 
terminal product and 
spliced c-terminal end 
STYX [Mus musculus] 


205 


163/180 
(90) 


167/180 
(92) 


3e-87 


gi | 1842088 | gb 
| AAB47561 . 1 | 
(U87169) 


tyrosine phosphatase- 
like protein homolog 
hSTYXb [Homo sapiens] 


66 


66/68 
(97) 


66/68 
(97) 


8e-30 



This BLASTP data is displayed graphically in the ClustalW in Table 77E. A multiple 
sequence alignment is given, with the NOV77 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 77D. 



Table 77E. ClustalW Alignment of NOV77 



NOV77 

gi I 17476793 | 
gi | 12833088 j 
gi I 2137698 | 
gi | 9789981 | 
gi | 1842088 | 



NOV7 7 

gi | 17476793 | 
gi | 12833088 j 
gi j 2137698 | 
gi | 9789981 | 
gi j 1842088 j 



(SEQ ID NO : 2 6 0 ) 

(SEQ ID NO: 684) 

(SEQ ID NO:685) 

(SEQ ID NO:686) 

(SEQ ID NO: 68 7) 

(SEQ ID NO: 688) 



10 



| 



MEDVKLEFPSiiPQCK^DAEEWTY 



20 30 
PEWTY 



40 
■[■■ 



50 



PMRREMQEMLPGLFLGPYSSAM 



medvklefps^pqck|daeewty^^mpmrremqeglpglflgpyssam 
medvklefps5pqck§daeewtyH^Hpmrremqe|lpglflgpyssam 
medvklefps|pqck|daeewtyM1Mpmrremqe|lpglflgpyssam 
medvklefps*pqck§daeewty^^S pmrrem Q e Slpgl flgpyssam 



pmrremqeglpglflgpyssam 
pmrremqeRlpglflgpyssam 



100 



NOV77 


KSK 


LP 


gi | 17476793 | 


KSK 


LP 


gi j 12833088 j 


KSK 


LP 


gi | 2137698 | 


KSK 


LP 


gi | 9789981 | 


KSK 


LP 
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gi | 1842088 | 



NOV7 7 

gi | 17476793 | 
gi j 12833088 j 
gi |2137698 | 
gi j 9789981 j 
gi j 1842088 j 



NOV7 7 

gi | 17476793 | 
gi | 12833088 | 
gi j 2137698 | 
gi | 9789981 | 
gi | 1842088 j 



NOV77 

gi | 17476793 [ 
gi j 12833088 j 
gi | 2137698 | 
gi | 9789981 j 
gi | 1842088 j 



Table 77F lists the domain description from DOMAIN analysis results against NOV77. 
This indicates that the NOV77 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 77F. Domain Analysis of NOV77 

qnl | Smart I smart00195 , DSPc, Dual specificity phosphatase, catalytic domain 
SEQ ID NO: 871 

CD-Length = 139 residues, 98.6% aligned 
Score = 142 bits (358), Expect = 2e-35 



NOV77: 33 EMQEILPGLFLGPYSSAMKSKVLPVLQKHGITHIICIRQNIEANFIKPNFQQLFRYLVLD 92 

EILP L+LG YS A L +L+K GITH+I + + + + F YL + 

Sbjct : 1 GPSEILPHLYLGSYSDASN L ALL KKLG I THV I NV TEEVPNSNKSGFLYLGIP 52 

NOV7 7 : 93 I ADNP VEN 1 1 R F F PM FCLQT KE F I DG S LQMGG K VL VHGNAG I S RS AAF V I AY I M E T FGM K 152 

+ DN I + P EFI + + + GGKVLVH AG+SRSA +IAY+M+ M 

Sbj ct : 53 VDDNTETKISPYLPE AVEFIEDAEKKGGKVLVHCQAGVSRSATLI IAYLMKYRNMS 108 

NOV77: 153 YRFRDAFAYVQERRFCINPNAGFVHQLQEYE 18 3 

DA+ +V+ERR I+PN GF+ QL EYE 
Sbjct: 109 L--NDAYDFVKERRPIISPNFGFLRQLIEYE 137 



Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 



no 

I - - • - I I 

~ Jfclc 



120 
. . I . . 



130 

. . I . . 



140 
. . I . . 



150 
. - I 



IIRFFPM 
IIRFFPM 



tkefidgslq^ggkvlvhgnagisrsaafviayimetfg: 
tkefidgslqSJggkvlvhgnagisrsaafviayimetfg 
tkefidgslqSggkvlvhgnagisrsaafviayimetfg 
tkefidgslqSggkvlvhgnagisrsaafviayimetfg 
tkefidgslqSggkvlvhgnagisrsaafviayimetfg 
vl vhgnag i srsaafvi ay imetfg 



160 



180 



200 
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processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
suggested that MAP kinase phosphatases may be tumor suppressors. 

NOV77 is predicted to be expressed in at least the following tissues: lung, lymphoid 
tissue, spleen, tonsils, whole organism. This information was derived by determining the 
5 tissue sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV77 is provided in Example 2. 

The NOV77 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
10 brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
15 disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
disorders including prostate cancer and other diseases, disorders and conditions of the like. 
The NOV77 nucleic acid encoding the MAP kinase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
20 the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 76A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 76A while still encoding a protein 
25 that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 76A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
30 complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
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In the mutant or variant nucleic acids, and their complements, up to about 9% of the bases may 
be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 76B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 76B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 10% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV78 

The disclosed NOV78 (alternatively referred to herein as CG568 10-01) includes the 
777 nucleotide sequence (SEQ ID NO:261) shown in Table 78A. A NOV78 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 61-63 and ends with a stop codon at 
nucleotides 768-770. The disclosed NOV78 maps to human chromosome 2. 



Table 78A. NOV78 Nucleotide Sequence (SEQ ID NO:261) 

TCACCTGAGCCTAGGAGTTCAAGATTGCAGTGGCCTATGATTGCATCATTGCACTCCAGCCTGGGTGACA 
TCACCTGAGCCTAGGAGTTCAAGATTGCAGTGGCCTATGATTGCATCATTGCACTCCAGCCTGGGTGACA 
GAAAG AGAC CCTGT C T CTG C ATGATG AT AAT AAT AAT TAAAAG AG AGAG AGAGAG AGAGAAAAC AT AT AG 
TAGAAGGAGCACTTCTGGTGTGGGATTGAAGCAGTATTATCATTCAAGTGAGTCGCATTTAAATTTTTTA 
CCTTT CTTGTT AC CG AT AT C AAC AC C CCAGT C T CTT T TT AGGT AC C CT AAT T C ATGGG ATT T AAAAC AAT 
CATCTTTTTTCTTTCTCTTTAAGGTGACTCATATTCTTAATGTTGCATATGGAGTTGAAAATGCTTTCCT 
CAGTGACTTTACATATAAGAGCATTTCTATATTGGATCTGCCTGAAACCAACATCCTGTCTTATTTTCCA 
GAATGTTTTGAATTTATTGAAGAAGCAAAAAGAAAAGTGAGTTTTGTTTTGATCCATAGTTCTGCAGGAG 
TGGTTCTTGTTCATTGTAATGCAGGCGTTTCCAGGGCTGCTGCAATTGTAATAGGTTTCCTGATGAATTC 
TGAACAAACCTCATTTACCAGTGCTTTTTCTTTGGTGAAAAATGCAAGACCTTCCATATGTCCAAATTCT 
GGCTTCATGGAGCAGCTTCGTACATATCAAGAGGGCAAAGAAAGCAATAAGTGTGACAGAATACAGGAGA 
ACAGTTCATGAGTTGCATTGTAGCAGACAATGGACAACTGTAGTTTCTGAATTGACTTCTATAGCCATCT 
TTTCCCT ____ . - 

A NOV78 polypeptide (SEQ ID NO:262) encoded by SEQ ID NO:261 is 224 amino 
acids in length and is presented using the one-letter amino acid code in Table 78B. The Psort 
profile for NOV78 predicts that this sequence has no signal peptide and is likely to be 
localized to the endoplasmic reticulum (membrane) with a certainty of 0.6400. In alternative 
embodiments, a NOV78 polypeptide is located to the plasma membrane with a certainty of 
0.4960, or to the nucleus with a certainty of 0.2420. 
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Table 78B. NOV78 Polypeptide Sequence (SEQ ID 
NO:262) 

MIASLHSSLGDRKRPCLCMMI I I IKREREREKTYSRRSTSGVGLKQYYHSSESHLNFLPF 
LLPISTPQSLFRYPNSWDLKQSSFFFLFKVTHILNVAYGVENAFLSDFTYKSISILDLPE 
TNILSYFPECFEFIEEAKRKVSFVLIHSSAGWLVHCNAGVSRAAAIVIGFLMNSEQTSF 
TSAFSLVKNARPSICPNSGFMEQIiRTYQEGKESNKCDRIQENSS 

A BLAST analysis of NOV78 was run against the proprietary PatP GENE SEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV78 had high 
5 homology to other proteins as shown in Table 78C. 



Table 78C. BLASTX results from PatP database for NOV78 








Smallest 








Sum 






High 


Probability 


Sequences producing High-scoring Segment Pairs: Score 


P(N) 


patp 


AAB29109 


Human cellular proliferative response protein 618 


4 . 0e-60 


patp 


AAB73224 


Human phosphatase AI031656_h - Homo sapiens 618 


4 . 0e-60 


patp 


AAB73215 


Murine phosphatase AA2744 5 7_m 5 08 


1 . 8e-48 


patp 


AAM42211 


Human polypeptide 2 90 


2 .3e-25 


patp 


AAB94018 


Human protein sequence 2 01 


6 .2e-16 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 103 of 160 bases (64%) identical to a gb:GENBANK- 

10 ID:HS106C24|acc:Z83313.1 mRNA from Homo sapiens (Human DNA sequence from PAC 
106C24, between markers DXS294 and DXS730 on chromosome X). The full amino acid 
sequence of the protein of the invention was found to have 71 of 172 amino acid residues 
(41%) identical to, and 99 of 172 amino acid residues (57%) similar to, the 203 amino acid 
residue ptnr:SPTREMBL-ACC:Q9NGLl protein from Drosophila melanogaster (Fruit fly) 

1 5 (MAP KINASE PHOSPHATASE- 1 ). NOV78 also has homology to the other proteins shown 
in the BLASTP data in Table 78D. 



Table 78D. NOV78 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi | 18146956 | d 
bj |BAB82499. 1 
| (AB038770) 


protein phosphatase 
[Homo sapiens] 


217 


126/136 
(92) 


126/136 
(92) 


6e-65 


gi|l3277360|r 
ef |NP_077758 . 
1 | (NM_024438) 


dua 1 - spec i f i c i t y 
phosphatase 
[Mus musculus] 


220 


103/131 
(78) 


113/131 
(85) 


3e-54 



638 




gi | 12845353 | d 
bj j BAB26718 . 1 
| (AK010127) 


Dual specificity protein 
phosphatase containing 
protein [Mus musculus] 


220 


102/131 
(77) 


113/131 
(85) 


le-53 


gi | 12858039 | d 
bj |BAB31181 . 1 
| (AK018369) 


Dual specificity protein 
phosphatase containing 
protein 


162 


103/131 
(78) 


113/131 
(85) 


le-52 


gi | 18148911 |d 
bj j BAB83499 . 1 
| (AB063187) 


SKRP1 [Homo sapiens] 


166 


74/74 
(100) 


74/74 
(100) 


7e-37 



This BLASTP data is displayed graphically in the ClustalW in Table 78E. A multiple 
sequence alignment is given, with the NOV78 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 78D. 



Table 78E. ClustalW Alignment of NOV78 



NOV7 8 

gi | 18146956 | 
gi | 13277360 | 
gi | 12845353 j 
gi j 12858039 j 
gi | 18148911 j 



NOV7 8 

gi | 18146956 | 
gi | 13277360 | 
gi | 12845353 | 
gi j 12858039 j 
gi j 18148911 | 



NOV7 8 

gi | 18146956 | 
gi j 13277360 | 
gi | 12845353 | 
gi j 12858039 | 
gi | 18148911 j 



NOV7 8 

gi | 18146956 | 
gi j 13277360 | 
gi j 12845353 | 
gi|l2858039| 
gi | 18148911 j 



NOV7 8 

gi | 18146956 | 
gi j 13277360 | 
gi j 12845353 | 
gi j 12858039 | 
gi j 18148911 j 



(SEQ ID NO:262) 

(SEQ ID NO: 689) 

(SEQ ID NO: 6 90) 

(SEQ ID NO:691) 

(SEQ ID NO:692) 

(SEQ ID NO:693) 





DLQVGVIKPWLLLGSQDAAHDL^LgKgKVTHILN^ 



DLQVGVIKPWLLLG SQDAAHDLg 
SQDAAHDLjaj 



LSKgKVTHILNV 

lIkskvthilnv 



DLQVGVIKPWLLLGSQDAAHDL£HLB3Ki 



ISILD 
ISILD 
ISILD 
ISILD 
ISILD 



160 
■ - I 



HSSA 



GVVLVHC 
GVVLVHC 
GVVLVHC 
GVVLVHC 
GVVLVHC 
GVVLVHC 
















4 







RPSICP 
RPSICP 
RPSICP 
RPSICP 
RPSICP 
RPSICP 
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NOV 7 8 

gi | 18146956 | 
gi j 13277360 j 
gi | 12845353 | 
gi j 12858039 j 
gi j 18148911 j 



210 



GFMEQLRTYQgGKES 

gfmeqlrtyqHgkes 

GFMEQLRTYQfflGKES 

gfmeqlrtyqSgkes 
gfmeqlrtyqBgkes 
gfmeqlrtyqHgkes 



220 230 
..Ivv.-|....|.. 

S KC S^l QENS i 

SKCig|QElsrs| 

Sgg i §9paed||tggl 

RgG » K^PAEDplTGGL 
fflGG ■ fej P AE DpTGGL 
gKC^R^QENSg 



Table 78F lists the domain description from DOMAIN analysis results against NOV78. 
This indicates that the NOV78 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 78F. Domain Analysis of NOV78 


gnl | Pf am | pf am007 82 , DSPc, Dual specificity phosphatase, catalytic 


domain . 


Ser/Thr 


and 


Tyr protein phosphatases. The enzyme's tertiary fold i 


s highly 


similar 


to 


that of tyrosine- specif ic phosphatases, except for a "recognition" 


region. 


SEQ 


ID NO: 872 








CD-Length = 139 residues, 80.6% aligned 




Score = 


112 bits (280), Expect = 2e-26 




NOV78 : 


88 


FKVTH I LNVAYGVENAFLSDFT YKS I S I LDLPETNI LS YFPECFEFI EEAKRKVS FVLIH 


147 






+TH++NV VN+ SFY I+D ET+I Y E EFIE+A++K 




Sbjct : 


26 


LGITHVINVTEEVPNSKNSGFLYLHIPVDDNHETDISPYLDEAVEFIEDARQK 


78 


NOV 7 8 : 


148 


SSAGWLVHCNAGVSRAAAIVIGFLMNSEQTSFTSAFSLVKNARPSICPNSGFMEQLRTY 


207 






G VLVHC AG+SR+A ++I +LM + S A+S VK RP I PN GF QL Y 




Sbjct : 


79 


- - GGKVLVHCQAGI SRSATLI IAYLMKTRNLSLNEAYS FVKERRPIISPNFGFKRQLIEY 


1*3 6 


NOV7 8 : 


208 


Q 208 




Sbjct : 


137 


+ 

E 137 





Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 
processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
suggested that MAP kinase phosphatases may be tumor suppressors. 

NOV78 is predicted to be expressed in at least the following tissues: parathyroid gland, 
peripheral blood, whole organism. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV78 is provided in Example 2. 

The NOV78 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
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brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 
disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 
inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
5 lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
disorders including prostate cancer and other diseases, disorders and conditions of the like. 
NOV78 nucleic acids encoding the MAP kinase-like protein of the invention, or fragments 

10 thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 78A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 

1 5 be changed from the corresponding base shown in Table 78A while still encoding a protein 

that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 78A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

20 The invention additionally includes nucleic acids or nucleic acid fragments, or 

complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 

25 used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 36% of the bases 
may be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 78B. The invention also includes a mutant or 
30 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 78B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 59% of the amino acid residues may be so changed. 
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These materials are further useful in the generation of antibodies that bind 
immunospeciflcally to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
5 below. 



NOV79 

The disclosed NOV79 (alternatively referred to herein as CG56862-01) includes the 
939 nucleotide sequence (SEQ ID NO:263) shown in Table 79A. A NOV79 ORF begins with 
10 a Kozak consensus ATG initiation codon at nucleotides 19-21 and ends with a stop codon at 
nucleotides 928-930. The disclosed NOV79 maps to human chromosome 20. 



Table 79 A. NOV79 Nucleotide Sequence (SEQ ID NO:263) 



TACAAGCAGGGTCTCCCTATGTTTCCCAGGCTGGTCTCAAACTCCTGGGCTCACACAGTCCTCCTGCCTT 
GGCCTCCCAAAGTTCTGGGATTACAGACCCTGCAGGCGTCGGGCCTGGGCCGTCAGGGCAGCTGTGACCG 
GATCGCTTCCCGGGCGGCGAGCTGGGGGTGCACCCGGACCGCCGCCCCCGGGATCATGGGCAATGGCATG 
ACCAAGGTACTTCCTGGACTCTACCTCGGAAACTTCATTGGTCATCCCGCCAGCCAGATTGGCTCAAGCA 
T C CTGTTTCTTT C AGATGCC AAAGAC C T GG AT C AG C TGGGC CGAAAT AAGAT CAC AC AC ATC ATC T CT AT 
CCATGAGTCACCCCAGCCTCTGCTGCAGGATATCACCTACCTTCGCATCCCGGTCGCTGATACCCCTGAG 
GTACCCATGAAAAAGCACTTCAAAGAATGTATCAACTTCATCCACTGCTGCCGCCTTAATGGGGGGAACT 
GCCTTGTGCACACCACGATTGTGACAGCGTATGTGATGACTGTGACGGGGCTAGGCTGGCGGGACGTGCT 
TGAAGCCATCAAGGCCACCAGGCCCATCGCCAACCCCAACCCAGGCTTTAGGCAGCAGCTTGAAGAGTTT 
GGCTGGGCCAGTTCCCAGAAGGTACAGCTTCGCCGGCAGCTGGAGGAGCGCTTCGGCGAGAGCCCCTTCC 
GCGACGAGGAGGAGTTGCGCGCGCTGCTGCCGCTGTGCAAGCGCTGCCGGCAGGGCTCCGCGACCTCGGC 
CTCCTCCGCCGGGCCGCACTCAGCAGCCTCCGAGGGAACCGTGCAGCGCCTGGTGCCGCGCACGCCCCGG 
GAAGCCCACCGGCCGCTGCCGCTGCTGGCGCGCGTCAAGCAGACTTTCTCTTGCCTCCCCCGGTGTCTGT 
CCCGCAAGGGCGGCAAGTGAGGATGCAGT 



A NOV79 polypeptide (SEQ ID NO:264) encoded by SEQ ID NO:263 is 303 amino 
15 acids in length and is presented using the one-letter amino acid code in Table 79B. The Psort 
profile for NOV79 predicts that this sequence has a signal peptide and is likely to be localized 
to the cytoplasm with a certainty of 0.6500. In alternative embodiments, a NOV79 
polypeptide is located to lysosomes with a certainty of 0.2216. The Signal P predicts a likely 
cleavage site for a NOV79 peptide is between positions 24 and 25, i.e., at the dash in the 
20 sequence VLG-LQ. 



Table 79B. NOV79 Polypeptide Sequence (SEQ ID 
NO:264) 



MFPRLVSNSWAHTVLLPWPPKVLGLQTLQASGLGRQGSCDRIASRAASWGCTRTAAPGIM 
GNGMTKVLPGLYLGNFIGHPASQIGSSILFLSDAKDLDQLGRNKITHIISIHESPQPLLQ 
D I T YL R I P VADT P E VPM KKHF KE CINFIHCC RLNGGNCL VHT T I VT A YVMT VT GL GWRD V 
LEAIKATRPIANPNPGFRQQLEEFGWASSQKVQLRRQLEERFGESPFRDEEELRALLPLC 
KRCRQGSATSASSAGPHSAASEGTVQRLVPRTPREAHRPLPLLARVKQTFSCLPRCLSRK 
GGK 



642 



A BLAST analysis of NOV79 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV79 had high 
homology to other proteins as shown in Table 79C. 



Table 79C. BLASTX results from PatP database for NOV79 



Smallest 
Sum 











High 


Probability 


Sequences producing 


High-scoring Segment Pairs: 


Score 


P(N) 


patp 


AAE04840 


Human 


SGP0 08 phosphatase polypeptide 


1298 


3 . 5e-132 


patp 


AAY68795 


Amino 


acid sequence of a human protein 


433 


1 . 6e-40 


patp 


AAB67167 


Human 


dual-specificity phosphatase DSP-3 


433 


1 . 6e-40 


patp 


AAB66431 


Human 


DSP-3 protein - Homo sapiens , 184 aa. 


433 


1 . 6e-40 


patp 


AAB73216 


Human 


phosphatase AA3 74 7 53 h - Homo sapiens 


433 


1 . 6e-40 



5 

In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 93 of 1 18 bases (78%) identical to a gb:GENBANK- 
ID:HUMFLNG6PD|acc:L44140.1 mRNA from Homo sapiens (chromosome X region from 
filamin (FLN) gene to glucose-6-phosphate dehydrogenase (G6PD). The full amino acid 
10 sequence of the protein of the invention was found to have 273 of 276 amino acid residues 
(98%) identical to, and 274 of 276 amino acid residues (99%) similar to, the 275 amino acid 
residue ptnr:TREMBLNEW-ACC:CAC 10008 protein from Homo sapiens (Human) 
(BA243J16.6 (NOVEL PROTEIN)). NOV79 also has homology to the other proteins shown 
in the BLASTP data in Table 79D. 

15 



Table 79D. NOV79 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi | 18104942 | r 
ef |NP_542178 . 
1 | (NM 080611) 


dual specificity 
phosphatase- 1 ike 15 
[ Homo sapi en s ] 


243 


241/245 
(98) 


242/245 
(98) 


e-133 


gi | 17458347 | r 
ef |XP_059288 . 
1 | <XM_059288) 


similar to bA243J16.6 
(novel protein with a 
dual specificity 
phosphatase, catalytic 
domain) (H. 

sapiens) [Homo sapiens] 


235 


226/252 
(89) 


227/252 
(89) 


e-121 


gi | 9910432 | re 
f|NP 064570.1 
| (NM_020185) 


mitogen- activated 
protein kinase 
phosphatase x; homolog 
of mouse dual 
specificity phosphatase 
LMW-DSP2; JWK- 
stimulating phosphatase 
1 [Homo sapiens] 


184 


88/186 
(47) 


119/186 
(63) 


3e-44 
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# 



gi | 13183069 | g 
b| AAK15038. 1 | 
AF237619_1 
(AF237619) 


dual specificity 
phosphatase TS-DSP2 [Mus 
musculus] 


184 


87/186 
(46) 


119/186 
(63) 


le-43 


gi | 14726046 | r 
efjxP 046543. 
1 | (XM_046543) 


mitogen- activated 
protein kinase 
phosphatase x [Homo 
sapiens] 


184 


86/186 
(46) 


112/186 
(59) 


2e-41 



This BLASTP data is displayed graphically in the ClustalW in Table 79E. A multiple 
sequence alignment is given, with the NOV79 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 79D. 



Table 79E. ClustalW Alignment of NOV79 



NOV7 9 (SEQ ID NO: 264) 

gi | 18104942 | (SEQ ID NO:694) 

gi j 17458347 | (SEQ ID NO: 695) 

gi j 9910432 | (SEQ ID NO:696) 

gi|l3183069| (SEQ ID NO:697) 

gi | 14726046 | (SEQ ID NO:698) 



NOV7 9 

gi | 18104942 | 
gi j 17458347 | 
gi j 9910432 | 
gi | 13183069 | 
gi | 14726046 | 



10 20 30 40 50 

....|....|....|....|....|....|....|....|....|....| 
MFPRLVSNSWAHTVLLPWPPKVLGLQTLQASGLGRQGSCDRIAS RAASWG 



NOV7 9 

gi | 18104942 | 
gi j 17458347 j 
gi j 9910432 | 
gi j 13183069 | 
gi j 14726046 | 



NOV7 9 

gi | 18104942 | 
gi | 17458347 j 
gi | 9910432 | 
gi | 13183069 | 
gi j 14726046 j 



NOV7 9 

gi | 18104942 | 
gi | 17458347 j 
gi j 9910432 | 
gi j 13183069 | 
gi | 14726046 | 



CTRTAAPGI 




70 





LPGLY 




GNF 


LPGLY 




GNF 


LPGLY 




GNF 


LPGLY 




GNF 


LPGLY 




GNF 


LPGLY 




GNF 



80 90 
...|....|....|. 
IGHPASQIGSSILFLS 
IGHPASQIGSSILFLS 

I 

K 

K 

K 



130 



140 





200 



210 



220 



230 



240 
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NOV7 9 

gi | 18104942 | 
gi | 17458347 j 
gi | 9910432 | 
gi | 13183069 | 
gi | 14726046 j 



NOV7 9 

gi | 18104942 | 
gi | 17458347 | 
gi j 9910432 | 
gi j 13183069 | 
gi j 14726046 j 



NOV7 9 

gi | 18104942 | 
gi | 17458347 j 
gi | 9910432 | 
gi | 13183069 | 
gi j 14726046 | 




I 



WAS S QKVj^IiBjRjQl 
WAS SQ- - ^lBrqI 
WAS S Q - - kliH^QK J KB 
KHEVH - - IBySqw™* 
KHEVH- - 
KHEVH- - 




260 270 280 290 300 

I — | — | . | ^| — | — | — | — I - • ■ - 1 

CRQGSATSASSAGPHSAg8EgT|?0RLVPRTPREAHRPLPLLARVKQTgSC 
CRQGSATSASSAGPHSAB^EfflT^RLVPRTPREAHRPLPLLARVKQTHsC 
CRQGSATSASSAGPHSAgSE^vjqRLVPRTPREAHRPLPLLARVKQTHsC 
TapSiXK gWA 

I i^WA 

I Hwa 




Table 79F lists the domain description from DOMAIN analysis results against NOV79. 
This indicates that the NOV79 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 79F. Domain Analysis of NOV79 


gnl | Smart | 


smart00195, DSPc, Dual specificity phosphatase, catalytic 


domain 


SEQ ID NO: 


873 






Score 


CD-Length = 139 residues, 97.8% aligned 
= 108 bits (271), Expect = 3e-25 




NOV7 9 : 
Sbjct : 


63 
1 


GMTKVLPGLYLGNFIGHPASQIGSSILFLSDAKDLDQLGRNKITHIIS-IHESPQPLLQD 
G +++LP LYLG++ SDA +L L + ITH+I+ E P 
GPSEILPHLYLGSY SDASNLALLKKLGITHVINVTEEVPNSNKSG 


121 
45 


NOV7 9 : 
Sbjct 


122 
46 


ITYLRIPVADTPEVPMKKHFKECINFIHCCRLNGGNCLVH TTIVTAYVMTVT 

YL IPV D E + + E + FI GG LVH T++ AY+M 
FLYLGIPVDDNTETKISPYLPEAVEFIEDAEKKGGKVLVHCQAGVSRSATLIIAYLMKYR 


173 
105 


NOV7 9 
Sbj ct 


174 
106 


GLGWRDVLEAI KATRPI ANPNPGFRQQLEEF 2 04 

+ D + +K RPI + PN GF +QL E+ 
NMSLNDAYDFVKERRP IIS PNFGFLRQLI E Y 136 





Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 
processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
suggested that MAP kinase phosphatases may be tumor suppressors. 
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NOV79 is predicted to be expressed in at least the following tissues: brain, kidney, 
pancreas, testis, whole organism. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
5 expression data for NOV79 is provided in Example 2. 

The NOV79 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 
10 disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 

inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
15 disorders including prostate cancer and other diseases, disorders and conditions of the like. 

The NOV79 nucleic acid encoding the phosphatase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
20 protein includes the nucleic acid whose sequence is provided in Table 79A, or a fragment 

thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 79A while still encoding a protein 
that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
25 are complementary to the sequence of Table 79A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
30 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 22% of the bases 
may be so changed. 
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The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 79B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 79B while still encoding a protein that maintains its dual specificity phosphatase-like 
5 activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 2% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
10 prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV80 

The disclosed NOV80 (alternatively referred to herein as CG56882-01) includes the 
15 2039 nucleotide sequence (SEQ ID NO:265) shown in Table 80A. A NOV80 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 100-102 and ends with a TGA 
codon at nucleotides 1947-1949. The disclosed NOV80 maps to human chromosome 10. 



Table 80A. NOV80 Nucleotide Sequence (SEQ ID NO:265) 

CATGTGCTAGGTTATTCCCAGTGCGAGGCCACACTTGGGCCGTCGGAGCAGCCCCTCCTCACTTCAGGGG 
TCACCCTCCCCAAGACCCATTGCCCCATCATGGCCGGGGACCGGCTCCCCAGGAAGGTGATGGACGCCAA 
GAAGCTGGCCAGCCTGCTGCGGGGCGGGCCTGGGGGGGGCCTGGTCATCGACAGTCACTCCTTCCTGGAG 
TACAACAGCTGGCATGTGCTCAGCTCCGTCAACATCTGCTGCTCCAAGCTGGTGAAGTGGCGGTTGCAGA 
AGGGCAAGGTGACCATTGTGGAGTTCATCCAGCCGGCCGCACGCAGCCAGGTGGAGGCCACTGAGCCACA 
GGACGTGGTGGTCTATGACCAGAGCACGCGGGCCGCAGACAGCTTCCTCTCCATCCTGCTGAGCAAGCTG 
GATGGCTGCTTCCACAGCGTGGCCGGCTGCTTCCACAGCATGGCCATCATCACGGGGGGCTTCGCCACCT 
TCTCCTCCTGCTTCCCCGACCTCTGCGAGGGCGAGCCTGCTGCCCTGCTACCCATGAGCCTCTCCCAGTC 
CTGCCTGCTCGTGCCCAGCGTGGGCCTGACCCTCATCCTGCCTCACCTCTACCTGGGCTCGCAGGAAGAC 
GTCCTGAACAAGGATCTGATGACGCAGAATGGAATAAGCTACGTCCTCTATGCCAGCAACTCCTGCCCCA 
AGCCTGACTTCATCTACCAGAGCCACTTCTTGCGGGTCCCCATCAACGACAACTACTGTGAAAAGCTGCT 
GCCCTGGCTGGACAAGTCCATCGAGTTCGTCGATAAAGCCAAGCTGTCCAGCTGCCAAGTCATCGTCCAC 
CGTCTGGCCGGCATCTCCTGCTGTGCCACTATCGCCATCGCCTACATCATGAAGACCATGGGCATGTCCT 
CCGAAGACGCCTACAGGTTTGTGAAGGACCAGCGCCCGTCCATCTCGCCCAACTTCAACTTCCTGGGCCA 
GCTGCTGGAGGACCAGAGCAGCCCGAAGCTGCTGGCCGCCGTGCAGGGCGACGCGGGCACCCCCTCAGGA 
ATGCAGGAGCCTCCCCCCAGCCCTGCGGCCGGGGCCCCACTGCCATGGCTGCCACCACCTACCTCAGAGA 
CCGCTGCCACCAGGAGTGCGGCTGCCAGGGAGGGCGGCCCGAGCGCGGGCAGGAAGCCCCCGGCGCCCCC 
CACGGCCACCAGCACGCTGCAGCAGGGCCTGCGCAGCCTGCGCCTCTCCTCGGACCACCTGCAGGACACC 
AGCCGCCTCAAGCCCTCCTTCTCTCTGGACATCAAGTCGGCCTACGCCCCCAGCAGGCGGCCCGGCGGCC 
CGGGCCCAGCGACCCCGGCGAGGCCCCGAAGCTCTCTGAAAGCTGGACAGCCAGTCGGGGCCATGTTGGG 
CCTGCCCTCGCCCTGCCCGGACGCCGCGCCCGCGGCCCGGGCCCAGCGACCCCGGCGAGGCCCCGAAGCT 
AGCCAGTCGGGGCCATGTTGGGCCCCTGCCCGGACGCCGCGCCCAGGCACGCCCACGGCCCGGCGCGCTA 
CCCCGCGCGCGGCCTTAACTTCGGCTACGCGGCTGCCGGGCCCTGGCCAGCCGGCCAGCCCCGGAGCCTG 
GACGCCACCGCTCGACTCCCTGAAGCGTCCTCGGTGCTTCAGCCCCGAGGGCGTGCAAGGGCCGGGCAGG 
GTGCTGTTTGCGCCCTTCGGCCGGGCGGGCGCCCCGGAACCCAACGGCTGCAGCGACCTGCCACGGCGGG 
AGGCAGCAAGGGCTGAGCCCGGGACGGGTCAGACGAGCTGGCCCGACGAGCTGGCCCCGGATTCGCACTT 
CAAGTGCTGCAGCTGCCAGATGGAGTTCGAGGAGGGCATGGTGGAGGGGCGCGCGCGCGGCGAGGAGCTG 
GCCGCCCTGGGCAAGCAGGGGAGCTTCTCGGGCAGCGTGGAGGTCATCGAGATGTCCTGACCCCTCCGCT 
GCCCTCGGCTCCGCCGCCCGCAGCTGGGCAGTTATAAATATATATTATATATAATGCAAAGAAAGGCAAA 

TGGTTTTAC _ 
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A NOV80 polypeptide (SEQ ID NO:266) encoded by SEQ ID NO:265 is 616 amino 
acids in length and is presented using the one-letter amino acid code in Table 80B. The Psort 
profile for NOV80 predicts that this sequence is a Type lb membrane protein, has no signal 
peptide, and is likely to be localized at the plasma membrane with a certainty of 0.7000. In 
alternative embodiments, a NOV80 polypeptide is located to peroximsomal microbodies with 
a certainty of 0.3000, to the mitochondrial inner membrane with a certainty of 0.2143, or to 
the nucleus with a certainty of 0.3000. 



Table 80B. NOV80 Polypeptide Sequence (SEQ ID 
NO:266) 



MAGDRLPRKVMDAKKLASLLRGGPGGGLVIDSHSFLEYNSWHVLSSVNICCSKLVKWRLQ 
KGKVTIVEFIQPAARSQVEATEPQDWVYDQSTRAADSFLSILLSKLDGCFHSVAGCFHS 
MAIITGGFATFSSCFPDLCEGEPAALLPMSLSQSCLLVPSVGLTLILPHLYLGSQEDVLN 
KDLMTQNGISYVLYASNSCPKPDFIYQSHFLRVPINDNYCEKLLPWLDKSIEFVDKAKLS 
SCQVIVHRLAGISCCATIAIAYIMKTMGMSSEDAYRFVKDQRPSISPNFNFLGQLLEDQS 
SPKLLAAVQGDAGTPSGMQEPPPSPAAGAPLPWLPPPTSETAATRSAAAREGGPSAGRKP 
PAPPTATSTLQQGLRSLRLSSDHLQDTSRLKPSFSLDIKSAYAPSRRPGGPGPATPARPR 
SSL KAGQ P VG AM L GL P S P C PD AAP AARAQR P RRG P E AS Q SG P C W AP ART P RPGT PT ARRA 
TPRAALTSATRLPGPGQPASPGAWTPPLDSLKRPRCFSPEGVQGPGRVLFAPFGRAGAPE 
PNGCSDLPRREAARAEPGTGQTSWPDEIAPDSHFKCCSCQMEFEEGMVEGRARGEELAAIi 
GKQGSFSGSVEVIEMS . 



10 A BLAST analysis of NOV80 was run against the proprietary PatP GENE SEQ Protein 

Patent database. It was found, for example, that the amino acid sequence of NOV80 had high 
homology to other proteins as shown in Table 80C. 



Table 80C. BLASTX results from PatP database for NOV80 



Sequences producing High- scoring Segment Pairs: 

patp:AAW29l50 Dual - speci f ic murine thr-tyr phosphatase 
patp:AAE04 834 Human SGP002 phosphatase polypeptide 
patp : AAU0 9 016 Human dual specificity phosphatase 21117 
patp : AAM2 5744 Human protein sequence 

patp: AAB20325 Human protein phosphatase and kinase protein 



Smallest 
Sum 

High Probability 
Score P (N) 

1873 4.1e-193 

950 6.6e-l08 

950 6.6e-108 

955 7.8e-96 

949 3.4e-95 



15 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence of this invention has 1587 of 1930 bases (82%) identical to a gb:GENBANK- 
ID:HSU27193|acc:U27193.1 mRNA from Homo sapiens (Human protein-tyrosine 
phosphatase mRNA). The full amino acid sequence of the protein of the invention was found 
to have 489 of 625 amino acid residues (78%) identical to, and 514 of 625 amino acid residues 

20 (82%) similar to, the 625 amino acid residue ptnr:SWiSSNEW-ACC:Q13202 protein from 
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Homo sapiens (Human) (DUAL SPECIFICITY PROTEIN PHOSPHATASE 8 (EC 3.1.3.48) 
(EC 3.1.3.16) (DUAL SPECIFICITY PROTEIN PHOSPHATASE HVH-5)). (NOV80 also 
has homology to the other proteins shown in the BLASTP data in Table 80D. 



Table 80D. NOV80 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi [4758212 | re 
f |NP.__004411 . 1 
| (NM_004420) 


dual specificity 
phosphatase 8; Hi 
phosphatase, vaccinia 
virus homolog; protein 
tyrosine phosphatase; 
serine/threonine 
specific protein 
phosphatase 
[Homo sapiens] 


625 


480/632 
(75) 


506/632 
(79) 


0 . 0 


gi | 6679156 | re 
f |NP_032774.1 
| (NM_008748) 


neuronal 

tyrosine /threonine 
phosphatase 1 
[Mus musculus] 


663 


450/682 
(65) 


476/682 
(68) 


0 . 0 


gi | 17471343 | r 
ef |XP_061101 . 
1 | <XM_0 61101) 


similar to dual 
specificity phosphatase 
8; HI phosphatase, 
vaccinia virus homolog; 
protein 

tyros inephosphatase ; 
serine/ threonine 
specific protein 
phosphatase (H. sapiens) 
[Homo sapiens] 


461 


324/422 
(76) 


333/422 
(78) 


e-140 


gi | 13639013 | r 
ef |XP_012007 . 
2 | (XMJ12007) 


dual specificity 
phosphatase 8 
[Homo sapiens] 


501 


343/473 
(72) 


363/473 
(76) 


e-131 


gi | 12697945 |d 
bj [BAB21791. 1 
| (AB051487) 


KIAA1700 protein 
[Homo sapiens] 


690 


270/668 
(40) 


373/668 
(55) 


e-103 



5 

This BLASTP data is displayed graphically in the ClustalW in Table 80E. A multiple 
sequence alignment is given, with the NOV80 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 80D. 
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Table 80E. ClustalW Alignment of NOV80 


NOV8 0 


(SEQ 


ID NO: 266) 


gi | 4758212 | 


(SEQ 


ID NO:699) 


gi | 6679156 j 


(SEQ 


ID NO:700) 


gi | 17471343 


(SEQ 


ID NO: 701) 


gi j 13639013 


(SEQ 


ID NO: 702) 


gi j 12697945 


(SEQ 


ID NO: 703) 






10 20 30 40 50 


NOV8 0 







649 



• 



gi |4758212 | 
gi j 6679156 j 
gi j 17471343 | 
gi | 13639013 j 
gi | 12697945 j 



NOV8 0 

gi | 4758212 | 
gi | 6679156 j 
gi j 17471343 | 
gi | 13639013 | 
gi | 12697945 | 



NOV8 0 

gi | 4758212 | 
gi j 6679156 j 
gi | 17471343 | 
gi | 13639013 j 
gi j 12697945 j 



NOV8 0 

gi |4758212 | 
gi | 6679156 j 
gi | 17471343 | 
gi | 13639013 | 
gi j 12697945 j 



NOV8 0 

gi | 4758212 | 
gi j 6679156 j 
gi j 17471343 | 
gi | 13639013 | 
gi | 12697945 | 



NOV8 0 

gi | 4758212 | 
gi j 6679156 | 
gi | 17471343 | 
gi | 13639013 | 
gi j 12697945 j 



NOV8 0 

gi | 4758212 | 
gi | 6679156 j 
gi | 17471343 | 
gi | 13639013 | 
gi j 12697945 j 



NOV8 0 

gi |4758212 | 
gi | 6679156 | 
gi j 17471343 | 
gi j 13639013 | 
gi | 12697945 j 



MAGDRLPRKVMDAKKLASLLRGGPG 
MAGDRLPRKVMDAKKLASLLRGGPG 
MAGDRLPRKVMDAKKLASLLRGGPG 



AFEHTT I TS S WQ I KEGGGKRGL I VV(!Sh1m§GT^I^TE§-! 



100 




KvffiL ffi^ pf3 yf5Sfl flsl3^^^ 

110 120 130 140 150 




I 



SFL&ILLS* 
jSVLAgD- -SE 

IsvlaMd--se 



jdgcfhs^a 
JsillskBe 

JsiLLSK& 

cjfe 




H1#dlDC 



160 
• - I - 



iVASLSSD- -CF(3TVLLGKgEKS[|Ng|HgA 
170 180 190 200 



GGFATFSSCFP 
GGFATFSSCFP 
GGFATFSSCFP 
GGFATFSSCFP 




250 



260 
. . I . . 



270 



I 



290 
• I - - 



300 



wldksiefjgdkaklsscqvivh[glagis^atiaiayimktmgmssj»day 
wldksiefidkaklsscqvivhclagisrsatiaiayimktmgmssdday 
wldksiefidkaklsscqvivhclagisrsatiaiayimktmgmssdday 
wldksief^dkaklsscqvivh[3lagisEatiaiayimktmgmss§day 
wldksiefidkakls scq vivhclagisrsatiaiayimktmgmssdday 

WLDKS^FI^KAK^Sj^gSv^VHCLAGISRSATIAIAYIMKi^liBMSijDiAV 



350 




400 



pppspaag 
p3SEpaag| 

pppspaag 
pppspaag 



HV||gvgEgGQKSETP 



650 




410 



420 



NOV8 0 

gi | 4758212 | 
gi j 6679156 | 
gi j 17471343 | 
gi j 13639013 j 
gi j 12697945 j 



NOV8 0 

gi | 4758212 | 
gi | 6679156 | 
gi | 17471343 | 
gi | 13639013 j 
gi | 12697945 | 



NOV8 0 

gi |4758212 | 
gi j 6679156 j 
gi j 17471343 | 
gi j 13639013 j 
gi | 12697945 | 



NOV8 0 

gi | 4758212 | 
gi | 6679156 j 
gi | 17471343 | 
gi | 13639013 | 
gi j 12697945 j 



NOV8 0 

gi | 4758212 | 
gi j 6679156 j 
gi j 17471343 | 
gi j 13639013 j 
gi j 12697945 j 



NOV8 0 

gi | 4758212 | 
gi j 6679156 | 
gi j 17471343 | 
gi | 13639013 j 
gi j 12697945 | 



NOV8 0 

gi [4758212 | 
gi | 6679156 j 
gi j 17471343 | 
gi | 13639013 j 
gi j 12697945 j 



NOV8 0 

gi |4758212 | 
gi | 6679156 j 
gi | 17471343 | 




gp KLCraLDSjSsgAA - - fgeff 

gPSyTLDGTNKjJCQi 



K/QgLsfQ 



gPRRRPK 
3TSgtDKEEASlQ 



510 



520 



530 



540 



550 




TPRPGTPT|£R g 

sHglnfgd^ 

LNFGDTJ 




PAGSlsgg S3Z H sfflG LNFGDAaJ^oJJE 1 IGf^A ffl^? pg 

kklot^psdSo^krBhsvrts^^gtaq[5Isl™pI1hrs"svednyhtsfl 



560 



570 



580 



590 



600 




- - Ji Jc4 JeM^j Gigg g ffl Ajggg^g GH- -^p; 

FGllSTSO^LTKS^LGLKGlilHSDX'LAraC)Hs r l'B9rv 




760 
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gi | 13639013 | PPPAARp9HlS?ffiYYI _ ~~~ " ~~ " " ~~ 

gi j 12697945 j iaS Baflgaj - - 



Table 80F lists the domain description from DOMAIN analysis results against NOV80. 
This indicates that the NOV80 sequence has properties similar to those of other proteins 
known to contain this domain. 
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Table 80F. Domain Analysis of NOV80 


gnl | Smart | smart00195, DSPc, Dual specif icitv phosphatase, catalytic 


domain 


SEQ ID NO: 


873 






Score 


CD-Length = 13 9 residues, 97.1% aligned 
= 154 bits (388), Expect = 2e-38 




NOV8 0 
Sbjct : 


162 
1 


GLTLILPHLYLGSQEDVLNKDLMTQNGISYVLYASNSCPKPDFIYQSHFLRVPINDNYCE 
G + ILPHLYLGS D N L+ + GI++V+ + P +L +P++DN 
GPSEILPHLYLGSYSDASNLALLKKLGITHVINVTEEVPN-SNKSGFLYLGIPVDDNTET 


221 
59 


NOV8 0 : 
Sbjct : 


222 
60 


KLLPWLDKS IEFVDKAKLS SCQVI VHRLAGI S CCAT I AI AYIMKTMGMSSEDAYRFVKDQ 
K+ P+L +++EF++ A+ +V+VH AG+S AT+ IAY+MK MS DAY FVK++ 
KI S P YL. PEAVE F I EDAEKKGGKVL VHCQAGVS RSATL 1 1 AYLMKYRNMSLNDAYDFVKER 


281 
119 


NOV8 0 : 
Sbjct : 


282 
120 


RPS I S PNFNFLGQLLE 2 97 

RP ISPNF FL QL+E 

RPI ISPNFGFLRQLIE 13 5 





Mitogen-activated protein (MAP) kinase phosphatases constitute a growing family of 
dual specificity phosphatases thought to play a role in the dephosphorylation and inactivation 
of MAP kinases and are therefore likely to be important in the regulation of diverse cellular 

10 processes such as proliferation, differentiation, and apoptosis. For this reason it has been 
suggested that MAP kinase phosphatases may be tumor suppressors. 

NOV80 is predicted to be expressed in at least the following tissues: kidney. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 

15 sources, and/or RACE sources. Further expression data for NOV80 is provided in Example 2. 
The NOV80 nucleic acids and proteins of are useful in potential therapeutic 
applications implicated in various pathological disorders described further herein, for example, 
brain disorders including epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart 
disease; blood disorders, kidney disorders, liver diseases, inflammation and autoimmune 

20 disorders including Crohn's disease, IBD, allergies, rheumatoid and osteoarthritis, 

inflammatory skin disorders, allergies, blood disorders; psoriasis; colon-, ovarian-, testicular-, 
lymphatic-, brain-, and pancreatic cancers; leukemia AIDS; thalamus disorders; metabolic 
disorders including diabetes and obesity; lung diseases such as asthma, emphysema, cystic 
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fibrosis, and cancer; pancreatic disorders including pancreatic insufficiency; and prostate 
disorders including prostate cancer and other diseases, disorders and conditions of the like. 
The NOV80 nucleic acid encoding the phosphatase-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
5 the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a dual specificity phosphatase-like 
protein includes the nucleic acid whose sequence is provided in Table 80A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 80A while still encoding a protein 

10 that maintains its dual specificity phosphatase-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 80A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 

15 complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 

20 In the mutant or variant nucleic acids, and their complements, up to about 1 8% of the bases 
may be so changed. 

The novel protein of the invention includes the dual specificity phosphatase-like 
protein whose sequence is provided in Table 80B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

25 in Table 80B while still encoding a protein that maintains its dual specificity phosphatase-like 
activities and physiological functions, or a functional fragment thereof In the mutant or 
variant protein, up to about 22% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 

30 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV81 
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NOV81 includes two galactosyltransferase-like proteins, designated herein as NOV81a 
and NOV81b. 



NOV81a 

5 The disclosed NOV81a (alternatively referred to herein as CG56283-01) includes the 

1247 nucleotide sequence (SEQ ID NO:267) shown in Table 81 A. A NOV81a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 33-35 and ends with a stop codon 
at nucleotides 1224-1226. The disclosed NOV81a maps to human chromosome 19. 



Table 81A. NOV81a Nucleotide Sequence (SEQ ID NO:267) 



CTCCCGCGGCCGCCCCTTCCCTGGGCCGGGTCATGCGCTGCCCCAAGTGCCTTCTCTGCCTGTCAGCACT 
GCTCACACTCCTGGGCCTCAAAGTGTACATCGAATGGACATCCGAGTCCCGGCTCAGCAAGGCCTACCCC 
AGCCCTCGGGGCACCCCGCCAAGCCCCACGCCAGCCAACCCTGAGCCCACCCTACCTGCCAACCTCTCCA 
CCCGCCTGGGCCAGACTATCCCGCTGCCCTTTGCTTACTGGAACCAGCAGCAGTGGCGGCTGGGGTCCCT 
GCCCAGTGGGGACAGCACTGAAACGGGGGGCTGCCAGGCTTGGGGGGCCGCCGCCGCCACCGAGATCCCT 
GACTTCGCCTCCTACCCCAAGGACCTCCGCCGCTTCTTGCTGTCAGCAGCCTGCCGGAGCTTCCCACAGT 
GGCTGCCTGGAGGTGGTGGCGGCCAAGTCTCCAGCTGCTCAGATACTGATGTCCCCTACCTGCTGTTGGC 
CGTCAAGTCAGAACCAGGGCGCTTTGCAGAACGACAGGCCGTGAGAGAGACGTGGGGCAGTCCAGCTCCA 
GGGATCCGGCTGCTCTTCCTGCTAGGGTCTCCGGTAGGTGAGGCGGGGCCTGACCTAGACTCACTAGTGG 
CATGGGAGAGCCGTCGCTACAGTGACCTGCTGCTCTGGGACTTCCTCGACGTCCCATTCAACCAGACGCT 
CAAAGACCTGCTGCTGCTGGCCTGGCTGGGCCGCCACTGCCCCACCGTGAGTTTTGTCTTGCGAGCTCAG 
GACGATGCCTTTGTACACACCCCTGCCCTGCTGGCTCACCTGCGGGCCCTGCCACCTGCCTCGGCCCGAA 
GCCTCTACCTGGGTGAGGTCTTTACCCAGGCCATGCCTCTCCGGAAGCCAGGAGGACCCTTCTATGTGCC 
CGAGTCCTTCTTCGAAGGTGGCTACCCAGCCTATGCAAGCGGGGGTGGCTACGTCATTGCCGGGCGCCTG 
GCACCCTGGCTGCTGCGGGCGGCAGCCCGTGTGGCACCCTTCCCCTTTGAGGACGTCTACACTGGCCTTT 
GCATCCGAGCCCTGGGCCTGGTGCCCCAGGCCCACCCAGGCTTCCTCACAGCCTGGCCAGCAGACCGCAC 
TGCGGACCACTGTGCTTTCCGCAACCTGCTGCTGGTACGGCCCCTGGGCCCCCAGGCCAGCATTCGGCTC 
TGGAAACAACTGCAAGACCCAAGGCTCCAGTGCTGACTCTCATTGGGGAGGGCGGAG 

10 ^ ^" 

A NOV81a polypeptide (SEQ ID NO:268) encoded by SEQ ID NO:267 is 397 amino 
acids in length and is presented using the one-letter amino acid code in Table 81B. The Psort 
profile for NOV81a predicts that this sequence has a signal peptide and is likely to be 
localized to lysosomes with a certainty of 0.8650, or to the outside of the cell with a certainty 
15 of 0.8191 . The Signal P predicts a likely cleavage site for a NOV81a peptide is between 
positions 34 and 35, i.e., at the dash in the sequence SKA-YP. 



Table 81B. NOV81a Polypeptide Sequence (SEQ ID 
NO:268) 



MRCPKCLLCLSALLTLLGLKVYIEWTSESRLSKAYPSPRGTPPSPTPANPEPTLPANLST 
RLGQTI PLPFAYWNQQQWRLGSLPSGDSTETGGCQAWGAAAATEI PDFAS YPKDLRRFLL 
SAACRS FPQWLPGGGGGQVSS CSDTDVP YLLLAVKSE PGRFAERQAVRET WGS PAPGIRL 
LFLLGSPVGEAGPDLDSLVAWESRRYSDLLLWDFLDVPFNQTLKDLLLLAWLGRHCPTVS 
FVLRAQDDAFVHTPALLAHLRALPPASARSLYLGEVFTQAMPLRKPGGPFYVPESFFEGG 
YPAYASGGGYVIAGRLAPWLLRAAARVAPFPFEDVYTGLCIRALGLVPQAHPGFLTAWPA 
DRTADHC AFRNLLLVRPLGPQAS I RLWKQLQDPRLQC 



NOV81b 
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The disclosed NOV81b (alternatively referred to herein as CG56283-02) includes the 
1368 nucleotide sequence (SEQ ID NO:269) shown in Table 81C. A SEC2 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 4-6 and ends with a TGA codon at 
nucleotides 1 195-1 197. In Table 81C, putative untranslated regions are indicated by 
underlining, and the stop and start codons are indicated in bold. The disclosed NOV81b maps 
to human, chromosomes 19. 



Table 81C. NOV81b Nucleotide Sequence (SEQ ID 
NO:269) 



GTC ATGCGCTGCCCCAAGTGCCTTCTCTGCCTGTCAGCACTGCTCACACTCCTGGGCCTC 
AAAGTGTACATCGAGTGGACATCCGAGTCCCGGCTCAGCAAGGCCTACCCCAGCCCTCGG 
GGCACCCCGCCAAGCCCCACGCCAGCCAACCCTGAGCCCACCCTACCTGCCAACCTCTCC 
ACCCGCCTGGGCCAGACTATCCCGCTGCCCTTTGCTTACTGGAACCAGCAGCAGTGGCGG 
CTGGGGTCCCTGCCCAGTGGGGACAGCACTGAAACGGGGGGCTGCCAGGCTTGGGGGGCC 
GCCGCCGCCACCGAGATCCCTGACTTCGCCTCCCACCCCAAGGACCTCCGCCGCTTCTTG 
CTGTCAGCAGCCTGCCGGAGCTTCCCACAGTGGCTGCCTGGAGGTGGTGGCAGCCAAGTC 
T C C AGCTG CT C AGAT AC TG ATGT C C C C T AC CT GCTGT TGGC CGT C AAGT C AGAACC AGGG 
CGCTTTGCAGAACGACAGGCCGTGAGAGAGACGTGGGGCAGTCCAGCTCCAGGGATCCGG 
CTGCTCTTCCTGCTAGGGTCTCCGGTGGGTGAGGCGGGGCCTGACCTAGACTCACTAGTG 
GCCTGGGAGAGCCGTCGCTACAGTGACCTGCTGCTCTGGGACTTCCTCGACGTCCCATTC 
AACCAGACGCTCAAAGACCTGCTGCTGCTGGCCTGGCTGGGCCGCCACTGCCCCACCGTG 
AGTTTTGTCTTGCGAGCTCAGGACGATGCCTTTGTACACACCCCTGCCCTGCTGGCTCAC 
CTGCGGGCCCTGCCACCTGCCTCGGCCCGAAGCCTCTACCTGGGTGAGGTCTTTACCCAG 
GCCATGCCTCTCCGGAAGCCAGGAGGACCCTTCTATGTGCCCGAGTCCTTCTTCGAAGGT 
GGCTACCCAGCCTATGCAAGCGGGGGTGGCTACGTCATTGCCGGGCGCCTGGCACCCTGG 
CTGCTGCGGGCGGCAGCCCGTGTGGCACCCTTCCCCTTTGAGGACGTCTACACTGGCCTT 
TGCATCCGAGCCCTGGGCCTGGTGCCCCAGGCCCACCCAGGCTTCCTCACAGCCTGGCCA 
GCAGACCGCACTGCGGACCACTGTGCTTTCCGCAACCTGCTGCTGGTACGGCCCCTGGGC 
CCCCAGGCCAGCATTCGGCTCTGGAAACAACTGCAAGACCCAAGGCTCCAGTGCTGACTC 
TCATTGGGGAGGGCGGAGGTGCTGACCTGGCCCCGGCCCTGGCCTGGGCCTCTGGGGCCG 
GCCCCTGGCTCAGCCCCTCCTTCCAGGTCTTGATGGGAGGGAGGAGGGCCCAGAAGCTGG 
ACAACTTAAGCCACTCCTTGGCCTCCCCCAGCCAGGTGAGTGAGCTAT 



10 
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A NOV81b polypeptide (SEQ ID NO:270) encoded by SEQ ID NO:269 is 397 amino 
acids in length and is presented using the one-letter amino acid code in Table 8 ID. The Psort 
profile for NOV81b predicts that this sequence has a signal peptide and is likely to be 
localized to lysosomes with a certainty of 0.8650, to the exterior of the cell with a certainty of 
0.8190. The Signal P predicts a likely cleavage site for-a NOV81b peptide is between 
positions 34 and 35, i.e., at the dash in the sequence SAK-YP. 



Table 81D. NOV81b Polypeptide Sequence (SEQ ID 
NO:270) 



MRCPKCLLCLSALLTLLGLKVYIEWTSESRLSKAYPSPRGTPPSPTPANPEPTLPANLST 
RLGQTIPLPFAYWNQQQWRLGSLPSGDSTETGGCQAWGAAAATEIPDFASHPKDLRRFLL 
SAACRSFPQWLPGGGGSQVSSCSDTDVPYLLLAVKSEPGRFAERQAVRETWGSPAPGIRL 
LFLLGSPVGEAGPDLDSLVAWESRRYSDLLLWDFLDVPFNQTLKDLLLLAWLGRHCPTVS 
FVLRAQDDAFVHTPALLAHLRALPPASARSLYLGEVFTQAMPLRKPGGPFYVPESFFEGG 
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YPAYASGGGYVIAGRLAPWLLRAAARVAPFPFEDWTGLCIRALGLVPQAHPGFLTAWPA 
DRTADHCAFRNLLLVRPLGPQAS I RXjWKQLQDPRLQC 



A BLAST analysis of NOV81 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV81 had high 
5 homology to other proteins as shown in Table 8 IE. 



Table 81E. BLASTX results from PatP database for NOV81 

Smallest 
Sum 







High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P (N) 


patp 


:AAB03619 Human beta- 1 , 3 -galactosyltransf erase Znssp2 


2130 


2 .4e-220 


patp 


AAB03620 Murine beta- 1 , 3 -galactosyltransf erase Znssp 


1585 


1 .4e-162 


patp 


AAM419 87 Human polypeptide 


1528 


1.5e-156 - 


patp 


AAE05767 Human secreted protein (SECP) 


641 


1 . 5e-62 


patp 


AAM4 02 01 Human polypeptide 


641 


1 . 5e-62 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 647 of 1088 bases (59%) identical to a gb:GENBANK- 

10 ID:AP001754|acc:AP00 1754.1 mRNA from Homo sapiens (genomic DNA, chromosome 21q, 
section 98/105). The full amino acid sequence of the protein of the invention was found to 
have 127 of 343 amino acid residtues (37%) identical to, and 194 of 343 amino acid residues 
(56%) similar to, the 397 amino acid residue ptnr :TREMBLNE W- ACC : A AD09763 protein 
from Mus musculus (Mouse) (BETA-l,3-N-ACETYLGLUCOSAMINYLTRANSFERASE 

15 (EC 2.4.1.149)). NOV81 also has homology to the other proteins shown in the BLASTP data 
in Table 8 IF. 



Table 81F. NOV81 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi 1 14290592 |g 
b| AAH09075 . 1 | 
AAH09075 
(BC009075) 


beta-l, 3-N- 

acetylglucosaminyltransf 
erase 1 [Mus musculus] 


397 


126/350 
(36) 


192/350 
(54) 


4e-58 


gi | 16973463 |g 
b|AAL32299.l| 
AF3 218 31_1 
(AF321831) 


beta-3- 

galactosyl transferase 
[Danio rerio] 


406 


128/346 
(36) 


186/346 
(52) 


le-57 



656 



V 
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gi | 9938024 | re 
f |NP_058584 . 2 
| (NM_016888) 


UDP-GlcNAc:betaGal beta- 
1, 3-N- 

ace tylglucosaminyl transf 
erase 1; beta-l,3-N- 
ace tylglucosaminyl transf 
erase ; beta- 1 , 3-N- 
ace tylglucosaminyl transf 
erase 1; UDP- 
Gal :betaGlcNAc beta 1,3- 
galactosyltransf erase , 
pcjiypepuiue o 
[Mus musculus] 


397 


125/350 
(35) 


141/350 
(53) 


2e-57 


gi | 9845238 | re 
f|NP 006568.2 
| (NM_006577) 


beta-1, 3-N- 

ace tylglucosaminyl transf 
erase bGnT-i; beta3gal- 
T5 gene; beta-l,3-N- 
ace tylglucosaminyl transf 
erase bGnT-2 
[Homo sapiens] 


397 


121/351 
(34) 


187/351 
(52) 


6e-56 


gi | 9664889 | gb 1 
| AAF97254 . 1 | A 
F2 8 82 0 9_l 
(AF288209) 


beta 

galactosyl transf erase 
bGalT7 [Homo sapiens] 


393 


121/351 
(34) 


187/351 
(52) 


6e-56 



This BLASTP data is displayed graphically in the ClustalW in Table 81G. A multiple 
sequence alignment is given, with the NOV81 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 8 IF. 



Table 81G. ClustalW Alignment of NOV81 



NOV8 la 
NOV8 lb 
gi | 14290592 | 
gi j 16973463 j 
gi | 9938024 | 
gi | 9845238 | 
gi | 9664889 | 



NOV81a 

N0V81b 

gi | 14290592 | 

gi j 16973463 | 

gi | 9938024 | 

gi | 9845238 | 

gi j 9664889 j 



N0V81a 
NOV8 lb 
gi | 14290592 | 
gi | 16973463 j 
gi | 9938024 | 
gi | 9845238 | 
gi | 9664889 j 



NOV81a 



(SEQ ID NO: 268) 

(SEQ ID NO:270) 

(SEQ ID NO: 704) 

(SEQ ID NO: 705) 

(SEQ ID NO: 706) 

(SEQ ID NO:707) 

(SEQ ID NO: 70 8) 



CPfcL§CjlS. 
SVG|RRgK§LG|j 

qgpHr--kpkv| 
svcHrr^kMlgI 
svgHrr|IkBlg1 

MVsS- -S 





70 



kns^dkSg 

|3lg8EKyG 
80 



40 50 
|....|....|....| 

tpsprgtppsptpanp 
ypsprgtppsptpanp 
IdkIgS 



100 



■GgTQP - LP FAYW|jC|QQWgLgS| 
,G&TyP - LPFAYWNQQQWRlBs 
~ KMRWYjgP ljffi^)/AN^TaE| 

1#2 RWY 3 P lgNRVANQT@Ej 
QyBp iS^MjlTN^Tr 

qyi"p: 

120 



PSGDg- 

psgd! 

.TSPSl-j 
iPDWlJT 

.Tspfe-gs 

jEAGRLSw-IS 

:agrlsS-is 

140 




130 

KDL? Eig 5aaB3sfpqv^pgg . :< K - 3S I L 



150 
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NOV81b 
gi | 14290592 | 
gi | 16973463 | 
gi | 9938024 | 
gi | 9845238 | 
gi j 9664889 j 



NOV81a 
NOV8 lb 
gi | 14290592 | 
gi j 16973463 j 
gi j 9938024 | 
gi j 9845238 | 
gi j 9664889 | 



NOV81a 
NOV8 lb 
gi | 14290592 | 
gi | 16973463 | 
gi j 9938024 | 
gi j 9845238 j 
gi j 9664889 j 



NOV81a 

NOV81b 

gi | 14290592 | 

gi j 16973463 | 

gi | 9938024 | 

gi | 9845238 | 

gi | 9664889 j 



NOV81a 
NOV8 lb 
gi | 14290592 | 
gi | 16973463 | 
gi | 9938024 | 
gi j 9845238 j 
gi | 9664889 j 



NOV81a 

NOV8lb 

gi | 14290592 | 

gi j 16973463 | 

gi j 9938024 | 

gi | 9845238 j 

gi|9664889| 



NOV81a 
NOV8 lb 
gi | 14290592 | 
gi j 16973463 | 
gi j 9938024 
gi | 9845238 
gi j 9664889 




DFNgfcPDRFKDFLLYgRCRhgY 

DFNj|LPDRFKDFLLYBRCRgY 
FNgpPDRFKDFLLYgRCRgY 

fnHlpdrfkdfllyBrcriBy 




160 



170 



200 





410 
..|....| 



TVSEQLLLIGFTD 
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Table 81H lists the domain description from DOMAIN analysis results against 
NOV81 . This indicates that the NOV81 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 81H. Domain Analysis of NO V81 


gnl | Pf am| pf am01762 , Galactosyl T, Galactosyl trans f erase . This family inrluHpa 


the galactosyl transferases UDP-galactose : 2 -acetamido- 2 -deoxy-D-glucose3beta- 


galactosyltransf erase and UDP -Gal : beta- GlcNAc beta 1 , 3 -galactosyltranf erase . 


Specific galactosyltransferases transfer galactose to GlcNAc terminal chains 


in the 


synthesis of the lacto-series oligosaccharides types 1 and 2. SEQ ID 


NO: 874 










CD- Length = 195 residues, 98.5% aligned 


Score = 


77.4 bits (189), Expect = le-15 


NOV81 : 


162 


AERQAVRETWGSP- -APGIRL- -LFLLGSPVGEAGPDLDSLVAWESRRYSDLLLWDFLDV 217 






A R A+R+TW + + G R+ LFL+G + L LV E+R Y D+++ D D 


Sbjct : 


1 


ARRNAIRKTWMNQNNSRGGRIKSLFLVG- -LAALDGKLKKLVMEEARLYGDI I WDLEDS 5 8 


NOV8 1 : 


218 


PFNQTLKDLLLIAWLGRHCPTVSFVLRAQDDAFVHTPALIJ^LRALPPASARSLYLGEVF 2 7 7 






N TLK L +L ++ CP + + DD FV+ LL+ L + + G + 


Sb j Ct : 


59 


YLNLTLKTLTILLYWSKCPNAKLIGKIDDDVFVNPDNLLSLLEREYIDPSPLSFYGYII ' 118 


NOV81 : 


278 


TQAMPLRKPGGPFYVPESFF-EGGYPAYA^GGGWIAGRJ^PWLJLRAAARVAPFPFEDVY 33 6 






P+R +YVP + + YP Y SG Y+ + + AP +L+A+ EDV 


Sbjct : 


119 


KNGEPVRTKKSKWYVPPTAYPCSNYPPYLSGPFYILSRDAAPLiILiKAS KHRRFIKIEDVL 178 


NOV8 1 : 


337 


-TGLCIRALGLVPQ 34 9 






TG+ LG+ 


Sbjct : 


179 


ITGILALDLGI SRI 192 



5 ~' ~ 

There are 2 known types of carbohydrate chains in the lacto series of oligosaccharides: 
type 1 chains, which contain the Gal(beta-l-3)GlcNAc linkage, and type 2 chains, which 
contain the topoisomer Gal(beta-l-4)GlcNAc. The biosynthesis of both types of chains is 
catalyzed by specific galactosyltransferases (GalTs), which transfer galactose (Gal) to N- 
10 acetylglucosamine (GIcNAc)-terminating chains. Beta-4-GalT enzymes (e.g., GGTB2; are the 
galactosyltransferases responsible for type 2 chain biosynthesis, while beta-3-GalTs are the 
type 1 elongating enzymes. 

Kolbinger et al. (1998) searched an expressed sequence tag (EST) database with the 
amino acid sequence of a human beta-3-GalT, which they called beta-3-GalTl, and identified 
1 5 human brain cDNAs encoding a novel beta-3-GalT, which they named beta-3-GalT2. The 

deduced 422-amino acid beta-3-GalT2 protein has a predicted type II transmembrane topology 
with 5 potential N-glycosylation sites, and a predicted molecular mass of 49,202 Da. Beta-3- 
GalT2 shares 46% amino acid identity with beta-3-GalTl, but has a 17-amino acid extension 
at the carboxy terminus and longer cytoplasmic and stem regions. Beta-3-GalT2 directed the 
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synthesis of type 1 chains in mammalian cells and transferred Gal to GlcNAc- and Gal- 
terminating acceptors in enzymatic assays. 

Northern blot analysis demonstrated strong expression of a 3.5-kb beta-3-GalT2 
transcript, and weaker expression of a 2.8-kb transcript, in heart and brain. Amado et al. 
5 (1998) stated that a human beta-3-galactosyltransferase gene, called beta-3-GalTl by them, 
was isolated from a melanoma cell line using a transfection-cloning strategy. Beta-3-GalTl is 
a predicted 326-amino acid protein. By carrying out a BLAST search of an EST database with 
the beta-3-GalTl coding sequence, Amado et al. (1998) identified cDNAs encoding 3 other 
beta-3-galactosyltransferases, beta-3-GalT2, beta-3-GalT3, and beta-3-GalT4. The sequences 
10 of the 4 predicted proteins share 29 to 42% identity and have several conserved short sequence 
motifs. All 4 appear to be evolutionarily related, since their coding regions are contained in a 
single exon. 

Using an insect cell expression system, Amado et al. (1998) showed that beta-3-GalTl 
and beta-3-GalT2 are UDP-galactose:beta-N-acetyl-glucosamine beta-1,3 

1 5 galactosyltransferases with similar kinetic properties. Northern blot analysis revealed that 
beta-3-GalTl is expressed as a 6.5-kb mRNA exclusively in brain. Hennet et al. (1998) 
identified a mouse beta-3-GalTl homolog, designated beta-3-GalTI, and found that the coding 
region was contained in a single exon. 

NOV81 is predicted to be expressed in at least the following tissues: colon, blood, and 

20 lymphocyte. This information was derived by determining the tissue sources of the sequences 
that were included in the invention including but not limited to SeqCalling sources, public 
EST sources, literature sources, and/or RACE sources. Further expression data for NOV81 is 
provided in Example 2. 

The NOV81 nucleic acids and proteins of are useful in potential therapeutic 

25 applications implicated in various pathological disorders described further herein, for example, 
hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, autoimmume disease, 
allergies, immunodeficiencies, transplantation, graft versus host disease (GVHD), 
lymphaedema, anemia, ataxia-telangiectasia, autoimmume disease,immunodeficiencies, 
Hirschsprung's disease, Crohn's Disease, appendicitis as well as other diseases, disorders and 

30 conditions. NOV8 1 nucleic acids encoding the galactosyltransferase-like protein of the 

invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a beta-l,3-galactosyItransferase-like 
protein includes the nucleic acid whose sequence is provided in Table 81 A or 81C, or a 
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fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 81 A or 81 C while still 
encoding a protein that maintains its beta-l,3-galactosyltransferase-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to the sequence of Table 81 A or 81 C, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 41% of the bases 
may be so changed. 

The novel protein of the invention includes the beta-l,3-galactosyltransferase-like 
protein whose sequence is provided in Table 81 B or 8 IB. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 81B or 81D while still encoding a protein that maintains its beta-1,3- 
galactosyltransferase-like activities and physiological functions, or a functional fragment 
thereof. In the mutant or variant protein, up to about 63% of the amino acid residues may be so 
changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV82 

The disclosed NOV82 (alternatively referred to herein as CG56983-01) includes the 
348 nucleotide sequence (SEQ ID NO:271) shown in Table 82A. A NOV82 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 9-1 1 and ends with a stop codon at 
nucleotides 321-322. The disclosed NOV82 maps to human chromosome X. 
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Table 82A. NOV82 Nucleotide Sequence (SEQ ID NO:271) 



ctatccctatggtgtcggtgtgcaggccgtggcctgctgtggccatagcacttctggctctgctggtctg" 

CCTGGGGGCGCTGGTCGACACCTGCCCCATCAAACCCGAGGCTCCTGGCGAAGACGAGTCCCTGGAGGAG 
CTGAGCCACTATTATGCTTCCCTGTGCCACTACCTCAACGTGGTCACCAGACAGTTAATTTCAGAGAGAA 
AC CTAC C AGAC AC C ATTGTGTC C AAGGAAGT AT T TT TC ACAAGC ACAAAGG AAAG AC CTG TGAGGACACA 
GAAGGAAGGTTGCCATCTGCAAGCCAAGGAGAGAAGCCTCTGAAAAAACCAAACCTGCTGGCACCTTG 



A NOV82 polypeptide (SEQ ID NO:272) encoded by SEQ ID NO:271 is 104 amino 
acids in length and is presented using the one-letter amino acid code in Table 82B. The Psort 
profile for NOV82 predicts that this sequence has a signal peptide and is likely to be localized 
at the exterior of the cell with a certainty of 0.8200. In alternative embodiments, a NOV82 
polypeptide is located to the endoplasmic reticulum (membrane) with a certainty of 0.1000. 
The Signal P predicts a likely cleavage site for a NOV82 peptide is between positions 28 and 
29, i.e., at the dash in the sequence VDT-CP. 



Table 82B. NOV82 Polypeptide Sequence (SEQ ID 
NO:272) 



MVSVCRPWPAVAIALLALLVCLGALVDTCPIKPEAPGEDESLEELSHYYASLCHYLNVVT 
RQLISERNLPDTIVSKEVFFTSTKERPVRTQKEGCHLQAKERSL 



A BLAST analysis of NOV82 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV82 had high 
homology to other proteins as shown in Table 82C. 



Table 82C. BLASTX results from PatP database for NOV82 



Smallest 
Sum 

High Probability 

Sequences producing High-scoring Segment Pairs: Score P (N) 

patp:AAE09439 Human sbghPYYa protein - Homo sapiens 329 1.7e-29 

patp:AAB08020 Amino acid sequence of a human peptide yY 3 01 l.*6e-26 

patp:AAG75364 Human colon cancer antigen protein * 293 l.le-25 

patp:AAY14602 Amino acid sequence of the baboon PY 221 4*7e-18 

patp:AAY43334 Neuropeptide Y - Synthetic, 97 aa. - 188 1 5e-14 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 217 of 288 bases (75%) identical to a gb:GENBANK- 
ID:HUMPYYP3|acc:D13902.1 mRNA from Homo sapiens (Human mRNA for peptide YY). 
The full amino acid sequence of the protein of the invention was found to have 62 of 94 amino 
acid residues (65%) identical to, and 73 of 94 amino acid residues (77%) similar to, the 97 
amino acid residue ptnr:SWISSNEW-ACC:P10082 protein from Homo sapiens (Human) 
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(PEPTIDE YY PRECURSOR (PYY)). NOV82 also has homology to the other proteins 
shown in the BLASTP data in Table 82D. 



Table 82D. NOV82 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi | 1172796 | sp | PI 
0 0 82 | PYY_HUMAN 


PEPTIDE YY PRECURSOR 
(PYY) (PEPTIDE 
TYROSINE TYROSINE) 


97 


62/94 
(65) 


73/94 
(76) 


2e-23 


gi | 131753 | Sp | P10 
631 | PYY_RAT 


PEPTIDE YY PRECURSOR 
(PYY) (PEPTIDE 
TYROSINE TYROSINE) 


97 


61/94 
(64) 


72/94 
(75) 


8e-23 ! 


gi | 4758982 | ref |N 
P 004151. l| 
(NM 004160) 


peptide YY 
[Homo sapiens] 


90 


60/91 
(65) 


71/91 
(77) 


4e-22 1 


gi |422871 |pir | |S 
34569 


peptide YY precursor 
(clone L.2) - human 
(fragment) 


90 


60/91 
(65) 


71/91 
(77) 


4e-22 ! 


gi |422870 |pir | |S 
34568 


peptide YY precursor 
(clone LI) - human 
(fragment) 


90 


59/91 
(64) 


70/91 
(76) 


2e-21 



5 This BLASTP data is displayed graphically in the ClustalW in Table 82E. A multiple 

sequence alignment is given, with the NOV82 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 82D. 



Table 82E. ClustalW Alignment of NOV82 

_____ ■ (SEQ ID NO : 2 72) — 

gi|ll72796| (SEQ ID NO:709) 

gi | 131753 | (SEQ ID NO:710) 

gi|4758982| (SEQ ID NO:711) 

gi|42287l| (SEQ ID NO:712) 

gi|422870| (SEQ ID NO:713) 



NOV82 

gi | 1172796 | 
gi | 131753 | 
gi |4758982 | 
gi | 422871 | 
gi |422870 | 



NOV82 

gi | 1172796 | 
gi j 131753 | 
gi | 4758982 | 
gi | 422871 | 
gi | 422870 | 



NOV82 
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gi|H72796| 

gi j 131753 | 

gi|4758982| 

gi|42287l| 

gi|422870| 



Table 82F lists the domain description from DOMAIN analysis results against NOV82. 
This indicates that the NOV82 sequence has properties similar to those of other proteins 
known to contain this domain. 

5 

Table 82F. Domain Analysis of NOV82 

gnllPfam|pfam00159 , hormone3, Pancreatic hormone peptide SEQ ID NO : 875 

CD-Length = 3 6 residues, 91.7% aligned 
Score = 43.1 bits (100), Expect = 8e-06 

NOV82 : 30 PIKPEAPGEDESLEELSHYYASLCHYLNVVTRQ 62 

P KPE PG+D S E+L+ Y +L Y+N++TR 
Sbjct: 2 PSKPEYPGDDASPEDLAQYLRALRQYINLITRP 34 

Pancreatic hormone (PP) is a peptide synthesized in pancreatic islets of Langherhans, 
which acts as a regulator of pancreatic and gastrointestinal functions. The hormone is 
produced as a larger propeptide, which is enzymatically cleaved to yield the mature active 

10 peptide, which is 36 amino acids in length and has an amidated C-terminus. The hormone has 
a globular structure with residues 2-8 forming a left-handed poly-proline-II-like helix, residues 
9-13 a beta turn, and 14-32 an alpha-helix,held close to the first helix by hydrophobic 
interactions. Unlike glucagon, another peptide hormone, the structure of pancreatic peptide is 
preserved in aqueous solution. Both N- and C-termini are required for activity: receptor 

15 binding and activation functions may reside in the N- and C-termini respectively. 

PYY is secreted from endocrine cells in the lower small intestine, colon, and pancreas. 
It acts on the gastrointestinal tract as an inhibitor of gastric acid secretion, gastric emptying, 
digestive enzyme secretion by the pancreas, and gut motility (Leiter et al., 1987). A related 
peptide, pancreatic polypeptide is secreted only by cells within the endocrine and exocrine 

20 pancreas and specifically inhibits the secretion of enzymes and bicarbonate from the exocrine 
pancreas. A third member of this gene family is neuropeptide Y. 

Each of these proteins are synthesized with a signal peptide sequence followed by a 
36-amino acid active peptide and a carboxyterminal peptide. During maturation, the signal and 
carboxyterminal peptides are cleaved and a common carboxyterminal tyrosine in the mature 

25 peptide is amidated. Hort et al. (1995) cloned the human PYY gene by screening a genomic 
library with a PCR product produced from the rat locus. The gene contains 4 exons spanning 
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about 1.2-kb of DNA. Exon 1 represents 5-prime untranslated sequence and is 75% identical 
to the comparable rat sequence. 

PYY and PPY are about 10-kb apart and are mapped them by fluorescence in situ 
hybridization to 17q21. Based on a comparison of the 3 gene sequences,it has been concluded 
5 that NPY and PYY are the result of a gene duplication event, and that a subsequent tandem 
duplication produced the PPY gene. Pancreatic polypeptide, peptide tyrosine-tyrosine (PYY), 
and neuropeptide tyrosine (NPY), three members of a family of structurally related peptides, 
are mainly expressed in the endocrine pancreas, in endocrine cells of the gut, and in the brain, 
respectively. Synthetic human PYY prepared using a solid-phase synthetic technique was 

10 found to be structurally identical to the natural peptide. 

NOV82 is predicted to be expressed in at least the following tissues: endothelium, 
esophageal carcinoma. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 

1 5 NOV82 is provided in Example 2. 

The novel nucleic acids of the invention encoding a PEPTIDE YY -like proteins 
includes the nucleic acid whose sequence is provided in Table 82A, or a fragment thereof. The 
invention also includes a mutant or variant nucleic acid any of whose bases may be changed 
from the corresponding base shown in Table 82A while still encoding a protein that maintains 

20 its PEPTIDE YY -like activities and physiological functions, or a fragment of such a nucleic 
acid. The invention further includes nucleic acids whose sequences are complementary to the 
sequence of Table 82A, including nucleic acid fragments that are complementary to any of the 
nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 

25 complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 

30 In the mutant or variant nucleic acids, and their complements, up to about 25% of the bases 
may be so changed. 

The novel protein of the invention includes the PEPTIDE YY -like protein whose 
sequence is provided in Table 82B. The invention also includes a mutant or variant protein any 
of whose residues may be changed from the corresponding residue shown in Table 82B while 



still encoding a protein that maintains its PEPTIDE YY -like activities and physiological 
functions, or a functional fragment thereof. In the mutant or variant protein, up to about 35% 
of the amino acid residues may be so changed. 

The NOV82 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, esophageal 
carcinoma, inflammatory bowel disease, diverticular disease, as well as other diseases, 
disorders and conditions. NOV82 nucleic acids encoding the peptide YY-like protein of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV83 

The disclosed NOV83 (alternatively referred to herein as CG56890-01) includes the 
1701 nucleotide sequence (SEQ ID NO:273) shown in Table 83 A. A NOV83 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 19-21 and ends with a stop codon 
at nucleotides 1669-1671. The disclosed NOV83 maps to human chromosome 17. 



Table 83A. NOV83 Nucleotide Sequence (SEQ ID NO:273) 

TGCGCCCCGTGCTCAGCCATGGTGGACATGGGGGCCCTGGACAACCTGATCGCCAACACCGCCTACCTGC 
AGGCCCGGAAGCCCTCGGACTGCGACAGCAAAGAGCTGCAGCGGCGGCGGCGTAGCCTGGCCCTGCCCGG 
GCTGCAGGGCTGCGCGGAGCTCCGCCAGAAGCTGTCCCTGAACTTCCACAGCCTGTGTGAGCAGCAGCCC 
ATCGGTCGCCGCCTCTTCCGTGACTTCCTAGCCACAGTGCCCACGTTCCGCAAGGCGGCAACCTTCCTAG 
AGGACGTGCAGAACTGGGAGCTGGCCGAGGAGGGACCCACCAAAGACAGCGCGCTGCAGGGGCTGGTGGC 
CACTTGTGCGAGTGCCCCTGCCCCGGGGAACCCGCAACCCTTCCTCAGCCAGGCCGTGGCCACCAAGTGC 
CAAGCAGCCACCACTGAGGAAGAGCGAGTGGCTGCAGTGACGCTGGCCAAGGCTGAGGCCATGGCTTTCT 
TGCAAGAGCAGCCCTTTAAGGATTTCGTGACCAGCGCCTTCTACGACAAGTTTCTGCAGTGGAAACTCTT 
CGAGATGCAACCAGTGTCAGACAAGTACTTCACTGAGTTCAGAGTGCTGGGGAAAGGTGGTTTTGGGGAG 
GTAAAAAACACTGGGAAGATGTATGCCTGTAAGAAACTGGACAAGAAGCGGCTGAAGAAGAAAGGTGGCG 
AGAAGATGGCTCTCTTGGAAAAGGAAATCTTGGAGAAGGTCAGCAGCCCTTTCATTGTCTCTCTGGCCTA 
TGCCTTTGAGAGCAAGACCCATCTCTGCCTTGTCATGAGCCTGATGAATGGGGGAGACCTCAAGTTCCAC 
ATCTACAACGTGGGCACGCGTGGCCTGGACATGAGCCGGGTGATCTTTTACTCGGCCCAGATAGCCTGTG 
GGATGCTGCACCTCCATGAACTCGGCATCGTCTATCGGGACATGAAGCCTGAGAATGTGCTTCTGGATGA 
C CTCGG C AACTG C AGGTT AT CT GAC CTGGGG CTGGC CGTGG AG ATG AAGGGTGG C AAG CC C AT C AC C C AG 
AGGCAGGCTGGAACCAATGGTTACATGGCTCCTGAGATCCTAATGGAAAAGGTAAGTTATTCCTATCCTG 
TGGACTGGTTTGCCATGGGATGCAGCATTTATGAAATGGTTGCTGGACGAACACCATTCAAAGATTACAA 
GGAAAAGGTCAGTAAAGAGGATCTGAAGCAAAGAACTCTGCAAGACGAGGTCAAATTCCAGCATGATAAC 
TTCACAGAGGAAGCAAAAGATATTTGCAGGCTCTTCTTGGCTAAGAAACCAGAGCAACGCTTAGGAAGCA 
GGAGAGAAAAGTCTGATGATCCCAGGAAACATCATTTCTTTAAAACGATCAACTTTCCTCGCCTGGAAGC 
TGGCCTAATTGAACCCCCATTTGTGCCAGACCCTTCAGTGGTTTATGCCAAAGACATCGCTGAAATTGAT 
GATTTCTCTGAGGTTCGGGGGGTGGAATTTGATGACAAAGATAAGCAGTTCTTCAAAAACTTTGCGACAG 
GTGCTGTTCCTATAGCATGGCAGGAAGAAATTATAGAAACGGGACTGTTTGAGGAACTGAATGACCCCAA 
CAGACCTACGGGTTGTGAGGAGGGTAATTCATCCAAGTCTGGCGTGTGTTTGTTATTGTAAATTGCTCTC 
TTTACCAGACAGGCAGCAGGA 
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A NOV83 polypeptide (SEQ ID NO:274) encoded by SEQ ID NO:273 is 550 amino 
acids in length and is presented using the one-letter amino acid code in Table 83B. The Psort 
profile for NOV83 predicts that this sequence is likely to be localized to the nucleus with a 
5 certainty of 0.9685. In alternative embodiments, a NOV83 polypeptide is located to 
microbodies with a certainty of 0. 1 3 1 7. 



Table 83B. NOV83 Polypeptide Sequence (SEQ ID 
NO:274) 

MVDMGALDNLIANTAYLQARKPSDCDSKELQRRRRSLALPGLQGCAELRQKLSLNFHSLC 
EQQPIGRRLFRDFLATVPTFRKAATFLEDVQNWELAEEGPTKDSALQGLVATCASAPAPG 
NPQPFLSQAVATKCQAATTEEERVAAVTLAKAEAMAFLQEQPFKDFVTSAFYDKFLQWKL 
FEMQ P VS DK Y FTE FR VLG KGG FGE VKNTG KM Y AC KKLDK KRL K KKGG E KM ALLE KE I L E K 
VSSPFIVSLAYAFESKTHLCLVMSLMNGGDLKFHIYNVGTRGLDMSRVIFYSAQIACGML 
HLHELGIVYRDMKPENVLLDDLGNCRLSDLGLAVEMKGGKPITQRQAGTNGYMAPEILME 
KVSYSYPVDWFAMGCSI YEMVAGRTPFKDYKEKVSKEDLKQRTLQDEVKFQHDNFTEEAK 
DICRLFLAKKPEQRLGSRREKSDDPRKHHFFKTINFPRLEAGLIEPPFVPDPSWYAKDI 
AEIDDFSEVRGVEFDDKDKQFFKNFATGAVPIAWQEEIIETGLFEELNDPNRPTGCEEGN 
SSKSGVCLLL 



A BLAST analysis of NOV83 was run against the proprietary PatP GENESEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV83 had high 
homology to other proteins as shown in Table 83C. 



Table 83C. BLASTX results from PatP database for NOV83 










Smal lest 










Sum 








High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P (N) 


patp 


•AAU03502 


Human protein kinase #2 - Homo sapiens 


2812 


1 . 3e-292 


patp 


:AAY57085 


Human rhodopsin kinase amino acid sequence 


1255 


1 . 3e-127 


patp 


AAY24423 


GRK4 polymorphism GRK4- alpha protein 


1248 


7 . Oe-127 


patp 


:AAY24424 


GRK4 polymorphism GRK4-beta protein 


1190 


1 .7e-124 


patp 


AAY24 4 25 


GRK4 polymorphism GRK4- gamma protein 


1215 


2 . 2e-123 



In a search of sequence databases, it was found, for example, that the nucleic acid 
1 5 sequence of this invention has 1 359 of 1 700 bases (79%) identical to a gb:GENB ANK- 

ID:AF063016|acc:AF063016.1 mRNA from Spermophilus tri decern lineatus (Spermophilus 
tridecemlineatus G protein-coupled receptor kinase GRK7 mRNA). The full amino acid 
sequence of the protein of the invention was found to have 463 of 549 amino acid residues 
(84%) identical to, and 502 of 549 amino acid residues (91%) similar to, the 548 amino acid 
20 residue ptnr:SPTREMBL-ACC:Q9Z2G7 protein from Spermophilus tridecemlineatus 
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(Thirteen-lined ground squirrel) (G PROTEIN-COUPLED RECEPTOR KINASE GRK7). 
NOV83 also has homology to the other proteins shown in the BLASTP data in Table 83D. 



Table 83D. NOV83 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi | 17026318 | g 
b | AAL33880 . 1 | 
AF2 82 2 6 9_1 
(AF282269) 


G protein-coupled 
receptor kinase 7 [Homo 
sapiens] 


553 


548/555 
(98) 


548/555 
(98) 


0 . 0 


gi | 17933490 | g 
b| AAL06241.1 | 
(AY049726) 


retina G protein- coupled 
receptor kinase 7 
[Bos taurus] 


552 


470/555 
(84) 


503/555 
(89) 


0 . 0 


gi | 17026320 |g 
b | AAL33881 . 1 | 
AF282270_1 
(AF282270) 


G protein- coupled 
receptor kinase 7 [Sus 
scrof a] 


553 


465/555 
(83) 


504/555 
(90) 


0 . 0 


gi |4001826 |gb 
| AAC95001 . 1 | 
(AF063016) 


G protein- coupled 
receptor kinase GRK7 
[Spermophi lus 
tridecemlineatus ] 


548 


463/554 
(83) 


502/554 
(90) 


0.0 


gi | 17443851 | r 
ef |XP_067593 . 
1 | (XM_067593) 


similar to G protein- 
coupled receptor kinase 
GRK7 (H. sapiens) 
[ Homo sapi en s ] 


692 


326/350 
(93) 


328/350 
(93) 


e-172 



5 This BLASTP data is displayed graphically in the ClustalW in Table 83E. A multiple 

sequence alignment is given, with the NOV83 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 83D. 



Table 83E. ClustalW Alignment of NOV83 



NOV83 SEQ ID NO: 274) 

gi | 17026318 | SEQ ID NO:714) 

gi|l7933490| SEQ ID NO:715) 

gi|l7026320| SEQ ID NO:716) 

gi|4001826| SEQ ID NO:717) 

gi|l744385l| SEQ ID NO:718) 



10 20 30 40 50 

....|....|....|....|....|....|....|....|....|....| 

NOV83 

gi 1 17026318 | : 

gi | 17933490 j 

gi | 17026320 | 

gi j 4001826 | 

gi|l744385l| MPGEYKVTAPETSARDTHASVQVEAYQGREVHREISTLLKKAAALPRKSS 



NOV83 

gi | 17026318 | 
gi | 17933490 | 
gi | 17026320 j 
gi [4001826 | 



60 70 80 90 100 

|....|....|....|....|....|....|....|....|....| 
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gi | 17443851 | RETSFLRTAGSAEQSALHGCHKDTRNMRTSVYKGSVPCSDVGCDKFELQR 

110 120 130 140 150 



NOV8 3 

gi | 17026318 | 
gi | 17933490 | 
gi j 17026320 | 
gi|4001826| 
gi j 17443851 1 



NOV8 3 

gi | 17026318 | 
gi j 17933490 j 
gi j 17026320 | 
gi | 4001826 | 
gi | 17443851 1 



NOV83 

gi | 17026318 | 
gi j 17933490 | 
gi | 17026320 | 
gi | 4001826 | 
gi | 17443851 | 



NOV83 

gi | 17026318 | 
gi j 17933490 | 
gi j 17026320 j 
gi j 4001826 | 
gi | 17443851 | 



NOV83 

gi | 17026318 | 
gi j 17933490 j 
gi j 17026320 j 
gi j 4001826 | 
gi | 17443851 | 



NOV8 3 

gi | 17026318 | 
gi | 17933490 j 
gi | 17026320 | 
gi |4001826 | 
gi j 17443851 1 



NOV8 3 

gi | 17026318 | 
gi j 17933490 | 
gi j 17026320 | 
gi |4001826 | 
gi | 17443851 | 



NOV8 3 

gi | 17026318 | 



I 



I 



GHRGFVSFSKSKPEPEPEFRPSLQPPPLAAVSRAPDATRAAGPGRRESES 



160 



170 



180 



190 



200 



EFGNEELESIFSGLENNQRLQGLSLRYCGLGPQSGLRLGSVISQSAICEL 
210 220 230 240 250 



I 



I 



I 



FLDGNYLECSGALALLRPIAGFAETQGEDQPAPGSPDTGNPPQRLQGHRT 
260 270 280 290 300 



I 



HCKSLGRCLTREGKAASSPPALLCFPWECAPCSA 
310 320 330 



I 



VDMGgLDNL I ANT AY 
MVDMGgLDNL I ANT AY 
MVDMGgLDNL I ant ay 
MVDMGgLDNL I ANT AY 

JjDMGgLDNL I ANTAY 
MVDMGSLDNL I ANTAY 



340 



350 




360 



RRLFRDFLATVP 
RRLFRDFLATVP 
RRLFRDFLATVP 
RRLFRDFLATVP 
RRLFRDFLATVP 
RRLFRDFLATVP 



410 




380 



390 



400 



fledvqnwelaeegpgkgsalqglvatc 
fledvqnwelaeegpHkBsalqglvatc. 

FLE|VQ|WELAEEGPgKgSALQGLv3TCAg 
FLE|vQgWELAEEGpBKBsnLQ0LVATCAB 

fledvqnwelaeegpBkjjsuilqSlSatc. 

fledvqnwelaeegpHkSsalqglvatca 



420 



430 



s 

D 

IS 

450 




I 



460 
. . I . . 



I 



470 

. . I . . 



I 



480 
. . I . . 



I 



490 



500 
• - I 



SgFYDKFLQWKLFEMQPVSDKYFTEFRVLGKGGFGj 



^jSBFYDKFLQWKLFEMQPVSDKYFTEFRVLGKGGFGEVCAVQVKNTGKM 
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gi | 17933490 | 
gi | 17026320 | 
gi j 4001826 | 
gi | 17443851 | 



NOV8 3 

gi | 17026318 | 
gi j 17933490 j 
gi | 17026320 j 
gi|4001826| 
gi j 17443851 | 



NOV8 3 

gi | 17026318 | 
gi j 17933490 | 
gi | 17026320 j 
gi | 4001826 | 
gi | 17443851 | 



NOV8 3 

gi | 17026318 | 
gi | 17933490 j 
gi j 17026320 | 
gi j 4001826 | 
gi j 17443851 | 



NOV83 

gi | 17026318 | 
gi j 17933490 | 
gi | 17026320 j 
gi |4001826 | 
gi | 17443851 | 



NOV83 

gi | 17026318 | 
gi | 17933490 j 
gi j 17026320 | 
gi|4001826| 
gi j 17443851 | 



NOV83 

gi | 17026318 | 
gi | 17933490 j 
gi j 17026320 j 
gi | 4001826 | 
gi | 17443851 | 



NOV8 3 

gi | 17026318 | 
gi | 17933490 j 
gi | 17026320 j 
gi | 4001826 | 
gi | 17443851 | 



fydkflqwkgfemqpvsdkyfgefrvlgkggfgevcavqvkntgkm 
fydkflqwk!|femqpvsdkyf@efrvlgkggfgevcavqvkntgkm 
fyd^flqwklfemqpvsdkyftefrvlgkggfgevcavqvj^ntgkm 
fydkflqwklfemqpvsdkyftefrvlgkggfgevcavqvkntgkm 



510 



I 



520 
. . I . . 



I 



530 
. . I . . 



I 



540 
- I - ■ 



I 



S50 
■ I 



YACKKLDKKRLKKKGGEKMALLEKE 
YACKKLDKKRLKKKGGEKMALLEKE 
YACKKLDKKRLKKKGGEKMALLEKE 
YACKKLDKKRLKKKGGEKMALLEKE 
YACKKLDKKRLKKKGGEKMALLEKE 
YACKKLDKKRLKKKGGEKMALLEKE 



ilekvsspfivslayafeskthlcl 
ilekvsspfivslayafeskthlcl 
ilej|vsspfivslayafeskShlcl 
ilekvsspfivslayafeskthlcl 

ILEKVSSPFIVSLAYAFESKTHLCL 
ILEKVSSPFIVSLAYAFESKTHLCL 



600 



mslmnggdlkfhiynvgtrgldmsrvifysaqigcg|jlhlhglgivyrd 
vmslmnggdlkfhiynvgtrgldmsrvifysaqiBcg||lhlhHlgivyrd 
vmslmnggdlkfhiy|vg33gldmsrvifysaqiHcg|lhlhBlgivyrd 
vmslmnggdlkfhi y|vg§rgldm|Srvi fysaq^cgSlhlhBlgi vyrd 

VMSLMNGGDLKFHIYNVGTRGL^MSRVIFY^AQ^HcGfflLHLHgLGIVYRD 

vmslmnggdlkfhiynvgtrgldmsrvifysaqiscgSlhlhBlgivyrd 



rRGLDMSRVIFYSAQIgCGgLHLHl3LGIVYRD 
rRGLDMSRVI FYSAQiHcgSlHLH 

PgldmsrvifysaqiBcgSlhlh 
GLDM^RVI FYSAQgCG|LHLHSLGI VYRD 




810 



820 



830 



iawqeei ietglfeelndpnrp«gcy 
iawqeeiietglfeelndpnrpHgcr 
iawqeei ietg^f^elndpnrpggcs 
i awqee i i etglfeelndpnrpggcj 
SawqeeiietglfeelndpnrpBgiS 



SSgSGVCLLL 

;|ss|sgvclll 
g|ss|sgvclll 



EG 

EGij 
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Table 83F lists the domain description from DOMAIN analysis results against NOV83. 
This indicates that the NOV83 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 83F. Domain Analysis of NOV83 



gnliSmartlsmart00220 , S_TKc, Serine/Threonine protein kinases, catalytic domain; Phosphotransferases. 
Serine or threonine-specific kinase subfamily. SEQ ID NO : 87 6 

CD-Length = 256 residues, 100.0% aligned 
Score = 208 bits (530), Expect = 6e-55 



NOV83 : 


191 


Sbjct : 


1 


NOV83 : 


246 


Sb j Ct : 


60 


NOV83 : 


306 


Sbjct : 


118 


NOV83 : 


366 


Sb j Ct : 


177 


NOV83 : 


424 


Sbj Ct : 


234 



FTE FRVLGKGGFGE V KNTG KM Y AC KKLDKKRLKKKGGEKMAlAiE KE I LE KV S S P F 245 

+ VLGKG FG+V K TGK+ A K + K++LKKK E L E +IL+K+ P 

YELLEVLGKGAFGKVYLARDKKTGKLVAIKVIKKEKLKKKKRE-RILREIKILKKLDHPN 59 

I VSLAYAFESKTHLCLVMSLMNGGDLKFHI YNVGTRGLDMSRVI FYSAQI ACGMLHLHEL 3 0 5 
IV L FE L LVM GGDL + G L FY+ QI + +LH 

IVKLYDVFEDDDKLYLVMEYCEGGDLFDLLKKRG- -RLSEDEARFYARQILSALEYLHSQ 117 

GIVYRDMKPENVLLDDLGNCRLSDLGLAVEMKGGKPITQRQAGTNGYMAPEILMEKVSYS 3 6 5 
GI++RD+KPEN+LLD G+ +L+D GLA ++ G + GT YMAPE+L+ K Y 

GIIHRDLKPENILLDSDGHVKLADFGLAKQLDSGGTLLTTFVGTPEYMAPEVLLGK-GYG 176 

YPVDWFAMGCSIYEMVAGRTPFKDYKEKVSKEDLiKQRTLQDEVKF- -QHDNFTEEAKDIC 423 

VD +++G +YE++ G+ PF + L ++ + F + EAKD+ 

KAVDIWSLGVILYEL.LTGKPPFPGDDQ LLALFKKIGKPPPPFPPPEWKISPEAKDLI 23 3 

RLFLAKKPEQRLGSRREKSDDPRKHHFF 451 
+ L K PE+RL +++ +H FF 

KKLLVKDPEKRLT AEEALEHPFF 256 



Serine/threonine protein kinases are an extensive family of enzymes that catalyzes the 
phosphorylation of serine or threonine residues on its target protein. Protein kinases share a 
conserved catalytic core common to both serine/ threonine and tyrosine protein kinases. This 
domain contains residues, which are specific to the distinct types of protein kinases 

10 The S6/H4 kinase purified from human placenta catalyzes phosphorylation of the S6 

ribosomal protein, histone H4, and myelin basic protein. In vitro activation of the p60 S6/H4 
kinase requires removal of an autoinhibitory domain by mild trypsin digestion and 
autophosphorylation of the catalytic domain (p40 S6/H4 kinase). The two 
autophosphorylation/autoactivation sites contain the sequences SSMVGTPY (site 1) and 

15 SVIDPVPAPVGDSHVDGAAK (site 2) (SEQ ID NO: 1381). These sequences identify S6H4 
kinase as the rac-activated PAK65 (Martin, G. A., Bollag, G., McCormick, F. and Abo, A. 
(1995) EMBO J. 14, 1971-1978). Site 1 phosphorylation is most rapid, but activation does not 
occur until site 2 is autophosphorylated. The site 1 phosphorylation occurs by an 
intramolecular mechanism whereas site 2 autophosphorylation occurs by an intermolecular 

20 mechanism. A model is proposed in which phosphorylation of sites 1 and 2 occurs 

sequentially. The model proposes that trypsin treatment of the inactive holoenzyme removes 
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an inhibitory rac-binding domain which blocks MgATP access to the catalytic site. The 
pseudosubstrate domain at site 1 is autophosphorylated and subsequent bimolecular 
autophosphorylation at site 2 fully opens the catalytic site. Phosphorylation by a regulatory 
protein kinase may occur at site 2 in vivo. 
5 Rapid regulation of G-protein-coupled receptors (GPCRs) involves agonist- promoted 

receptor phosphorylation by GPCR kinases (GRKs) . This process is followed by arrestin 
binding and transient receptor internalisation. It has been shown that beta-adrenergic receptor 
kinase (beta ARK-1 or GRK2) follows a similar pattern of internalisation upon agonist 
activation of beta(2)-adrenergic receptors (beta(2)AR) and that beta ARK expression levels 

1 0 modulate receptor sequestration. 

Such studies indicate a functional relationship between receptor phosphorylation and 
sequestration, showing that beta ARK not only translocates from the cytoplasm to the plasma 
membrane in response to receptor occupancy, but also shares endocytic mechanisms with the 
beta(2)AR . These results suggest a role for beta ARK in the sequestration process, or 

1 5 involvement of receptor internalisation in the intracellular trafficking of the kinase. 

NOV83 is predicted to be expressed in at least the following tissues: retina, spleen. 
This information was derived by determining the tissue sources of the sequences that were 
included in the invention including but not limited to SeqCalling sources, public EST sources, 
literature sources, and/or RACE sources. Further expression data for NOV83 is provided in 

20 Example 2. 

The NOV83 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, hemophilia, 
hypercoagulation, idiopathic thrombocytopenic purpura, immunodeficiencies, graft vesus host 
disease, Von Hippel-Lindau (VHL) syndrome, diabetes, tuberous sclerosis, as well as other 

25 diseases, disorders and conditions. NOV83 nucleic acids encoding the GPCR-like protein of 
the invention, or fragments thereof, may further be useful in diagnostic applications, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a G protein-coupled receptor kinase 
GRK7-like protein includes the nucleic acid whose sequence is provided in Table 83A, or a 

30 fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 

bases may be changed from the corresponding base shown in Table 83A while still encoding a 
protein that maintains its G protein-coupled receptor kinase GRK7-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 




nucleic acids whose sequences are complementary to the sequence of Table 83 A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
5 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 21% of the bases 

10 may be so changed. 

The novel protein of the invention includes the G protein-coupled receptor kinase 
GRK7-like protein whose sequence is provided in Table 83B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 83B while still encoding a protein that maintains its G Protein- 

15 Coupled Receptor Kinase GRK7-like activities and physiological functions, or a functional 

fragment thereof. In the mutant or variant protein, up to about 16% of the amino acid residues 
may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 

20 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV84 

25 The disclosed NOV84 (alternatively referred to herein as CG56912-01) includes the 

2355 nucleotide sequence (SEQ ID NO:275) shown in Table 84A. A NOV84 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 11-13 and ends with a stop codon 
at nucleotides 2336-2338. The disclosed NOV84 maps to human chromosome 17. 



Table 84A. NOV84 Nucleotide Sequence (SEQ ID NO:275) 

AGGCTGGAGGATGGGGCAGCAGGTCCTAGCGGGGGAGGGCAGGGGAGCCTCTGGGCAGGTACGGCCTGAC 
GCCCCGGGTCCTCCCGCCCCGCCAGGCCTGACGGAGGACGAGGACGTGCGCGCCATGCTGCGGGGCTCCC 
GGCTCCGCAAGATCCGCTCGCGCACGTGGCACAAGGAGCGGCTGTACCGGCTGCAGGAGGACGGCCTGAG 
CGTGTGGTTCCAGCGGCGCATCCCGCGTGCGCCATCGCAGCACATCGTCCTCCGCCCTGACCCGGCCCTC 
CTCTCAGTCTTCGTGCAGCACATCGAGGCGGTCCGCGAGGGCCACCAGTCCGAGGGCCTGCGGCGCTTCG 
GGGGTGCCTTCGCGCCAGCGCGCTGCCTCACCATCGCCTTCAAGGGCCGCCGCAAGAACCTGGACCTGGC 
GGCGCCCACGGCTGAGGAAGCGCAGCGCTGGGTGCGCGGTCTGACCAAGCTCCGCGCGCGCCTGGACGCC 
ATGAGCCAGCGCGAGCGGCTAGACCAATATTGGATCCACTCCTATCTGCACCGGGCTGACTCCAACCAGG 
ACAGCAAGATGAGCTTCAAGGAGATCAAGAGCCTGCTGAGAATGGTCAACGTGGACATGAACGACATGTA 
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CGCCTACCTCCTCTTCAAGCAGGAGTGTGACCACTCCAACAACGACCGTCTAGAGGGGGCTGAGATCGAG 
GAGTTCCTGCGGCGGCTGCTGAAGCGGCCGGAGCTGGAGGAGATCTTCCATCAGTACTCGGGCGAGGACC 
GCGTGCTGAGTGCCCCTGAGCTGCTGGAGTTCCTGGAGGACCAGGGCGAGGAGGGCGCCACACTGGCCCG 
CGCCCAGCAGCTCATTCAGACCTATGAGCTCAACGAGACACCCTCTCCTGCCACCCCTATGACACTGGAT 
GGC T T C ATG ATGT AC CT GTTGT C GC CGG AGGGGGCT GC CTTGGAC AAC AC C C AC ACGTGTGTGT TC C AGG 
ACATGAACCAGCCCCTTGCCCACTACTTCATCTCTTCCTCCCACAACACCTATCTGACTGACTCCCAGAT 
CGGGGGGCCCAGCAGCACCGAGGCCTATGTTAGGGCCTTTGCCCAGGGATGCCGCTGCGTGGAGCTGGAC 
T GCTGGG AGGGG C C AGG AGGGG AGC C CGT C AT CT AT C ATGGCC AT AC C CTC AC C T C C AAG ATT CT CTT C C 
GGGACGTGGTCCAAGCCGTGCGCGACCATGCCTTCACGCTGTCCCCTTACCCTGTCATCCTATCCCTGGA 
CAACCACGACGGGCTGGAGCAGCAGGCTGCCATGGCCCGCCACCTCTGCACCATCCTGGGGGACATGCTG 
GTGACACAGGCGCTGGACTCCCCAAATCCCGAGGAGCTGCCATCCCCAGAGCAGCTGAAGGGCCGGGTCC 
TGGTGAAGGGAAAGAAGCTGCCCGCTGCTCGGAGCGAGGATGGCCGGGCTCTGTCGGATCGGGAGGAGGA 
GGAGG AGG ATGACGAGG AGG AAG AAGAGG AGGTGGAGGCTGC AGCGC AGAGG CGG CT GGC CAAGC AGATC 
TCCCCGGAGCTGTCGGCCCTGGCTGTGTACTGCCACGCCACCCGCCTGCGGACCCTGCACCCTGCCCCCA 
ACGCCCCACAACCCTGCCAGGTCAGCTCCCTCAGCGAGCGCAAAGCCAAGAAACTCATTCGGGAGGCAGG 
GAACAGCTTTGTCAGGCACAATGCCCGCCAGCTGACCCGCGTGTACCCGCTGGGGCTGCGGATGAACTCA 
GCCAACTACAGTCCCCAGGAGATGTGGAACTCGGGCTGTCAGCTGGTGGCCTTGAACTTCCAGACGCCAG 
GCTACGAGATGGACCTCAATGCCGGGCGCTTCCTAGTCAATGGGCAGTGTGGCTACGTCCTAAAACCTGC 
CTGCCTGCGGCAACCTGACTCGACCTTTGACCCCGAGTACCCAGGACCTCCCAGAACCACTCTCAGCATC 
CAGGTGCTGACTGCACAGCAGCTGCCCAAGCTGAATGCCGAGAAGCCACACTCCATTGTGGACCCCCTGG 
TGCGCATTGAGATCCATGGGGTGCCCGCAGACTGTGCCCGGCAGGAGACTGACTACGTGCTCAACAATGG 
CTTCAACCCCCGCTGGGGGCAGACCCTGCAGTTCCAGCTGCGGGCTCCGGAGCTGGCACTGGTCCGGTTT 
GTGGTGGAAGATTATGACGCCACCTCCCCCAATGACTTTGTGGGCCAGTTTACACTGCCTCTTAGCAGCC 
TAAAGCAAGGGTACCGCCACATACACCTGCTTTCCAAGGACGGGGCCTCACTGTCACCAGCCACGCTCTT 
C ATC C AAAT C CG CAT CC AG CGCT CCTGAGGG C C C AC C TC AC T CGC 



A NOV84 polypeptide (SEQ ID NO:276) encoded by SEQ ID NO:275 is 775 amino 
acids in length and is presented using the one-letter amino acid code in Table 84B. The Psort 
profile for NOV84 predicts that this sequence is likely to be localized to the cytoplasm with a 
5 certainty of 0.4500. In alternative embodiments, a NOV84 polypeptide is located to 
microbodies with a certainty of 0.3000. 



MGQQVLAGEGRGASGQVRPDAPGPPAPPGLTEDEDVRAMLRGSRLRKIRSRTWHKERLYR 
LQEDGLSWFQRRIPRAPSQHIVLRPDPALLSVFVQHIEAVREGHQSEGLRRFGGAFAPA 
RCLTIAFKGRRKNLDLAAPTAEEAQRWVRGLTKLRARLDAMSQRERLDQYWIHSYLHRAD 
SNQDSKMSFKEIKSLLRMVNVDMNDMYAYLLFKQECDHSNNDRLEGAEIEEFLRRLLKRP 
ELEE I FHQYSGEDRVLSAPELLE FLEDQGEEGATLARAQQLI QTYELNET PS P ATPMTLD 
G FMMY LL S P EG AALDNTHT C V FQDMNQ P L AH Y F I S S S HNT YLTDS Q I GGP S S T E A Y VRAF 
AQGCRCVELDCWEGPGGEPVIYHGHTLTSKILFRDWQAVRDHAFTLSPYPVILSLDNHD 
GLEQQAAMARHLCTILGDMLVTQALDSPNPEELPSPEQLKGRVLVKGKKLPAARSEDGRA 
LSDREEEEEDDEEEEEEVEAAAQRRLAKQISPELSALAVYCHATRLRTLHPAPNAPQPCQ 
VSSLSERKAKKLIREAGNSFVRHNARQLTRVYPLGLRMNSANYSPQEMWNSGCQLVALNF 
QTPGYEMDLNAGRFLVNGQCGYVLKPACLRQPDSTFDPEYPGPPRTTLSIQVLTAQQLPK 
LNAE K PHS I VD P L VR I E I HGVP ADCARQETD YVLNNG FN P RWGQTLQ FQL RAP E L AL VRF 
WEDYDATSPNDFVGQFTLPLSSLKQGYRHIHLLSKDGASLSPATLFIQIRIQRS 



A BLAST analysis of NOV84 was run against the proprietary PatP GENESEQ Protein 



10 Patent database. It was found, for example, that the amino acid sequence of NOV84 had high 
homology to other proteins as shown in Table 84C. 



Table 84B. NOV84 Polypeptide Sequence (SEQ ID 
NO:276) 



Table 84C. BLASTX results from PatP database for NOV84 



Smallest 



674 












Sum 








High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P(N> 


patp 


AAG63220 


Amino acid sequence of a human protein 


3735 


0.0 


patp 


AAB47516 


Human phospholipase C, 16 835 - Homo sapiens 


3734 


0 . 0 


patp 


AAY81394 


Rat phospholipase C-delta-1 - Rattus sp . 


1882 


4.6e-194 


patp 


AAE10440 


Novel human phospholipase protein #7 


1783 


1 .4e-183 


patp 


AAE11925 


Human CG121 (or C592) lipase protein #1 


1182 


6. 9e-120 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 954 of 1415 bases (67%) identical to a gb:GENBANK- 
ID:OCPLCMR|acc:Z49747.1 mRNA from Oryctolagus cuniculus (O.cuniculus mRNA for 
5 phospholipase C). The full amino acid sequence of the protein of the invention was found to 
have 381 of 745 amino acid residues (51%) identical to, and 524 of 745 amino acid residues 
(70%) similar to, the 745 amino acid residue ptnr:SPTREMBL-ACC:Q60450 protein from 
Cricetulus griseus (Chinese hamster) (PHOSPHOLIPASE C-DELTA1). NOV84 also has 
homology to the other proteins shown in the BLASTP data in Table 84D. 

10 



Table 84D. NOV84 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 14715017 |g 
b|AAH10668.l| 
AAH10668 
(BC010668) 


Similar to phospholipase 
C, delta [Homo sapiens] 


613 


604/615 
(98) 


606/615 
(98) 


0.0 


gi | 17481372 | r 
ef |XP_053638 . 
2 | (XM 053638) 


hypothetical protein 
XP_0 53 63 8 
[Homo sapiens] 


581 


472/500 
(94) 


475/500 
(94) 


0 . 0 


gi | 2137061 | pi 
r| |PC4183 


1 -phosphat idyl inositol- 
4, 5-bisphosphate 
phosphodiesterase (EC 
3.1.4.11) delta-1 - 
Chinese hamster 
(fragment) 


745 


378/758 
(49) 


522/758 
(67) 


0.0 


gi|839398l|re 
f |NP_058731 . 1 
| (NM_017035) 


phospholipase C-deltal 
[Rattus norvegicus] 


756 


368/757 
(48) 


501/757 
(65) 


0 . 0 


gi | 9790167 | re 
f |NP_062650. 1 
| (NM_019676) 


phospholipase C, delta ; 
PLC-delta 1; 
phospholipase C delta-1 
[Mus musculus] 


756 


374/757 
(49) 


503/757 
(66) 


0.0 



This BLASTP data is displayed graphically in the ClustalW in Table 84E. A multiple 
sequence alignment is given, with the NOV84 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 84D. 



Table 84E. ClustalW Alignment of NOV84 
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NOV84 (SEQ ID NO: 276) 

gi|l4715017| (SEQ ID NO:719) 

gi | 17481372 | (SEQ ID NO:720) 

gi|213706l| (SEQ ID NO:721) 

gi|839398l| (SEQ ID NO:722) 

gi | 9790167 | (SEQ ID NO:723) 



NOV84 

gi | 14715017 | 
gi | 17481372 | 
gi | 2137061 | 
gi j 8393981 j 
gi | 9790167 | 



NOV84 

gi | 14715017 | 
gi | 17481372 | 
gi | 2137061 | 
gi j 8393981 j 
gi | 9790167 | 



NOV84 

gi | 14715017 | 
gi j 17481372 | 
gi j 2137061 | 
gi | 8393981 j 
gi j 9790167 j 



NOV84 

gi | 14715017 | 
gi | 17481372 | 
gi | 2137061 | 
gi|839398l| 
gi j 9790167 j 



NOV84 

gi | 14715017 | 
gi j 17481372 j 
gi | 2137061 1 
gi j 8393981 j 
gi | 9790167 j 



NOV84 

gi | 14715017 | 
gi | 17481372 j 
gi j 2137061 | 
gi I 8393981 1 
gi | 9790167| 



NOV84 

gi | 14715017 | 
gi j 17481372 | 
gi j 2137061 | 
gi | 8393981 | 



10 20 30 

....|....|....|....|....|.. 
MGQQ VL AGE GRG AS GQ VR PD APG P PAP 



40 



50 



-MDSGRDFLTLF 
- MDSGRDFLTLt 




70 



80 



90 



YjSLQED 



^^^ F i^-S p i3fiS s Q H l VLRPDP ^ IjSVF ^3 H SN 

IrH- 
Is 3k 




150 



VRgGH^EGL 

vrSghHSegl 1 
K/rBgh^egl 
vrSghIBegl 




320 



330 



DGFgMYLLS 

dgfSmylls 
dgfSmylls 
dgfSmylls 
dgfImylls 




340 
. ■ I . . 



SSSHNTYL 
SSSHNTYL 
SSSHNTYL 
SSSHNTYL 
SSSHNTYL 
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gi | 9790167 | 



NOV84 

gi | 14715017 | 
gi j 17481372 | 
gi j 2137061 | 
gi j 8393981 | 
gi | 9790167 j 



NOV84 

gi | 14715017 | 
gi j 17481372 j 
gi |2137061 | 
gi | 8393981 | 
gi j 9790167 | 



NOV84 

gi | 14715017 | 
gi | 17481372 j 
gi | 2137061 | 
gi | 8393981 j 
gi | 9790167 | 



NOV84 

gi | 14715017 | 
gi | 17481372 | 
gi | 2137061 | 
gi j 8393981 | 
gi j 9790167 j 



NOV84 

gi | 14715017 | 
gi j 17481372 | 
gi j 2137061 | 
gi j 8393981 j 
gi j 9790167 | 



NOV84 

gi | 14715017 | 
gi | 17481372 j 
gi|213706l| 
gi j 8393981 j 
gi j 9790167 | 



NOV84 

gi | 14715017 | 
gi j 17481372 j 
gi | 2137061 | 
gi j 8393981 j 
gi | 9790167 j 



NOV84 

gi | 14715017 | 




NPEEjj 

NPEEM 

npeeB 

VTMsB 
VTTSN 
VTTsB 




530 



540 550 
I I I - -^'l ^1 

hStrlrtlhpapnIpwp - c$| 

EO^TRLRTLHPAPlSrMpgP - 
H^TRLRPdJ|ir 

k^vhfggf^n 
kIvhfggfs's 
kPvhfggfIs 
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gi I 17481372 I R 

gi j 2137061 | s S<ffl AV l T jj 

gi j 8393981 j sIqESavIT!! 

gi j 9790167 j S; 



NOV84 

gi|l4715017| 
gijl7481372| 
gi|213706l| 
gi | 8393981 j 
gi j 9790167 | 




Table 84F lists the domain description from DOMAIN analysis results against NOV84. 
This indicates that the NOV84 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 84F. Domain Analysis of NOV84 



gnliSmartismart00148 , PLCXc, Phospholipase C, catalytic domain (part); domain X; Phosphoinositide- 

specific phospholipases C. These enzymes contain 2 regions (X and Y) which together form a TIM barrel-like 

structure containing the active site residues. Phospholipase C enzymes (PI-PLC) act as signal transducers 

that generate two second messengers, inositol-l,4,5-trisphosphate and diacylglyceroL The bacterial enzyme 

appears to be a homologue of the mammalian PLCs. SEQ ID NO : 877 

CD-Length = 145 residues, 93.1% aligned 
Score =186 bits (471), Expect = 6e-48 



NOV84: 
Sbjct : 
NOV84 : 
Sbjct : 
NOV84 : 
Sbjct: 



323 QDMNQPLAHYFISSSHNTYLTDSQIGGPSSTEAYVRAFAQGCRCVELDCWEGPGGEPVIY 3 82 

QDM++PL+HYFI+SSHNTYLT Q+ G SS E Y++A GCRCVELDCW+GP GEPVIY 
1 QDMSKPLSHYFINSSHNTYLTGKQLWGESSVEGYIQALKHGCRCVELDCWDGPDGEPVIY 60 

3 83 HGHTLiTSKI LFRDWQAVRDHAFTLS PYPVI LSLDNHDGLEQQAAMARHLCTI LGDMLVT 442 

HGHT T I +V++A++ AF SPYPVILSL+NH +QQA MA+ I GD+L T 

61 HGHTFTLP I KLSEVLE AI KKFAFVTS PYP VI LSLENHCS PDQQAKMAQMFKE I FGDLLYT 120 . 

44 3 QALDSPNPEELPSPEQ 4 58 
S + E LPSPEQ 

121 PPTTS - SLEYLPSPEQ 13 5 



Phosphoinositide-specific phospholipase C acts as a signal transducer that generates 
two messengers, diacylglycerol and inositol 1 ,4,5-trisphosphate, by hydrolyzing inositol 
phospholipids. Molecules belonging to the PLC family are divided into subfamilies, PLC-beta, 
10 PLC-gamma, and PLC-delta, whose amino acid sequences are highly conserved in two distinct 
regions designated X and Y. PLC-delta- 1 is distinguished from PLC-gamma by lack of the 
SH2 and SH3 domains that are essential for activation of PLC-gamma by tyrosine protein 
kinases, and from PLC-beta by lack of the C-terminal region of PLC-beta that is responsible 
for binding and activation by G proteins. 
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Cheng et al. (1995) cloned cDNA for human PLC-delta-1 and localized the gene to 
chromosome 3 by means of a human/rodent somatic cell panel (Lyu et al., 1996). In the course 
of a large-scale sequencing analysis of genomic DNA in the vicinity of the homozygous 
deletion on chromosome 3p found in a lung cancer cell line, Ishikawa et al. (1997) found that 
5 the gene encoding phospholipase C, delta- 1 (PLCD1) is located just distal to the region 

removed by the deletion. They found that the gene consists of 15 exons and spans about 22 kb. 
By fluorescence in situ hybridization, they localized the PLCD1 gene to 3p22-p21.3. 
Shimohama et al. (1998) examined the entire sequences corresponding to protein-coding 
exons 2-15 of the hamster PLC-delta-1 gene in genomic DNA derived from the leukocytes of 
10 13 unrelated patients with early-onset sporadic Alzheimer disease. In 1 of these patients whose 
clinical features and course did not differ from those of the other 12 cases, they found a 
change of codon CGC (arg) to CAC (his), located in the pleckstrin homology domain of the 
PLCD1 gene. They stated that this was the first mutation found in the human PLC genes. 
Site-directed mutagenesis of the glutathione-S-transferase (GST/PLCD1) fusion 
15 protein changing argl05 to his resulted in a 4-fold decrease in the affinity of specific binding 
and a reduction in hydrolyzing activity to about 40% of that of the wildtype enzyme. This 
remarkable loss of function could be interpreted in terms of a conformational change in the 
pleckstrin homology domain. Shimohama et al. (1998) found that the argl05-to-his mutation 
was present in heterozygous state in the patient with AD. The mutation was not found in DNA 
20 extracted from leukocytes of 23 unrelated patients with familial AD, 23 unrelated patients with 
early-onset sporadic AD, 46 unrelated patients with late-onset sporadic AD, and 456 
nondemented control subjects. Thus the change did not appear to be a common polymorphism. 

However, determination of the possible pathologic role required transgenic studies of 
the mutant gene to determine the role of the enzyme and the mutation and a search for other 
25 mutations in the pleckstrin homology domain of PLC genes in human subjects with genetic 
disorders. In vitro single point mutagenesis, inositol phospholipid hydrolysis, and substrate 
protection experiments were used to identify catalytic residues of human 
phosphatidylinositide-specific phospholipase C delta 1 (PLC delta 1) isolated from a human 
aorta cDNA library. Invariant amino acid residues containing a functional side chain in the 
30 highly conserved X region were changed by in vitro mutagenesis. Most of the mutant enzymes 
were still able to hydrolyze inositol phospholipid with activity ranging from 10 to 100% of 
levels in the wild type enzyme. Exceptions were mutants with the conversion of Arg338 to 
Leu (R338L), Glu341 to Gly (E341G), or His356 to Leu (H356L), which made the enzyme 
severely defective in hydrolyzing inositol phospholipid. Phospholipid vesicle binding 
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experiments showed that these three cleavage-defective mutant forms of PLC delta 1 could 
specifically bind to phosphatidyl inositol 4,5-bisphosphate (PIP2) with an affinity similar to 
that of wild type enzyme. Western blotting analysis of trypsin-treated enzyme-PIP2 complexes 
revealed that a 67-kDa major protein fragment survived trypsin digestion if the wild type 
5 enzyme, E341G, or H356L mutant PLC delta 1 was preincubated with 7.5 microM PIP2, 

whereas if it was preincubated with 80 microM PIP2, the size of major protein surviving was 
comparable to that of intact enzyme. However, mutant enzyme R338L was not protected from 
trypsin degradation by PIP2 binding. These observations suggest that PLC delta 1 can 
recognize PIP2 through a high affinity and a low affinity binding site and that residues Glu341 

10 and His356 are not involved in either high affinity or low affinity PIP2 binding but rather are 
essential for the Ca(2+)-dependent cleavage activity of PLC. 

NOV84 is predicted to be expressed in at least the following tissues: aorta, brain, 
colon, foreskin, heart, muscle, placenta, stomach, uterus, whole embryo, brain, colon, eye, 
head and neck, lung, muscle, ovary, pancreas, placenta, skin, stomach, uterus. This 

1 5 information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 
sources, and/or RACE sources. Further expression data for NOV84 is provided in Example 2. 

The NOV84 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, 

20 cardiomyopathy, atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial 
septal defect (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis 
, subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation as well as other diseases, disorders and conditions. 
NOV84 nucleic acids encoding the phospholipase-Iike protein of the invention, or fragments 

25 thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a Phospholipase C delta 1 -like 
protein includes the nucleic acid whose sequence is provided in Table 84A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 

30 be changed from the corresponding base shown in Table 84A while still encoding a protein 
that maintains its Phospholipase C delta 1 -like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 84A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 
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The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
5 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 33% of the bases 
may be so changed. 

The novel protein of the invention includes the Phospholipase C delta 1-like protein 
10 whose sequence is provided in Table 84B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
84 while still encoding a protein that maintains its Phospholipase C delta 1 -like activities and 
physiological functions, or a functional fragment thereof. In the mutant or variant protein, up 
to about 49% of the amino acid residues may be so changed. 
1 5 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

20 

NOV85 

The disclosed NOV85 (alternatively referred to herein as CG56955-01) includes the 
4091 nucleotide sequence (SEQ ID NO:277) shown in Table 85 A. A NOV85 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 13-15 and ends with a stop codon 
25 at nucleotides 4078-4080. 



Table 85A. NOV85 Nucleotide Sequence (SEQ ID NO:277) 

ACTACTTGTGGAATGTCACTGCCTCGGGGTATTTCACAAGACAGGTCACCTCTTGTGAAAGTCCGAAGTA 
ATTCTCTGAAAGCTCCTTCCACGCATGTCACAAAACCATCATTTAGCCAGAAATCATTTGTTTCTATGAG 
AGACCAAAGACCAGTAAATCACTTGCATCAGAACAGTCTGTTGAATCAGCAGACATGGGTAAGGACTGAC 
AGTGCCCCCGATCAGCAAGTGGAGACTGGGAAATCCCCCTCTTTATCTGGAGCCTCTGCCAAGCCTGCCC 
CTCAGTCGAGTGAAAACGCTGGTACTTCAGATTTAGAACTACCTGTCAGTCAAAGGAATCAAGATTTAAG 
TTTACAAGAGGCTGAAACTGAGCAATCAGATACTTTAGATAATAAAGAAGCTGTCATCCTAAGGGAAAAA 
CCTCCATCTGGACGCCAGACACCGCAGCCTTTAAGGCATCAGTCTTACATCTTGGCAGTAAATGACCAGG 
AGACCGGGTCAGACACTACCTGCTGGCTGCCCAATGATGCACGTCGAGAGGTCCACATAAAAAGAATGGA 
GGAAAGAAAAGCCTCGAGTACCAGTCCGCCTGGCGATTCTTTGGCTTCCATCCCATTTATAGATGAACCA 
ACTAGCCCTAGCATTGATCATGATATTGCACATATCCCTGCCTCTGCTGTTATATCAGCCTCTACCTCTC 
AGGTCCCCTCCATAGCAACAGTTCCTCCTTGCCTCACAACTTCAGCTCCATTAATTCGCCGTCAGCTCTC 
ACATGACCACGAATCTGTTGGCCCTCCTAGCCTGGATGCTCAGCCCAACTCAAAGACAGAAAGATCAAAA 

TC AT ATG ATG AGGGTC TGG AT GATTAC AG AG AAG ATG C AAAAT TGT CC T TT AAGC ACGT AT CT AGT C TG A 
AGGGAATCAAGATCGCAGACAGCCAAAAGTCATCAGAAGACTCTGGGTCCAGAAAAGATTCTTCCTCAGA 
GGTCTTCAGTGATGCTGCCAAGGAAGGGTGGCTTCATTTCCGACCCCTTGTCACCGATAAGGGCAAGCGA 
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GTTGGTGGAAGTATTCGGCCATGGAAACAGATGTATGTTGTCCTTCGGGGTCATTCACTTTACCTGTACA 

AAGATAAAAGAGAGCAGACGACTCCGTCTGAGGAAGAGCAGCCCATCAGTGTTAATGCTTGCTTGATAGA 

CATCTCTTACAGTGAGACCAAGAGGAAAAATGTGTTTCGACTCACCACGTCCGACTGTGAATGCCTGTTT 

CAGGCTGAAGACAGAGATGATATGCTAGCTTGGATCAAGACGATCCAGGAGAGCAGCAACCTAAACGAAG 

AGG AC ACTGGAGT C ACT AAC AGGGATC T AATT AGT CG AAGAAT AAAAG AAT AC AAC AAT CT GATG AGC AA 

AGCAGAACAGTTGCCAAAAACACCTCGCCAGAGTCTCAGCATCAGGCAAACTTTGCTTGGTGCTAAATCA 

GAGCCAAAGACTCAAAGCCCACACTCTCCGAAGGAAGAGTCGGAAAGGAAACTTCTCAGTAAAGATGATA 

C C AGTCCC C C AAAAGAC AAAGG C AC ATGG AG AAAAGGC AT T CC AAGT AT C ATG AGAAAG AC ATT TGAGAA 

AAAGCCAACTGCTACAGGAACTTTCGGCGTCCGACTAGATGACTGCCCACCAGCTCATACTAATCGGTAT 

ATTCCATTAATAGTTGACATATGTTGCAAATTAGTTGAAGAAAGAGGTCTTGAATATACAGGTATTTATA 

GAGTTCCTGGAAATAATGCAGCCATCTCAAGTATGCAAGAAGAACTCAACAAGGGAATGGCTGATATTGA 

TATACAAGATGATAAATGGCGAGATTTGAATGTGATAAGCAGTTTACTAAAATCCTTCTTCAGAAAACTC 

CCTGAGCCTCTCTTCACAAATGATAAATATGCTGATTTTATTGAAGCCAATCGTAAAGAAGATCCTCTAG 

ATCGTCTGAAAACATTAAAAAGACTAATTCACGATTTGCCTGAACATCATTATGAAACACTTAAGTTCCT 

TTCAGCTCATCTGAAGACAGTGGCAGAAAATTCAGAGAAAAAAAATAAGATGGAACCAAGAAACCTAGCA 

ATAGTGTTTGGTCCCACCCTTGTTCGAACATCAGAAGACAACATGACCCACATGGTCACCCACATGCCTG 

ACCAGTACAAGATTGTAGAAACGCTCATCCAGCACCATGACTGGTTTTTCACAGAAGAAGGTGCTGAAGA 

GCCTCTTACAACAGTGCAGGAGGAAAGCACAGTAGACTCCCAGCCAGTGCCAAACATAGATCATTTACTC 

AC C AAC AT TGG AAGGACAGGAGT CTC C C C AGG AGATGT AT C AG ATT C AGCT AC T AGTGACT CAAC AAAAT 

CTAAGGGTTCTTGGGGATCTGGAAAGGATCAGTATAGCAGGGAACTGCTTGTGTCCTCCATCTTTGCAGC 

TGC T AGT CGCAAG AGG AAG AAGC CG AAAG AAAAAG C ACAGC CT AG C AG CTC AGAAG ATG AACTGGAC AAT 

GTATTTTTTAAGAAAGAAAATGTGGAACAGTGTCACAATGATACTAAAGAGGAGTCCAAAAAAGAAAGTG 

AGAC ACTGGGC AG AAAAC AG AAGAT C ATC ATTGCC AAAGAAAAC AGC ACT AGGAAAGAC CC C AG C ACGAC 

AAAAG ATG AAAAG AT AT CACT AGG AAAAGAGAGC ACGCCT T CTGAAGAACC CT CAC CAC C ACAC AACTC A 

AAACACAACAAGTCACCAACTCTCAGCTGTCGCTTTGCCATCCTGAAAGAGAGCCCCAGGTCACTTCTGG 

CACAGAAGTCCTCCCACCTTGAAGAGACAGGCTCTGACTCTGGCACTTTGCTCAGCACGTCTTCCCAGGC 

CTCCCTGGCAAGGTTTTCCATGAAGAAATCAACCAGTCCAGAAACGAAACATAGCGAGTTTTTGGCCAAC 

GTCAGCACCATCACCTCAGATTATTCCACCACATCGTCTGCTACATACTTGACTAGCCTGGACTCCAGTC 

GACTG AG C CCTGAGGTGC AAT C CGTGG CAGAGAGC AAGGGGG ACG AGG CAG ATGACGAG AG AAGCG AAC T 

CATCAGTGAAGGGCGGCCTGTGGAAACCGACAGCGAGAGCGAGTTTCCCGTGTTCCCCACAGCCTTGACT 

TCAGAGAGGCTTTTCCGAGGAAAACTGCAAGAAGTGACTAAGAGCAGCCGGAGAAATTCTGAAGGAAGTG 

AATTAAGTTGCACCGAGGGAAGTTTAACATCAAGTTTAGATAGCCGGAGACAGCTCTTCAGTTCCCATAA 

AC T CAT CG AATGT G AT ACTCT TTC C AGGAAAAAAT C AGC TAG ATT C AAGT C AG AT AGTGGAAGT C T AGG A 

GATGC CAAGAATG AGAAAG AAGC ACCT TCGTT AAC T AAAGTGT TTGATGTT AT GAAAAAAGGAAAGT C AA 

CTGGGAGTTTACTGACACCCACCAGAGGCGAATCCGAAAAACAGGAACCCACATGGAAAACGAAAATAGC 

AGATCGGTTAAAACTGAGACCCAGAGCCCCTGCGGATGACATGTTTGGAGTAGGGAATCACAAAGTGAAT 

GCCGAGACTGCTAAAAGGAAAAGCATCCGGCGCAGACATACACTAGGAGGGCACAGAGATGCTACCGAAA 

TCAGCGTTTTGAATTTTTGGAAAGTGCATGAGCAGAGCGGGGAGAGAGAATCTGAACTTTCAGCTGTAAA 

CCGGTTAAAACCAAAATGCTCAGCCCAGGACCTTTCCATCTCAGACTGGCTGGCCAGGGAACGCCTACGC 

AC C AGT AC CT C TG AC C TT AGC AGAGG AGAAAT CGG AG AT C CC C AGAC AGAG AAC C C AAG CAC AC GAG AAA 

TAGCCACGACCGACACACCTTTGTCTCTTCATTGCAACACAGGCAGTTCTTCCAGCACCTTGGCTTCAAC 

AAACAGGCCCCTTCTTTCCATACCACCACAGTCACCTGACCAAATAAACGGAGAAAGCTTCCAGAACGTG 

AGCAAAAATGCTAGTTCTGCAGCGAATGCCCAACCTCATAAACTGTCTGAAACCCCAGGCACTAAAGCAG 

AGTTTCATCCCTGTCTTTAAACTGGGGGTAT „ 

A NOV85 polypeptide (SEQ ID NO:278) encoded by SEQ ID NO:277 is 1355 amino 
acids in length and is presented using the one-letter amino acid code in Table 85B. The Psort 
profile for NOV85 predicts that this sequence is likely to be localized to the nucleus with a 
certainty of 0.7000. In alternative embodiments, a NOV85 polypeptide is located to the 
mitochondrial matrix space with a certainty of 0.3600, or to microbodies with a certainty of 
0.3000. 



Table 85B. NOV85 Polypeptide Sequence (SEQ ID 
NO:278) 

MSLPRGISQDRSPLVKVRSNSLKAPSTHVTKPSFSQKSFVSMRDQRPVNHLHQNSLLNQQ 
TWVRTDSAPDQQVETGKSPSLSGASAKPAPQSSENAGTSDLELPVSQRNQDLSLQEAETE 
QSDTIiDNKEAVILREKPPSGRQTPQPLRHQSYILAVNDQETGSDTTCWLPNDARREVHIK 
RMEERKASSTSPPGDSLASIPFIDEPTSPSIDHDIAHIPASAVISASTSQVPSIATVPPC 
LTTSAPLIRRQLSHDHESVGPPSLDAQPNSKTERSKSYDEGLDDYREDAKLSFKHVSSLK 
GIKIADSQKSSEDSGSRKDSSSEVFSDAAKEGWLHFRPLVTDKGKRVGGSIRPWKQMYW 
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Ti ill V IP ; r; 



is it to 



fa i -a ' 



LRGHSLYLYKDKREQTTPSEEEQPISVNACLIDISYSETKRKNVFRLTTSDCECLFQAED 
RDDMLAWIKTIQESSNLNEEDTGVTNRDLISRRIKEYNNLMSKAEQLPKTPRQSLSIRQT 
LLGAKSEPKTQSPHSPKEESERKLLSKDDTSPPKDKGTWRKGIPSIMRKTFEKKPTATGT 
FGVRLDDCPPAHTNRYIPLIVDICCKLVEERGLEYTGIYRVPGNNAAISSMQEELNKGMA 
DIDIQDDKWRDLNVISSLLKSFFRKLPEPLFTNDKYADFIEANRKEDPLDRLKTLKRLIH 
DLPEHHYETLKFLSAHLKTVAENSEKKNKMEPRNLAIVFGPTLVRTSEDNMTHMVTHMPD 
QYKIVETLIQHHDWFFTEEGAEEPLTTVQEESTVDSQPVPNIDHLLTNIGRTGVSPGDVS 
DSATSDSTKSKGSWGSGKDQYSRELLVSS I FAAASRKRKKPKEKAQPSSSEDELDNVFFK 
KENVEQCHNDTKEESKKESETLGRKQKI I IAKENSTRKDPSTTKDEKISLGKESTPSEEP 
SPPHNSKHNKSPTLSCRFAILKESPRSLLAQKSSHLEETGSDSGTLLSTSSQASLARFSM 
KKSTSPETKHSEFLANVSTITSDYSTTSSATYLTSLDSSRLSPEVQSVAESKGDEADDER 
SEIilSEGRPVETDSESEFPVFPTALTSERLFRGKLQEVTKSSRRNSEGSELSCTEGSLTS 
SLDSRRQLFSSHKLIECDTLSRKKSARFKSDSGSLGDAKNEKEAPSLTKVFDVMKKGKST 
GSLLTPTRGESEKQEPTWKTKIADRLKLRPRAPADDMFGVGNHKVNAETAKRKSIRRRHT 
LGGHRDAT E I S VLNF WK VHEQ S G E RE S E L S A VNR L K P KC S AQDL SIS D WL ARE RL RT S T S 
DLSRGEIGDPQTENPSTREIATTDTPLSLHCNTGSSSSTLASTNRPLLSIPPQSPDQING 
ES FQNVSKNASSAANAQPHKLSETPGTKAEFHPCL 



A BLAST analysis of NOV85 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV85 had high 
homology to other proteins as shown in Table 85C. 
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Table 85C. BLASTX results from PatP database for NOV85 










Smallest 










Sum 








High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P (N) 


patp 


AAB97911 


Human G-protein activating protein 


6012 


0 . 0 


patp 


AAB41660 


Human ORFX ORF14 24 polypeptide sequence 


4368 


0.0 


patp 


:AAM93705 


Human polypeptide 


852 


3.9e-84 


patp 


:AAU17101 


Novel signal transduction pathway protein 


618 


6 .3e-59 


patp 


:AAB64387 


Amino acid sequence of human intracellular 


426 


2 . le-38 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 3871 of 3875 bases (99%) identical to a gb:GENBANK- 
ID:AB037845|acc:AB037845.1 mRNA from Homo sapiens (mRNA for KIAA1424 protein). 
10 The full amino acid sequence of the protein of the invention was found to have 1285 of 1287 
amino acid residues (99%) identical to, and 1286 of 1287 amino acid residues (99%) similar 
to, the 1286 amino acid residue ptnr:SPTREMBL-ACC:Q9P2C3 protein from Homo sapiens 
(Human) (KIAA1424 PROTEIN). NOV85 also has homology to the other proteins shown in 
the BLASTP data in Table 85D. 
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Table 85D. NOV85 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%> 


Positive 
(%) 


Expect 


gi | 7243229 | db 
j |BAA92662 . 1 | 
(AB037845) 


KIAA1424 protein 
[Homo sapiens] 


1286 


1285/1287 
(99) 


1286/1287 
(99) 


0 . 0 




gi | 14736381 | r 
ef |XP_038564 . 
1 | (XM 038564) 


hypothetical protein 
XP_038564 
[Homo sapiens] 


1173 


1172/1174 
(99) 


1173/1174 
(99) 


0 . 0 


gi|7959263|db 
j | BAA96025 . 1 | 
(AB040934) 


KIAA1501 protein 
[ Homo s api en s ] 


735 


331/685 
(48) 


433/685 
(62) 


e-148 


gi | 14748447 | r 
ef |XP_041405 . 
1 | (XM_041405) 


hypothetical protein 
XP_0414 05 
[Homo sapiens] 


304 


277/284 
(97) 


279/284 
(97) 


e-135 


gi | 12856667 | d 
bj |BAB30742. 1 
| (AK017433) 


KIAA14 24 PROTEIN 
(FRAGMENT) -putative [Mus 
musculus] 


385 


301/384 
(78) 


328/384 
(83) 


e-127 



This BLASTP data is displayed graphically in the ClustalW in Table 85E. A multiple 
sequence alignment is given, with the NOV85 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 85D. 



Table 85E. ClustalW Alignment of NOV85 



NOV8 5 (SEQ ID NO: 27 8) 

gi|7243229| (SEQ ID NO:724) 

gi|l473638l| (SEQ ID NO:725) 

gi|7959263| (SEQ ID NO:726) 

gi | 14748447 | (SEQ ID NO: 727) 

gi | 12856667 j (SEQ ID NO:728) 



0 40 50 



NOV8 5 

gi | 7243229 | 
gi | 14736381 | 
gi | 7959263 | 
gi | 14748447 | 
gi j 12856667 j 



NOV8 5 

gi | 7243229 | 
gi|l473638l| 
gi | 7959263 | 
gi j 14748447 | 
gi | 12856667 | 



NOV8 5 

gi|7243229 | 
gi | 14736381 | 
gi j 7959263 | 
gi | 14748447 | 
gi | 12856667 | 



NOV85 

gi | 7243229 | 
gi | 14736381 | 
gi | 7959263 | 
gi | 14748447 | 
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PDQQVETGKSPSLSGASAKPAPQSSENAGTSDLELPVSQRSQDLSLQEAE 



60 70 80 90 100 
I I I I I I I I i [ 

TEQSDTLDNKEAVILREKPPSGRQTPQPLRHQSYILAVNDQETGSDTTCW 



110 120 130 140 150 

LPNDARREVHIKRMEERKASSTSPPGDSLASIPFIDEPTSPSIDHDIAHI 



160 170 180 190 200 

PASAVISASTSQVPSIATVPPCLTTSAPLIRRQLSHDHESVGPPSLDAQP 



i! ii fi 



gi|l2856667| 

210 220 230 240 250 

....|....|....|....|....|...-|.---|...-|....|.--.| 

NOV85 MSLPRGISQDRSP 

gi | 7243 229 | NSKTERSKSYDEGLDDYREDAKLSFKHVSSLKGIKIADSQKSSEDSGSRK 

gi|l473638l| 

gi|7959263| 

gi | 14748447 | 

gijl2856667| 

260 270 280 290 300 

. ... | .... | .... | .... | .... | .... | ... | .... I I .... I 

NOV8 5 LVKVRSNSLKAPSTHVTKPSFSQKSFVSMRDQRPVNHLHQNSLLNQQTWV 
gi | 7243229 | DSSS E V F S DAAKE G WLH FR P L VTD KGKR VGG S I RP W KQM Y WLRGH S L Y L 

gi|l473638l| 

gi|7959263| 

gi j 14748447 | 

gi|l2856667| 

310 320 330 340 350 

NOV8 5 RTDS AP DQQ VETGK S P S L S GAS AKP AP Q S S ENAGT S DL E L P VS QRNQDL S 

gi | 7243229 | YKDKREQTTPSEEEQPISVNACLIDISYSETKRKNVFRLTTSDCECLFQA 

gi|l473638l| 

gi|7959263| 

gi | 14748447 | 

gi|l2856667| 

360 370 380 390 400 

....|....|....|....|....|....|....|....|...-|-.-.| 
NOV8 5 LQEAETEQSDTLDNKEAVILREKPPSGRQTPQPLRHQSYILAVNDQETGS 
gi|7243229| EDRDDMLAWIKTIQESSNLNEEDTGVTNRDLISRRIKEYNNLMSKAEQLP 

gi|l473638l| 

gi | 7959263 | 

gi j 14748447 | 

gi|l2856667| 

410 420 430 440 450 

NOV8 5 DTTCWLPNDARREVHIKRMEERKASSTSPPGDSLASIPFIDEPTSPSIDH 

gi | 7243229 | KTPRQSLSIRQTLLGAKSEPKTQSPHSPKEESERKLLSKDDTSPPKDKGT 

gi | 14736381 | MEERKASSTSPPGDSLASIPFIDEPTSPSIDH 

gi | 7959263 | 

gi|l4748447| 

gi | 12856667 j 

460 470 480 490 500 

....|....|....|....|....|....|....|....|....|-...| 
NOV85 DIAHIPASAVISASTSQVPSIATVPPCLTTSAPLIRRQLSHDHESVGPPS 
gi | 7243229 | WRKGIPSIMRKTFEKKPTATGTFGVRLDDCPPAHTNRYIPLIVDICCKLV 
gi | 14736381 | DIAHIPASAVISASTSQVPSIATVPPCLTTSAPLIRRQLSHDHESVGPPS 

gi|7959263| 

gi j 14748447 | 

gi | 12856667 | 

510 520 530 540 550 

....|....|....|....|....|....|....|....|..-.|...-| 
NOV8 5 LDAQPNSKTERSKSYDEGLDDYREDAKLSFKHVSSLKGIKIADSQKSSED 
gi | 7243 22 9 | EERGLEYTGIYRVPGNNAAISSMQEELNKGMADIDIQDDKWRDLNVISSL 
gi|l473638l| LDAQPNSKTERSKSYDEGLDDYREDAKLSFKHVSSLKGIKIADSQKSSED 

gi|7959263| 

gi | 14748447 | 

gi|l2856667| 

560 570 580 590 600 

....|....|....|....|....|....|....|. ...|....|....| 
NOV8 5 SGS R KD S S S E VF S D AAKE GWLH FR P L VTDKG KRVGG S I RPWKQMYWLRG 

gi|7243229| LKSFFRKLPEPLFTNDKYADFIEANRKEDPLDRLKTLKRLIHDLPEHHYE 
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gi | 14736381 | 
gi | 7959263 | 
gi j 14748447 | 
gi j 12856667 j 



NOV8 5 

gi | 7243229 | 
gi | 14736381 | 
gi | 7959263 | 
gi | 14748447 | 
gi | 12856667 j 



SGSRKDSSSEVFSDAAKEGWLHFRPLVTDKGKRVGGSIRPWKQMYVVLRG 



NOV85 

gi 
gi 
gi 
gi 
gi 



7243229 | 
1473638l| 
7959263 | 
14748447 | 
12856667 | 



NOV85 

gi | 7243229 | 
gi j 14736381 | 
gi | 7959263 | 
gi | 14748447 | 
gi | 12856667 | 



NOV8 5 

gi | 7243229 | 
gi j 14736381 | 
gi j 7959263 | 
gi | 14748447 | 
gi j 12856667 | 



NOV8 5 

gi | 7243229 | 
gi j 14736381 | 
gi | 7959263 | 
gi | 14748447 | 
gi j 12856667 j 



NOV8 5 

gi | 7243229 | 
gi | 14736381 | 
gi j 7959263 | 
gi'j 14748447 | 
gi j 12856667 j 



NOV85 

gi | 7243229 | 
gi j 14736381 | 
gi j 7959263 | 
gi | 14748447 | 
gi j 12856667 j 



650 



610 620 630 640 

....|....|....|....|....|....|....|...-|....|....| 
HSLYLYKDKREQTTPSEEEQPISVNACLIDISYSETKRKNVFRLTTSDCE 
TLKFLS7AHLKTV7A£NSEKNKMEPRNLAIVFGPTLVRTSEDNMTHMVTHMP 
HSLYLYKDKREQTTPSEEEQPISVNACLIDISYSETKRKNVFRLTTSDCE 



660 



700 



670 680 690 

CLFQAE DRDDML AW IKTIQES SNLNEEDTGVTNRDL I S RR I KEYNNLM S K 
DQYKIVETLIQHHDWFFTEEGAEEPLTTVQEESTVDSQPVPNIDHLLTNI 
CL FQ AE DRDDML AW IKTIQES SNLNE EDTG VTNRDL ISRRI KE YNNLMS K 



710 720 730 740 750 

....|....|....|....|....|.---|....|....|-...|.-.-| 
AEQLPKTPRQSLSIRQTLLGAKSEPKTQSPHSPKEESERKLLSKDDTSPP 
GRTGVSPGDVSDSATSDSTKSKGSWGSGKDQYSRELLVSSIFAAASRKRK 
AEQLPKTPRQSLSIRQTLLGAKSEPKTQSPHSPKEESERKLLSKDDTSPP 
GLDDLG 



760 



770 780 790 800 

....|....|....|....|....|....|....|....|....|....| 

KDKGTWRKGIPSIMRKTFEKKPTATGTFGVRLDDCPPAHTNRYIPLIVDI 
KPKEKAQPSSSEDELDNVFFKKENVEQCHNDTKEESKKESETLGRKQKII 
KDKGTWRKGI PS IMRKTFEKKPTATGTFGVRLDDCPPAHTNRYI PLIVDI 
Y I GYRS Y S PS FQRRTGLLHALS FRDS P FGGLPTFNLAQS PAS FP PE AS E P 



810 



850 



820 830 840 

....|....|.-..|..-.|.---|... (L |....|...-|...-|-...| 
CCKLVEERGLEYTGIYRVPGNNAAISSMQEELNKGMADIDIQDDKWRDLN 
IAKENSTRKDPSTTKDEKISLGKESTPSgEPSPPHNSKHNKSPTLSCRFA 
CCKLVEERGLEYTGIYRVPGNNAAISSMgEELNKGMADIDIQDDKWRDLN 
PRWRPEPSTRALEPPAEDRGDEWLRQ^PPTGRKVQLTPARQMNTLGFGD 



860 



900 



870 880 890 

....|....|....|....|....|...J....|....|....|...-| 
VISSLLKHFFR^LPEPLFTNDKYADFIEpNRKEDPLDRLKTLKRLIHDLP 
ILKESPR sLLAgKSSHLEETGSDSGTLLiTSSQASLARFSMKKSTSPETK 

vissllkSffr^peplftndkyadfieI^^^^'"^ " r T 1 ^ r VT1T T ljr,r 1 

ES PE PE A@GRG§RLGRKVAPLATTEDSl| 



RKEDPLDRLKTLKRLIHDLP 
IPFIDEPTSPSI DLQAKHVP 



920 



930 



910 

I | I J - • • • I ■ 

EHH|E|LKFLiAHLKlVAE^EKKNKj 
HSE^LANVSTITSDYSTTSS^TYLTSj 
EH^ElLKFLp!AHLK^VAE|?SEK-NKj 
AS Aw|s AMNSAP VL'GT S P§^PTFT FTLGRj 
1 p|PGj 

- -e2l§iagt5tsdy§tts§3tylts^ss: 




940 950 

I | I I 

llVFGPTLVR T 

;PEVQSVAESKGD E 

IVFGPTLVR T 

gQDC SSI KAGRR S S YLL A 

^ IVFGPTLVR T 

'PEVQSVAESKGD E 



960 



970 



980 



990 



1000 
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m 



NOV8 5 

gi | 7243229 | 
gi | 14736381 | 
gi j 7959263 | 
gi | 14748447 | 
gi j 12856667 j 



NOV8 5 

gi | 7243229 | 
gi | 14736381 | 
gi|7959263| 
gi j 14748447 | 
gi | 12856667 j 



NOV8 5 

gi | 7243229 | 
gi | 14736381 | 
gi | 7959263 | 
gi j 14748447 | 
gi | 12856667 j 



NOV8 5 

gi | 7243229 | 
gi j 14736381 | 
gi | 7959263 | 
gi | 14748447 | 
gi | 12856667 j 



NOV 8 5 

gi | 7243229 | 
gi | 14736381 | 
gi j 7959263 | 
gi j 14748447 | 
gi | 12856667 j 



NOV 8 5 

gi | 7243229 | 
gi|l473638l| 
gi | 7959263 | 
gi j 14748447 | 
gi|l2856667| 



NOV85 

gi | 7243229 | 
gi j 14736381) 
gi j 7959263 | 
gi j 14748447 | 
gi | 12856667 j 



NOV85 

gi | 7243229 | 
gi j 14736381 1 
gi j 7959263 | 
gi | 14748447 | 



INI : - -HMPD^yJS^eG 

Se rSeSj^eg r p ve td s e se?|p]8 f pE 
InySl HMPD ? pfli:y 5 EE 

I TTER jKSCDDGIJtfTFRDEGRVLfeilPNS 
§EfflNM"rfHfiBS HMPDQYKfVEjj" 



|EGRPLETDSESgjP§FP| 



2HHD 1 
iT S ERLHRGKLi 1 

}HHDW§- 
JPSLRMLRSF 

dTSDRLHRG: 



1010 




!sqp\; 



CTEGSLTS8L HsRRQLFSSHKfilEC BTLSRK 



®TVO] 

SegsL _,. 

S tv Qh e S^ v S sqpv B ni 

|8EDA|JAP f £jKRHSTSDLS 

ywofc^vSsQPv^fiiS 

lEGSjT " 



1020 1030 
[LLTO0GjaTS3S PG HVSDjgA* 




rSRRQQFSSHRfllEcSTLSRKKgAR 



1060 



1070 



1080 



I 



KPKEKAQg 



JWGSgKDgYSiagLLVSSl I 

@gslEd7a&ne5§ap s lt'kv fdvm | 

^GS^DlYSISLLV^IF^gSHKB. _ 

K P VY A AT . RURS L S L SlKE R^Sp G P^AAG^^B^GAG^DETAAP VC I G§C L V B3f 
HwGSgKD|YsfflLLVS$I F^^RjaRgKPKEKAC " 

^sp^tMteIStpalMkmfdv^^ 




fepGgVSD^A 
'LYYKQI L|[KKGKKA< 

s pgSvsdSatBdst! 



KPKEKAQ 
STGSLLT 



1110 



1120 



1130 



1140 




I 



_ jgCHNDTgEi|SKKi$i^LGygQKI0I0 
glAl^|LKLRPRAPADDMFGVGNH aVNAETgSR 
^p§CHNDTlE^>KiS^LG|HQKiniHSE] 
fe^LTTAD FCgYL F^AgDRDDMLG WMRgl RENj$ 
_^CHNDTlrES§KKfeSH T LG|raQKI~ 
WlA^LKLRp|APgDDMFGVGN|SpTAET! 




1170 



HRU 



1160 

iek^3lg{9estp| 

ISV|iJ$FW HVHE< 

stJ^dekBSlg gEST 
^gcanqal|Bkklndyrkvs 
!Mst^dek H8 lg{9estpBe§p1 

HRDAipi svISfwHaheobaHk: 




1180 1190 1200 

I — J- — L — I 1 

jP@PPHgKaKS^LSc|- F|JjLKES 

Jel^avj|rlkHkcSa^dl|Bsdw: 

PPHteK^KSg|L@Cl-F^0LKESP| 

pkSd*s!pkg|rglgglkse flkqs J| 

PHNSEfegjKSgSLBcg- FgHLKDSPr 
L§^viiRLKHKC@AlDL|aSDW: 



1210 

. i -.l^ 1 l----l 



1220 




1260 



1270 



1280 



1290 



I 



I 



I 



I 



I 



I 



1300 
■■I 



peHkhseflanvstitsdysttssatyltsldssrlspevqsvaeskgde 

ASMNgP 

peBkhseflanvstitsdysttssatyltsldssrlspevqsvaeskgde 
atenQrvplivaaccrivearg lestgiyrvpgnnawsslqeql 




1310 



1320 



1330 



1340 



I 



I 



adderselisegrpvetdsesefpvfptaltsef 

adderselisegrpvetdsesefpvfptaltse: 
nrgpgdiniiqderwqdlnvissllksffrklpep 

R^ 



|RGKLQEVTKS SjRfj 
jSIP PQSP 

|rgklqevtkss| 
■ddkyndfieanSi 




687 



gi | 12856667 | 



NOV85 

gi | 7243229 | 
gi | 14736381 | 
gi (7959263 | 
gi j 14748447 | 
gi j 12856667 | 



NOV8 5 

gi | 7243229 | 
gi j 14736381 | 
gi j 7959263 | 
gi j 14748447 | 
gi | 12856667 j 



NOV8 5 

gi | 7243229 | 
gi | 14736381 | 
gi | 7959263 | 
gi j 14748447 | 
gi j 12856667 j 



NOV85 

gi | 7243229 | 
gi j 14736381 1 
gi | 7959263 | 
gi j 14748447 | 
gi | 12856667 j 



NOV8 5 

gi | 7243229 | 
gi j 14736381 | 
gi j 7959263 | 
gi j 14748447 | 
gi|l2856667| 



NOV85 

gi | 7243229 | 
gi j 14736381 | 
gi | 7959263 | 
gi | 14748447 | 
gi | 12856667 | 



-pIBtp- 



1360 



1370 



1380 



1390 



1400 



• I • -iJ. — I • i -.i- ■ I -i- L^-J — 1 — 1 — 1 

SEG S^§CTEGSLTHSLDgRRQL||SS ^gl |CDTLSRKKSARFKSDS 

D ^l3GESFQNvSK||ABsA^AQp||gJs gTPG 

SEG Sg3^CTEGSLT§SLD§RRQLHsS ^M l§CDTLSRKKSARFKSDS 

EDARERMRTLRKL I RDLPGMy YETL kBl VGHgKT I ADHS E KNKME P RNLA 

5 jR--lH--BK£vcc- 



D pffRESFQ] 



tanih: 



srasPD- 



1410 



1420 



1430 



1440 



1450 



I 



I 



GSLGDAKNEKEAPSLTP 



F^ 



I 



I 



gVMKKGKSTGSLLTPTRGESEKQEPTWKTKI 

TKABFHPCL 

GSLGDAKNEKEAPSLTKgFgVMKKGKSTGSLLTPTRGESEKQEPTWKTKI 
LVFGPTLVRTSEDNMTDkvTHMPDRYKIVETLIQHSDWFFSDEEDKGERT 



C TKAETPP 



1460 



1470 



1480 



1490 



1500 



ADRLKLRPRAPADDMFGVGNHKVNAETAKRKS I RRRHTLGGHRDATEI S V 

ADRLKL R PRAP ADDM FG VGNHK VNAET AKR KS I RRRHT LGGHRDATE I S V 
PVGDKEPQAVPNIEYLLPNIGRTVPPGDPGSADLLEI 



1550 



1510 1520 1530 1540 

....|....|....|....|....|.-..|....|....|....|-.--| 
LNFWKVHEQSGERESELSAVNRLKPKCSAQDLS I SDWLARERLRTSTSDL 

LNFWKVHEQSGERESELSAVNRLKPKCSAQDLSISDWLARERLRTSTSDL 



1560 1570 1580 1590 1600 

....|....|....|....|....|....|.-..|.-..|....|....| 
SRGEIGDPQTENPSTREIATTDTPLSLHCNTGSSSSTLASTNRPLLSIPP 

SRGEIGDPQTENPSTREIATTDTPLSLHCNTGSSSSTLASTNRPLLSIPP 



1610 1620 1630 1640 

....|....|....|....|....|....|....|....|... 

QSPDQINGESFQNVSKNASSAANAQPHKLSETPGTKAEFHPCL 
QSPDQINGESFQNVSKNASSAANAQPHKLSETPGSKAEFHPCL 



Table 85F lists the domain description from DOMAIN analysis results against NOV85. 
This indicates that the NOV85 sequence has properties similar to those of other proteins 
known to contain this domain. 
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Table 85F. Domain Analysis of NOV85 


enllSm 


artlsmart00324, RhoGAP, GTPase-activator protein for Rho-like GTPases; GTPase activator proteins 


towards Rho/Rac/Cdc42-like small GTPases. SEQ ID NO : 87 8 






Score = 


CD-Length = 175 residues, 100.0% aligned 
169 bits (428), Expect = 9e-43 




N0V8 5 
Sbjct 


: 555 
: 1 


RYIPLIVDICCKLVEERGLEYTGIYRVPGNNAAISSMQEELNKGMADIDIQDDKWRDLNV 
+ IP+IV+ C + +E+RGL+ GIYR G+ + + + +E + G D D++ 
KPIPIIVEKCIEYLEKRGLDTEGIYRKSGSASRVKELREAFDSGPDPD- -LDLSEYDVHD 


614 
58 


NOV8 5 
Sbjct 


: 615 
: 59 


ISSLLKSFFRKLPEPLFTNDKYADFIEANRKEDPLDRLKTLKRLIHDLPEHHYETLKFLS 
+ + LLK F R+LPEPL T + Y +FIEA + ED +RL+ L+ L+ LP + TL++L 
VAGLLKLFLRELPEPLITFELYEEFIEAAKLEDEEERLRALRELLSLLPPANRATLRYLL 


674 
118 


NOV85 
Sbjct 


: 675 
: 119 


AHLKTVAENSEKKNKMEPRNLAIVFGPTLVRTSEDNMTHMVTHMPDQYKIVETLIQHHD 
AHL VAE+SE +NKM RNLAIVFGPTL+R + + + Q K+VE LI++ D 
AHLNRVAEHSE - ENKMTARNLAI VFGPTLLRPPDGESA- SLKDIRHQNKWEFLIENAD 


733 
175 



Rho GTPases control a variety of cellular processes. There are three subtypes of Rho 
GTPases in the Ras superfamily of small G proteins: RHO, RAC, and CDC42. GTPase- 
activating proteins (GAPs) bind activated forms of Rho GTPases and stimulate GTP 
5 hydrolysis. Through this catalytic function, Rho GAPs negatively regulate Rho-mediated 

signals. GAPs may also serve as effector molecules and play a role in signaling downstream of 
Rho and other Ras-like GTPases. 

By screening a Jurkat cDNA library using a yeast 2-hybrid system with an activated 
form of RAC as bait, followed by screening a placenta cDNA library, Toure et al. (1998) 

10 isolated a cDNA encoding RACGAP1, which they called MGCRACGAP. The predicted 527- 
amino acid RACGAP1 protein has a large N-terminal region containing a protein kinase C- 
% like cysteine-rich motif. RACGAP1 shares highest homology with the Drosophila RnRacGAP 
and the chimerins of rat and human. Functional analysis showed that the GAP domain of 
RACGAP1 exhibits strong GAP activity towards CDC42, RAC1, and RAC2. Northern blot 

15 analysis detected an approximately 3.2-kb RACGAP1 transcript that was most abundantly 

expressed in testis, with low expression in most other tissues. Western blot analysis detected a 
RACGAP1 protein of 58 kD in testis extracts. In situ hybridization showed that RACGAP1 
expression is restricted to germ cells in mature testis. Human breakpoint cluster region (bcr) 
gene product is a member of a group of GTPase-activating proteins that act exclusively on 

20 members of the Ras-related Rho subfamily. A complementary DNA was isolated from 

Caenorhabditis elegans that encoded a polypeptide of 1438 amino acid residues, CeGAP, 
which contains a domain with sequence similarity to the COOH-terminal segment (GTPase- 
activating protein region) of Bcr and other known GTPase-activating proteins of the Rho 
subfamily. It also contains a "pleckstrin homology" motif, present in many signaling proteins 
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including GTPase-activating proteins and nucleotide exchange factors. The Bcr-like domain of 
CeGAP exhibited activity not only on members of the C. elegans and human Rho subfamily 
but surprisingly also on C. elegans Ras protein (let-60), human Ras, and Rab3A. 

CeGAP is therefore the first GTPase-activating protein acting on Ras-related proteins 
5 across different subfamilies. Together with the presence of the pleckstrin homology motif, this 
suggests a central and integrative role for CeGAP in a signaling pathway common to Ras and 
related proteins. 

NOV85 is predicted to be expressed in at least the following tissues: pancreas, 
stomach, brain, bone. This information was derived by determining the tissue sources of the 

10 sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV85 is provided in Example 2. 

The NOV85 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Alzheimer's 

15 disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, 
cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 
leukodystrophies, behavioral disorders, addiction, anxiety, pain, neuroprotection, 
hypercalceimia, ulcers, diabetes, Von Hippel-Lindau (VHL) syndrome, pancreatitis, obesity as 
well as other diseases, disorders and conditions. NOV85 nucleic acids encoding the CeGAP- 

20 like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 
assessed. 

The novel nucleic acid of the invention encoding a GTPase activating protein-like 
protein includes the nucleic acid whose sequence is provided in Table 85A, or a fragment 

25 thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 85 A while still encoding a protein 
that maintains its GTPase activating protein-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 85 A, including nucleic acid fragments that are 

30 complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
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in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the GTPase activating protein-like protein 
whose sequence is provided in Table 85B. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
85B while still encoding a protein that maintains its GTPase activating protein-like activities 
and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 
up to about 1% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV86 

The disclosed NOV86 (alternatively referred to herein as CG56957-01) includes the 
3451 nucleotide sequence (SEQ ID NO:279) shown in Table 86A. A NOV86 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 35-37 and ends with a stop codon 
at nucleotides 3442-3445. 



Table 86A. NOV86 Nucleotide Sequence (SEQ ID NO:279) 

CCCGCCCCCTCGGGCTCCCGGCCGGGGCCCCATCATGTTCTCCAGGAAGAAACGAGAGCTCATGAAAACC 
CCTTCCATCTCGAAAAAGAACCGCGCGGGAAGCCCCAGCCCGCAGCCCTCGGGGGAGCTGCCCAGGAAGG 
ATGGGGCTGACGCGGTGTTCCCCGGACCAAGCCTGGAGCCGCCCGCTGGGTCCTCCGGCGTCAAGGCCAC 
AGGGACCCTCAAGCGGCCCACCAGCCTGAGCCGCCACGCCAGCGCGGCTGGCTTCCCCCTGTCGGGTGCT 
GCCTCCTGGACACTGGGCCGGAGCCACCGGAGCCCACTGACAGCCGCCAGCCCGGGCGAGCTGCCCACCG 
AGGGTGCCGGCCCGGACGTCGTCGAGGACATCTCCCATCTGCTGGCGGACGTGGCCCGCTTCGCTGAGGG 
CCTTGAGAAACTTAAGGAGTGTGTGTTGCGTGACGACCTCCTTGAGGCCCGCCGCCCGCGGGCCCACGAG 
TGCCTGGGTGAGGCTCTGCGTGTCATGCATCAGATCATCTCCAAGTACCCGCTGCTGAACACCGTGGAGA 
CGCTCACCGCAGCCGGCACCCTCATTGCCAAGGTCAAAGCCTTCCATTATGAGAGCAACAATGATCTGGA 
GAAACAGGAGTTCGAGAAGGCCCTGGAGACGATTGCTGTGGCCTTCAGTAGCACAGTGTCCGAGTTCCTC 
ATGGGTGAAGTGGACAGCAGCACCCTCCTAGCAGTGCCTCCTGGGGACTCGAGCCAGTCCATGGAAAGCC 
TGTATGGACCGGGCAGTGAGGGCACGCCTCCCAGCCTGGATGACTGTGACGCCGGCTGCCTGCCCGCCGA 
GGAGGTGGACGTGCTGCTACAGCGCTGTGAGGGGGGCGTGGATGCCGCACTGCTGTATGCCAAGAACATG 
GCCAAGTACATGAAGGACCTCATCAGCTACCTGGAGAAGCGGACGACGCTGGAGATGGAGTTTGCCAAGG 
GCCTGCAGAAGATCGCTCACAACTGCAGACAGAGCGTCATGCAGGAGCCCCACATGCCGCTCCTGTCCAT 
CTACTCGCTGGCCCTGGAGCAGGACCTGGAGTTCGGCCACAGCATGGTGCAGGCGGTGGGCACCTTGCAG 
ACCCAGACCTTCATGCAGCCCCTGACCCTGCGGCGGCTTGAACACGAGAAGCGCAGGAAGGAGATCAAGG 
AGGCCTGGCACCGTGCCCAGAGGAAGCTGCAAGAGGCGGAGTCCAACCTGCGCAAGGCCAAGCAGGGTTA 
CGTGCAGCGCTGCGAGGACCACGACAAGGCTCGCTTCCTCGTGGCCAAGGCGGAGGAGGAGCAGGCTGGC 
AGCGCGCCGGGAGCAGGCGGCACGGCCACCAAGACCCTGGACAAGCGGCGGCGGCTGGAGGAGGAGGCCA 
AGAACAAGGCGGAGGAAGCTATGGCCACCTACCGCACCTGCGTGGCCGACGCGAAGACGCAGAAGCAGGA 
GCTGGAGGATACCAAGGTGACGGCGCTGCGGCAGATCCAGGAGGTCATCCGGCAGAGCGACCAAACCATC 
AAGTCGGCCACGATCTCCTACTACCAGATGATGCATATGCAGACGGCGCCGCTGCCCGTGCACTTCCAGA 
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TGCTGTGTGAGAGCAGCAAGCTGTATGACCCAGGCCAGCAGTACGCCTCCCACGTGCGCCAGCTGCAGCG 
GGACCAGGAGCCCGATGTGCACTACGACTTTGAGCCCCACGTCTCCGCCAACGCCTGGTCCCCCGTCATG 
CGTGCCCGGAAGAGCAGCTTCAACGTGAGTGATGTGGCGCGGCCCGAGGCTGCCGGGAGCCCCCCAGAAG 
AAGGCGGGTGCACTGAGGGCACACCTGCCAAGGACCACAGGGCCGGGCGAGGACACCAGGTTCACAAGTC 
ATGGCCGCTCTCGATCTCAGACTCGGACAGTGGGCTGGACCCCGGCCCTGGCGCAGGGGACTTTAAGAAG 
TTCGAGCGGACGTCATCCAGTGGTACCATGTCGTCCACGGAGGAGCTGGTGGACCCAGACGGTGGAGCCG 
GGGCTTCAGCCTTTGAGCAGGCTGACCTCAACGGCATGACCCCCGAGCTGCCGGTGGCCGTGCCCAGTGG 
ACCGTTCCGCCACGAGGGGCTGTCCAAGGCGGCCCGTACTCACCGGCTCCGGAAGCTCCGCACGCCCGCC 
AAGTGCCGCGAGTGCAACAGCTACGTCTACTTCCAGGGTGCTGAGTGTGAAGAGTGCTGCCTGGCCTGCC 
ACAAGAAATGTCTGGAGACGCTGGCCATACAGTGCGGGCACAAGAAGCTGCAGGGCCGCCTGCAGCTGTT 
CGGCCAGGACTTCAGCCACGCGGCCCGCAGCGCCCCCGACGGCGTGCCCTTCATCGTCAAGAAGTGCGTC 
TGCGAGATCGAGCGGCGGGCGCTGCGCACCAAGGGCATCTACCGGGTCAATGGGGTAAAGACACGCGTGG 
AGAAGCTGTGCCAGGCCTTCGAGAACGGCAAGGAGCTGGTCGAGCTGTCGCAGGCCTCGCCCCACGACAT 
CAGCAACGTCCTCAAGCTCTACCTGCGTCAGCTTCCCGAGCCGCTCATCTCCTTCCGCCTCTACCACGAG 
CTCGTAGGGCTGGCCAAGGACAGCCTGAAGGCAGAGGCCGAGGCCAAGGCGGCGTCCCGGGGCCGGCAGG 
ACGGCTCGGAGAGCGAGGCAGTGGCGGTGGCCCTGGCAGGTCGGCTGCGGGAGCTCCTGCGGGACCTGCC 
GCCTGAGAACCGGGCCTCGCTGCAGTACCTGCTGCGTCACCTACGCAGGATCGTGGAGGTGGAGCAGGAC 
AACAAGATGACCCCCGGGAACCTGGGCATCGTGTTCGGGCCCACGCTGCTTCGGCCACGGCCCACCGAGG 
CCACCGTGTCCCTCTCCTCCCTGGTGGATTATCCCCATCAGGCCCGCGTCATCGAGACTCTCATCGTCCA 
CTACGGCCTGGTCTTCGAGGAGGAGCCGGAGGAGACCCCCGGGGGCCAGGACGAGTCATCCAACCAGCGA 
GCTGAGGTAGTCGTCCAGGTGCCGTACCTGGAGGCGGGCGAGGCGGTGGTCTACCCGCTGCAGGAGGCGG 
CGGCGGACGGGTGCAGAGAATCCCGAGTTGTGTCCAACGATTCGGACTCGGACCTAGAGGAGGCCTCCGA 
GCTGCTGTCCTCATCGGAGGCCAGTGCCCTGGGCCACCTCAGCTTCCTGGAGCAGCAGCAGAGCGAGGCC 
AGCCTAGAGGTGGCTTCTGGCAGCCACAGCGGCAGTGAGGAGCAGCTGGAGGCCACAGCCCGGGAGGACG 
GGGACGGGGACGAGGACGGCCCGGCCCAGCAGCTCTCAGGATTCAACACCAACCAGTCCAACAACGTGCT 
GCAGGCCCCACTGCCCCCCATGAGGCTCCGTGGCGGGCGGATGACACTGGGCTCCTGCAGGGAAAGGCAG 
CCGGAATT CGTGTGAGCTGGG „ _ 

A NOV86 polypeptide (SEQ ID NO:280) encoded by SEQ ID NO:279 is 1 136 amino 
acids in length and is presented using the one-letter amino acid code in Table 86B. The Psort 
profile for NOV86 predicts that this sequence is likely to be localized to the nucelus with a 
5 certainty of 0.9800. 



Table 86B. NOV86 Polypeptide Sequence (SEQ ID 
NO:280) 

MFSRKKRELMKTPSISKKNRAGSPSPQPSGELPRKDGADAVFPGPSLEPPAGSSGVKATG 
TLKRPTSLSRHASAAGFPLSGAASWTLGRSHRSPLTAASPGELPTEGAGPDVVEDISHLL 
AD VAR F AEGLE KL KE C VL RDDLLE ARRP RAHE C LGE AL R VMH Q 1 1 SKYPLLNTVETLTAA 
GTLIAKVKAFHYESNNDLEKQEFEKALETIAVAFSSTVSEFLMGEVDSSTLLAVPPGDSS 
QSMESLYGPGSEGTPPSLDDCDAGCLPAEEVDVLLQRCEGGVDAALLYAKNMAKYMKDLI 
SYLEKRTTLEMEFAKGLQKIAHNCRQSVMQEPHMPLLSIYSLALEQDLEFGHSMVQAVGT 
LQTQTFMQPLTLRRLEHEKRRKEIKEAWHRAQRKLQEAESNLRKAKQGYVQRCEDHDKAR 
FLVAKAEEEQAGSAPGAGGTATKTLDKRRRLEEEAKNKAEEAMATYRTCVADAKTQKQEL 
EDTKVTALRQIQEVIRQSDQTIKSATISYYQMMHMQTAPLPVHFQMLCESSKLYDPGQQY 
ASHVRQLQRDQE PDVHYDFEPHVS ANAWS PVMRARKS S FNVS DVARPEAAGS P PEEGGCT 
EGTPAKDHRAGRGHQVHKSWPLSISDSDSGLDPGPGAGDFKKFERTSSSGTMSSTEELVD 
PDGGAGASAFEQADLNGMTPELPVAVPSGPFRHEGLSKAARTHRLRKLRTPAKCRECNSY 
VYFQGAECEECCLACHKKCLETLAIQCGHKKLQGRLQLFGQDFSHAARSAPDGVPFIVKK 
CVCEIERRALRTKGIYRVNGVKTRVEKLCQAFENGKELVELSQASPHDISNVLKLYLRQL 
PE PL I S FRL YHEIj VGLAKDS L KAE AE AKAAS RGRQDG S E S E AVAV AL AGRLRE LL RDL P P 
ENRASLQYLLRHLRRIVEVEQDNKMTPGNLGIVFGPTLLRPRPTEATVSLSSLVDYPHQA 
RVIETLIVHYGLVFEEEPEETPGGQDESSNQRAEVWQVPYLEAGEAWYPLQEAAADGC 
RESRWSNDSDSDLEEASELLSSSEASALGHLSFLEQQQSEASLEVASGSHSGSEEQLEA 
TAREDGDGDEDGPAQQLSGFNTNQSNNVLQAPLPPMRLRGGRMTLGSCRERQPEFV 



A BLAST analysis of NOV86 was run against the proprietary PatP GENE SEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV86 had high 
10 homology to other proteins as shown in Table 86C. 
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Table 86C. BLASTX results from PatP database for NOV86 










Smallest 










Sum 








High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P (N) 


patp 


AAU17313 


Novel signal transduction pathway protein 


2855 


5 -7e-299 


patp 


AAW75995 


GTPase activating protein (GAP) , PARG 


1497 


2.9e-153 


patp 


AAY90268 


Human GTP-ase activating polypeptide PARG 


1497 


2.9e-153 


patp 


AAU17459 


Novel signal transduction pathway protein 


1265 


1 . le-128 


patp 


AAU17460 


Novel signal transduction pathway protein 


1127 


4 .7e-114 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 3447 of 3451 bases (99%) identical to a gb:GENBANK- 
ID:D86976|acc:D86976.1 mRNA from Homo sapiens (Human mRNA for KIAA0223 gene). 
The full amino acid sequence of the protein of the invention was found to have 1 134 of 1 136 
amino acid residues (99%) identical to, and 1 135 of 1 136 amino acid residues (99%) similar 
to, the 1 165 amino acid residue ptnr:SPTREMBL-ACC:Q92619 protein from Homo sapiens 
(Human) (MYELOBLAST KIAA0223). NOV86 also has homology to the other proteins 
shown in the BLASTP data in Table 86D. 



Table 86D. NOV86 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi | 14765644 | r 
ef|XP 037574. 
1 | (XM_037574) 


minor histocompatibility 
antigen HA-1 
[Homo sapiens] 


113 6 


1134/1136 
(99) 


1135/1136 
(99) 


0 . 0 


gi | 1504026 | db 
j |BAA13212 . 1 | 
(D86976) 


similar to C.elegans 
protein (Z37093) [Homo 
sapiens] 


1165 


1134/1136 
(99) 


1135/1136 
(99) 


0.0 


gi | 2896796 | gb 
|AAC03237.l| 
(AC004151) 


D1013 901 [Homo sapiens] 


996 


994/996 
(99) 


995/996 
(99) 


0 . 0 


gi|l2857707|d 
bj |BAB31085 . 1 
| (AK018130) 


MYELOBLAST KIAA02 23 
(FRAGMENT) -putative [Mus 
musculus] 


523 


423/542 
(78) 


460/542 
(84) 


0 . 0 


gi | 13635768 | r 
ef |XP_017232 . 
1 | (XM_017232) 


similar to PTPL1- 
associated RhoGAP 1 (H. 
sapiens) [Homo sapiens] 


1261 


345/881 
(39) 


509/881 
(57) 


e-160 



This BLASTP data is displayed graphically in the ClustalW in Table 86E. A multiple 
sequence alignment is given, with the NOV86 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 86D. 
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Table 86E. ClustalW Alignment of NOV86 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi | 2896796 | 
gi j 12857707 | 
gi | 13635768 j 



(SEQ ID NO: 280) 

(SEQ ID NO: 72 9) 

(SEQ ID NO:730) 

(SEQ ID NO:731) 

(SEQ ID NO:732) 

(SEQ ID NO: 73 3) 



10 



I 



20 
.|.. 



30 



40 

NOV8 6 MFSRKKRELMKTPSIg 

gi | 14765644 | MFSRKKRE^MKTPSlI 

gi | 1504026 | QCGPEAAS AGS CS AETPS P PPRAPGRGP IM FSRKKREJ^MKTPS I B 

gi|2896796| 

gi 12857707 | " f 

gi | 13635768 | M1AHKQKK! 




NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi | 2896796 j 
gi j 12857707 | 
gi | 13635768 | 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi j 2896796 | 
gi | 12857707 | 
gi j 13635768 j 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi |2896796 j 
gi | 12857707 | 
gi | 13635768 j 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi j 2896796 j 
gi j 12857707 | 
gi | 13635768 j 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi j 2896796 j 
gi | 12857707 | 
gi | 13635768 j 



NOV8 6 

gi | 14765644 | 



60 



GSPSPQPgGELPE 
GSPSPQPgGELPK 
GpbsPQPgGELPF 



70 80 90 100 

I — I — I — I — I — I — I 

dA^fpgpsleHpagssgvkatgtlkrptslsrh 

kv ; FPGPSLpj|PAGSSGVKATGTLKRPTSLSRH 

^fpgpsleBpagssgvkatgtlkrptslsrh 



W^SGQLSfDITTSlM@LKlgsSNSIFpB- 



110 



120 



130 



150 



I 



I 



I 



I 



I 



asaagfplsgaas wtlgrshrs pltaas pgelptegagpdvvedgsi 
asaagfplsgaaswtlgrshrspltaaspgelptegagpdw|dHsi 
asaagf pls gaas wtlgrshrs pltaas pgelptegagpdwidf 



160 




170 
. . I . . 



I 



180 



190 



200 



^LggLKE^jggDLLEARRPRAHECLGEALRVMHQIISKYPLL 

IBlBBlke^^Mdllearrpraheclgealrvmhqiiskypll 
^lke !SIBB dllearrpraheclgealrvmhqi i skypll 

DLLEARRPRAHECLGEALRVMHQI I SKYPLL 



K^^roflL yHB^fs |cf§e vigji r3e[|l[J^|ksB^PqnJJ 

210 220 230 240 250 




LIAKVKAFHYES 
'LIAKVKAFHYES 
'LIAKVKAFHYES 
'LIAKVKAFHYES 




240 



I 



QEFEKALETIAVAFSST 
QEFEKALETIAVAFSST 
QEFEKALETIAVAFSST 
QEFEKALETIAVAFSST 



jv^TEvHjENBNDLFHvFS^a^FTgc^ll 



I 



260 
- . I - . 



270 
. . I . . 



I 



280 
. . I . . 



290 



I 



I 



300 

^1 



VSEFLMGEVDSSTLLAVPPGDSSQSMESLYGPGSEGTPPSLjlDCDAGCLP 
VSEFLMGEVDSSTLLAVPPGDSSQSMESLYGPGSEGTPPSLgDCDAGCLP 
VSEFLMGEVDSSTLLAVPPGDSSQSMESLYGPGSEGTPPSLgDCDAGCLP 
VSEFLMGEVDSSTLLAVPPGDSSQSMESLYGPGSEGTPPSLI3DCDAGCLP 



310 



330 



340 



FS 



350 



_1 



\EEVDVLLQRCEGGVDAALLYAKNMAKYMKDLISYLEKRTTLEMEFAKGL 
EEVDVLLQRCEGGVDAALLYAKNMAKYMKDLISYLEKRTTLEMEFAKGL 
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AEEVDVLLQRCEGGVDAALLYAKNMAKYMKDLISYLEKRTTLEMEFAKGL 
AE EVDVLLQRCEGGVDAALLYAKNMAKYMKDLISYLEKRTTLEMEFAKGL 



gi | 1504026 | 
gi|2896796| 
gi | 12857707 | 
gi | 13635768 j 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi |2896796 j 
gi j 12857707 | 
gi|l3635768| 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi j 2896796 j 
gi j 12857707 | 
gi 1 13635768 j 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi | 2896796 | 
gi 1 12857707 | 
gi | 13635768 | 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi | 2896796 | 
gi | 12857707 | 
gi j 13635768 | 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi|2896796| 
gi | 12857707 | 
gi j 13635768 j 



NOV8 6 

gi|l4765644| 
gi | 1504026 | 
gi|2896796| 
gi j 12857707 | 
gi | 13635768 j 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi|2896796 j 
gi j 12857707 | 
gi j 13635768 | 



PL@i3N§aLlNTDS|lLg3j 



WS| 



T|Ni 



360 



370 
I I . . 



380 



390 



I 



I 



400 
- - I 



QKIAHNCRQSVMQEPHMPLLSIYSLALEQDLEFGHSMVQAVGTLQTQTFM 
QKIAHNCRQSVMQEPHMPLLSIYSLALEQDLEFGHSMVQAVGTLQTQTFM 
QKIAHNCRQSVMQEPHMPLLSIYSLALEQDLEFGHSMVQAVGTLQTQTFM 
QKIAHNCRQSVMQEPHMPLLSIYSLALEQDLEFGHSMVQAVGTLQTQTFM 



SS 



iatStpIgi^e: 

410 420 



I 



I 



430 
. . I . - 



I 



440 



450 

^1 



QPLTLRRLEHEKRRKEIKEAWHRAQRKLQEAESNLRKAKQGYVQRCEDHD 
QPLTLRRLEHEKRRKEIKEAWHRAQRKLQEAESNLRKAKQGYVQRCEDHD 
Q PLTLRRLEHE KRRKE I KE AWHRAQRKLQE AE SNLRKAKQG Y VQRC EDHD 
QPLTLRRLEHEKRRKEIKEAWHRAQRKLQEAESNLRKAKQGYVQRCEDHD 



I 



460 

. . I . . 



I 



470 
- - I - - 



480 



490 



I 



I 



KARFLVAKAEEEQAGSAPGAGS TATKTLDKRRRLEEEAKNKAEEAMATYR 
KARFLVAKAEEEQAGSAPGAGg TATKTLDKRRRLEEEAKNKAEEAMATYR 
KARFLVAKAEEEQAGSAPGAGg TATKTLDKRRRLEEEAKNKAEEAMATYR 
KARFLVAKAEEEQAGSAPGAG§TATKTLDKRR RLEEE AKN KAEEAMATYR 



510 
. . I . . 



I 



520 
. . | - - 



530 



540 



550 



I 



I 



I 



TCVADAKTQKQELEDTKVTALRQIQEVIRQSDQTIKSATISYYQMMHMQT 
TCVADAKTQKQELEDTKVTALRQIQEVIRQSDQTIKSATISYYQMMHMQT 
TCVADAKTQKQELEDTKVTALRQIQEVIRQSDQTIKSATISYYQMMHMQT 
TCVADAKTQKQELEDTKVTALRQI QEVIRQSDQTIKSATISYYQMMHMQT 



560 




I 



EI 



570 
. . I . . 



580 
. . I . . 



590 



600 



LCESSKLYDPGQQYASHVRQLQRDQEPDVHYDFEPHVS, 
LCESSKLYDPGQQYASHVRQLQRDQEPDVHYDFEPHVS. 
LCESSKLYDPGQQYASHVRQLQRDQEPDVHYDFEPHVS. 
LCESSKLYDPGQQYASHVRQLQRDQEPDVHYDFEPHVS. 



gs[§ADSL@s| 



?HS E F^ATNS TpQEK^DGpVNK^^^ - § 



610 



I 



I 



620 
. . I . . 



I 



630 
. . I . . 



640 



650 



I 



WSPVMRARKSSFNVSDVARPEAAGSPPEEGGCTEGTPAKDHRAGRGHQVH 
WSPVMRARKSSFNVSDVARPEAAGSPPEEGGCTEGTPAKDHRAGRGHQVH 
WSPVMRARKSSFNVSDVARPEAAGSPPEEGGCTEGTPAKDHRAGRGHQVH 
W S P VMR AR K S S FNV S D V AR P EAAGSPP E E GG C T E GT P A KDH RAGRG HQ VH 

feggH^eE^ p^kdhrEgrg hqvh 



Q PS GFGPANS L[|D WRIigdS SNK I [ 




710 



720 



730 



740 
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NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi | 2896796 j 
gi | 12857707 | 
gi j 13635768 j 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi | 2896796 j 
gi j 12857707 | 
gi | 13635768 j 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi |2896796 | 
gi | 12857707 | 
gi | 13635768 j 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi j 2896796 | 
gi | 12857707 | 
gi j 13635768 j 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi | 2896796 | 
gi | 12857707 | 
gi j 13635768 j 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi j 2896796 j 
gi | 12857707 | 
gi | 13635768) 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi |2896796 j 
gi | 12857707 | 
gi | 13635768) 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi | 2896796 | 
gi | 12857707 | 




rSIiQPQDVMCSIGWDQGCFPKPLLiSPEERDIERSMKSLF 



1060 



1070 



I 



1080 
. - | | 



1090 
>.)... 



1100 

• ♦I 



EEPEETPGGQDESSNQRAEVWQVPYLEAGj 
EEPEETPGGQDESSNQRAEWVQVPYLEAG 
EEPEETPGGQDESSNQRAEWVQVPYLEAG 
EEPEETPGGQDESS NQRAEWV Q VPY LEAG 

EE PEEBffiGgQ^Bg^BBBBQ^E^g 
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gi | 13635768 | 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi|2896796 j 
gi j 12857707] 
gi|l3635768| 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi j 2896796 j 
gi | 12857707 | 
gi j 13635768| 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi | 2896796 j 
gi j 12857707 | 
gi |13635768| 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi | 2896796 | 
gi j 12857707 | 
gi j 13635768 j 



NOV8 6 

gi | 14765644 | 
gi | 1504026 | 
gi | 2896796 j 
gi | 12857707 | 
gi | 13635768 | 



NOV8 6 

gi | 14765644 | 
gi j 1504026 | 
gi | 2896796 j 
gi j 12857707| 
gi | 13635768 j 



FSSKEDIHTSESESKIFERATSFj33^ERK@^GKCI^C'||SDKAQL[jLDQ 
1120 1130 1140 




1150 
I 



1160 



1170 



:l|PRTKIRPVSLPVDRLLL 
1180 1190 1200 



I 



1190 
..|... 



ASPPNERNGRNMGNVNLDKFCKNPAFEGVNRKDAATTVCSKFNGFDQQTL 
1210 1220 1230 1240 




QKI QDKQY{|QN@lSAK| 

1260 1270 



1280 



|||GDHSINgT£|PSKP 
1290 




1300 
I 



YAEPVRS 



1310 



1320 



1330 



1340 



1350 



GGRMTLGSCRERQPEFV 
GGRMTLGSCRERQPEFV 
GGRMTLGSCRERQPEFV 
GGRMTLGSCRERQPEFV 



jpGTDHDPHGLWKSMPDPDKASACPGQATGQPKEDS 



1360 
..|... 



1370 
..|... 



1380 
..|... 



EELGLPDVNPMCQRPRLKRMQQFEDLEGEIPQFV 



Table 86F lists the domain description from DOMAIN analysis results against NOV86. 
This indicates that the NOV86 sequence has properties similar to those of other proteins 
known to contain this domain. 
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Table 86F. Domain Analysis of NOV86 
gnllSmartlsmart00324 , RhoGAP, GTPase-activator protein for Rho like GTPases; GTPase activator proteins 
towards Rho/Rac/Cdc42-like small GTPases SEQ ID NO : 87 9 

CD-Length = 175 residues, 98.3% aligned 
Score = 157 bits (396), Expect - 4e-39 



NOV8 6 : 


774 


V P F I V KK C VCE I E RRAL RT KG I YRVNG VKTRVE KL CQ AFENG KE L V -ELS Q AS PHD I SNV 


832 






+P IV+KC+ +E+R L T+GIYR +G +RV++L +AF++G + +LS+ HD++ + 




Sbjct : 


3 


IPIIVEKCIEYLEKRGLDTEGIYRKSGSASRVKELREAFDSGPDPDLDLSEYDVHDVAGL 


62 


NOV86 : 


833 


LKL YLRQLPE PL I S FRIjYHEIjVGLAKDSLKAEAEAKAASRGRQDGSE S EAVAVALAGSLi? 


892 






LKL+LR+LPEPLI+F LYE+A +EE+ 




Sbjct : 


63 


LKLFLRELPEPLITFELYEEFIEAA KLEDEEERLRALRE 


101 


NOV86 : 


893 


FLLiiDLPPENRASLQYLLRHLRRIVEVEQDNKMTPGNLGIVFGPTLLRPRPTEATVSLSS 


952 






LL LPP NRA+L+YLL HL R+ E ++NKMT NL IVFGPTLLRP E +S 




Sbj Ct : 


102 


- LLSLLPPANRATLRYLLAHLNRVAEHSEENKMTARNLAI VFGPTLLRPPDGE S AS 


156 


NOV86 : 


953 


LVDYPHQARVIETLIVHY 970 








L D HQ +V+E LI + 




Sbjct: 


157 


LKD I RHQNKWE FL I EN A 174 





Rho GTPases control a variety of cellular processes. There are 3 subtypes of Rho 
GTPases in the Ras superfamily of small G proteins: RHO, RAC, and CDC42. GTPase- 
activating proteins (GAPs) bind activated forms of Rho GTPases and stimulate GTP 
5 hydrolysis. Through this catalytic function, Rho GAPs negatively regulate Rho-mediated 

signals. GAPs may also serve as effector molecules and play a role in signaling downstream of 
Rho and other Ras-like GTPases. 

By screening a Jurkat cDNA library using a yeast 2-hybrid system with an activated 
form of RAC as bait, followed by screening a placenta cDNA library, Toure et al. (1998) 
10 isolated a cDNA encoding RACGAP1, which they called MGCRACGAP. The predicted 527- 
amino acid RACGAP1 protein has a large N-terminal region containing a protein kinase C- 
like cysteine-rich motif. RACGAP1 shares highest homology with the Drosophila RnRacGAP 
and the chimerins of rat and human. Functional analysis showed that the GAP domain of 
RACGAP1 exhibits strong GAP activity towards CDC42, RAC1, and RAC2. Northern blot 
15 analysis detected an approximately 3.2-kb RACGAP1 transcript that was most abundantly 

expressed in testis, with low expression in most other tissues. Western blot analysis detected a 
RACGAP1 protein of 58 kD in testis extracts. In situ hybridization showed that RACGAP1 
expression is restricted to germ cells in mature testis Human breakpoint cluster region (bcr) 
gene product is a member of a group of GTPase-activating proteins that act exclusively on 
20 members of the Ras-related Rho subfamily. 

A complementary DNA was isolated from Caenorhabditis elegans that encoded a 
polypeptide of 1438 amino acid residues, CeGAP, which contains a domain with sequence 
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similarity to the COOH-terminal segment (GTPase-activating protein region) of Bcr and other 
known GTPase-activating proteins of the Rho subfamily. It also contains a "pleckstrin 
homology" motif, present in many signaling proteins including GTPase-activating proteins 
and nucleotide exchange factors. The Bcr-like domain of CeGAP exhibited activity not only 
5 on members of the C. elegans and human Rho subfamily but surprisingly also on C. elegans 
Ras protein (let-60), human Ras, and Rab3A. 

CeGAP is therefore the first GTPase-activating protein acting on Ras-related proteins 
across different subfamilies, studies suggest a central and integrative role for CeGAP in a 
signaling pathway common to Ras and related proteins. 

10 NOV86 is predicted to be expressed in at least the following tissues: pancreas, 

stomach, brain, bone. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV86 is provided in Example 2. 

15 The NOV86 nucleic acids and proteins are useful in potential therapeutic applications 

implicated in various pathological disorders described further herein, for example, Alzheimer's 
disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, 
cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 
leukodystrophies, behavioral disorders, addiction, anxiety, pain, neuroprotection, 

20 hypercalceimia, ulcers, diabetes, Von Hippel-Lindau (VHL) syndrome, pancreatitis, obesity as 
well as other diseases, disorders and conditions. NOV86 nucleic acids encoding the CeGAP- 
like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 
assessed. 

25 The novel nucleic acid of the invention encoding a GTPase activating protein-like 

protein includes the nucleic acid whose sequence is provided in Table 86A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 86A while still encoding a protein 
that maintains its GTPase activating protein-like activities and physiological functions, or a 

30 fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 86 A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 




include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
5 In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the GTPase activating protein-like protein 

whose sequence is provided in Table 86B. The invention also includes a mutant or variant 

protein any of whose residues may be changed from the corresponding residue shown in Table 
10 86B while still encoding a protein that maintains its GTPase activating protein-like activities 

and physiological functions, or a functional fragment thereof. In the mutant or variant protein, 

up to about 1% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
15 methods. These antibodies may be generated according to methods known in the art, using 

prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 

below. 

NOV87 

NOV87 includes two GTPase activating-like proteins, designated herein as NOV87a 
20 and NOV87b. 

NOV87a 

The disclosed NOV87a (alternatively referred to herein as CG56886-01) includes the 
994 nucleotide sequence (SEQ ID NO:281) shown in Table 87A. A NOV87a ORF begins 
25 with a Kozak consensus ATG initiation codon at nucleotides 16-18 and ends with a TGA 
codon at nucleotides 982-984. The disclosed NOV87a maps to human chromosome 17. 



Table 87A. NOV87a Nucleotide Sequence (SEQ ID NO:281) 

GGGGGCCACTCTCTCATGGCCCCCAAAGACAAATCCAGTAGGAAGAATGTGCTGGAGGTGAGTGGCGGGG 
TTGGGGAGAAGGAGGGAAGGGGCTTAGTGATGCCTGCCAGCTCTCTGACAACTATTGACCTGGAAATGAC 
CTGGGGGTTTTTGACCCTTAATTCCCAGCTACGGAGCCGAGATGGCTCTGAGTACCTGATCCAGCACGAC 
TCGGAGGCCATCATCAGCACCTGGCATAAGGCCATTGCTCAGGGCATCCAGGAGCTGGTAAGCAGAGCCC 
AGGGCCTCAGCGACTTGAGCAAGGTCCGGCACAAGCTCCGCAAGTTCCTCCAGAGGCGGCCCACACTGCA 
GTCGCTGCGGGAGAAGGGCTACATCAAAGACCAGGTGTTCGGCTGCGCGCTGGCCGCGCTGTGTGAGCGC 
GAGAGGAGCCGGGTGCCACGCTTCGTGCAGCAGTGCATCCGCGCCGTCGAGGCCCGCGGTCTGGACATCG 
ACGGGCTGTACCGCATCAGTGGAAACCTGGCCACCATCCAGAAGCTACGCTATAAGGTGGACCACGGTGA 
GGATGAGCGCCTTGACCTGGATGACGGGCGCTGGGAGGACGTCCACGTTATCACCGGAGCCCTGAAGCTC 
TTCTTTCGGGAGCTGCCCGAGCCCCTCTTCCCCTTCTCGCACTTCCGCCAGTTCATTGCGGCCATCAGTG 
AGCAGGACCAGGCCCGGCGCAGCCGCTGTGTGCGTGACTTGGTGCGCTCGCTGCCCGCTCCCAACCACGA 
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CACTCTGCGGATGCTCTTCCAGCACCTCTGCCGGAGGGTGATCGAGCACGGCGAGCAGAACCGCATGTCG 
GTGCAGAGCGTGGCCATTGTGTTCGGGCCCACGCTGCTGCGGCCCGAGGTGGAAGAGACCAGCATGCCCA 
TGACCATGGTGTTCCAGAACCAGGTGGTGGAGCTCATCCTGCAGCAGTGCGCGGACATCTTCCCGCCGCA 

CTGACTGCTGGCCT 

A NOV87a polypeptide (SEQ ID NO:282) encoded by SEQ ID NO:281 is 322 amino 
acids in length and is presented using the one-letter amino acid code in Table 87B. The Psort 
profile for NOV87a predicts that this sequence is likely to be localized to the cytoplasm with a 
5 certainty of 0.6500. 



Table 87B. NOV87a Polypeptide Sequence (SEQ ID 
NO:282) 

MAPKDKSSRKNVLEVSGGVGEKEGRGLVMPASSLTTIDLEMTWGFLTLNSQLRSRDGSEY 
LIQHDSEAIISTWHKAIAQGIQELVSRAQGLSDLSKVRHKLRKFLQRRPTLQSLREKGYI 
KDQVFGCALiAALCERERSRVPRFVQQCI RAVEARGLD IDGLYRI SGNLAT I QKLRYKVDH 
GEDERLDLDDGRWEDVHVITGALKLFFRELPEPLFPFSHFRQFIAAISEQDQARRSRCVR 
DLVRSLPAPNHDTLRMLFQHLCRRVIEHGEQNRMSVQSVAIVFGPTLLRPEVEETSMPMT 
MVFQNQWEL I LQQCADI FPPH . 



NOV87b 

The disclosed NOV87b (alternatively referred to herein as CG56886-02) includes the 
10 985 nucleotide sequence (SEQ ID NO:283) shown in Table 87C. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 7-9 and ends with a TGA codon at 
nucleotides 973-975. The disclosed NOV87b maps to human chromosome 17. 



Table 87C. NOV87b Nucleotide Sequence (SEQ ID NO:283) 

GGGGGCCACTCTCTCATGGCCCCCAAAGACAAATCCAGTAGGAAGAATGTGCTGGAGGTGAGTGGCGGGG 
TTGGGGAGAAGGAGGGAAGGGGCTTAGTGATGCCTGCCAGCTCTCTGACAACTATTGACCTGGAAATGAC 
CTGGGGGTTTTTGACCCTTAATTCCCAGCTACGGAGCCGAGATGGCTCTGAGTACCTGATCCAGCACGAC 
TCGGAGGCCATCATCAGCACCTGGCATAAGGCCATTGCTCAGGGCATCCAGGAGCTGGTAAGCAGAGCCC 
AGGGCCTCAGCGACTTGAGCAAGGTCCGGCACAAGCTCCGCAAGTTCCTCCAGAGGCGGCCCACACTGCA 
GTCGCTGCGGGAGAAGGGCTACATCAAAGACCAGGTGTTCGGCTGCGCGCTGGCCGCGCTGTGTGAGCGC 
GAGAGGAGCCGGGTGCCACGCTTCGTGCAGCAGTGCATCCGCGCCGTCGAGGCCCGCGGTCTGGACATCG 
ACGGGCTGTACCGCATCAGTGGAAACCTGGCCACCATCCAGAAGCTACGCTATAAGGTGGACCACGGTGA 
GGATGAGCGCCTTGACCTGGATGACGGGCGCTGGGAGGACGTCCACGTTATCACCGGAGCCCTGAAGCTC 
TTCTTTCGGGAGCTGCCCGAGCCCCTCTTCCCCTTCTCGCACTTCCGCCAGTTCATTGCGGCCATCAGTG 
AGCAGGACCAGGCCCGGCGCAGCCGCTGTGTGCGTGACTTGGTGCGCTCGCTGCCCGCTCCCAACCACGA 
CACTCTGCGGATGCTCTTCCAGCACCTCTGCCGGAGGGTGATCGAGCACGGCGAGCAGAACCGCATGTCG 
GTGCAGAGCGTGGCCATTGTGTTCGGGCCCACGCTGCTGCGGCCCGAGGTGGAAGAGACCAGCATGCCCA 
TGACCATGGTGTTCCAGAACCAGGTGGTGGAGCTCATCCTGCAGCAGTGCGCGGACATCTTCCCGCCGCA 

CTGACTGCTGGCCT , _ 



1 5 The NOV87b polypeptide (SEQ ID NO:284) encoded by SEQ ID NO:283 is 322 

amino acids in length and is presented using the one-letter amino acid code in Table 87D. The 
Psort profile for NOV87b predicts that this sequence has no signal peptide and is likely to be 
localized to the cytoplasm with a certainty of 0.6500. 
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Table 87D. NOV87b Polypeptide Sequence (SEQ ID 
NO:284) 

MAPKDKSSRKNVLEVSGGVGEKEGRGLVMPASSLTTIDLEMTWGFLTLNSQLRSRDGSEY 
LIQHDSEAIISTWHKAIAQGIQELVSRAQGLSDLSKVRHKLRKFLQRRPTLQSLREKGYI 
KDQVFGCALAALCERERSRVPRFVQQCIRAVEARGLDIDGLYRISGNLATIQKLRYKVDH 
GEDERLDLDDGRWEDVHVITGALKLFFRELPEPLFPFSHFRQFIAAISEQDQARRSRCVR 
DLVRSLPAPNHDTLRMLFQHLCRRVIEHGEQNRMSVQSVAIVFGPTLLRPEVEETSMPMT 
MVFQNQWELILQQCADIFPPH 



A BLAST analysis of NOV87 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV87 had high 
homology to other proteins as shown in Table 87E. 



Table 87E. BLASTX results from PatP database for NOV87 



Smallest 
Sum 

High Probability 

Sequences producing High- scoring Segment Pairs: Score P (N) 

patp: AAU1744 9 Novel signal transduction pathway protein 

patp : AAB68548 Human GTP-binding associated protein 

patp:AAG66505 GTP enzyme Rho family active site 90 

patp : AAB643 87 Amino acid sequence of human intracellular 

patp : AAY9445Q Human inflammation associated protein 

In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 276 of 417 bases (66%) identical to a gb:GENBANK- 
ID:BC006107|acc:BC006 107.1 mRNA from Homo sapiens {Homo sapiens, clone 
MGC: 12959, mRNA). The full amino acid sequence of the protein of the invention was found 
to have 173 of 319 amino acid residues (54%) identical to, and 229 of 319 amino acid residues 
(71%) similar to, the 3 16 amino acid residue ptnr : SPTREMBL-ACC : Q9NT76 protein from 
Homo sapiens (Human) (HYPOTHETICAL 36.4 KDA PROTEIN). NOV87 also has 
homology to the other proteins shown in the BLASTP data in Table 87F. 



Table 87F. NOV87 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%> 


Expect 


gi | 11360091 |p 
ir | | T46471 


hypothetical protein 
DKFZp434L0130.l - human 


316 


168/355 
(47) 


221/355 
(61) 


7e-80 


gi | 13676443 |d 
bj |BAB41146 . 1 
| (AB060206) 


hypothetical protein 
[Macaca f ascicularis] 


847 


164/356 
(46) 


220/356 
(61) 


6e-79 



2 . 9e-82 

3 .5e-77 
3 .5e-77 
5.6e-70 
3 . 0e-62 
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gi | 14245732 |d 
bj |BAB56159.1 
| (AB051853) 


rho-GTPase activating 
protein [Homo sapiens] 


731 


130/328 
(39) 


188/328 
(56) 


9e-57 ; 


gi | 1814683l|d 
bj | BAB83128 . 1 
| (AB030239) 


RGL1 [Homo sapiens] 


547 


130/328 
(39) 


188/328 
(56) 


2e-56 i 


gi | 15080081 |g 
b | AAH11820 . 1 | 
AAH11820 
(BC011820) 


Unknown (protein for 
IMAGE: 3619501) 
[Homo sapiens] 


599 


130/328 
(39) 


188/328 
(56) 


2e-56 



This BLASTP data is displayed graphically in the ClustalW in Table 87G. A multiple 
sequence alignment is given, with the NOV87a and b proteins being shown on lines 1 and 2 in 
a ClustalW analysis comparing the protein of the invention with the related protein sequences 
5 shown in Table 87F. 



Table 87G. ClustalW Alignment of NOV87 



NOV87a (SEQ ID NO: 2 82) 

NOV8 7b (SEQ ID NO: 2 84) 

gi|H36009l| (SEQ ID NO:734) 

gi|l3676443| (SEQ ID NO:735) 

gi|l4245732| (SEQ ID NO:736) 

gi|l814683l| (SEQ ID NO:737) 

gi|l508008l| (SEQ ID NO:738) 



10 20 30 40 50 

|....|....|....|....|....|....|....|....|-.-.| 



MKMADRSGKIIPGQAYIEVEYDYEYEAKDRKIVIKQGERYILVKKTNDDW 
M 



60 70 80 90 100 

|....|....|....|----|...-|....|-...|....|-..-| 



WQVKPDENSKAFYVPAQYVKEVTRKALMPPVKQVAGLPNNSTKIMQSLHL 
LSSRWWPSSWGILGLGPRSPPRGSQLCALYAFTYTGADGQQVSLAEGDRF 



110 120 130 140 150 



QRSTENVNKLPELS S FGKPSS S VQGTGLTRDANQNFGPS YNPGHTVNLSL 
LLLRKTNSDWWLARRLEAPSTSRP I FVPAAYMI EES I PSQS PTTVI PGQL 



160 170 180 190 200 

|....|....|....|.-..|.-..|....|....|-..-|....| 



DLTHNNGKFNNDSHSPKVSSQNRTRLFGHFPGPEFLDVEKTSFSQEQSCD 



NOV8 7a 
NOV8 7b 
gi | 11360091 1 
gi | 13676443 j 
gi j 14245732 j 
gi | 18146831 1 
gi | 15080081 j 



NOV87a 
NOV8 7b 
gi | 11360091 | 
gi j 13676443 | 
gi j 14245732 | 
gi | 18146831 | 
gi | 15080081 | 



NOV87a 

NOV87b 

gi | 11360091 | 

gi j 13676443 | 

gi | 14245732 j 

gi | 18146831 j 

gi | 15080081 j 



NOV87a 
NOV87b 
gi|H36009l| 
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gi | 14245732 | 
gi j 18146831 | 
gi | 15080081 j 



NOV8 7a 

NOV87b 

gi | 11360091 | 

gi | 13676443 j 

gi j 14245732 j 

gi | 18146831 | 

gi | 15080081 j 



LWTPGPKLFHGSLEELSQALPSRAQASSEQPPPLPRKMCRSVSTDNLSPS 



- PPLPRKMCRSVSTDNLSPS 



210 



220 



230 



240 



250 



|....|....|....|....|....|....|...-| 



SAGEGSERIHQDSESGDELSSSSTEQIRATTPPNQGRPDSPVYANLQELK 

LLKPFQEGPSGRSLSQEDLPSEAS ASTAGPQPLMSEPPVYCNLVDLR 

MSEPPVYCNLVDLR 

LLKPFQEGPSGRSLSQEDLPSEAS ASTAGPQPLMSEPPVYCNLVDLR 



260 



270 



280 



290 
..|.. 



300 



NOV87a 
NOV8 7b 

gi 
gi 
gi 
gi 
gi 



11360091 | 
13676443 | 
14245732 j 
18146831 | 
15080081| 



I SQSALPPLPGS PAIQI NGEWETHKD - S SGRC Y YYDRGTQERTWKP PRWT 
RCPRSPPPGPACPLLQRLDAWEQHLDPNSGRCFYINSLTGCKSWKPPRRS 
RCPRSPPPGPACPLLQRLDAWEQHLDPNSGRCFYINSLTGCKSWKPPRRS 
RCPRSPPPGPACPLLQRLDAWEQHLDPNSGRCFYINSLTGCKSWKPPRRS 



310 320 330 340 350 

NOV8 7a 

NOV8 7b 

gi|H36009l| 

gi j 13676443 j RDASISKGDFQSPGDQELLSSEENYYSTSYSQSDSQCGSPPRGWSEELDE 

gi | 14245732 j R SE 

gi j 18146831] R SE 

gi j 15080081 j R SE 

360 370 380 390 400 

NOV8 7a 

NOV8 7b 

gi|ll36009l| 

gi | 13676443 j RGHT L YT S D YTNE KWL KH I DDQGRQ Y Y Y S ADG S R S E WE L P K YNAS S QQQ R 

gi|l4245732| TNPGSMEGTQTLKRNNDVLQPQA 

gi|l814683l| TNPGSMEGTQTLKRNNDVLQPQA 

gi|l508008l| TNPGSMEGTQTLKRNNDVLQPQA 

410 420 430 440 450 

NOV8 7a 

NOV8 7b 

gi|H36009l| 

gi | 13676443 j EIIKSRSLDRRLQEPIVLTKWRHSTIVLDTNDKESPTASKPCFPENESSP 

gi | 14245732 | KGFRSDTGTPEPLDPQGSLSLSQRTSQLDPPALQAP RP 

gi | 1814 6831 j KGFRSDTGTPEPLDPQGSLSLSQRTSQLDPPALQAP RP 

gi j 15080081 | KGFRSDTGTPEPLDPQGSLSLSQRTSQLDPPALQAP RP 



460 



470 



480 



490 



500 



NOV87a 

NOV87b 

gi | 11360091 

gi j 13676443 

gi j 14245732 

gi j 18146831 

gi | 15080081 



NOV87a 
NOV87b 
gilll36009l| 



|....|....|..-.|---.|.-..|....|.-.-|....|--..| 



SSPKHQDTASSPKDQEKYGLLNVTKIAENGKKVRKNWLSSWAVLQGSSLL 

L P QL LDD PHE VE K S GL LNMT K I AQGG R KL RKNWG P S WWLTGNS L V 

LPQLLDDPHEVEKSGLLNMTKIAQGGRKLRKNWGPSWWLTGNSLV 

LPQLLDDPHEVEKSGLLNMTKIAQGGRKLRKNWGPSWWLTGNSLV 



510 
|....|.. 



520 



530 



540 



550 
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gi | 13676443 | FTKT- -QGSSTSWFGSNQSKPEFTVDLKGATIE^0S-gDK 

gi | 1424 5732 | FYREPPPTAPSSGWGPAGSRPESSVDLRG- -AAI^G^HLg 

gi | 18146831 | FYREPPPTAPSSGWGPAGSRPESSVDLRG- -AAJ^G&HL 

gi|l508008l| FYREPPPTAPSAGWGPAGSRPESSVDLRG- -AAISHGB 



NOV8 7a 
NOV87b 

gi 
gi 
gi 
gi 
gi 



11360091 | 
13676443 j 
14245732 | 
18146831 | 
15080081 



NOV8 7a 

NOV87b 

gi | 11360091 | 

gi j 13676443 j 

gi | 14245732 | 

gi | 18146831 | 

gi j 15080081 j 



NOV8 7a 
NOV8 7b 
gi | 11360091 | 
gi | 13676443 " 
gi j 14245732 | 
gi | 18146831 | 
gi | 15080081 j 
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gi 


| 11360091 | 


jG)Sflg 


gi 


| 13676443 j 




gi 


j 14245732 | 


JsSpB 


gi 


| 18146831 j 




gi 


| 15080081 | 


J2>p§ 




Table 87H lists the domain description from DOMAIN analysis results against 
NOV87. This indicates that the NOV87 sequence has properties similar to those of other 
proteins known to contain this domain. 



Table 87H. Domain Analysis of NOV87 



pnl [Smartlsmart00324 , RhoGAP, GTPase-activator protein for Rho-like GTPases; GTPase activator proteins 
towards Rho/Rac/Cdc42-like small GTPases SEQ ID NO: 87 9 

CD-Length = 175 residues, 99.4% aligned 
Score = 144 bits (364), Expect = 6e-36 



NOV 87 : 138 S RVPRF VQQC I RAVE ARGLD I DGL Y R I S GNL AT I Q KLR YK VDHGEDERLDLDDGRWE D VH 197 

+P V++CI +E RGLD +G+YR SG+ + + + +LR D G D LDL + DVH 
Sbjct: 1 KPIPIIVEKCIEYLEKRGLDTEGI YRKSGSASRVKELREAFDSGPDPDLDLSE YDVH 57 ■ 

a o 

NOV87 : 198 V1TGALKLFFRELPEPLFPFSHFHQFT AAI S EQDQARRSRCVRDLVRS LPAPNHDTLRML 257 

+ G LKLF RELPEPL F + +FI A +D+ R R +R+L+ LP N TLR L 
Sbjct : 58 DVAGLLKLFLRELPEPLITFELYEEFIEAAKLEDEEERLRALRELLSLLPPANRATLRYL 117 

o 

N0V87 : 258 FQHLCRRVIEHGEQNRMSVQSVAIVFGPTLLRPEVEETSMPMTMVFQNQWELILQQC 315 

HL RV EH E+N+M+ + + +AI VFGPTLLRP E++ + QN+WE + + + 
Sbjct ■ 118 Li AHL - NR V AE H S E ENKMT ARNL AI VFG PTL LR P PDGE S AS LKD I RHQN K WE FL I ENA 174 



Rho GTPases control a variety of cellular processes. There are 3 subtypes of Rho 
GTPases in the Ras superfamily of small G proteins: RHO, RAC, and CDC42. GTPase- 
activating proteins (GAPs) bind activated forms of Rho GTPases and stimulate GTP 

10 hydrolysis. Through this catalytic function, Rho GAPs negatively regulate Rho-mediated 

signals. GAPs may also serve as effector molecules and play a role in signaling downstream of 
Rho and other Ras-like GTPases. 

By screening a Jurkat cDNA library using a yeast 2-hybrid system with an activated 
form of RAC as bait, followed by screening a placenta cDNA library, Toure et al. (1998) 

15 isolated a cDNA encoding RACGAP1, which they called MGCRACGAP. The predicted 527- 
amino acid RACGAP1 protein has a large N-terminal region containing a protein kinase C- 
like cysteine-rich motif. RACGAP1 shares highest homology with the Drosophila RnRacGAP 
and the chimerins of rat and human. Functional analysis showed that the GAP domain of 
RACGAP1 exhibits strong GAP activity towards CDC42, RAC1, and RAC2. Northern blot 

20 analysis detected an approximately 3.2-kb RACGAP1 transcript that was most abundantly 

expressed in testis, with low expression in most other tissues. Western blot analysis detected a 
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RACGAP1 protein of 58 kD in testis extracts. In situ hybridization showed that RACGAP1 
expression is restricted to germ cells in mature testis Human breakpoint cluster region (bcr) 
gene product is a member of a group of GTPase-activating proteins that act exclusively on 
members of the Ras-related Rho subfamily. 



polypeptide of 1438 amino acid residues, CeGAP, which contains a domain with sequence 
similarity to the COOH-terminal segment (GTPase-activating protein region) of Bcr and other 
known GTPase-activating proteins of the Rho subfamily. It also contains a "pleckstrin 
homology" motif, present in many signaling proteins including GTPase-activating proteins 
10 and nucleotide exchange factors. The Bcr-like domain of CeGAP exhibited activity not only 
on members of the C. elegans and human Rho subfamily but surprisingly also on C. elegans 
Ras protein (let-60), human Ras, and Rab3A. 

CeGAP is therefore the first GTPase-activating protein acting on Ras-related proteins 
across different subfamilies, studies suggest a central and integrative role for CeGAP in a 
1 5 signaling pathway common to Ras and related proteins. 

NOV87 is predicted to be expressed in at least the following tissues: pancreas, 
stomach, brain, bone. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
20 NOV87 is provided in Example 2. 

The NOV87 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Alzheimer's 
disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, 
cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, 
25 leukodystrophies, behavioral disorders, addiction, anxiety, pain, neuroprotection, 

hypercalceimia, ulcers, diabetes, Von Hippel-Lindau (VHL) syndrome, pancreatitis, obesity as 
well as other diseases, disorders and conditions. NOV87 nucleic acids encoding the CeGAP- 
like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 
30 assessed. 

The novel nucleic acid of the invention encoding a GTPase activating protein-like 
protein includes the nucleic acid whose sequence is provided in Table 87A or 96C, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 87A or 96C while still 



5 



A complementary DNA was isolated from Caenorhabditis elegans that encoded a 
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encoding a protein that maintains its GTPase activating protein-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to the sequence of Table 87A or 96C, 
including nucleic acid fragments that are complementary to any of the nucleic acids just 
5 described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
10 in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the GTPase activating protein-like protein 
1 5 whose sequence is provided in Table 87B or 96D. The invention also includes a mutant or 

variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 87B or 96D while still encoding a protein that maintains its GTPase activating 
protein-like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 1% of the amino acid residues may be so changed. 
20 These materials are further useful in the generation of antibodies that bind 

immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

25 

NOV88 

The disclosed NOV88 (alternatively referred to herein as CG56394-01) includes the 
1092 nucleotide sequence (SEQ ID NO:285) shown in Table 88A. A NOV88 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon 
30 at nucleotides 1033-1035. The disclosed NOV88 maps to human chromosome 2. 



Table 88A. NOV88 Nucleotide Sequence (SEQ ID NO:285) 

GCACCAGCCACATCCTGAGATACCATGGTTAAGGTGAAGGCCAGAGTCAACAGATTTGGCCACATTGGGC 
ACCAGATCACCAGGGCTGCTTTTAACTCTGGTAAAGTGGATATTGTTGCCATCAGTGACCCCTTCACTGG 
CCTCAACTACATGGTCTACGTGTTCCAGTGTGGTTCTACCCATGGCAAATTCCATGGCACTGTCAAGGCT 
GAGAATGGGAAGCTTGTCATTAACGGAAATCTCATCACCATCTTTCAGGAGCGAGATCCCACCAAAATCA 
AATGGGACAATGTTGACGCTGAGTACATTTGGGTGTCCACCGGTGTCTTCACCACCACAGAGAAGGCTGG 
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GGCTCACTTGCAGCAGGGAGCCAAAAGGGTCATAATCTCTACTCCCTCTGCTGACGCCCCCATGTTCATG 
ATGGGCGTGAACCATAAGAAATATGAAAACAGCCTCAAGATCATCAGCAATGCCTCCTGTACCACCAACT 
TCTTAGCCTCCCTGGCCAAGCTCATCCATGACAACTTTGGTATTGTGGAAGGACTCATGACCACGACCCA 
C ACC ATC ACTG C CAC CC AGAAG ACTGT AGATGG AC C CT C C AGG AAAC TGTGGTGTG ATGG C C ACGGGG CT 
CTCCAGATCATCATCCCTGCATCTACTGGTGCTGCCAAAGCTGTAGGCAAGGTCATCCCCGAGATGAATG 
GGAAGATTACTAGCATGGCCTTCCGTGTCCCCACCACCAATGTGTCGGTCATGCATCTGACCTGCCATCT 
GGAAAATCCTGCCAAATATGATGACATCAAGAAGGTGGTGAAACAGGCATCAGAGGCCCCTCCCCTCAAG 
GGCATCCTGGACTACACTGAGCACCACGTTGTCTCCTCCAGCTTTAACAGTGACACCCACTCTTCCACCT 
TCAATGATGGGGCTGGTATTGCCCTCAATGACCATTTTGTCAAGCTCATTTCCTGTTATGACAATGCATT 
TGGCTACAACAACAGGGCAGTGGACCTCATGGCCCACATGGCCTCCAAGAAGTAAGACCCCCAGACCACC 
AGCCTCAGGCCCTCAGCTGCTAGGAATCCCCTATTGCACTAG 



A NOV88 polypeptide (SEQ ID NO:286) encoded by SEQ ID NO:285 is 336 amino 
acids in length and is presented using the one-letter amino acid code in Table 88B. The Psort 
profile for NOV88 predicts that this sequence has no signal peptide and is likely to be 
5 localized to microbodies with a certainty of 0.4804. In alternative embodiments, a NOV88 
polypeptide is located to the mitochondrial matrix space with a certainty of 0.3600. 



Table 88B. NOV88 Polypeptide Sequence (SEQ ID 
NO:286) 

M VK VKAR VNR FGH I GHQ I TRAAFNS G K VD I VA I S DP FTG LNYM VY VFQCG S THG K FHG TV 
KAENG KL. V INGNL I T I FQERDPT K I KWDNVD AE Y I WVS TG VFTTTE KAG AHL QQG AKR V I 
I S TPS ADAPMFMMGVNHKKYENS LK I I SNASCTTNFLAS LAKL I HDNFG I VEGLMTTTHT 
ITATQKTVDGPSRKLWCDGHGALQI I IPASTGAAKAVGKVIPEMNGKITSMAFRVPTTNV 
SVMHLTCHLENPAKYDDIKKWKQASEAPPLKGILDYTEHHWSSSFNSDTHSSTFNDGA 
G I ALNDH FVKL I S CYDNAFGYNNRAVDLMAHMAS KK 



A BLAST analysis of NOV88 was run against the proprietary PatP GENESEQ Protein 
1 0 Patent database. It was found, for example, that the amino acid sequence of NOV88 had high 
homology to other proteins as shown in Table 88C. 



Table 88C. BLASTX results from PatP database for NOV88 










Smallest 










Sum 








High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P (N) 


patp 


AAY07036 


Breast cancer associated antigen precursor 


1407 


9 . 9e-144 


patp 


AAY05368 


Human HCMV inducible gene protein 


1407 


9 . 9e-144 


patp 


AAG64817 


Human G3PDH fragment - Homo sapiens, 327 aa. 


1377 


1 . 5e-140 


patp 


AAE04373 


Mouse cancer associated antigen OY-MC-2 


1316 


4 . 4e-134 


patp 


AAR12995 


GAP-DH - Aspergillus oryzae (ATCC 4214 9) 


993 


7 . 4e-100 



In a search of sequence databases, it was found, for example, that the nucleic acid 
1 5 sequence of this invention has 928 of 1055 bases (87%) identical to a gb:GENBANK- 

ID:AF261085|acc:AF26 1085.1 mRNA from Homo sapiens (glyceraldehyde-3-phosphate 
dehydrogenase (GADPH) mRNA), The full amino acid sequence of the protein of the 
invention was found to have 278 of 336 amino acid residues (82%) identical to, and 294 of 
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336 amino acid residues (87%) similar to, the 335 amino acid residue ptnnTREMBLNEW- 
ACC:AAG0I996 protein from Homo sapiens (Human) (CLONE CDABP0047 MRNA 
SEQUENCE). NOV88 also has homology to the other proteins shown in the BLASTP data in 
Table 88D. 

5 



Table 88D. NOV88 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 7669492 | ref | 
NP_002037 .2 | 
(NM 002046) 


glyceraldehyde- 3 - 
phosphate dehydrogenase 
[Homo sapiens] 


335 


278/336 
(82) 


294/336 
(86) 


e-152 


gi|31645|emb|CA 
A25833.l| 
(X01677) 


glyceraldehyde- 3- 
phosphate 
dehydrogenase 
[Homo sapiens] 


335 


227/336 
(82) 


294/336 
(87) 


e-152 


gi | 2407184 | gb | A 
AB94053 . 1 | 
(AF017079) 


glyceraldehyde 3- 
phosphate dehydrogenase 
[Sus scrofa] 


333 


272/333 
(81) 


290/333 
(86) 


e-149 


gi | 6983849 | dbj | 
BAA90818. 1 | 
(AB038241) 


glyceraldehyde- 3 - 
phosphate dehydrogenase 
[Felis catus] 


333 


267/333 
(80) 


285/333 
(85) 


e-145 


gi | 2506441 | sp | P 
00355 |G3P_PIG 


GLYCERALDEHYDE 3- 
PHOSPHATE DEHYDROGENASE 
(GAPDH) 


333 


265/333 
(79) 


284/333 
(84) 


e-145 



This BLASTP data is displayed graphically in the ClustalW in Table 88E. A multiple 
sequence alignment is given, with the NOV88 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
10 Table 88D. 



Table 88E. ClustalW Alignment of NOV88 



NOV8 8 (SEQ ID NO: 2 86) 

gi|7669492| (SEQ ID NO:739) 

gi|31645| (SEQ ID NO:740) 

gi|2407184| (SEQ ID NO:741) 

gi|6983849| (SEQ ID NO:742) 

gi|250644l| (SEQ ID NO:743) 



MGK 

mg: 



aVK^gVNtaFGiijlG jlga TRAAFNSGKVDIVAI^DPFg^LNYMVYffiFO 
VKVGVNGFGRIGRLVTRAAFNSGKVDIVAINDPFIDLNYMVYMFQYD 
3VKVGVNGFGRIGRLVTRAAFNSGKVDIVAINDPFIDLNYMVYMFQYD 



1vk vg vng fgr i g r l v t raafns g kvd i v a i nd p f i d lju ym v ym fq y d 
1vkvgvngfgrigrlvtraafnsgkvdivaindpfidlnymvymfqyd 
iVkvgvngfgrigrlvtraafnsgkvdivaindpfidlSymvymfqyd 



NOV8 8 

gi I 7669492 ] 
gi|31645| 
gi |2407184 | 
gi|6983849| 
gi (2506441 | 



NOV8 8 

gi [7669492 | 
gi | 31645 | 
gi |2407184 | 



60 
- I • - 



70 
• I - - 



80 



90 



100 



1- , 



sthgkfhgtvkaengklvingnpitifqerdp|kikwgdagaeyvvestg 
sthgkfhgtvkaengklvingnpitifqerdpSkikwgdagaeyvvestg 
sthgkfhgtvkaengklvingnp it i fqerdpfgki kwgdagaeyvvestg 
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1 Gll/'iPi'l 1 



gi | 6983849 | 
gi | 2506441 j 



NOV8 8 

gi | 7669492 | 
gi|31645| 
gi j 2407184 | 
gi | 6983849 j 
gi |250644l| 



NOV8 8 

gi | 7669492 | 
gi |31645 | 
gi | 2407184 | 
gi j 6983849 j 
gi|250644l| 



NOV8 8 

gi | 7669492 | 
gi|31645| 
gi j 2407184 | 
gi | 6933849 | 
gi |2506441 | 



NOV8 8 

gi | 7669492 | 
gi |31645 | 
gi |2407184 | 
gi |6983849 | 
gi (2506441 | 



NOV8 8 

gi |7669492 | 
gi (31645 | 
gi | 2407184 | 
gi j 6983849 | 
gi|250644l| 



sthgkfhgtvkaengklvinggpitifqerdp^blkwgdagaeyvvestg 
sthgkfhgtvkaengklvingEitifqerdpSikwgdagaSSywestg 



110 



120 



130 



140 



150 




160 



170 
. I . . 



180 



190 



S C TTNgL AgL AK[j I HDNFG I VEG LMT TgH|j I TAT QKT VDGP S jgK L WgDGj 

scttnclaplakvihdnfgiveglmttvhaitatqktvdgpsgklwrdgr 
scttnclaplakvihdnfgiveglmttvhaitatqktvdgpsgklwrdgr 
scttnclaplakvihdnfgiveglmttvhaitatqktvdgpsgklwrdgr 
scttnclaplakvihdjgfgiveglmttvhaitatqktvdgpsgklwrdgr 
scttnclaplakvihdSfgiveglmttvhaitatqktvdgpsgklwrdgr 



210 220 230 240 250 
. 1 1 I .... 1 1 — — I — • .(.... I I 



GALQQl I PASTGAAKAVGKVI PE§NGKgT@MAFRVPT^VS vgLTCgiLE 
GALQNI I PASTGAAKAVGKVI PELNGKLTGMAFRVPtBnVS VVDLTCRLE 
GALQNI I PASTGAAKAVGKVI PEL^GKLTGMAFRVPTSNVS VVDLTCRLE 
GALQNI I PASTGAAKAVGKVI PELNGKLTGMAFRVPtBnVSWDLTCRLE 
GA0QNI I PASTGAAKAVGKVI PELNGKLTGMAFRVPtBnVSWDLTCRLE 
GALQNI I PASTGAAKAVGKVI PELNGKLTGMAFRVPtBnVSWDLTCRLE 



260 



270 280 



290 



300 



PAKYDDIKKWKQASEgP 
KPAKYDD I KKWKQASEGP 
KPAKYDDIKKWKQASEGP 
KPAKYDD I KKWKQASEGP 
KPAKYDDIKKWKQASEGP 
KPAKYDDIKKWKQASEGP 




310 320 330 

■ . | ■ . . - - | .... | . . 



GIALNDHFVKLISgYDNgFGY^NRgVDLMAHMASKy 
G I ALNDH FVKLI S WYDNE FG YSNR WDLMAHMAS KE 
G I ALNDH FVKL I S WYDNE FG YSNR WDLMAHMAS KE 
G I ALNDHF VKL I S@YDNE FG Y SNRWDLM^HMAS KE 
GI ALNDHFVKL I S WYDNE FGYSNRWDLMAHMAS KE 
G I ALNDH FVKL I S WYDNE FG Y SNR WDLMffiHMAS KE 



Table 88F lists the domain description from DOMAIN analysis results against NOV88. 
This indicates that the NOV88 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 88F. Domain Analysis of NO V88 



gnllPfamlpfamQ2800 , gpdhC, Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a 

tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed 

al p ha/an ti parallel beta fold. SEQ ID NO: 880 

CD-Length = 163 residues, 100.0% aligned 
Score = 169 bits (427), Expect = 3e-43 



NOV88 : 



153 TTNFLASLAKLIHDNFGIVEGLMTTTHTITATOJCTVDGPSRKLWCDGHGALQIIIPASTG 212 
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TTN LA LAK+++DNFGI +GLMTT H TA QK VDGP K G A IIP STG 




Sbjct : 


1 


TTNCLAPLAKVLNDNFGIEKGLMTTVHAYTADQKLVDGPHHKDLRRGRAAAPNI I PTSTG 


60 


NOV8 8 : 


213 


AAKAVGKVIPEMNGKITSMAFRVPTTNVSVMHLTCHLENPAKYDDIKKWKQASEAPPLK 


272 






AAKAVG V+PE+NGK+T MAFRVPT NVSV+ LT LE P + + I +K+A+E P LK 




Sbjct : 


61 


AAKAVGL VL PE LNGKLTGM AFRVP T PNVS VVDLT VELE K P VT VEE I NAAL KE AAEG P ALK. 


120 


NOV88 : 


273 


G I LiDYTEHHWSSSFNSDTHSS TFNDGAG I ALNDH FVKL I S C Y 315 








GIL YTE +VSS F D HSS F+ A I LND+FVKL++ Y 




Sbjct : 


121 


GI LGYTEDPLVS SDF I GDPHS S I FDAKAT I VLNDNFVKL VAWY 163 





NAD-dependent glycerol-3-phosphate dehydrogenase (EC 1.1.1.8) (GPD) catalyzes 
the reversible reduction of dihydroxyacetone phosphate to glycerol-3-phosphate. It is a 
cytoplasmic protein that is active as a homodimer, with each monomer containing an N- 
5 terminal NAD binding site. In insects, it acts in conjunction with a mitochondrial alpha- 
glycerophosphate oxidase in the alpha-glycerophosphate cycle, which is essential for the 
production of energy used in insect flight. 

Glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.2.12) (GAPDH) mRNA levels, 
protein, and enzymatic activity increase in 3T3-F442A adipocytes after exposure to 

10 physiological concentrations of insulin (Alexander, M., Curtis, G., Avruch, J., and Goodman, 
H. (1985) J. Biol. Chem. 260, 1 1978-1 1985). In order to understand the mechanism of this 
regulation, researchers isolated and sequenced 5.4 kilobase pairs of a 12-kilobase pair human 
genomic clone encoding a functional GAPDH gene. The gene consists of 9 exons and 8 
introns with eukaryotic signals necessary for the transcription and translation of GAPDH 

15 mRNA. The exon sequence confirms previously published cDNA sequences for human 

GAPDH in muscle, liver, and erythrocytes. The organization of the human and the unique 
chicken GAPDH genes is strikingly similar. Although chicken exons VIII-XI have been fused 
into human exon 8, introns which separate exons encoding the NAD binding, catalytic, and 
helical domains of the GAPDH protein have been retained. Stable transfection of rodent cells 

20 with the intact human GAPDH gene resulted in the expression of a correctly initiated human 
GAPDH mRNA and an enzymatically active human GAPDH polypeptide. Thus, the gene 
contains a functional promoter and intact coding sequences. Although many processed 
GAPDH pseudogenes and GAPDH-like sequences are present in the human genome, Southern 
blot analysis of human genomic DNA using a probe derived from the 3 '-untranslated region of 

25 the GAPDH gene detected only two genes, a 1 0-copy processed pseudogene and a single copy 
of the isolated gene. In contrast, a probe derived from an intron segment of the isolated gene 
detected only a single copy of the GAPDH gene. Collectively, these findings strongly suggest 
that the human genome encodes a single functional GAPDH gene. 
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Hopkinson et al. (1974) presented evidence that glycerol-3-phosphate dehydrogenase 
(EC 1 .1 .1 .8) is a dimer of dissimilar subunits. Electrophoretic variants at each of two loci, 
designated GPD1 and GPD2, were described. By the method of somatic cell hybridization, 
Kielty and Povey (1982) assigned the presumed structural gene for alpha-glycerophosphate 
dehydrogenase to chromosome 12. Since this is a liver-specific enzyme, a rat hepatoma cell 
line was used as one of the 'parents 1 in the hybridization. 

NOV88 is predicted to be expressed in at least the following tissues: liver. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 
sources, and/or RACE sources. Further expression data for NOV88 is provided in Example 2. 

The NOV88 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, cirrhosis, transplantation as well as other diseases, disorders 
and conditions. NOV88 nucleic acids encoding the Glycerol-3-Phosphate Dehydrogenase-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a Glycerol-3-Phosphate 
Dehydrogenase-like protein includes the nucleic acid whose sequence is provided in Table 
88A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 88A while still 
encoding a protein that maintains its Glycerol-3-Phosphate Dehydrogenase-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to the sequence of Table 88A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 13% of the bases 
may be so changed. 

The novel protein of the invention includes the Glycerol-3-Phosphate Dehydrogenase- 
like protein whose sequence is provided in Table 88B. The invention also includes a mutant or 
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variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 88B while still encoding a protein that maintains its Glycerol-3-Phosphate 
Dehydrogenase-like activities and physiological functions, or a functional fragment thereof In 
the mutant or variant protein, up to about 1 8% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV89 

The disclosed NOV89 (alternatively referred to herein as CG56396-01) includes the 
1221 nucleotide sequence (SEQ ID NO:287) shown in Table 89A. A NOV89 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon 
at nucleotides 1033-1035. The disclosed NOV89 maps to human chromosome 6. 



20 



Table 89 A. NOV89 Nucleotide Sequence (SEQ ID NO:287) 



CTGCATCTTCTCGTGCATTGCCAGCTGCATCCCTGAGACACCATGGTGAAGGTGAAGGCTGGAGTCAACA 
GATTTGGTTGTATTGGCTGCCTGGTCACCAGGGCTGCTTTAAACTCTGGTTTAGTCGATATTGTCGCCAT 
CAATGACCCCTTCATTGACCTCAACAACACTGTCTACATGTTCCAGTATAATTCCGCCCATGGCAAATTC 
CACGGCACCGTCAAGGCTGAGAACGGGAAGCTTGTTATCAATGGAAATCTCATCACTATTTTCCAGGGGC 
AAGATCTCACCAAAATCAAATGGGGCAATGCTGGCACTGAGTACATCATGGAGTTCACCAGCATCTTCAC 
CACCATGGAGAAGGCTGGGGCTCACTTGGAGGGAGGAGCCAAAACGGTCATCATCTCTGCACCCTCTGCT 
GATGCCCCCATGTTCGTGATGGGTGTGAACCATGAGAAATATGACAACAGCTCAAGATTACTCAAGATTA 
TCAGCAATGCCTCCTGCACCACCAGCTGCTTAACGCCCCTGGCCAAGGTCATCCATGACAACTTTGGTAC 
CGTGGAAGGACTCATGACCATCGCTGCCACCCAGAAGACTATGGATGGCTCCTATGGGAAACTGTGGGGT 
GACGGCCATGGGGCTCTCCAGAACATCCTCTCTGCCTCTACTGGTGCTGCCAAGGCTGTGAGGAAGGTCA 
TCCCTGAGCTAAACGGGAAGCTCACTGGCATGGCCTTCCGTGTCCCCACTGCCAACATGTCAGTGGTGGA 
CCTGACCTGCCGTCTGGAAAAACCTACCAAATATGATGACACCAAGAAGGTGGTGAAGCAGGCGTCAGAG 
GACCCCCTCAAAGGCATCCTGGGCTACTCTGAGCACCAGGTGGTCTCCTCCAACTTCAACTCAACAGACA 
CCCACTCTTCCACCTTCGATGCTGGGGCTGGCATTGCCCTCAACGACCACTTTGTCAAGCTCATTTCCTG 
GT ATG AC AATG AATTTGGC TGC AG C AAC AGGGTGGTGG AC CT CT G CC C AC AGTGTGGCT TC C AAGGAGT A 
AGACCCCCAGACCACCAGCCCCAGCGACAGCACGACGGGAAGAGAGCGGCCCTCACTGCTGGAGAGTCCC 
TGCCACACTCAGTCTCCCACCACACTGAGAATCTCCCCTCCTCATAGTTTCCATGCAGACCCCCTAAAAG 

GGAGGAGCCGAGGGAGCCCCACCTTTTCATG , . _ 



A NOV89 polypeptide (SEQ ID NO:288) encoded by SEQ ID NO:287 is 374 amino 
acids in length and is presented using the one-letter amino acid code in Table 89B. The Psort 
profile for NOV89 predicts that this sequence is likely to be localized, to the endoplasmic 
reticulum (membrane) with a certainty of 0.5500, or to lysosomes with a certainty of 0.2630. 



Table 89B. NOV89 Polypeptide Sequence (SEQ ID 
NO:288) 



MVKVKAGVNRFGCIGCLVTRAAIjNSGLVDIVAINDPFIDLNNTVYMFQYNSAHGKFHGTV 
KAENGKLVINGNLITIFQGQDLTKIKWGNAGTEYIMEFTSIFTTMEKAGAHLEGGAKTVI 
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ISAPSADAPMFVMGVNHEKYDNSSRLLKIISNASCTTSCLTPLAKVIHDNFGTVEGLMTI 
AATQKTMDGSYGKLWGDGHGALQNILSASTGAAKAVRKVIPELNGKLTGMAFRVPTANMS 
WDLTCRLEKPTKYDDTKKWKQASEDPLKGILGYSEHQWSSNFNS TDTHSSTFDAGAG 
I ALNDH F VKL I S W YDNE FGC S NRWDLC P QC GFQG VRP P DHQ P QRQHDGKRAALT AGE S L 
PHSVSHHTENLPSS 



A BLAST analysis of NOV89 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV89 had high 
homology to other proteins as shown in Table 89C. 

5 



Table 89C- BLASTX results from PatP database for NOV89 








Smallest 








Sum 






High 


Probability 


Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


patp:AAY07 03 6 


Breast cancer associated antigen precursor 


1393 


3 . Oe-142 


patp:AAY053 68 


Human HCMV inducible gene protein 


1393 


3 . Oe-142 


patp:AAG64 817 


Human G3PDH fragment - Homo sapiens, 221 aa. 


1377 


1.5e-140 


patp:AAE043 73 


Mouse cancer associated antigen OY-MC-2 


1315 


5 . 6e-134 


patp: AAR12995 


GAP - DH - Aspergillus oryzae (ATCC 42149 


984 


6 . 6e-99 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1066 of 1225 bases (87%) identical to a gb:GENBANK- 
ID:AF261085|acc:AF261085.1 mRNA from Homo sapiens (glyceraldehyde-3 -phosphate 

10 dehydrogenase (GADPH) mRNA). The full amino acid sequence of the protein of the 

invention was found to have 279 of 327 amino acid residues (85%) identical to, and 293 of 
327 amino acid residues (89%) similar to, the 335 amino acid residue ptnnTREMBLNEW- 
ACC:AAG01996 protein from Homo sapiens (Human) (CLONE CDABP0047 MRNA 
SEQUENCE). NOV89 also has homology to the other proteins shown in the BLASTP data in 

15 Table 89D. 



Table 89D. NOV89 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 7669492 | ref | 
NP002037 .2 | 
(NM__002046) 


glyceraldehyde-3 - 
phosphate 
dehydrogenas e 
[Homo sapiens] 


335 


279/331 
(84) 


293/331 
(88) 


e-153 


gi | 31645 | emb ) CA 
A25833 . 1 j 
(X01677) 


glyceraldehyde-3 - 
phosphate 
dehydrogenase 
[Homo sapi ens] 


335 


278/331 
(83) 


293/331 
(87) 


e-152 


gi | 24 07184 | gb J A 
AB94053 . 1 | 
(AF017079) 


glyceraldehyde 3- 
phosphate 
dehydrogenase 
[Sus scrofa] 


333 


274/328 
(83) 


289/328 
(87) 


e-149 
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gi | 6 983 84 9 | db j | 
BAA90818 . 1 j 
(AB038241) 


glyceraldehyde - 3 - 
phosphate 
dehydrogenase 
[Felis catus] 


333 


268/328 
(81) 


283/328 
(85) 


e-146 


gi | 2506441 | sp | P 
00355 |G3P_PIG 


GLYCERALDEHYDE 3- 
PHOSPHATE 

DEHYDROGENASE (GAPDH) 


333 


267/328 
(81) 


283/328 
(85) 


e-146 



This BLASTP data is displayed graphically in the ClustalW in Table 89E. A multiple 
sequence alignment is given, with the NOV89 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 89D. 



Table 89E. ClustalW Alignment of NOV89 



NOV8 9 



gi | 7669492 | 
gi | 31645 | 
gi | 2407184 | 
gi | 6983849 j 
gi | 2506441 j 



NOV8 9 

gi | 7669492 | 
gi | 31645 | 
gi j 2407184 | 
gi | 6983849 j 
gi|250644l| 



NOV8 9 

gi 
gi 
gi 
gi 

gi 



7669492 | 
31645 | 
2407184 | 
6983849 | 
2506441 | 



(SEQ ID NO: 28 8) 

(SEQ ID NO: 744) 

(SEQ ID NO: 74 5) 

(SEQ ID NO: 74 6) 

(SEQ ID NO: 74 7) 

{SEQ ID NO: 74 8) 



NOV8 9 


MVK 


gi | 7669492 | 


MGK 


gi | 31645 | 


MGK 


gi | 2407184 | 




gi | 6983849 | 


--M 


gi|250644l| 





NOV8 9 

gi | 7669492 | 
gi j 31645 | 
gi | 2407184 | 
gi | 6983849 j 
gi | 2506441 j 
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20 



30 
. I . - 



I 



■ „ r VTRAAHNSG||VDIVAINDPFIDLNiS|iVYiyiFQY[_ 
VKVGVNGFGRIGRLVTRAAFNSGKVDIVAINDPFIDLNYMVYMFQYD 
VKVGVNGFGRIGRLVTRAAFNSGKVDIVAINDPFIDLNYMVYMFQYD 
VKVGVNGFGRIGRLVTRAAFNSGKVDIVAINDPFIDL^YMVYMFQYD 
IVKVGVNGFGRIGRLVTRAAFNSGKVDIVAINDPFIDLNYMVYMFQYD 
jvKVGVNGFGRIGRLVTRAAFNSGKVDIVAINDPFIDLlSjYMVYMFQYD 



50 

t 



100 




I 



110 

. . I . . 



I 



120 
. . I . . 



I 



130 



140 



150 



_FTTMEKAGAHL1GGAK0VI I SAPS ADAPMFVMGVNHEKYDNS 
VFTTMEKAGAHlIgGAKRVI I SAPS ADAPMFVMGVNHEKYDNS 

vfttmekagahlIggakrviisapsadapmfvmgvnhekydns 
vfttmekagahliggakrvi i sapsadapmfvmgjjjnhekydns 

VFTTMEKAGAHLIGGAKRVI I SAPS ADAPMFVMGVNHEKYDNS 



cagahliSggakrvi i sapsadapm: 



160 



I 



170 



180 



I 



190 
■ J.. 




SNASCTTgCL0PLAKVI 
SNASCTTNCLAPLAKVI 
SNASCTTNCLAPLAKVI 
SNASCTTNCLAPLAKVI 
SNASCTTNCLAPLAKVI 
SNASCTTNCLAPLAKVI 



210 



HDNFG0VEGLMTHH I S ATQKT i DG a3 GKLW ' 

hdnfgiveglmttvhaitatqktvdgpsgklwr 
hdnfgiveglmttvhaitatqktvdgpsgklwr 
hdnfgiveglmttvhaitatqktvdgpsgklwr 
hd^fgiveglmttvhaitatqktvdgpsgklwr 
hdIfgiveglmttvhaitatqktvdgpsgklwr 



220 



230 



240 



250 



716 




DGSjGALQNI^ASTGAAKAViSKVIPELNGKLTGMAFRVPTANUjsVVDLTC 
DGRGALQNI I PASTGAAKAVGKV I PELNGKLTGMAFRVPTANVSWDLTC 
DGRGALQNI I PASTGAAKAVGKVI PElSgKLTGMAFRVPTANVS WDLTC 
DGRGALQNI I PASTGAAKAVGKVI PELNGKLTGMAFRVPTANVSVVDLTC 
DGRGA0QNI I PAS T G AAKAVG K V I P E LNG KL TGMAF R V P tSnVS WD LTC 
DGRGAgQNI I PASTGAAKAVGKVI PELNGKLTGMAFRVPTANVSVVDLTC 



rlekpgkyddykkwkqasegplkgilgygehqvvss^fnshdthsstfd 
rlekpakyddikkwkqasegplkgilgytehqvvssdfnsBdthsstfd 
rlekpakyddikkwkqasegplkgilgytehqvvssdfnsBdthsstfd 
rlekpakyddikkwkqasegplkgilgytehqvvssdfnsbdthsstfd 
rlek^akyddikkwkqasegplkgilgyteSqvvsHdfnsbdthsstfd 
rlekpakyddikkwkqasegplkgilgyteBqvvsSdfnsBdthsstfd 



NOV8 9 

gi I 7669492 | 
gi I 31645 | 
gi | 2407184 | 
gi j 6983849 | 
gi j 2506441 | 



NOV8 9 

gi | 7669492 | 
gi | 31645 | 
gi | 2407184 | 
gi | 6983849 j 
gi |2506441 j 



NOV8 9 

gi | 7669492 | 
gi j 31645 | 
gi j 2407184 | 
gi | 6983849 j 
gi | 2506441 | 



NOV8 9 

gi | 7669492 | 
gi|31645| 
gi | 2407184 | 
gi j 6983849 j 
gi |2506441 | 



310 



320 



330 



340 



350 




I I.... I.... I 

|CPQCGF_^3VRPPDHQPQRQ 



hmaske 

HMASKE 
HMASKE 
HMASKE 



360 



370 



HDGKRAALTAGESLPHSVSHHTENLPSS 



Table 89F lists the domain description from DOMAIN analysis results against NOV89. 
This indicates that the NOV89 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 89F. Domain Analysis of NOV89 



gnl|Pfamipfam02800 , gpdhC, Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a 

tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed 

alpha/antiparallel beta fold. SEQ ID NO : 881 

CD-Length = 163 residues, 100.0% aligned 
Score = 209 bits (531), Expect = 3e-55 



NOV89 : 156 TT S CLTPL AKVI HDNFGTVEGLMT I AATQKTMDGS YGKLWGDGHGALQNI LSASTG 211 

TT+CL PLAKV+ +DNFG +GLMT A QK +DG + K G A NI + STG 

Sbjct: 1 TTNCLAPLAKVLNDNFGIEKGLMTTVHAYTADQKLVDGPHHKDLRRGRAAAPNIIPTSTG 60 

NOV89: 212 AAKAVRKVIPELNGKLTGMAFRVPTANMSWDLTCRLEKPTKYDDTKKWKQASEDP-LK 270 

AAKAV V+PELNGKLTGMAFRVPT N+SWDLT LEKP ++ +K+A+E P LK 

Sbjct: 61 AAKAVGLVLPELNGKLTGMAFRVPTPNVSWDLTVELEKPVTVEEINAALKEAAEGPALK 120 

NOV89: 271 GILGYSEHQWSSNFNSTDTHSSTFDAGAGIALNDHFVKLISWY 314 

GILGY+E +VSS+F D HSS FDA A I LND+FVKL++WY 
Sbjct: 121 GI LGYTEDPLVS SDFIG - DPHS S I FDAKATI VLNDNFVKLVAWY 163 
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NAD-dependent glycerol-3-phosphate dehydrogenase (EC 1.1.1.8) (GPD) catalyzes 
the reversible reduction of dihydroxyacetone phosphate to glycerol-3-phosphate. It is a 
cytoplasmic protein that is active as a homodimer, each monomer containing an N-terminal 
NAD binding site. In insects, it acts in conjunction with a mitochondrial alpha- 
glycerophosphate oxidase in the alpha-glycerophosphate cycle, which is essential for the 
production of energy used in insect flight. 

Glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.2.12) (GAPDH) mRNA levels, 
protein, and enzymatic activity increase in 3T3-F442A adipocytes after exposure to 
physiological concentrations of insulin (Alexander, M., Curtis, G., Avruch, J., and Goodman, 
H. (1985) J. Biol. Chem. 260, 1 1978-1 1985). In order to understand the mechanism of this 
regulation, researchers isolated and sequenced 5.4 kilobase pairs of a 12-kilobase pair human 
genomic clone encoding a functional GAPDH gene. The gene consists of 9 exons and 8 
introns with eukaryotic signals necessary for the transcription and translation of GAPDH 
mRNA. The exon sequence confirms previously published cDNA sequences for human 
GAPDH in muscle, liver, and erythrocytes. The organization of the human and the unique 
chicken GAPDH genes is strikingly similar. Although chicken exons VI1I-X1 have been fused 
into human exon 8, introns which separate exons encoding the NAD binding, catalytic, and 
helical domains of the GAPDH protein have been retained. Stable transfection of rodent cells 
with the intact human GAPDH gene resulted in the expression of a correctly initiated human 
GAPDH mRNA and an enzymatically active human GAPDH polypeptide. Thus, the gene 
contains a functional promoter and intact coding sequences. Although many processed 
GAPDH pseudogenes and GAPDH-like sequences are present in the human genome, Southern 
blot analysis of human genomic DNA using a probe derived from the 3 '-untranslated region of 
the GAPDH gene detected only two genes, a 10-copy processed pseudogene and a single copy 
of the isolated gene. In contrast, a probe derived from an intron segment of the isolated gene 
detected only a single copy of the GAPDH gene. Collectively, these findings strongly suggest 
that the human genome encodes a single functional GAPDH gene. 

Hopkinson et al. (1974) presented evidence that glycerol-3-phosphate dehydrogenase 
(EC 1.1.1.8) is a dimer of dissimilar subunits. Electrophoretic variants at each of two loci, 
designated GPD1 and GPD2, were described. By the method of somatic cell hybridization, 
Kielty and Povey (1982) assigned the presumed structural gene for alpha-glycerophosphate 
dehydrogenase to chromosome 12. Since this is a liver-specific enzyme, a rat hepatoma cell 
line was used as one of the 'parents 1 in the hybridization. 
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NOV89 is predicted to be expressed in at least the following tissues: liver. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 
sources, and/or RACE sources. Further expression data for NOV89 is provided in Example 2. 
5 The NOV89 nucleic acids and proteins are useful in potential therapeutic applications 

implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, cirrhosis, transplantation as well as other diseases, disorders 
and conditions. NOV89 nucleic acids encoding the Glycerol-3-Phosphate Dehydrogenase-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 

1 0 wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

. The novel nucleic acid of the invention encoding a Glycerol-3-Phosphate 
Dehydrogenase-like protein includes the nucleic acid whose sequence is provided in Table 
89A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 89A while still 

1 5 encoding a protein that maintains its Glycerol -3 -Phosphate Dehydrogenase-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to the sequence of Table 89A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. The 
invention additionally includes nucleic acids or nucleic acid fragments, or complements 

20 thereto, whose structures include chemical modifications. Such modifications include, by way 
of non-limiting example, modified bases, and nucleic acids whose sugar phosphate backbones 
are modified or derivatized. These modifications are carried out at least in part to enhance the 
chemical stability of the modified nucleic acid, such that they may be used, for example, as 
antisense binding nucleic acids in therapeutic applications in a subject. In the mutant or variant 

25 nucleic acids, and their complements, up to about 13% of the bases may be so changed. 

The novel protein of the invention includes the Glycerol-3-Phosphate Dehydrogenase- 
like protein whose sequence is provided in Table 89B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 89B while still encoding a protein that maintains its Glycerol-3-Phosphate 

30 Dehydrogenase-like activities and physiological functions, or a functional fragment thereof. In 
the mutant or variant protein, up to about 1 5% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
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prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV90 

5 The disclosed NOV90 (alternatively referred to herein as CG56888-01 ) includes the 

1686 nucleotide sequence (SEQ ID NO:289) shown in Table 90A. A NOV90 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 58-60 and ends with a stop codon 
at nucleotides 1630-1632. The disclosed NOV90 maps to human chromosome 3. 



Table 90A. NOV90 Nucleotide Sequence (SEQ ID NO:289) 

GTGCCTCACTGGGGTGGGGAACTTGCCTCACCTGGGGCCATTTCATATTTCTGAATCATGTGTGATAACA 
GAGAACTGGAAGACAAGCCTCCAGCACCTCCCGTGCGAATGAGCAGGACCATCTTTAGCACTGGGGGCAA 
AGACCCTTTGTCAGCCAATCACAGTTTGAAACCTTTGCCTTCTGTTCCAGAGGAGAAAAAGCCCAGGCAT 
AAAATC AT CT CC AT ATT CTC AGG C AC AG AGAAAGGAAGT AAAAAGAAAG AAAAGGAACGGC C AGAAATTT 
CTCCTCCATCTGATTTTGAACACACCATCCATGTTGGCTTTGATGCTGTTACTGGAGAATTCACTGGCAT 
GCCAGAACAGTGGGCTCGATTACTACAGACCTCCAATATCACCAAACTACAGCAAAAGAAGAATCCTCAG 
GCTGTGCTGGATGTCTACGACTCCAACACAGTGAAGCAGAAGTATCTGAGTTTTACTCCTCCTGAGAAAG 
ATGGCTTCCCTTCTGGAACACCAGCACTGAATGCCGAGGGAACAGAAGCACCTGCAGTAGTGACAGAGGA 
GGAGGACGATGATGAAGAGACTGCCCCTCCCATTATTGCCCCACCACCGGATCATATGAAATCAATTTAC 
AC ACGGT CTGT AATTGACCCTGTT C CTG C AC C AGTT GGTGATT C AAATGTTG ATGGTGGTGC C AAGT CTT 
TAGACAAACAGAAAAAGAAGACTAAGATGACAGATGAAGAGATTATGGAGAAACTAAGAACTATTGTGAG 
CATAGGTGACCCTAAGAAAAAAAGAAAAAAATATACAAGATATGAAAAAATTGGACAAGGGGCTTCTGGT 
ACAGTTTTCACTGCTACTGACGTTGCACTGGGACAGAAGGTTGCTATCAAACAAATTAATTTACAGAAAC 
AGCCAAAGAAGGAATTGATCATTAATGAGATTCTGGTAATGAAAGAATTAAAAAATCCCAACATAGTTAA 
CTTCTTGGACAGTTACCTGGTAGGAGATGAATTGTTTGTGGTCGTGGAATACCTTGCTAGGGGGTCACTC 
ACTGATGTGGTAACAGAAACCTGCATGGATGAAGCACAGATTGCCGCTGTATGCAGAGAGAGTTTACAGG 
CATTGGAGTTTTTACATGCTAATCAAGTGATCCACAGAGACATCAAAAGTGACAGTGTACTTTTGGGAAT 
GGAAGGATCGGTTAAGCTCACTGACTTTGGTTTCTGTGCCCAGATCACCCCTGAGCAGAGCAAACGCAGT 
ACCGTGGTCAGAACGCCATACTGGATGGCACCAGAAGTGGTTACACGGAAGGCTTATGGCCCTAAAGTCA 
ATGTATGGTCTCTGGGTATCATGGCTACTGAGATGGTAGAAGGAGAGCCTCCATACCTCAATGAAAATCC 
CTTGAGGGCCTTGTGCCTAATAGCAACTAATGGAATCCCAGAACTTCAGAATCCAGAGACACTTTCCCCA 
ATATTTCGGGATTTCTTAAATCGATGTTTGGAAACAGATGTGGAAAAAAGGGGTTCAGCCAAAGAATTAT 
TACAGCATCTTTTCCTGAAACTAGCCAAACTGTTATCTAGCTTGACACCACTGATCATGGCAGCTAAAGA 
AGCAATGAAGAGTAACCGTTAACATCACTGCTGTGGCCTCATATTCTTTTTTCCATTTTCTACAAGAAGC 
CTTTTA 

10 

A NOV90 polypeptide (SEQ ID NO:290) encoded by SEQ ID NO:289 is 524 amino 
acids in length and is presented using the one-letter amino acid code in Table 90B. The Psort 
profile for NOV90 predicts that this sequence is likely to be localized to the nucleus with a 
certainty of 0.7000. 

15 

Table 90B. NOV90 Polypeptide Sequence (SEQ ID 
NO.290) 

MCDNRELEDKPPAPPVRMSRTIFSTGGKDPIjSANHSLKPLPSVPEEKKPRHKI ISIFSGT 
EKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPEQWARLLQTSNITKLQQKKNP 
QAVLDVYDSNTVKQKYLSFTPPEKDGFPSGTPALNAEGTEAPAWTEEEDDDEETAPPII 
APPPDHMKS I YTRSVIDPVPAPVGDSNVDGGAKSLDKQKKKTKMTDEEIMEKLRTIVS IG 
D P KKKRKK YT RYE K I GQG ASGT V FT ATD VAL GQ KVAI KQ INLQKQPKKEL I INE I LVMKE 
LKNPNIVNFLDSYLVGDELFWVEYLARGSLTDWTETCMDEAQIAAVCRESLQALEFLH 
ANQVIHRDIKSDSVLLGMEGSVKLTDFGFCAQITPEQSKRSTWRTPYWMAPEWTRKAY 
GPKVNVWSLGIMATEMVEGEPPYLNENPLRALCLIATNGIPELQNPETLSPIFRDFLNRC 
LETDVEKRGSAKELLQHLFLKLAKLLSSLTPLIMAAKEAMKSNR 
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A BLAST analysis of NOV90 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV90 had high 
homology to other proteins as shown in Table 90C. 



Table 90C. BLASTX results from PatP database for NOV90 



Smallest 
Sum 

High Probability 



Sequences producing High-scoring Segment Pairs: Score P (N) 

patp:AAB03969 p-21 activated protein kinase (PAK2) 2494 6.5e-259 
patp:AAW95521 Protease activated protein kinase I (PAK I) 2459 3.3e-255 
patp:AAR96296 Human p21-protein activated serine kinase 2361 8.0e-245 
patp:AAWl3379 Human p21 activated serine kinase p65 protein 2361 8.0e-245 
patp:AAW47119 Human p21-activated serine kinase p65 2361 8.0e-245 



In a search of public sequence databases, it was found, for example, that the amino 
acid sequence of the NOV90 protein of the present invention was found to have 503 of 524 
amino acid residues (95%) similar to, the 524 amino acid residue ptnr:SWISSNEW- 
ACC:Q13177 protein from Homo sapiens (Human) (SER1NE/THREONINE-PROTEIN 
KINASE PAK 2 (EC 2.7.1.-) (P21 -ACTIVATED KINASE 2) (PAK-2) (PAK65) (GAMMA- 
PAK) (S6/H4 KINASE)). NOV90 also has homology to the other proteins shown in the 
BLASTP data in Table 90D. 



Table 90D. NOV90 BLASTP results 






Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 3041712 | sp 
| Q13177 | PAK2_ 
HUMAN 


SERINE /THREONINE - PROTEIN 
KINASE PAK 2 (P21- 
ACTIVATED KINASE 2) (PAK- 
2) (PAK65) (GAMMA-PAK) 
(S6/H4 KINASE) ' 


524 


493/527 
(93) 


503/527 
(94) 


0.0 


gi |4505599 | re 
f|NP 002568.1 
| (NM_002577) 


p21 (CDKN1A) -activated 
kinase 2; novel serine 
kinase; hPAK6 5 
[Hozno sapiens] 


525 


493/528 
(93) 


503/528 
(94) 


0.0 


gi | 16758002 | r 
ef |NP_445758 . 
1 | (NM 053306) 


p21 (CDKN1A) -activated 
kinase 2 

[Rattus norvegicus] 


524 


483/527 
(91) 


498/527 
(93) 


o.o ! 


gi | 2499647 | sp 
|Q29502|PAK2_ 
RABIT 


SERINE /THREONINE -PROTEIN 

KINASE PAK 2 (P21- 

ACTIVATED KINASE 2) 
(PAK-2) (GAMMA-PAK) 
(P21 -ACTIVATED PROTEIN 

KINASE I) (PAKI) 


524 


484/527 
(91) 


498/527 
(93) 


0.0 


gi | 6288680 | gb 
|AAF06695.1|U 

19967 1 
(U19967) 


PAK 2 [Rattus norvegicus] 


524 


481/527 
(91) 


498/527 
(94) 


.0.0 
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This BLASTP data is displayed graphically in the ClustalW in Table 90E. A multiple 
sequence alignment is given, with the NOV90 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 90D. 



Table 90E. ClustalW Alignment of NOV90 




60 
. I - ■ 



NOV9 0 (SEQ ID NO: 2 90) 

gi | 3041712 | (SEQ ID NO:749) 

gi|4505599| (SEQ ID NO:750) 

gi|l6758002| (SEQ ID NO:751) 

gi|2499647| (SEQ IDNO:752) 

gi|6288680| (SEQ ID NO:753) 



NOV9 0 

gi | 3041712 | 
gi | 4505599 | 
gi ] 16758002 | 
gi j 2499647 | 
gi | 6288680 j 



NOV9 0 

gi | 3041712 | 
gi |4505599 j 
gi | 16758002 | 
gi j 2499647 | 
gi | 6288680 | 



NOV9 0 

gi | 3041712 | 
gi | 4505599 j 
gi | 16758002 | 
gi | 2499647 | 
gi | 6288680 j 



NOV9 0 

gi | 3041712 | 
gi | 4505599 j 
gi | 16758002 | 
gi | 2499647 | 
gi j 6288680 j 



NOV9 0 

gi | 3041712 | 
gi | 4505599 j 
gi | 16758002 J 
gi | 2499647 | 
gi j 6288680 j 



I 



70 
• I - - 



80 
■ I - • 



90 
. I . . 



100 



L 



KIISIFSGTEKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPE 
§KIISIFSGTEKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPE 
iKIISIFSGTEKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPE 
IKIISIFS@TEKGSKKKEKERPEISPPSDFEHTIHVGFDAVTGEFTGMPE 

ikiisifsgtekgskkkekerpeisppsdfehtihvgfdavtgeftgmpe 
kiisifsStekgskkkekerpeisppsdfeStihvgfdavtgeftgmpe 



I 



110 



I 



120 
. . I - - 



130 



I 



I 



140 
. . I . . 



]_ 



150 



qwarllqtsni 
qwarllqtsni 
qwarllqtsni 
qwarllqtsni 
qwarllqtsni 
qwarllqtsni 



tkl|qkknpqavldv^Bydsntvkqkylsftppekdgf 

tkleqkknpqavldvlkfydsntvkqkylsftppekdgI 
tkleqkknpqavldvlkfydsntvkqkylsftppekdgj 
tkl eqkknpqavld vlkfydsntvkqkyls ftp pe kdgf 
tkleqkknpqavldvlkfydsntvkqkylsftppekdgf 
tkleqkknpqavldvlkfydsntvkqkylsftppekdgf 



VVTEE»lDDDEEj 
VVTEEjgDDDEE 
VVTEEsgDDDEE 

vvteeIddde^ 
vvtee|dddeej 
.vvteeSdddeIb 




180 

. . I . . 



190 



I 



_L 



200 



§APP||IAP[|PDHi5jKSIYTRSVID 
PPVIAPRPDHTKSIYTRSVID 
PPVIAPRPDHTKSIYTRSVID 
jAPPVIAPRPDHTKSIYTRSVID 
JAPPVIAPRPDHTKSIYTRSVID 

Sappviaprpdhtksiytrsvid 



230 

■ ■I.. 



kmtdeeimek: 
kmtdeeimek: 
kmtdeeimek 
kmtdeeimek 
kmtdeeimek 
kmtdeeimek 



240 


250 






e 


i 










RK 


lli' 




i 












$ 1 ii 




e 
* 


* 










is 




e 








i 








e 






i 




? 



280 



290 



300 
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NOV90 

gi | 3041712 | 
gi j 4505599 j 
gi | 16758002 | 
gi | 2499647 | 
gi | 6288680 j 



NOV9 0 

gi | 3041712 | 
gi|4505599| 
gi | 16758002 | 
gi j 2499647 | 
gi | 6288680 | 



NOV9 0 

gi | 3041712 | 
gi | 4505599 j 
gi j 16758002 | 
gi j 2499647 | 
gi j 6288680 j 



NOV9 0 

gi | 3041712 | 
gi | 4505599 | 
gi | 16758002 | 
gi |2499647 | 
gi j 6288680 | 



NOV9 0 

gi | 3041712 | 
gi [4505599 | 
gi j 16758002 | 
gi | 2499647 | 
gi | 6288680 | 



NOV9 0 

gi | 3041712 | 
gi (4505599 j 
gi j 16758002 | 
gi [2499647 | 
gi | 6288680 j 




I 



310 
■ - I . • 



320 



330 



340 



350 



I 



MKELKNPNIVNFLDSYLVGDELFWjBEYLASGSLTDWTET 
MKELKNPNIVNFLDSYLVGDELFWMEYLAGGSLTDWTET 
MKELKNPNIVNFLDSYLVGDELFWMEYLAGGSLTDVVTET 
MKELKNPNIVNFLDSYLVGDELFWMEYLAGGSLTDWTET 
MKELKNPNIVNFLDSYLVGDELFWMEYLAGGSLTDVVTET 
MKELKNPNIVNFLDSYLVGDELFWMEYLAGGSLTDWTET 





| 



390 
. . I . . 



400 



EGSVKLTDFGFCAQITPEj 
EGSVKLTDFGFCAQITPE 
EGSVKLTDFGFCAQITPE 
EGSVKLTDFGFCAQITPE 
EGSVKLTDFGFCAQITPE 
EGSVKLTDFGFCAQITPE 



I 



410 
. . I . . 



I 



420 
. - I . - 



I 



430 

1 



440 
..I.. 



450 

:_J_ 



QSKRSTgvjgTl>YWMAPEVVTRKAYGPKViSiWSLGIMAaEMVEGEPPYLNE 
QSKRSTMVGTPYWMAPEWTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNE 
QSKRSTMVGTPYWMAPEWTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNE 
QSKRSTMVGTPYWMAPEWTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNE 
QSKRSTMVGTPYWMAPEWTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNE 
QSKRSTMVGTPYWMAPEWTR^AYGPKVDIWSLGIMAIEMVEGEPPYLNE 



460 



470 



480 




490 



500 



I 



DVEKRGSAKELLQ 
DVEKRGSAKELLQ 
DVEKRGSAKELLQ 
DVEKRGSAKELLQ 
DVEKRGSAKELLQ 
DVEKRGSAKELLQ 



I 



510 
. . I 



I 



520 



HjjFLKLAKJjLSSLTPLIMAAKEAMKSNR 

hpflklakplssltplimaakeamksnr 
hpflklakplssltplimaakeamksnr 
hpflklakplssltpliSJaakeamksnr 
hpflklakplssltplimaakeamksnr 
!hpflklakplssltplii!aakeamksnr 



Table 90F lists the domain description from DOMAIN analysis results against NOV90. 
This indicates that the NOV90 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 90F. Domain Analysis of NOV90 



gnllSmart|smart0022(K SJTKc, Serine/Threonine protein kinases, catalytic domain; Phosphotransferases. 
Serine or threonine-specific kinase subfamily. SEQ ID NO : 8 82 

CD-Length = 256 residues, 94.9% aligned ; 
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Score = 


251 bits (640), Expect - le-67 




NOV90 : 


255 


IGQGASGTVFTATDVALGQKVAIKQINLQKQPKK- -ELIINEILVMKELKNPNIVNFLDS 


312 






+G+GA G V+ A D G+ VAIK I +K KK E 1+ EI ++K+L +PNIV D 




Sbjct : 


7 


LGKGAFGKVYLARDKKTGKLVAIKVIKKEKLKKKKRERILREIKILKKLDHPNIVKLYDV 


66 


NOV90 : 


313 


YLVGDELFVWEYLARGSLTDWTET - CMDEAQ I AAVC RE S LQ ALE FLHANQV I HRD IKS 


371 






+ D+L++V+EY G L D++ + + E + R+ L ALE+LH+ +IHRD+K 




Sbjct : 


67 


FEDDDKLYLVMEYCEGGDLFDLLKKRGRLSEDEARFYARQILSALEYLHSQGIIHRDLKP 


126 


NOV90 : 


372 


DSVLLGMEGSVKLTDFGFCAQITPEQSKRSTVTOTPYWRAPEVWRKAYGPKVNVWSLGI 


431 






+++LL +G VKL DFG Q+ + +T V TP +MAPEV+ K YG V++WSLG+ 




Sbj ct : 


127 


EN I LLD S DGHVKLADFGLAKQIjDS GGT LLT T F VGT PE YMA P E VLLG KG YGKAVD I W S LG V 


186 


NOV90 : 


432 


MATEMVEGEPPYLNENPLRALCLIATNGIPELQNPE-TLSPIFRDFLNRCLETDVEKRGS 


490 






+ E+ + G+PP+ ++ L AL P PE +SP +D + + L D EKR + 




Sbj Ct : 


187 


ILYELLTGKPPFPGDDQLLALFKKIGKPPPPFPPPEWKISPEAKDLIKKLLVKDPEKRLT 


246 


NOV90 : 


491 


AKE 4 93 








A+E 




Sbjct: 


247 


AEE 24 9 





Serine/threonine kinases are an extensive family of enzymes that catalyzes the 
phosphorylation of serine or threonine residues on its target protein. Protein kinases share a 
conserved catalytic core common to both serine/ threonine and tyrosine protein kinases. This 
5 domain contains residues, which are specific to the distinct types of protein kinases 

The S6/H4 kinase purified from human placenta catalyzes phosphorylation of the S6 
ribosomal protein, histone H4, and myelin basic protein. In vitro activation of the p60 S6/H4 
kinase requires removal of an autoinhibitory domain by mild trypsin digestion and 
autophosphorylation of the catalytic domain (p40 S6/H4 kinase). The two 

10 autophosphorylation/autoactivation sites contain the sequences SSMVGTPY (site 1) and 

SVIDPVPAPVGDSHVDGAAK (site 2). These sequences identify S6H4 kinase as the rac- 
activated PAK65 (Martin, G. A., Bollag, G., McCormick, F. and Abo, A. (1995) EMBO J. 14, 
1971-1978). Site 1 phosphorylation is most rapid, but activation does not occur until site 2 is 
autophosphorylated. The site 1 phosphorylation occurs by an intramolecular mechanism 

15 whereas site 2 autophosphorylation occurs by an intermolecular mechanism. A model is 

proposed in which phosphorylation of sites 1 and 2 occurs sequentially. The model proposes 
that trypsin treatment of the inactive holoenzyme removes an inhibitory rac-binding domain 
which blocks MgATP access to the catalytic site. The pseudosubstrate domain at site 1 is 
autophosphorylated and subsequent bimolecular autophosphorylation at site 2 fully opens the 

20 catalytic site. Phosphorylation by a regulatory protein kinase may occur at site 2 in vivo. 

NOV90 is predicted to be expressed in at least the following tissues: brain, cerebellum, 
skeletal muscle, ovary, thymus and spleen. This information was derived by determining the 
tissue sources of the sequences that were included in the invention including but not limited to 
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SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV90 is provided in Example 2. 

The NOV90 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
5 Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 

hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
10 immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV90 nucleic acids encoding the PAK2-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a serine/threonine-protein kinase 
15 PAK 2-like protein includes the nucleic acid whose sequence is provided in Table 90A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 90A while still encoding a 
protein that maintains its serine/threonine-protein kinase PAK 2-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 
20 nucleic acids whose sequences are complementary to the sequence of Table 90 A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
25 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 

The novel protein of the invention includes the serine/threonine-protein kinase PAK 2- 
like protein whose sequence is provided in Table 90B. The invention also includes a mutant or 
30 variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 90B while still encoding a protein that maintains its serine/threonine-protein kinase 
PAK 2-like activities and physiological functions, or a functional fragment thereof. In the 
mutant or variant protein, up to about 6% of the amino acid residues may be so changed. 
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These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 



NOV91 

The disclosed NOV91 (alternatively referred to herein as CG56779-01) includes the 
404 nucleotide sequence (SEQ ID NO:291) shown in Table 91 A. A NOV91 ORF begins with 
10 a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon at 
nucleotides 373-375. The disclosed NOV91n maps to human chromosome 3. 



AATGGGTCTCTTTTTCTACAGCCCATGCCCTTCCTAGAGCTAGACACGAACTTGCCTGCCAACCAAGTGC 
CTGCAGGGTTGGAGAAATGGCTCTGCGCCACAGCCTCCATCCTGGGCAAACCCAAGGATCATGTGAACAT 
GATGGGTGTAGCGGGCCTGACCATGGTGCTGAGTAGGTCCACTGAGCCCTGGGCGCAGCTGTTCATCTCC 
TCCACCAGCATGATGGACACCACTGAGGAGAACCGCAGCCACAGCACCCACTTCTTCGAGTTCCTCACCG 
AGGAGCTGGCCCTGGGCCAGGACCAGATAATTTTCCACTTTTCCCCCCTGGAGCCCTGGCAGACTGGCAA 
GAAGGGGATGGTCATAACTTTTTAGTGACTGGCCTCGAGGGATCCAGGGCATCT 



A NOV91 polypeptide (SEQ ID NO:292) encoded by SEQ ID NO:291 is 1 16 amino 



15 acids in length and is presented using the one-letter amino acid code in Table 9 IB. The Psort 
profile for NOV91 predicts that this sequence has no signal peptide and is likely to be 
localized to microbodies with a certainty of 0.6400. In alternative embodiments, a NOV91 
polypeptide is located to the cytoplasm with a certainty of 0.4500. 



A BLAST analysis of NOV91 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV91 had high 
homology to other proteins as shown in Table 91C. 



below. 



Table 91 A. NOV91 Nucleotide Sequence (SEQ ID NO:291) 



Table 91B. NOV91 Polypeptide Sequence (SEQ ID 
NO:292) 



M P FLE LDTNL P ANQVP AGLE K WL CAT AS I LGKPKDHVNMMGVAGLTMVLSRSTEPWAQLF 
ISSTSMMDTTEENRSHSTHFFEFLTEEIALGQDQIIFHFSPLEPWQTGKKGMVITF 



Table 91C. BLASTX results from PatP database for NOV91 



Sequences producing High-scoring Segment: Pairs: 



Smallest 
Sum 

High Probab ability- 
Score P(N) 
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patp:AAR83 04 8 Human macrophage migration inhibitory factor 
patp : AAY44 9 97 Human D-dopachrome tautomerase (DDT) 
patp :AAB43733 Human cancer associated protein sequence 
patp:AAM22llO Peptide #8544 encoded by probe 
patp : AAM38563 Peptide #126 00 encoded by probe 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 283 of 289 bases (97%) identical to a gb:GENBANK- 
ID:AP000500|acc:AP000500.1 mRNA from Homo sapiens (genomic DNA, chromosome 
5 3p21 .3, clone:603 to 320, anti-oncogene region, section 3/3). The full amino acid sequence of 
the protein of the invention was found to have 82 of 1 17 amino acid residues (70%) identical 
to, and 91 of 1 17 amino acid residues (77%) similar to, the 1 18 amino acid residue ptnnpir- 
id:JE0162 protein from human (dopachrome Delta-isomerase (EC 5.3.3.12)). NOV91 also has 
homology to the other proteins shown in the BLASTP data in Table 9 ID. 

10 



Table 91D. NOV91 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 

(%) 


Expect 


gi | 4503291 | re 
f |NP_001346 . 1 
| (NM 001355) 


D- dopachrome tautomerase 
[Homo sapiens] 


118 


82/117 
(70) 


91/117 
(77) 


le-38 


gi | 4699610 | pd 
b 1 1DPT 1 A 


Chain A, D-Dopachrome 
Tautomerase 


117 


81/116 
(69) 


90/116 
(76) 


5e-38 


gi 

r\ 


7512375 |pi 
G02438 


D- dopachrome tautomerase 
- human 


118 


80/117 
(68) 


89/117 
(75) 


7e-38 


gi 
ef 

1| 

(NT 


13162287 | r 
NP_077045 . 

\ 024131) 


D-dopachrome tautomerase 
[Rattus norvegicus] 


118 


71/117 
(60) 


94/117 
(79) 


6e-34 


gi | 6753618 | re 
f |NP_034157 . 1 
| (NM 010027) 


D-dopachrome tautomerase 
[Mus musculus] 


118 


67/117 
(57) 


92/117 
(78) 


2e-32 



This BLASTP data is displayed graphically in the ClustalW in Table 9 IE. A multiple 
sequence alignment is given, with the NOV91 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 9 ID. 



399 6.5e-37 

369 9.8e-34 

262 2.1e-22 

183 5.0e-14 

183 5.0e-14 



Table 91E. ClustalW Alignment of NOV91 


NOV91 


(SEQ ID NO: 2 92) 




gi |450329l| 


(SEQ ID NO: 754) 




gi |4699610 j 


(SEQ ID NO: 755) 




gi | 7512375 | 


(SEQ ID NO: 756) 




gi | 13162287 


| (SEQ ID NO:757) 




gi | 6753618 | 


(SEQ ID NO:758) 






10 20 30 40 50 





NOV91 

gi | 4503291 | 
gi |4699610 | 
gi | 7512375 | 
gi j 13162287 | 
gi | 6753618 | 



NOV91 

gi 
gi 
gi 
gi 
gi 



4503291 I 
4699610 | 
7512375 | 
13162287 | 
6753618 I 



NOV91 S 
gi | 4503291 | 
gi|4699610| 
gi | 7512375 j 
gi | 13162287 | 
gi | 6753618 | 




Table 9 IF lists the domain description from DOMAIN analysis results against NOV91. 
This indicates that the NOV91 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 91F. Domain Analysis of NOV91 



gnllPfam|pfam01187 , MIF, Macrophage migration inhibitory factor (MIF). SEQ ID NO : 883 

CD-Length = 114 residues, 100.0% aligned 
Score = 126 bits (316), Expect = 8e-31 



NOV91: 2 P FLE LDTNL P ANQ V P AGL E KWL C A - T AS I LG K P KDHVNMMG VAGLTMVL S R S TE P WAQL F 6 0 

P +DTNLPAN VPAG EK L A A LGKP+D + + G MV ST+P A 

Sbjct: 1 PMFTIDTNLPANSVPAGFEKRLTAALAKALGKPEDRIAVHIRPGQAMVFGGSTDPCAVCS 6 0 

NOV91 : 61 ISSTSMMDTTEENRSHSTHFFEFLTEELALGQDQIIFHFSPLEPWQTGKKGMVIT 115 

IS ++ E+NRSHS F+FL +EL L +D++ F LE Q G G + 
Sbjct : 61 IKSIGW-GAEQNRSHSALLFKFLAKELGLPKDRVYIRFFDLEAAQVGFNGTTMA 114 



D-Dopachrome tautomerase (DDT) shares a low homologous amino acid sequence 
(33% homology) with the macrophage migration inhibitory factor (MIF) yet possesses similar 
tautomerase activity. MIF is a cytokine involved in inflammatory reactions and immune 
10 responses. While MIF is a secreted protein, it is not processed from a larger precursor. 

Whereas recent studies have identified MIF as a pituitary hormone and immunoregulator, less 
is known about the structural basis of these physiological functions and the real significance of 
tautomerase activity. D-dopachrome tautomerase, which is related to MIF, is a mammalian 
cytoplasmic enzyme involved in melanin biosynthesis that tautomerizes 2-carboxy-2,3- 
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dihydroindole-5 ? 6-quinone (D-dopachrome) with concomitant decarboxylation to give 5,6- 
dihydroxyindole (DHI). It is a protein of 1 17 residues, and acts as a homotrimer. 

NOV91 is predicted to be expressed in at least the following tissues: largely in the 
liver, and to lesser extent in other organs, including the heart, lung, pancreas; and placenta. 
5 This information was derived by determining the tissue sources of the sequences that were 

included in the invention including but not limited to SeqCalling sources, public EST sources, 
literature sources, and/or RACE sources. Further expression data for NOV91 is provided in 
Example 2. 

The NOV91 nucleic acids and proteins are useful in potential therapeutic applications 

10 implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 

15 thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV91 nucleic acids encoding the D-Dopachrome tautomerase-like protein of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. 

20 The novel nucleic acid of the invention encoding a D-dopachrome tautomerase-like 

protein includes the nucleic acid whose sequence is provided in Table 91 A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 91 A while still encoding a protein 
that maintains its D-dopachrome tautomerase-like activities and physiological functions, or a 

25 fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to those just described, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 

30 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
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In the mutant or variant nucleic acids, and their complements, up to about 3% of the residues 
may be so changed. 

The novel protein of the invention includes the D-dopachrome tautomerase-like protein 
whose sequence is provided in Table 9 IB. The invention also includes a mutant or variant 
protein any of whose residues may be changed from the corresponding residue shown in Table 
9 IB while still encoding a protein that maintains its D-dopachrome tautomerase-like activities 
and physiological functions, or a functional fragment thereof In the mutant or variant protein, 
up to about 30% of the bases may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV92 

The disclosed NOV92 (alternatively referred to herein as CG56904-01) includes the 
1311 nucleotide sequence (SEQ ID NO:293) shown in Table 92 A. A NOV92 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 19-21 and ends with a stop codon 
at nucleotides 1282-1284. 



Table 92A. NOV92 Nucleotide Sequence (SEQ ID NO:293) 

GGAGCTCCCACACTTTCAATGGGGAGGCCCACCCAGTGGCCGAGCCTGCTGCTGCTCCTGCTGTTGCCGG 
GGCCCCCGCCCGTCGCCGGCTTGGAAGACGCTGCCTTCCCCCACCTGGGGGAGAGCTTGCAGCCCCTGCC 
CCGGGCCTGTCCCCTGCGCTGCTCCTGCCCCCGAGTCGACACTGTGGACTGTGATGGCTTGGACCTTCGA 
GTGTTCCCGGACAACATCACCAGAGCCGCTCAGCACCTCTCCCTGCAGAACAACCAGCTCCAGGAACTCC 
CCTACAATGAGCTGTCCCGCCTCAGTGGCCTGCGAACCCTCAACCTCCACAACAACCTCATCTCCTCCGA 
AGGCCTGCCTGACGAGGCCTTCGAGTCCCTCACCCAGCTGCAGCACCTCTGCGTGGCTCACAACAAGAAC 
AATCTCATCTCCAAGGTGCCCCGAGGAGCCCTGAGCCGCCAGACTCAACTCCGTGAGCTCTACCTCCAGC 
ACAACCAGCTGACAGACAGTGG CCTGGATGC CAC C AC CTT CAGCAAG C TGC ATAG CCTTGAATACCTGGA 
TCTCTCCCACAACCAGCTGACCACAGTGCCCGCCGGCCTGCCCCGGACCCTGGCTATCCTGCACCTGGGC 
CGCAACCGCATCCGGCAGGTGGAGGCGGCTCGGCTGCACGGGGCGCGTGGTCTGCGCTATTTGTTGCTGC 
AGCACAACCAGCTGGGGAGCTCAGGGCTGCCCGCCGGGGCTCTGCGGCCGCTGCGGGGCCTGCACACGCT 
GCACCTCGATGGCAATGGGCTGGACCGCGTGCCTCCAGCCCTGCCCCGCCGCCTGCGTGCCCTGGTGCTG 
CCCCACAACCACGTGGCCGCGCTGGGTGCCCGTGACCTGGTCGCCACACCGGGCCTGACGGAGCTTAACC 
TGGCCTATAACCGCCTGGCCAGCGCCCGTGTGCACCACCGGGCCTTCCGCCGGTTGCGTGCCCTGCGCAG 
CCTCGACCTGGCAGGGAATCAGCTAACCCGGCTGCCCATGGGCCTGCCCACTGGCCTGCGCACCCTGCAG 
CTGCAACGCAACCAGCTGCGGATGCTCGAGCCCGAGCCTCTGGCCGGCCTGGACCAACTGCGGGAGCTCA 
GCCTGGCGCACAACCGGCTCCGGGTCGGCGACATCGGGCCAGGCACCTGGCATGAGCTCCAAGCCCTCCA 
GGTCAGGCACAGGCTGGTTAGCCACACTGTCCCCAGGGCCCCTCCATCCCCCTGCCTGCCCTGCCACGTC 
CCAAACATTCTAGTTAGCTGGTAAAGCAATCAGAACAAGAAAATGATAAGA 



A NOV92 polypeptide (SEQ ID NO:294) encoded by SEQ ID NO:293 is 421 amino 
acids in length and is presented using the one-letter amino acid code in Table 92B. The Psort 
profile for NOV92 predicts that this sequence has a signal peptide and is likely to be secreted 
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with a certainty of 0.4419. The Signal P predicts a likely cleavage site for a NOV92 peptide is 
between positions 24 and 25, i.e., at the dash in the sequence VAG-LE. 



Table 92B, NOV92 Polypeptide Sequence (SEQ ID 
NO:294) 



MGRPTQWPSLLLLLLLPGPPPVAGLEDAAFPHLGESLQPLPRACPLRCSCPRVDTVDCDGL 
DLRVFPDNITRAAQHLSLQNNQLQELPYNELSRLSGLRTLNLHNNLISSEGLPDEAFESLT 
QLQHLCVAHNKNNLISKVPRGALSRQTQLRELYLQHNQLTDSGLDATTFSKLHSLEYLDLS 
HNQLTTVPAGLPRTLAILHLGRNRIRQVEAARLHGARGLRYLLLQHNQLGSSGLPAGALRP 
LRGLHTLHLDGNGLDRVPPALPRRLRALVLPHNHVAALGARDLVATPGLTELNLAYNRLAS 
ARVHHRAFRRLRALRSLDLAGNQLTRLPMGLPTGLRTLQLQRNQLRMLEPEPLAGLDQLRE 
LSLAHNRLRVGDIGPGTWHELQALQVRHRLVSHTVPRAPPSPCLPCHVPISriLVSW 



A BLAST analysis of NOV92 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV92 had high 
homology to other proteins as shown in Table 92C. 



Table 92C. BLASTX results from PatP database for NOV92 



Sequences producing High-scoring Segment Pairs: 

patp:AAY13396 Amino acid sequence of protein PR0332 
patp : AAB3 3 425 Human PR03 32 protein 

patp:AAB8 0264 Human PR03 32 protein - Homo sapiens, 642 aa. 
patp : AAU12 3 56 Human PR0332 polypeptide sequence 
patp:AAM41258 Human polypeptide SEQ ID NO 6189 



Smallest 
Sum 

High Probability 
Score P (N) 

690 4.9e-70 

690 4.9e-70 

690 4.9e-70 

690 4.9e-70 

334 5.0e-30 



10 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence of this invention has 1310 of 1312 bases (99%) identical to a gb:GENBANK- 
ID:AK027100|acc:AK027100.1 mRNA from Homo sapiens (cDNA: FLJ23447 fis, clone 
HSI03346). The full amino acid sequence of the protein of the invention was found to have 
290 of 291 amino acid residues (99%) identical to, and 290 of 291 amino acid residues (99%) 

15 similar to, the 363 amino acid residue ptnr:TREMBLNEW-ACC:BAB 15657 protein from 
Homo sapiens (Human) (CDNA: FLJ23447 FIS, CLONE HSI03346). NOV92 also has 
homology to the other proteins shown in the BLASTP data in Table 92D. 



Table 92D. NOV92 BLASTP results 


Gene Index / 
Identifier 


.Protein / Organism 


Length 
(aa) 


Identity 

<%) 


Positive 

(%) 


Expect 



731 




gi | 13376224 | r 
ef |NP_079101 . 
1 | (NM 024825) 


hypothetical protein 
FLJ2 344 7 [Homo sapiens] 


363 


290/291 
(99) 


290/291 
(99) 


e-136 


gi | 11761721 jg 
b| AAG4 0157 . 1 | 
AF247822_1 
(AF247822) 


decorin 

[Oreochromis niloticus] 


359 


87/281 
(30 ) 


137/281 
( a. n \ 


4e-27 


gi | 6759315 | db 
j | BAA9 024 6 . 1 | 
(AB037269) 


biglycan [Xenopus 
laevis] 


368 


90/289 
(31) 


134/289 
(46) 


le-26 


>gi | 8134605 | s 
p | Q9XSD9 | PGS2 
PIG* 


BONE PROTEOGLYCAN II 
PRECURSOR (PG-S2) 
(DECORIN) 


360 


94/298 
(31) 


144/298 
(47) 


4e-25 


gi | 129949 | sp | 
P21793 | PGS2_B 
OVIN 


BONE PROTEOGLYCAN II 
PRECURSOR (PG-S2) 


360 


88/292 
(30) 


140/292 
(47) 


6e-25 



This BLASTP data is displayed graphically in the ClustalW in Table 92E. A multiple 
sequence alignment is given, with the NOV92 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
5 Table 92D. 



Table 92E. ClustalW Alignment of NOV92 



NOV92 

gi | 13376224 | 
gi | 11761721 | 
gi | 6759315 | 
gi j 8134605 | 
gi | 129949 | 



(SEQ ID NO: 294) 
(SEQ ID NO: 75 9) 
(SEQ ID NO: 76 0) 
(SEQ ID NO:761) 
(SEQ ID NO: 7 62) 
(SEQ ID NO: 763) 



NOV92 

gi | 13376224 | 
gi | 11761721 j 
gi | 6759315 | 
gi j 8134605 j 
gi | 129949| 



NOV 9 2 

gi | 13376224 | 
gi | 11761721 | 
gi | 6759315 | 
gi | 8134605 j 
gi | 129949 | 



NOV 9 2 

gi | 13376224 | 
gi j 11761721 | 
gi | 6759315 | 
gi j 8134605 j 
gi j 129949 | 




10 

rptqwpslllQll||pgpp| 

WTFECS RTTS-PE 

£ Acffs LL gVTAC W0L 

LlLCS CILpI^ 

T IlFLL pAQjIs Wiol 

Tl|FLL- - -^AQ^SWgG 



30 40 50 

....|....|....|....|.-..| 

d AApp hSg^s l q p l prac p lrcs C 

- TS~ - PCRTTS SRNS P T-MSC 
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70 
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80 

- I 1 I I I ^ 1 - ■ ■ ra 

PRVgTVDCDGLDLRVF^NITRAAQH]dSLQNNQLQEL[^ELSRaSGL^T 

|S - - TSTTTSS - - P " ' m " ' 

SvTESPLPPVgG GP: 

SlPTESIPDVGLPP MDL[ 

aEDRFPEVPEgEPL GP1 



100 



P-- 



KDE 



SVACE- 



| IG Qeehfpevpeiepm- 



-GP^ 




110 

l^hnnlIss^g 

SPS- 



120 



3Eafesltqlqhlc|| 

J S--CSTgA 



SDLGL 
SDLGL 
SDLGL 
SDLGL 




160 



170 



180 



190 



200 
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NOV92 

gi | 13376224 | 
gi j 11761721 j 
gi | 6759315 | 
gi | 8134605 j 
gi | 129949 | 




NOV92 

gi | 13376224 | 
gi|ll76172l| 
gi j 6759315 | 
gi | 8134605 | 
gi | 129949 | 



NOV92 

gi | 13376224 | 
gi j 11761721 j 
gi | 6759315 | 
gi j 8134605 | 
gi | 129949 | 



NOV92 

gi | 13376224 | 
gi j 11761721 j 
gi j 6759315 | 
gi j 8134605 | 
gi | 129949 | 



NOV92 

gi | 13376224 | 
gi j 11761721 j 
gi j 6759315 | 
gi | 8134605 j 
gi|l29949| 



NOV92 

gi | 13376224 | 
gi j 11761721 1 
gi j 6759315 | 
gi j 8134605 | 
gi | 129949 | 



Leucine-rich repeats (LRRs) are relatively short motifs (22-28 residues in length) 
found in a variety of cytoplasmic, membrane and extracellular proteins. Although these 
proteins are associated with widely different functions, a common property involves protein- 
5 protein interaction. Little is known about the 3D structure of LRRs, although it is believed that 
they can form amphipathic structures with hydrophobic surfaces capable of interacting with 
membranes. In vitro studies of a synthetic LRR from Drosophila Toll protein have indicated 
that the peptides form gels by adopting beta-sheet structures that form extended filaments. 
These results are consistent with the idea that LRRs mediate protein-protein interactions and 
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cellular adhesion. Other functions of LRR-containing proteins include, for example, binding to 
enzymes and vascular repair. The 3-D structure of ribonuclease inhibitor, a protein containing 
15 LRRs, has been determined, revealing LRRs to be a new class of alpha/beta fold. LRRs 
form elongated non-globular structures and are often flanked by cysteine rich domains. 
5 NOV92 is predicted to be expressed in at least the following tissues: colon, brain. This 

information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 
sources, and/or RACE sources. Further expression data for NOV92 is provided in Example 2. 
The NOV92 nucleic acids and proteins are useful in potential therapeutic applications 

10 implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 

15 thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV92 nucleic acids encoding the LRR-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

20 The novel nucleic acid of the invention encoding a secreted leucine-rich repeat (LRR) 

protein-like protein includes the nucleic acid whose sequence is provided in Table 92A, or a 
fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 92A while still encoding a 
protein that maintains its secreted leucine-rich repeat (LRR) protein-like activities and 

25 physiological functions, or a fragment of such a nucleic acid. The invention further includes 
nucleic acids whose sequences are complementary to the sequence of Table 92A, including 
nucleic acid fragments that are complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 

30 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
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In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the secreted leucine-rich repeat (LRR) 
protein-like protein whose sequence is provided in Table 92B. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 92B while still encoding a protein that maintains its Secreted leucine- 
rich repeat (LRR) protein-like activities and physiological functions, or a functional fragment 
thereof. In the mutant or variant protein, up to about 1% of the amino acid residues may be so 
changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV93 

The disclosed NOV93 (alternatively referred to herein as CG56277-01) includes the 
1518 nucleotide sequence (SEQ ID NO:295) shown in Table 93 A. A NOV93 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 2-4 and ends with a stop codon at 
nucleotides 1556-1558. The disclosed NOV93 maps to human chromosome 2. 



Table 93A. NOV93 Nucleotide Sequence (SEQ ID NO:295) 

CATGAGGGGACTACCTATCAGCAGCAGCACCGGCTACGTGGTCGAGGACGGGTTCACTGCGACCGTGCAG 
CAGCTCTTCGCCAGTGACCAGGGACTCACCTACAACGACTTCTTGATTCTCCCAGGATTCATAGACTTCA 
TAGCTGATGAGGTGGACCTGACCTCAGCCCTGACCCAACCGGTCACTCTGAAGACGCCGCTGATCTCCTC 
CCCCATGGACACTGTGACAGAGGCCGACCTGGCCATCGTGATGGCTCTGATGGGAGGTACTGGTTTCATT 
CACCACAACTGCACCCCAGAGTTCCAGGCCAGTGAGGTGCAGAAGGTCAAGAAGTTTGAACCGGGCTTTA 
TCACACACCCCGTGGTGCTGAGCCCCTTGCACACTGTGGGTGATGTGTTGGAGGCCAAGATGCGTCATGG 
CTTCTCTGGCATCCCCATCACTGAGACGGGTACCATGGGCAGCAAGCTGGTGGGCATCGTCACCTCCCGA 
GACATCGACTTTCTTGCTGAGAAGGACCACACCACCCTCCTCAGTGAGGTGATGATGCCAAGGATCAAGC 
TAGTGGTGGCTCCAGCAAGCAGTGTGAGGTTGAAAGAGGCAAATGAGATCCTGCAGCTTAGTAAGAAAGG 
AAAGCTGCCTATCGTCAATGATCGCGATGAGCTGGTGGCCATTATCACCTGCACCGCGCTGAAGAACCGA 
GACTACCCTGTGGCCTCCAAGGATTCCCATGAGCAGCTGCTGGGCGGGGCAGCTGTGGGTACCCATGAGG 
ATGACAAATACCACCTGGACCTGCTCACCCAGGTAGGCGTCAATGTCATAGGCTTGGACTCGTCCCAAGG 
GAACTCGGTGTATCAGATCGCCATGGTGCATTACATCAAACAAAAGTACCCCCACCTCCAGGTGATTGGG 
GGGAACGTGGTGACAGCAGCCCAGGCCAACAACCTGATTGACGCTGGTGTGGATGGGCTGGGCAGGGGCA 
TGGACTGCGCGGCTGGCTCCATCTACATCAACCAGGAAGTGATAGCCTGCAGTCAGCCCCAGGGCACTGC 
TGTGTACAAGGTGGCCAAGCATACCCAGAACTTTGGTGTGCCCATCATAGCCGATGGTGGCATCCAGACC 
ATGGGGCATGTGGTCAAGGCCCTGGCCCTAGGAGCCTCCACAGTGATGATGGGCTCCCTGCTGGCCGCCA 
CCATGGAGGCCCCCGGCGAGTGCTTCTTCTCAGACGGAATGCAGCTCAAGAAGTACCAGGGCATGGGCTC 
ACTGGATG C C ATGGAGAAG AGC AGC AGC AGC C AGAAAC AAT ACTTC AAC GACGGGG ATAAGG C GAAG AT C 
ACGCAGGATGTCTTGGGCTCCATCCAGGACAAAGGGTCCATTCAGAAGTTCGTGCCCTACCTCATAGTGG 
GCATCCAGCATGGCTGCCAGGATATCGGGGCCCACAGCCTGTCTGTCCTTCGGTCCATGATGTACTCAGG 
GGAGCTCAAGTTTGAGAAGCAGACCATGTCAGCCCAGATCGACGGTGGCATCCATGGCCTGCACTCTTAC 
GAGAAGTGGCTGTACTGAGGACAGCGGTGCAGGGCGAGATG 
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A NOV93 polypeptide (SEQ ID NO:296) encoded by SEQ ID NO:295 is 518 amino 
acids in length and is presented using the one-letter amino acid code in Table 93B. The Psort 
profile for NOV93 predicts that this sequence is likely to be localized to the cytoplasm with a 
certainty of 0.4500. In alternative embodiments, a NOV93 polypeptide is located to 
5 lysosomes with a certainty of 0. 1 92 1 , or, to microbodies with a certainty of 0.3346. 



Table 93B. NOV93 Polypeptide Sequence (SEQ ID 
NO:296) 

MRGLPISSSTGYWEDGFTATVQQLFASDQGLTYNDFLILPGFIDFIADEVDLTSALTQP 
VTLKTPLISSPMDTVTEADIAIVMALMGGTGFIHHNCTPEFQASEVQKVKKFEPGFITHP 
WL S P LHT VGDVLE AKMRHG FS G I P I TE TGTMGS KLVG I VT S RD I D FL AE KDHTT LL S E V 
MMPRI KLWAPASSVRLKEANEILQLSKKGKLPI VNDRDELVAI ITCTALKNRDYPVASK 
DSHEQLLGGAAVGTHEDDKYHLDLLTQVGVNVIGLDSSQGNSVYQIAMVHYIKQKYPHLQ 
VI GGNWTAAQANNL I D AGVDGLGRGMDC AAGS I YINQEVI ACSQPQGTAVYKVAKHTQN 
FG VP 1 1 ADGG I QTMGHWKALAL GAS T VMMG S L L AATME APGEC F FS DGM QL.KK Y QGMG S 
LDAMEKSSSSQKQYFNDGDKAKITQDVLGSIQDKGSIQKFVPYLIVGIQHGCQDIGAHSL 
SVLRSMMYSGELKFEKQTMSAQIDGGIHGLHSYEKWLY 



A BLAST analysis of NOV93 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV93 had high 
10 homology to other proteins as shown in Table 93C. 



Table 93C. BLASTX results from PatP database for NOV93 








Smallest 








Sum 






High 


Probability 


Sequences producing High-scoring Segment Pairs: Score 


P(N) 


patp 


:AAR05431 


Chinese hamster IMPDH - Cricetulus sp, 514 aa. 1874 


3 .2e-193 


patp 


:AAR05432 


Human IMPDH - Homo sapiens, 514 aa . 1872 


5 .3e-l93 


patp 


:AAY08965 


A. gossypii inosine- monophosphate dehydrogenase 980 


1.8e-98 


patp 


:AAG30888 


Arabidopsis thaliana protein fragment 973 


9 . 7e-98 


patp 


:AAG43108 


Arabidopsis thaliana protein fragment 94 9 


3 .4e-95 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1402 of 1567 bases (89%) identical to a gb:GENBANK- 

15 ID:HUMIMPH|acc:J05272.1 mRNA from Homo sapiens (Human IMP dehydrogenase type 1 
mRNA). The full amino acid sequence of the protein of the invention was found to have 438 
of 513 amino acid residues (85%) identical to, and 462 of 513 amino acid residues (90%) 
similar to, the 514 amino acid residue ptnr:SWISSNEW-ACC:P20839 protein from Homo 
sapiens (Human) (INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE 1 (EC LI. 1.205) 

20 (IMP DEHYDROGENASE 1) (IMPDH-I) (IMPD 1)). NOV93 also has homology to the other 
proteins shown in the BLASTP data in Table 93D. 
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Table 93D. NOV93 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 17453301 | r 
ef |XP_004627. 
5 | (XM_004627) 


IMP (inosine 
monophosphate ) 
dehydrogenase 1 
[Homo sapiens] 


514 


440/514 
(85) 


464/514 
(89) 


0 . 0 


gi | 124417 | sp | 
P2 0 83 9 | IMD1_H 
UMAN 


INOSINE- 5 ' -MONOPHOSPHATE 
DEHYDROGENASE 1 (IMP 
DEHYDROGENASE 1) (IMPDH- 
I) (IMPD 1) 


514 


438/514 
(85) 


462/514 
(89) 


0.0 


gi | 4504687 | re 
f|NP 000874.1 
| (NM_000883) 


IMP (inosine 
monophosphate ) 
dehydrogenase 1; 
SWSS2 6 08 [Homo sapiens] 


514 


434/514 
(84) 


458/514 
(88) 


0.0 


gi | 6754344 | re 
f |NP_035959 . 1 
| (NM 011829) 


inosine 5 1 -phosphate 
dehydrogenase 1 
[Mus musculus] 


514 


430/514 
(83) 


462/514 
(89) 


0 . 0 


gi | 16549223 | d 
bj | BAB70780 . 1 
| (AK054640) 


unnamed protein product 
[Homo sapiens] 


489 


418/514 
(81) 


440/514 
(85) 


0.0 



This BLASTP data is displayed graphically in the Clustal W in Table 93E. A multiple 
sequence alignment is given, with the NOV93 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 93D. 



Table 93E. ClustalW Alignment of NOV93 



NOV93 


(SEQ ID NO: 2 


96) 


gi 


1745330l| 


(SEQ 


ID 


NO: 


764) 


gi 


124417 | 


(SEQ 


ID 


NO : 


765) 


gi 


4504687 | 


(SEQ 


ID 


NO: 


766) 


gi 


6754344 | 


(SEQ 


ID 


NO: 


767) 


gi 


16549223 | 


(SEQ 


ID 


NO: 


768) 



NOV93 

gi | 17453301 | 
gi j 124417 | 
gi [4504687 | 
gi j 6754344 j 
gi j 16549223 | 




NOV9 3 

gi | 17453301 1 
gi | 124417 | 
gi|4504687| 
gi | 6754344 j 
gi | 16549223 | 
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VD L.T S AL TjSjgfjTLKT PL I S S PMDTVT E ADijA I ^M ALMGG0GF I HHNCT P E 
VDLTSALTRKITLKTPLISSPMDTVTEADMAIAMALMGGIGFIHHNCTPE 
VDLTSALTRKITLKTPL I SS PMDTVTEADMAI AMALMGGIGFIHHNCTPE 
VDLTSALTRKITLKTPLISSPMDTVTEADMAIAMALMGGIGFIHHNCTPE 
VDLTSALTRKITLKTPL I SS PMDTVTEADMAI AMA LMGGIGFIHHNCTPE 
VDLTSALTRKITLKTPL I SS PMDTVTEADMAI AMA 



NOV 9 3 
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110 
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140 
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150 



FQAfflEV^KVKKFEj3GFITj5jPVVLSP[lHTVGDVLEAKMRHGFSGI PITETG 
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fqanevrkvkkfeqgfitdpwlspshtvgdvleakmrhgfsgipitetg 
fqanevrkvkffifeqgfi tdp wls p sht vgdvleakmrhgfsgi p i tetg 
fqanevrkvksjfeqgfitdpwlspshtvgdvleakmrhgfsgipitetg 
fqanevrkvk kfeqgfitdpwlspshtvgdvleak^5hgfsgipitBtg 
b^^^^^^ h vtrtrn gfitdpvvlsp sht vgdvleakmrhgfsgi pi tetg 



gi | 17453301 | 
gi j 124417 | 
gi | 4504687 | 
gi j 6754344 | 
gi j 16549223 | 



NOV93 

gi | 17453301 | 
gi | 124417 | 
gi | 4504687 | 
gi j 6754344 j 
gi j 16549223 | 



NOV9 3 

gi | 17453301 | 
gi j 124417 | 
gi |4504687 | 
gi j 6754344 j 
gi j 16549223 | 



NOV93 

gi | 17453301 | 
gi | 124417 | 
gi j 4504687 | 
gi j 6754344 | 
gi | 16549223 | 



NOV93 

gi | 17453301 | 
gi j 124417 | 
gi | 4504687 | 
gi j 6754344 | 
gi j 16549223 | 



160 170 180 190 200 

| .... | .... | ... . | .. . . I .... 1 . ... I • ■■■ I 



TMGSKLVGIVTSRDIDFLAEKDHTTLLSEVMjJjPRIgjLWAP 
TMGSKLVGIVTSRDIDFLAEKDHTTLLSEVMTPRIELVVAPA 
TMGS KLVGIVTSRD I DFLAE KDHTTLLS EVMTPR I ELVVAPAGj 
TMGSKLVGIVTSRDIDFLAEKDHTTLLSEVMTPRIELWAPAG: 
TMGS KLVGIVTSRD I DFLAE KDHTTLLS EVMTPR^ELVVAPAG 
TMGS KLVG I VT S RD I D F L AE KDHTTLL S E VMT PR I E L WAP AG 



210 



220 



230 





1 



260 
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300 
-I 



VGTgjEDDKYigLDLLTQBGViVIg: 
VGTREDDKYRLDLLTQAGVDVI" 
VGTREDDKYRLDLLTQAGVDVI' 
VGTREDDKYRLDLLTQAGVDVI' 
VGTREDDKYRLDLLTQAGVDVI- 
VGTREDDKYRLDLLTQAGVDVI' 



I 



jLDSSQGNSVYQIAMVHYIKQKYPHL 
VLDS SQGNSVYQI AMVHYI KQKYPHL 
VLDSSQGNSVYQIAMVHYIKQKYPHL 
vjSfisSQGNSVYOIAMVHY I KQKYPHL 
VLDSSQGNSVYQIAMVHYIKQKYPHL 
VLDSSQGNSVYQIAMVHYIKQKYPHL 



310 



320 



330 



QV I GGNV VT AAQAjJjNL I D AGVDGLggjGMjgC 
Q VI GGN WTAAQ AKNL I D AG VDGLRVGMGC 
QVIGGNWTAAQAKNL IDAGVDGLRVGMGC 
QVIGGN WTAAQ AKNL I DAGVDGLRVGMGC 
QVIGGNWTAAQAKNL IDAGVDGLRVGMGC 
Q VI GGN WTAAQ AKNL I DAGVDGLRVGMGC 
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NOV93 

gi | 17453301 | 
gi | 124417 | 
gi j 4504687 | 
gi j 6754344 j 
gi | 16549223 | 



NOV 9 3 

gi | 17453301 | 
gi j 124417 | 
gi j 4504687 | 
gi |6754344 | 
gi j 16549223 | 



NOV93 

gi | 17453301 | 
gi j 124417 | 
gi |4504687 | 
gi j 6754344 | 
gi | 16549223 | 



VYKVA 1 S ^ ig^iFGVPIIADGGIQTiSjGHWKALALGASTVMMGSLLAATiaiE| 

vykvae yarrfgvp i iadggiqtvghwkalalgastvmmgsllaatte 
vykvaeyarrfgvpi i7adggiqtvghvvkalalgastvmmgsllaatte 
vykvae yarrfgvp i i adgg i qtvghvvkalalgastvmmgsllaatte 
avykvaeyarrfgvpEiadggiqtvghvvkalalgastvmmgsllaatte 
avykvaeyarrfgvpi i adggi qtvghvvkalalgastvmmgsllaatte 
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•J 



PGEgFFSDG'^LKKYgGMGSLDAMEKSS. 

PGEYFFSDGVRLKKYRGMGSLDAMEKS 
A.PGEYFFSDGVRLKKYRGMGSLDAMEKS 
PGEYFFSDGVRLKKYRGMGSLDAMEKS 
PGEYFFSDGVRLKKYRGMGSLDAMEKS 
PGEYFFSDGVRLKKYRGMGSLDAMEKS 



! 



S S QKygY FggGDKgK I UQlSVaG! 
SSSQKRYFSEGDKVKIAQGVSG 
SSSQKRYFSEGDKVKIAQGVSG 
SSSQKRYFSEGDKVKIAQGVSG 
SSSQKRYFSEGDKVKIAQGVSG 
SSSQKRYFSEGDKVKIAQGVSG 



460 
. . | . . 



470 
I . . 



480 



490 



500 



IIQDKGSIQKFVPYLlSGIQHGCQDIGAgjSLSVLRSMMYSGELKFEKgTMB 
SIQDKGSIQKFVPYLIAGIQHGCQDIGARSLSVLRSMMYSGELKFEKRTM 
SIQDKGSIQKFVPYLIAGIQHGCQDIGARSLSVLRSMMYSGELKFEKRTM 
SIQDKGSIQKFVPYLIAGIQHGCQDIGARSLSVLRSMMYSGELKFEKRTM 
SIQDKGS I QKFVPYLI AGIQHGCQDIGA^SLSVLRSMMYSGELKFEKRTM 
SIQDKGSIQKFVPYLIAGIQHGCQDIGARSLSVLRSMMYSGELKFEKRTM 
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IAQ I gGGSHGLHS YE KgjLy ■ 
SAQIEGGVHGLHSYEKRLY 
SAQIEGGVHGLHSYEKRLY 
SAQIEGGVHGLHSYEKRLY 
SAQIEGGVHGLHSYEKRLY 
SAQIEGGVHGLHSYEKRLY 



NOV9 3 

gi | 17453301 | 
gi | 124417 | 
gi | 4504687 | 
gi | 6754344 | 
gi j 16549223 | 



Table 93F lists the domain description from DOMAIN analysis results against NOV93. 
This indicates that the NOV93 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 93F. Domain Analysis of NOV93 



gnllPfam|pfam00478 , IMPDH_C, IMP dehydrogenase / GMP reductase C terminus. This family is involved 
in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. 
The alignment does not contain the whole TIM barrel domain. The alignment is truncated after the insert 
domain (2 CBS domains pfam00571) found in the inosine-S'-monophosphate dehydrogenase structure. This 
family should always be associated with pfam01574. This family is a member of the common phosphate 
binding site TIM barrel family. SEQ ID NO i 884 

CD-Length = 222 residues, 100.0% aligned 
Score = 261 bits (666), Expect = 9e-71 



NOV93 : 264 LLTQVGVNVIGLDSSQGNSVYQIAMVHYIKQKYPHLQVIGGNWTAAQANNLIDAGVDGL 323 

L + GV+VI LDSS G S QI + +I++KYP +QVI GNWT A LI DAG D + 
Sbjct : 1 ALVEAGVDVICLDSSNGYSEVQIDFIRWIREKYPTVQVIAGNWTGEMAEELIDAGADAV 6 0 

NOV93 : 324 GRGMDCAAGS I YINQEVIACSQPQGTAVYKVAKHTQNFGVPI I ADGGIQTMGHVVKALAL 3 83 

G+ GSI I +EV +PQ TAV +VA + +P+I+DGGI GH+ KALA 

Sbjct : 61 KVGI - -GPGSICITREVAGIGRPQATAVLEVADASHGLNIPVISDGGITNPGHMAKALAG 118 

NOV93 : 384 GASTVMMGSLLAATMEAPGECFFSDGMQLKKYQGMGSLDAMEKSSSSQKQYFNDGDKAKI 443 

GA VM+GSLLA T EAPGE F DG + K Y+GMGSL AM+K S +YF K + 

sb j ct . 119 GADFVMIGSLLAGTEEAPGEWFKDGKKYKLYRGMGSLTAMKKYQGSVARYFASKQKLSV 178 

NOV93 : 444 TQDVLGSIQDKGSIQKFVPYLIVGIQHGCQDIGAHSLSVLRSMM 487 

+VG+ KG++V L+ G++ C IGA L LR 
Sbjct : 179 EEGVTGYVPYKGDVSRTVHDLLGGLRSSCTYIGATKLKQLRKRA 222 



Inosine-5-prime-monophosphate dehydrogenase (EC 1.1.1.205) catalyzes the 
formation of xanthine monophosphate (XMP) from IMP. In the purine de novo synthetic 
pathway, IMP dehydrogenase is positioned at the branch point in the synthesis of adenine and 
guanine nucleotides and is thus the rate-limiting enzyme in the de novo synthesis of guanine 
nucleotides. Inhibition of cellular IMP dehydrogenase activity results in an abrupt cessation of 
DNA synthesis and a cell-cycle block at the Gl-S interface. Collart and Huberman (1988) 
used a polyclonal antibody directed against the purified protein to isolate human and Chinese 
hamster IMP dehydrogenase cDNA clones. The sequence of these clones demonstrated an 
open reading frame for a protein containing 514 amino acids. The molecular mass of the 
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produced protein was 56 kD, which is the observed molecular mass of the purified protein and 
of the immunoprecipitated in vitro translation product. 

A high order of conservation of the IMP dehydrogenase protein was indicated by the 
finding that human and Chinese hamster cDNA clones differed by only 8 amino acids. 
Natsumeda et al. (1990) isolated two distinct cDNAs (types I and II) encoding IMP 
dehydrogenase from a human spleen cDNA library. Both clones encode proteins of 514 
residues showing 84% sequence identity. Type I mRNA was found to be the main species in 
normal leukocytes, and type II (146691) predominated in human ovarian tumors. Using PCR 
primers specific for type II 1MPDH, Glesne et al. (1993) screened a panel of human/Chinese 
hamster cell somatic hybrids and a separate deletion panel of chromosome 3 hybrids and 
localized the gene to 3p24.2-p21.2. 

The gene was also localized on a map of two overlapping YACs and found to span no 
more than 12.5 kb of genomic DNA. From cloning and sequencing IMPD2, Glesne and 
Huberman (1994) determined that the gene spans approximately 5 kb and is interrupted by 12 
introns. The transcriptional start sites were determined by SI nuclease mapping to be 
somewhat heterogeneous but the predominant mRNA species showed a 5-prime end at 102 
and 85 nucleotides from the translational initiation codon. Zimmermann et al. (1995) also 
cloned the human gene and noted that it has 14 exons spanning approximately 5.8 kb. They 
also characterized regulatory elements in the 5-prime flanking region of the gene. 

NOV93 is predicted to be expressed in at least the following tissues: brain, 
prosencephalon/forebrain, diencephalon, pituitary gland. This information was derived by 
determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, public EST sources, literature sources, and/or RACE 
sources. Further expression data for NOV93 is provided in Example 2. 

The NOV93 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV93 nucleic acids encoding the inosine-5-prime-monophosphate 
dehydrogenase-like protein of the invention, or fragments thereof, may further be useful in 
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diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. 

The novel nucleic acid of the invention encoding a inosine-5 -monophosphate 
dehydrogenase-like protein includes the nucleic acid whose sequence is provided in Table 

5 93 A, or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 
whose bases may be changed from the corresponding base shown in Table 93A while still 
encoding a protein that maintains its inosine-5'-monophosphate dehydrogenase-like activities 
and physiological functions, or a fragment of such a nucleic acid. The invention further 
includes nucleic acids whose sequences are complementary to the sequence of Table 93 A, 

1 0 including nucleic acid fragments that are complementary to any of the nucleic acids just 



The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
1 5 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1 1% of the bases 
may be so changed. 

20 The novel protein of the invention includes the inosine-5'-monophosphate 

dehydrogenase-like protein whose sequence is provided in Table 93B. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 93B while still encoding a protein that maintains its 
inosine-5'-monophosphate dehydrogenase-like activities and physiological functions, or a 

25 functional fragment thereof. In the mutant or variant protein, up to about 15% of the amino 
acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 

30 prediction from hydrophobic ity charts, as described in the "Anti-NOVX Antibodies" section 



NOV94 

The disclosed NOV94 (alternatively referred to herein as CG56281-01) includes the 

35 1 573 nucleotide sequence (SEQ ID NO:297) shown in Table 94A. A NOV94 ORF begins 
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described. 



below. 




with a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 1 564-1566. The disclosed NOV94 maps to human chromosome 3. 



Table 94 A. NOV94 Nucleotide Sequence (SEQ ID NO:297) 

ATGGCGCGGAAGCAGGACCCGAAGCCTAAATTCCAGGAGGGTGAGCGAGTGCTGTGCTTTCATGGGCCTC 
TTCTTTATGAAGCAAAGTGTGTAAAGGTTGCCATAAAGGACAAACAAGTGAAATACTTCATACATTACAG 
TGGTTGGAATAAAAATTGGGATGAGTGGGTTCCGGAGAGCAGAGTACTCAAATACGTGGACACCAATGAA 
AATCGTAGATTGGCCAGGGAAATTCGTAGATTACAGCATAAATTGGCAAGAAATGCTGTAGCTCACCTGA 
GGAGCAAGAGAGAAAGAAGCAGCCGCTCCAGGTTGCTTGGTGCTGACTCTGTCTTAAAAGGCCTCTCCAT 
CG AAG AAAAAAATGAAAATGATGAAAACT CAT T AAGC AGTTC C T C TGAC AGT AGTG AAGAC AAGGATGAA 
AAAATAAGTGAAGAATGTGATATTGAAGAAAAGACTGAAGTGAAAGAAGAACCGGAGCTTCAAACAAAAA 
GGGAAATGGAAGAAAGAACAGTAACTCTAGAAATCCCTGAAGTTCTGAAGAGGCAGCTGGAGGATGATTG 
TTACTACATTAATCGGAGGAAACGGTTAGTGCAACTTCCATGCCACACCAACATCATAACGATTTTGGAA 
TCCTATGTGAAGCATTTTGCTATCAGTGCAGCCTTTTCAGCCAATGAGAGGCCTCGTCACCATCACGCTA 
TGCCACATGCCAGCATGAACGTGCCTTATATCCCAGCAGAAAAGAATATTGACCTTTGTAAGGAGATGGT 
GGATGGATTAAGAATAACCTTTGATTACACTCTCCCGTTGGTTTTACTCTATCCCTATGAACAAGCTCAG 
TATAAAAAGGTGACTGCATCTAAGGTTTTTCTTGCAATTAAGGAAAGTGCCACAAATACTAATAGGAGCC 
AGGAGAAGCTCTCTCCCAGCTTACGTTTGTTGAATCCATCCAGGCCGCAGTCTACAGAGAGTCAGTCGAC 
CAGCGGTGAACCAGCCACCCCTAAAAGGCGCAAAGCCGAGCCGCAAGCAGTGCAGTCTCTGAGGCGGTCC 
TCGCCCCACACCGCCAACTGTGACAGGCTTTCTAAGAGCAGCACCTCACCTCAGCCCAAGCGCTGGCAGC 
AGGACATGTCCACCAGTGTGCCCAAGCTGTTCCTGCACCTGGAAAAGAAGACACCTGTGCATAGCAGATC 
ATCTTCACCTACTCTGACTCCTAGCCAGGAAGGGAGTCCTGTGTTTGCTGGCTTTGAAGGGAGAAGAACT 
AATGAAATAAATGAGGTCCTCTCCTGGAAGCTCGTACCTGACAATTACCCACCAGGTGACCAGCCACCTC 
CACCCTCTTACATTTACGGGGCGCAACATTTGCTGCGATTGTTTGTCAAACTTCCAGAAATTCTTGGAAA 
AATGTCCTTTACTGAGAAGAATCTGAAGGCTTTATTGAAGCACTTTGATCTCTTTGTGAGGTTTTTAGCA 
GAATACCACGATGACTTCTTCCCAGAGTCAGCTTACGTCGCTGCCTCTGAGGTGCATTACAGCACCAGGA 
AC C C C C AGGCAGT C AAT AAGTGTT GAT GGTTC T 

5 A NOV94 polypeptide (SEQ ID NO:298) encoded by SEQ ID NO:297 is 521 amino 

acids in length and is presented using the one-letter amino acid code in Table 94B. The Psort 
profile for NOV94 predicts that this sequence is likely to be localized to the nucleus with a 
certainty of 0.9700. In alternative embodiments, a NOV94 polypeptide is localized to 
microbodies with a certainty of 0.3000. 

10 

Table 94B. NOV94 Polypeptide Sequence (SEQ ID 
NO:298) 

MARKQDPKPKFQEGERVLCFHGPLLYEAKCVKVAIKDKQVKYFIHYSGWNKNWDEWVPES 
R VL K Y VDTNENRRL ARE I RRL QH KLARNAVAHLR S KRER S S R SRLLG ADS VL KGL S I E E K 
NENDENSLSSSSDSSEDKDEKISEECDIEEKTEVKEEPELQTKREMEERTVTLEIPEVLK 
RQLEDDCYYINRRKRLVQLPCHTNIITILESYVKHFAISAAFSANERPRHHHAMPHASMN 
VPYIPAEKNIDLCKEMVDGLRITFDYTLPLVLLYPYEQAQYKKVTASKVFLAIKESATNT 
NRSQEKLSPSLRLLNPSRPQSTESQSTSGEPATPKRRKAEPQAVQSLRRSSPHTANCDRL 
SKSSTSPQPKRWQQDMSTSVPKLFLHLEKKTPVHSRSSSPTLTPSQEGSPVFAGFEGRRT 
NEINEVLSWKLVPDNYPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFTEKNLKALLK 
H FPL F VR FLAE YHDD F FP E S AY VAAS E VH Y S TRN PQ AVN KC 



A BLAST analysis of NOV94 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV94 had high 
homology to other proteins as shown in Table 94C. 

15 
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Table 94C. BLASTX results from PatP database for NOV94 










Smallest 










Sum 








High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P(N) 


patp 


AAW96184 


Senescence protein derived from human 


465 


2 . 9e-51 


patp 


AAB60085 


Human transport protein TPPT-5 


465 


2 . 9e-51 


patp 


AAU3 22 95 


Novel human secreted protein #2786 


430 


3 .4e-40 


patp 


AAM64801 


Human brain expressed single exon probe 


239 


3 . 5e-19 


patp 


.AAM77558 


Human bone marrow expressed probe 


239 


3 .5e-19 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1381 of 1581 bases (87%) identical to a gb:GENBANK- 
ID:AF117065|acc:AFl 17065.1 mRNA from Homo sapiens (male-specific lethal-3 homolog 1 
5 (MSL3L1) mRNA). The full amino acid sequence of the protein of the invention was found to 
have 414 of 520 amino acid residues (79%) identical to, and 457 of 520 amino acid residues 
(87%) similar to, the 521 amino acid residue ptnr:SPTREMBL-ACC:Q9Y5Z8 protein from 
Homo sapiens (Human) (MALE-SPECIFIC LETHAL-3 HOMOLOG 1). NOV94 also has 
homology to the other proteins shown in the BLASTP data in Table 94D. 
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Table 94D. NOV94 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 17975757 | re 
f |NP_523353 . 1 | 
(NM_078629) 


male-specific lethal 3- 
like 1 isoform a ; 
drosophila MSL3-like 1 
[Homo sapiens] 


521 


413/529 
(78) 


456/529 
(86) 


0.0 


gi | 17975759 | re 
f |NP_523354 . 1 | 
(NM_078630) 


male- specif ic lethal 3- 
like 1 isoform b; 
drosophila 
MSL3-like 1 
[Homo sapiens] 


462 


389/468 
(83) 


420/468 
(89) 


0.0 


gi | 14764458 | re 
f |XP_045715.l| 
(XM 045715) 


male-specific lethal-3 
(Drosophila) - like 1 
[Homo sapiens] 


496 


387/498 
(77) 


427/498 
(85) 


0.0 


gi | 11545735 | re 
f |NP_034962.2| 
(NM 010832) 


male- specif ic lethal-3 
homolog 1 (Drosophila) 
[Mus musculus] 


466 


346/472 
(73) 


396/472 
(83) 


e-168 


gi | 17975761 | re 
f |NP_006791.2 | 
(NM_006800) 


male-specific lethal 3- 
like 1 isoform c;' 
drosophila MSL3-like 1 
[Homo sapiens] 


355 


312/354 
(88) 


335/354 
(94) 


e-162 



This BLASTP data is displayed graphically in the ClustalW in Table 94E. A multiple 
sequence alignment is given, with the NOV94 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 94D. 
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Table 94E. ClustalW Alignment of NOV94 



NOV94 

gi | 17975757 
gi | 17975759 
gi | 14764458 
gi | 11545735 
gi | 17975761 



(SEQ ID NO: 2 98) 
(SEQ ID NO: 76 9) 
(SEQ ID NO: 770) 
(SEQ ID NO: 771) 
(SEQ ID NO: 772) 
(SEQ ID NO: 773) 
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NOV94 

gi | 17975757| 
gi j 17975759 I 
gi | 14764458 | 
gi j 11545735 | 
gi|l797576l| 



NOV94 

gi | 17975757 | 
gi j 17975759 j 
gi j 14764458 | 
gi j 11545735 | 
gi | 17975761 j 



NOV94 

gi | 17975757 | 
gi | 17975759 j 
gi j 14764458 | 
gi | 11545735 j 
gi | 17975761 j 



NOV94 

gi | 17975757 | 
gi | 17975759 j 
gi | 14764458 j 
gi | 11545735 | 
gi jl797576l| 



10 20 30 40 

- I . | . _ . - | | _ . | | 1 1 1 | 

MARKQDP KPKFQEGERVLC FHG PLL YE AKCVKVAI K - D KQV 

MSASEGMKFKFHSGEKVLCFEPDPTKARVXiYDAKIVWIVGKDEKGRKIP 

MSASEGMKFKFHSGEKVLCFEPDPTKARVLYDAKIVDVIVGKDEKGRKIP 
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kyfihysgwn: 
eylihfngwn] 

MP] 

EYLIHFNGWN: 
M 




140 



150 




160 



170 



180 



190 



200 




I 



ELQT^jREMEERTgT||E I PEVLKjgQLEDD) 
ELQTRREMEERTITIEIPEVLKKQLEDD 
ELQTRREMEERTITIEIPEVLKKQLEDD 
ELQTRREMEERTITIEIPEVLKKQLEDD 
L^R^EM!|ERT I T 1 3 1 PtgVLKKQLEDD 
JVIEERTITIEIPEVLKKQLEDD 



NOV94 


gi 


17975757 | 


gi 


17975759 | 


gi 


14764458 | 


gi 


11545735 j 


gi 


17975761| 



210 
. - I . . 



I 



220 
. . I . . 



I 



230 
. . I . . 



240 



250 



CYYINRRKRLVgLPCglTNIITILESYVKHFAigAAFSANERPRHHHSMPHI 
CYYINRRKRLVKLPCQTNIITILESYVKHFAINAAFSANERPRHHHVMPHj 
CYYINRRKRLVKLPCQTNIITILESYVKHFAINAAFSANERPRHHHVMPH 
CYYINRRKRLVKLPCQTNIITILESYVKHFAINAAFSANERPRHHHVMPH 
CYYINRRKRLVKLPCQTNIITILESYVKHFAINAAFSANERPRHHHgMj5]H 
CYYINRRKRLVKLPCQTNIITILESYVKHFAINAAFSANERPRHHHVMPH 



I 



260 
. . I - . 



270 



I 



I 



280 



290 



300 



JL 



NOV94 

gi | 17975757 | 
gi j 17975759 | 
gi | 14764458 j 
gi | 11545735 j 
gi | 17975761 j 



NOV94 

gi | 17975757 | 
gi 1 17975759 [ 



{MNV[|YIPAEKN|jDLCKEMVDGLRITFDYTLPLVLLYPYEQAQYKKVTl 
ANMNVHYIPAEKNVDLCKEMVDGLRITFDYTLPLVLLYPYEQAQYKKVTS 
ANMNVHYIPAEKNVDLCKEMVDGLRITFDYTLPLVLLYPYEQAQYKKVTS 
WNVHYIPAEKNVDLCKEMVDGLRITFDYTLPLVLLYPYEQAQYKKVTS 

jMNVHYEPAEKNVDLCKEMVDGLRITFDYTLPLVLLYPYEQSQYKfgVTS 

Ab^NVHYIPAEKNVDLCKEMVDGLRITFDYTLPLVLLYPYEQAQYKKVTS 



I 



310 
. - I - - 



320 



I 



330 



340 



350 



SK^FLglKESATjgTNRSQEjgLSPSjflgLLNPSjgPQSTESQgTgjGEPATPKR 
SKFFLPIKESATSTNRSQEELSPSPPLLNPSTPQSTESQPTTGEPATPKR| 
SKFFLPIKESATSTNRSQEELSPSPPLLNPSTPQSTESQPTTGEPATPKR 
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skfflpikesatstnrsqeelspsppllnpstpqstesqpttgepatpkr 
skfflpikesStStnrsqeelspsppllnpstpqstesqpStgepatpkr 
skfflpikesatstnr5qeelspsppllnpstpqstesqpttgepatpkr 



gi | 14764458 
gi j 11545735 
gi | 17975761 



NOV94 

gi | 17975757 | 
gi j 17975759 | 
gi | 14764458 | 
gi |11545735| 
gi j 17975761 | 



NOV94 

gi | 17975757 | 
gi j 17975759 | 
gi | 14764458 | 
gi j 11545735 j 
gi | 17975761 | 



NOV94 

gi | 17975757 | 
gi j 17975759 j 
gi | 14764458 | 
gi | 11545735 j 
gi | 17975761 j 



NOV94 

gi | 17975757 | 
gi j 17975759 j 
gi | 14764458 | 
gi|ll545735| 
gi | 17975761 | 



360 
. . I . . 



370 
• - I . . 



I 



380 



I 



390 
. .1.. 



400 
-I 



RKAEPgAgQSLRRS^H|SANCDRLS|SSQSPQPKRiS|QQDiJ|SasSPKLFLH^ 
RKAEPEALQSLRRSTRHSANCDRLSESSASPQPKRRQQDTSASMPKLFLH 
RKAEPEALQSLRRSTRHSANCDRLSESSASPQPKRRQQDTSASMPKLFLH 
RKAEPEALQSLRRSTRHSANCDRLSESSASPQPKRRQQDTSASMPKLFLH 
RKAEPEALQSLRRSTRHSjjNCDRLSESSgSPQPKRRQQDTSASMPKLFLH 
RKAEPEALQSLRRSTRHSANCDRLSESSASPQPKRRQQPTSASMPKLFLH 




460 



[ . 



470 

- - L_- 



480 



490 



500 



I 



YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFyEKNLKALLKHFDLF] 
YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 
YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 
YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 
YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 
YPPGDQPPPPSYIYGAQHLLRLFVKLPEILGKMSFSEKNLKALLKHFDLF 



I 



510 
. . I . . 



I 



520 

ml' 



530 

qSPnkc 



RFLAEYHDDFFPESAYVAA@EflHYST|2NP 
LRFLAEYHDDFFPESAYVAACEAHYSTKNPRAIY 
LRFLAEYHDDFFPESAYVAACEAHYSTKNPRAIY 



-VF@R- 



i[3a- 



LRFLAEYHDDFFPESAYVAACEAHYSTKNPRAIY 
LRFLAEYHDDFFPESAYVAACEAHYSTKNPRAIY 



Table 94F lists the domain description from DOMAIN analysis results against NOV94. 
This indicates that the NOV94 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 94F. Domain Analysis of NOV94 



gnllSmart|smart00298 , CHROMO, Chromatin organization modifier domain SEQ /ID NO: 885 

CD-Length = 55 residues, 
Score = 42.0 bits (97), Expect = 9e-05 



NOV94: 36 KDKQ VK Y F I H Y S GWNKNWDE WVPE S RVL K 64 

K +++Y + + G++ D W PE +L 
Sbjct: 14 KKGELEYLVKWKGYSYREDTWEPEENLLN 42 
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The Drosophila male-specific lethal (msl) genes regulate transcription from the male X 

chromosome in a dosage compensation pathway that equalizes X-linked gene expression in 

males and females. The members of this gene family, including msll, msl2, msB, mle, and 

mof, encode proteins with no sequence similarity to known proteins. However, mutations in 
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each of these genes produce a similar phenotype: sex-specific lethality of male embryos 
caused by the failure of mutants to increase transcription from the single male X chromosome. 

The MSL gene products assemble into a multiprotein transcriptional activation 
complex at hundreds of sites along the chromatin of the X chromosome. By searching 
5 sequence databases with the sequence of a BAC clone that maps to Xp22.3, Prakash et al. 
(1999) identified a human homolog of Drosophila msl3, MSL3-like-l (MSL3L1). They 
isolated a cDNA containing a complete MSL3L1 coding sequence. The deduced 521 -amino 
acid MSL3L1 protein shares 30% overall sequence identity with Drosophila MSL3 and 86% 
identity with mouse MsBll . Three segments of the Drosophila MSL3 protein are highly 

10 conserved in MSL3L1, including two putative chromodomains, one at the N terminus and the 
other at the C terminus. Chromodomains, which form a characteristic tertiary structure and can 
interact with components of chromatin, have been implicated to play roles in chromatin 
organization and transcriptional regulation. MSL3L1 also contains a putative nuclear 
localization signal, a putative leucine zipper motif within the second chromodomain, and two 

15 potential tyrosine kinase phosphorylation sites. 

Prakash et al. (1999) identified human fetal kidney cDNAs representing an 
alternatively spliced MSL3L1 transcript that lacks exon 2. The predicted protein, which is 
referred to as isoform 2, is identical to the first isoform from amino acid 62 to the C terminus 
but does not contain the first 26 amino acids of the N-terminal chromodomain. Northern blot 

20 analysis detected a major 2.4-kb MSL3L1 transcript in all tissues examined, namely liver, 

pancreas, heart, lung, kidney, skeletal muscle, brain, and placenta, with highest expression in 
skeletal muscle and heart. A 2.6-kb transcript unique to skeletal muscle was also found. 
Northern blot analysis of E7, El 1, El 5, and El 7 mouse embryos detected approximately equal 
levels of Msl311 expression in all embryos. The MSL3L1 gene spans 17 kb and contains 13 

25 exons. It is transcribed from telomere to centromere. Prakash et al. (1999) showed that the 
MSL3L1 gene undergoes X inactivation. 

NOV94 is predicted to be expressed in at least the following tissues: lung, testis, B- 
cell. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, public EST 

30 sources, literature sources, and/or RACE sources. Further expression data for NOV94 is 
provided in Example 2. 

The NOV 94 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 




hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
5 immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 

conditions. NOV94 nucleic acids encoding the MSL3Ll-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a male-specific lethal 3-like 1-like 
10 protein includes the nucleic acid whose sequence is provided in Table 94A, or a fragment 

thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 94A while still encoding a protein 
that maintains its male-specific lethal 3-like 1-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
1 5 are complementary to the sequence of Table 94A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
20 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 13% of the bases 
may be so changed. 

25 The novel protein of the invention includes the male-specific lethal 3-like 1-like 

protein whose sequence is provided in Table 94B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 
in Table 94B while still encoding a protein that maintains its male-specific lethal 3-like 1-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 

30 variant protein, up to about 21% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
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prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV95 

The disclosed NOV95 (alternatively referred to herein as CG56975-01) includes the 
1323 nucleotide sequence (SEQ ID NO:299) shown in Table 95 A. A NOV95 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 35-37 and ends with a stop codon 
at nucleotides 1301-1303. The disclosed NOV95 maps to human chromosome 1. 



Table 95A. NOV95 Nucleotide Sequence (SEQ ID NO:299) 



GCAATTCCTTTTCAATATTTATATATTTCAGAAAATGTCACTGAAATTCACAAATGCAAAACGGATTGAA 
GGACTTGATAGTAATGTGTGGATTGAATTTACCAAATTGGCTGCAGACCCTTCTGTTGTGAATCTTGGCC 
AAGGCTT T CCAGAT AT ATC C C C TC CT AC AT AT GT AAAAGAAGAATT AT C AAAG ATTGC AGC AAT CGATAG 
CCTGAATCAGTATACACGAGGCTTTGGCCATCCATCACTTGTGAAAGCTCTGTCCTATCTGTATGAAAAG 
CTTTATCAAAAGCAAATTGATTCAAATAAAGAAATCCTTGTGACAGTAGGAGCATATGGATCTCTTTTTA 
ACACCATTCAAGCATTAATTGATGAGGGACAGGTCATACTAATAGTGCCTTTCTATGACTGCTATGAGCC 
CATGGTGAGAATGGCTGGAGCAACACCTGTTTTTATTCCCCTGAGATCTGTAAGTTTGGGAAAAAGATGG 
T CT AGTT CTGACTGGAC AT TAG AT C CT C AAGAACT GG AAAGTAAAT TT AAT T C C AAAACC AAAGCT ATT A 
TACTAAATACTCCACATAACCCACTTGGCAAGGTATATAACAGAGAGGAACTGCAAGTAATTGCTGACCT 
TTGCATCAAATATGACACACTCTGCATCAGCGATGAGGTTTATGAATGGCTTGTATATTCTGGAAATAAG 
CACTTAAAAATAGCTACTTTTCCAGGTATGTGGGAGAGAACAATAACAATAGGAAGTGCTGGAAAGACTT 
TCAGTGTAACTGGCTGGAAGGTAGGCTGGTCCATTGGTCCAAATCATTTGATAAAACATTTACAGACAGT 
TCAACAAAACACGATTTATACTTGTGCAACTCCTTTACAGGAAGCCTTGGCTCAAGCTTTCTGGATTGAC 
ATC AAGCGC ATGGATG ACC C AG AATGT T ACTT T AAT TCT T TG C C AAAAGAGTT AG AAGT AAAAAGAGAT C 
GGATGGTACGTTTACTTGAAAGTGTTGGCCTAAAACCCATAGTTCCTGATGGAGGATACTTCATCATCGC 
TGATGTGTCTATTTTCATTGTGGTTTTAGATCCAGACCTCTCTGATATGAAGAATAATGAGCCTTATGAC 
TATAAGTTTGTGAAATGGATGACTAAACATCAGAAACTATCAGCCATCCCCGTTTCAGCATTCTGTAACT 
CAGAGACTAAATCACAGTTTGAGAAGTTTGTGCGTTTTTGCTTCATTAAAGTAAGTTCCCTGCTCGATGC 
TGCTGAAGAAATCATCAAGGCATGGAGTGTACAGAAGTCTTGATTTGTGCAGAATGGATTAAT 

A NOV95 polypeptide (SEQ ID NO:300) encoded by SEQ ID NO:299 is 422 amino 
acids in length and is presented using the one-letter amino acid code in Table 95B. The Psort 
profile for NOV95 predicts that this sequence is a Type II membrane protein, and is likely to 
be localized at the plasma membrane with a certainty of 0.4400. In alternative embodiments, a 
NOV95 polypeptide is located to microbodies with a certainty of 0.3691. 



Table 95B. NOV95 Polypeptide Sequence (SEQ ID 
NO:300) 

MSLKFTNAKRIEGLDSNVWIEFTKLAADPSWNLGQGFPDISPPTYVKEELSKIAAIDSL 
NQYTRG FGHP S L. VKAL S Y L Y E KL YQKQ I D SN KE I L VT VG A YG S L FNT I Q AL I DEGQ V I L I 
VPFYDCYEPMVRMAGATPVFIPLRSVSLGKRWSSSDWTLDPQELESKFNSKTKAIILNTP 
HNPLGKVYNREELQVIADLCIKYDTLCISDEVYEWLVYSGNKHLKIATFPGMWERTITIG 
S AGKT F S VTG W KVGW S I G PNH L I KHLQT VQQNT I YTC AT P LQE AL AQ AF W I D I KRMDD P E 
CYFNSLPKELEVKRDRMVRLLESVGLKPI VPDGGYFI IADVSIFIVVLDPDLSDMKNNEP 
YDYKFVKWMTKHQKLSAI PVSAFCNSETKSQFEKFVRFCFIKVSSLLDAAEEI IKAWSVQ 
KS 
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A BLAST analysis of NOV95 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV95 had high 
homology to other proteins as shown in Table 95C. 



Table 95C. BLASTX results from PatP database for NOV95 



Sequences producing High-scoring Segment Pairs: 

patp : AAY54 5 91 Amino acid sequence of a human transferase 
patp:AAR8 9 906 Human kynurenine aminotransferase (KAT) 
patp :AAR89896 Rat kynurenine aminotransferase (KAT) 
patp:AAR8 9 897 Rat kynurenine aminotransferase (KAT) 
patp : AAR89898 Rat kynurenine aminotransferase (KAT) 



Smallest 
Sum 

High Probability 
Score P (N) 

2121 2.2e-219 

1171 1.0e-118 

1130 2.2e-114 

1130 2.2e-114 

1130 2.2e-114 



10 



15 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 806 of 915 bases (88%) identical to a gb:GENBANK- 
ID:AF091090|acc:AF09 1090.1 mRNA from Homo sapiens (clone 669 unknown mRNA, 
complete sequence). The full amino acid sequence of the protein of the invention was found to 
have 229 of 415 amino acid residues (55%) identical to, and 300 of 415 amino acid residues 
(72%) similar to, the 419 amino acid residue ptnr:SPTREMBL-ACC:Q9W6U2 protein from 
Fugu rubripes (Japanese pufferfish) (Takifugu rubripes) (CYSTEINE CONJUGATE BETA- 
LYASE). NOV95 also has homology to the other proteins shown in the BLASTP data in 
Table 95D. 



Table 95D. NOV95 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Lengt 
h(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 12654031 | g 
b|AAH00819.l| 
AAH00819 
(BC000819) 


Similar to CG6950 gene 
product [Homo sapiens] 


290 


280/294 
(95) 


284/294 
(96) 


e-162 


gi | 5002565 | em 
b|CAB44334 . 1 | 
(Y17462) 


cysteine conjugate beta- 
lyase [Takifugu rubripes] 


419 


230/418 
(55) 


301/418 
(71) 


e-134 


gi | 4757928 | re 
f|NP 004050.1 
| (NM__004059) 


cytoplasmic cysteine 
conjugate-beta lyase ; 
glut amine -phenylpyruvate 
aminotransferase 
[Homo sapiens] 


422 


215/421 
(51) 


299/421 
(70) 


e-128 


gi | 15425868 | g 
b|AAK97625.l| 
AF395204_1 
(AF395204) 


kynurenine 

aminotransferase [Aedes 
aegypti] 


477 


224/420 
(53) 


284/420 
(67) 


e-126 


gi | 7299520 | gb 
| AAF54707 . 1 | 
(AE003693) 


CG6 95 0 gene product [alt 
3] 

[Drosophila melanogaster] 


417 


220/417 
(52) 


289/417 
(68) 


e-125 
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This BLASTP data is displayed graphically in the ClustalW in Table 95E. A multiple 
sequence alignment is given, with the NOV95 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 95D. 



Table 95E. ClustalW Alignment of NOV95 



NOV95 (SEQ ID NO: 300) 

gi|l265403l| (SEQ ID NO:774) 
gi|5002565| (SEQ ID NO:775) 
gi|4757928| (SEQ ID NO:776) 
gi|l5425868| (SEQ ID NO:777) 
gi|7299520| (SEQ ID NO:778) 



10 



20 



30 



40 
-I-- 



50 



NOV9 5 

gi | 12654031 | 
gi j 5002565 | 
gi j 4757928 j 
gi | 15425868 | 
gi j 7299520 | 



NOV95 

gi | 12654031 | 
gi | 5002565 | 
gi | 4757928 j 
gi j 15425868 | 
gi | 7299520 | 



NOV9 5 

gi | 12654031 | 
gi j 5002565 | 
gi |4757928 j 
gi | 15425868 | 
gi | 7299520 | 



NOV95 

gi | 12654031 | 
gi | 5002565 | 
gi |4757928 j 
gi j 15425868 | 
gi j 7299520 | 



NOV9 5 

gi | 12654031 | 
gi | 5002565 | 
gi | 4757928 | 
gi j 15425868 | 
gi j 7299520 | 



MMFLRNHNSVGGAIRTAWLQDLQFIVSNKSSALTGAVSSVHRQQIRTMS 



MSL0FTN. 



70 
IE@LDS! 



80 




-msIrihanQtis 



Sid: _ 
Sidy^p] 

STSNETMHNraFDLP^KgYQgSTK 

- me|3fdlp|(@lqSstp! 




110 



120 



130 



■QgKpy 

863 t 1 

qSkpf 

140 




150 



|....|....| 
T^VKEELSKIAAIDS- -j 



k^qeKfchalnegp 
d£avej|fqhavsgdf - ft. 

- <JSPDP^ 
E Y V V HSjLADIAKEQNP; 

160 




170 



180 



190 



200 




260 



270 



280 



290 



300 
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NOV9 5 

gi | 12654031 | 
gi | 5002565 | 
gi |4757928 
gi j 15425868 | 
gi | 7299520 | 



NOV95 

gi | 12654031 | 
gi |5002565 | 
gi |4757928 | 
gi I 15425868 
gi j 7299520 | 



NOV 9 5 

gi | 12654031 
gi j 5002565 
gi | 4757928 
gi | 15425868 | 
gi j 7299520 | 



NOV9 5 

gi | 12654031 | 
gi | 5002565 
gi | 4757928 
gi | 15425868 
gi | 7299520 | 



NOV95 

gi | 12654031 | 
gi |5002565 | 
gi | 4757928 j 
gi | 15425868 | 
gi j 7299520 | 




Table 95F lists the domain description from DOMAIN analysis results against NOV95. 
This indicates that the NOV95 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 95F. Domain Analysis of NOV95 



gnl | Pf am | pf am00155 , aminotran_l_2 , Aminotransferase class I and II SEQ ID NO: 886 



CD-Length = 316 residues, 94.3% aligned 
Score = 117 bits (292), Expect = 2e-27 



151 



NOV95 : 93 EILVTVGAYGSLFNTIQALIDEGQVILI - VPFYDCYEPMVRMAGATPVFIPLRSVSLGKR 

+ + L GA I G +L+ PYY++AG +LVL 
Sbjct : 17 QVLAGTGAKEVAALFISCFAAPGDAVLVPDPTYPIYSDVIjNHAGGI VRLYPVPLRSS 73 

NOV95 : 152 WS S SDWTLDPQELESKFNSKTKAI I LiNTPHNPLGKVYNREELQVI ADLC I KYDTL.C I SDE 211 

+ D+ + LE +K +++ PHNP G +L+ + DL +++ L + DE 

SbjcC: 74 NHN-DFKALEEALEEA- PEGSKWLVANPHNPTGMDGTLADLEKLLDLAKEHNILLLVDE 131 
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NOV95 : 212 VYEWLVYSGNKHLKIATFPGMWERTITIGSAGKTFSVTGWKVGWSIG PNHL 262 

Y V+G IA ++++SKF+G ++G + G + + 

Sb j Ct : 13 2 AYAGGVFGGLDGASI AELLDEYDNLLWQSLSKNFGLAGKRLGGAAGGIVAGSAASFDRV 191 

NOV95 : 263 IKHLQTVQQNTIYTCATPLQEALAQAFWIDIKRMDDPECYFNSLPKELEVKRDRMVRLLE 322 

+ + AT A++D E +L + 

Sbjct : 192 SSQSRALLF AT SS APPAVGAAI VAL ILQD KERLERWLKELKK 233 

NOV95: 323 SVGLKPIVPDGGYFIIADVSIFI WLDPDLSDMKNNEPYDYKFVKWMTKHQKLSAI PVSA 382 

+GL+ ++ G+ + D S 1+ L + + +P S 

Sbjct : 234 MLGLRVLLSRAGFVLWLDPS-GILPLWTFEDQA GLFSALLLEEHGVWPGSE 284 

NOV95: 383 FCNSETKSQFEKFVRFCFI KVS SL - LDAAEE 1 1 KA 416 

F R ++ LD E I+A 
Sbjct: 285 FPTVPPGW GRISLAGLTDETLDELLEAIRA 314 ' 



Aminotransferases share certain mechanistic features with other pyridoxal-phosphate 
dependent enzymes, such as the covalent binding of the pyridoxal-phosphate group to a lysine 
residue. On the basis of sequence similarity, these various enzymes can be grouped into 
5 subfamilies, one of which is the class I. Class I aminotransferases include cysteine conjugate 
beta-lyase. Living organisms employ a variety of metabolic pathways when detoxifying 
xenobiotic compounds, including the formation of cysteine S-conjugates via glutathione 
conjugation. Kidney cysteine conjugate beta-lyase (glutamine transaminase K, kyneurenine 
aminotransferase, EC 2.6.1.64) metabolises the cysteine conjugates of certain halogenated 

10 alkenes and alkanes to form reactive metabolites which can produce nephrotoxicity and 
neurotoxicity in experimental animals and man. 

NOV95 is predicted to be expressed in at least the following tissues: kidney, liver, 
colon, gall bladder. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 

15 public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV95 is provided in Example 2. 

The NOV95 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 

20 hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 

25 conditions. NOV95 nucleic acids encoding the aminotransferase-like protein of the invention, 
or fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 
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The novel nucleic acid of the invention encoding a cysteine conjugate beta- lyase- like 
protein includes the nucleic acid whose sequence is provided in Table 95A, or a fragment 
thereof. The invention also includes a mutant or variant nucleic acid any of whose bases may 
be changed from the corresponding base shown in Table 95A while still encoding a protein 
5 that maintains its cysteine conjugate beta-lyase-like activities and physiological functions, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose sequences 
are complementary to the sequence of Table 95 A, including nucleic acid fragments that are 
complementary to any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 

10 complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 

1 5 In the mutant or variant nucleic acids, and their complements, up to about 12% of the bases 
may be so changed. 

The novel protein of the invention includes the cysteine conjugate beta-lyase-like 
protein whose sequence is provided in Table 95B. The invention also includes a mutant or 
variant protein any of whose residues may be changed from the corresponding residue shown 

20 in Table 95B while still encoding a protein that maintains its cysteine conjugate beta-lyase-like 
activities and physiological functions, or a functional fragment thereof. In the mutant or 
variant protein, up to about 45% of the amino acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 

25 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 

NOV96 

NOV96 includes three monocarboxylate transporter-like proteins, designated herein as 
30 NOV96a, NOV96b and NOV96c. 

NOV96a 
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The disclosed NOV96a (alternatively referred to herein as CG5691 8-01) includes the 
1302 nucleotide sequence (SEQ ID NO:301) shown in Table 96A. A NOV96a ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 9-1 1 and ends with a stop codon 
at nucleotides 1287-1289. The disclosed NOV96a maps to human chromosome 17. 



CCGCTTAGATGGCGCGCAGGACAGAGCCCCCCGACGGGGGCTGGGGATGGGTGGTGGTGC 
TCTCAGCGTTCTTCCAGTCGGCGCTTGTGTTTGGGGTGCTCCGCTCCTTTGGGGTCTTCT 
TCGTGGAGTTTGTGGCGGCGTTTGAGGAGCAGGCAGCGCGCGTCTCCTGGATCGCCTCCA 
TAGGAATCGCGGTGCAGCAGTTTGGGAGCCCGGTAGGCAGTGCCCTGAGCACGAAGTTCG 
GGCCCAGGCCCGTGGTGATGACTGGAGGCATCTTGGCTGCGCTGGGGATGCTGCTCGCCT 
CTTTTGCTACTTCCTTGACCCACCTATACCTGAGTATTGGGTTGCTGTCAGGCTCTGGCT 
GGGCTTTGACCTTCGCTCCGACCCTGGCCTGCCTGTCCTGTTATTTTTCTCGCCGACGAT 
CCCTGGCCACCGGGCTGGCACTGACAGGCGTGGGCCTCTCCTCCTTCACATTTGCCCCCT 
TTTTCCAGTGGCTGCTCAGCCACTACGCCTGGAGGGGGTCCCTGCTGCTGGTGTCTGCCC 
TCTCCCTCCACCTAGTGGCCTGTGGTGCTCTCCTCCGCCCACCCTCCCTGGCTGAGGACC 
CTGCTGTGGGTGGTCCCAGGGCCCAACTCACCTCTCTCCTCCATCATGGCCCCTTCCTCC 
GTTACACTGTTGCCCTCACCCTGATCAACACTGGCTACTTCATTCCCTACCTCCACCTGG 
TGGCCCATCTCCAGGACCTGGATTGGGACCCACTACCTGCTGCCTTCCTACTCTCAGTTG 
TTGCTATTTCTGACCTCGTGGGGCGTGTGGTCTCCGGATGGCTGGGAGATGCAGTCCCAG 
GGCCTGTGACACGACTCCTGATGCTCTGGACCACCTTGACTGGGGTGTCACTAGCCCTGT 
TCCCTGTAGCTCAGGCTCCCACAGCCCTGGTGGCTCTGGCTGTGGCCTACGGCTTCACAT 
CAGGGGCTCTGGCCCCACTGGCCTTCTCCGTGCTGCCTGAACTAATAGGGACTAGAAGGA 
TTTACTGTGGCCTGGGACTGTTGCAGATGGTAGAGAGCATCGGGGGGCTGCTGGGGCCTC 
CTCTCTCAGGCTACCTCCGGGATGTGACAGGCAACTACACGGCTTCTTTTGTGGTGGCTG 
GGGCCTTCCTTCTTTCAGGGAGTGGCATTCTCCTCACCCTGCCCCACTTCTTCTGCTCCT 
CAACTACTACCTCCGGGCCCCAGGACCTTGTAACAGAAGCACTAGATACTAAAGTTCCCC 
T AC C C AAGGAGGGGCTGGAAG AGGACTGAACT CC AC AG AGT C 



A NOV96a polypeptide (SEQ ID NO.302) encoded by SEQ ID NO:301 is 426 amino 
acids in length and is presented using the one-letter amino acid code in Table 96B. The Psort 
profile for NOV96a predicts that this sequence is a Type Ilia membrane protein, has a signal 
10 peptide, and is likely to be localized to the plasma membrane with a certainty of 0.6400. In 
alternative embodiments, a NOV96a polypeptide is located to the Golgi with a certainty of 
0.4600, or to the endoplasmic reticulum (membrane) with a certainty of 0.3700. The Signal P 
predicts a likely cleavage site for a NOV96a peptide is between positions 34 and 35, i.e., at the 
dash in the sequence VAA-FE. 



MARRTE P PDGGWG WVWL S AF FQ S AL V FG VL RS FG VF FVE FV AAF E E QAAR VS W I AS I G I 
AVQQFGSPVGSALSTKFGPRPWMTGGILAALGMLLASFATSLTHLYLSIGLLSGSGWAL 
TFAPTLACLSCYFSRRRSLATGLALTGVGLSSFTFAPFFQWLLSHYAWRGSLLLVSALSL 
HLVACGALLRPPSLAEDPAVGGPRAQLTSLLHHGPFLRYTVALTLINTGYFIPYLHLVAH 
LQDLDWDPLPAAFLLSWAISDLVGRWSGWLGDAVPGPVTRLLMLWTTLTGVSLALFPV 
AQAPTALVAJLAVAYGFTSGALAPLAFSVLPELIGTRRIYCGLGLLQMVESIGGLLGPPLS 
GYLRDVTGNYTAS FWAGAFLLSGSGILLTLPHFFCSSTTTSGPQDLVTEALDTKVPLPK 
EGL.EED 
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Table 96A. NOV96a Nucleotide Sequence (SEQ ID 
NO:301) 



15 



Table 96B. NOV96a Polypeptide Sequence (SEQ ID 
NO:302) 
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NOV96b 



10 



The disclosed NOV96b (alternatively referred to herein as CG56918-02) includes the 
1294 nucleotide sequence (SEQ ID NO:303) shown in Table 96C. A SEC2 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a TGA codon at 
nucleotides 1279-1281 . The disclosed NOV96b maps to human chromosome 17. 



ATGGCGCGCAGGACAGAGCCCCCCGACGGGGGCTGGGGATGGGTGGTGGTGCTCTCAGCG 
TTCTTCCAGTCGGCGCTTGTGTTTGGGGTGCTCCGCTCCTTTGGGGTCTTCTTCGTGGAG 
TTTGTGGCGGCGTTTGAGGAGCAGGCAGCGCGCGTCTCCTGGATCGCCTCCATAGGAATC 
GCGGTGCAGCAGTTTGGGAGCCCGGTAGGCAGTGCCCTGAGCACGAAGTTCGGGCCCAGG 
CCCGTGGTGATGACTGGAGGCATCTTGGCTGCGCTGGGGATGCTGCTCGCCTCTTTTGCT 
ACTTCCTTGACCCACCTATACCTGAGTATTGGGTTGCTGTCAGGCTCTGGCTGGGCTTTG 
ACCTTCGCTCCGACCCTGGCCTGCCTGTCCTGTTATTTTTCTCGCCGACGATCCCTGGCC 
ACCGGGCTGGCACTGACAGGCGTGGGCCTCTCCTCCTTCACATTTGCCCCCTTTTTCCAG 
TGGCTGCTCAGCCACTACGCCTGGAGGGGGTCCCTGCTGCTGGTGTCTGCCCTCTCCCTC 
CACCTAGTGGCCTGTGGTGCTCTCCTCCGCCCACCCTCCCTGGCTGAGGACCCTGCTGTG 
GGTGGTCCCAGGGCCCAACTCACCTCTCTCCTCCATCATGGCCCCTTCCTCCGTTACACT 
GTTGCCCTCACCCTGATCAACACTGGCTACTTCATTCCCTACCTCCACCTGGTGGCCCAT 
CTCCAGGACCTGGATTGGGACCCACTACCTGCTGCCTTCCTACTCTCAGTTGTTGCTATT 
TCTGACCTCGTGGGGCGTGTGGTCTCCGGATGGCTGGGAGATGCAGTCCCAGGGCCTGTG 
ACACGACTCCTGATGCTCTGGACCACCTTGACTGGGGTGTCACTAGCCCTGTTCCCTGTA 
GCTCAGGCTCCCACAGCCCTGGTGGCTCTGGCTGTGGCCTACGGCTTCACATCAGGGGCT 
CTGGCCCCACTGGCCTTCTCCGTGCTGCCTGAACTAATAGGGA'CTAGAAGGATTTACTGT 
GGC CTGGG ACTGT TGC AGATGGT AGAG AGC AT CGGGGGGC TGCTGGGGC CT C C TCT CTC A 
GGCTACCTCCGGGATGTGACAGGCAACTACACGGCTTCTTTTGTGGTGGCTGGGGCCTTC 
CTTCTTTCAGGGAGTGGCATTCTCCTCACCCTGCCCCACTTCTTCTGCTCCTCAACTACT 
ACCTCCGGGCCCCAGGACCTTGTAACAGAAGCACTAGATACTAAAGTTCCCCTACCCAAG 
GAGGGACTGGAAGAGGACTGAACTCCACAGAGTC 



A NOV96b polypeptide (SEQ ID NO:304) encoded by SEQ ID NO:303 is 426 amino 
acids in length and is presented using the one-letter amino acid code in Table 96D. The Psort 
profile for NOV96b predicts that this sequence is likely to be a Type Ilia membrane protein, 
has a signal peptide, and is likely to be localized to the plasma membrane with a certainty of 
0.6400. In alternative embodiments, a NOV96b polypeptide is located to the Golgi with a 
certainty of 0.4600, or to the endoplasmic reticulum (membrane) with a certainty of 0.3700. 
The Signal P predicts a likely cleavage site for aNOV96b peptide is between positions 44 and 
45, i.e., at the dash in the sequence VAA-FE. 



MARRTEPPDGGWGWVWLSAFFQSALVFGVLRSFGVFFVEFVAAFEEQAARVSWIASIGI 
AVQQFGSPVGSALSTKFGPRPWMTGGILAALGMLLASFATSLTHLYLSIGLLSGSGWAL 
TFAPTLACLSCYFSRRRSLATGLALTGVGLSSFTFAPFFQWLLSHYAWRGSLLLVSALSL 
HLVACGALLRPPSLAEDPAVGGPRAQLTSLLHHGPFLRYTVALTLINTGYFIPYLHLVAH 
LQDLDWDPLPAAFLLSWAISDLVGRWSGWLGDAVPGPVTRLLMLWTTLTGVSLALFPV 
AQAPTALVALAVAYGFTSGALAPLAFSVLPELIGTRRIYCGLGLLQMVESIGGLLGPPIiS 
GYLRDVTGNYTASFWAGAFLLSGSGILLTLPHFFCSSTTTSGPQDLVTEALDTKVPLPK 



Table 96C. NOV96b Nucleotide Sequence (SEQ ID 
NO:303) 



Table 96D. NOV96b Polypeptide Sequence (SEQ ID 
NO:304) 
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[ EGLEED I 

NQV96c 

The disclosed NOV96c (alternatively referred to herein as CG56918-03) includes the 
1445 nucleotide sequence (SEQ ID NO:305) shown in Table 96E. A SEC2 ORF begins with a 
5 Kozak consensus ATG initiation codon at nucleotides 9-1 1 and ends with a stop codon at 
nucleotides 1287-1289. The disclosed NOV96c maps to human chromosome 17. 



Table 96E. NOV96c Nucleotide Sequence (SEQ ID 
NO:305) 

CCGCTTAGATGGCGCGCAGGACAGAGCCCCCCGACGGGGGCTGGGGATGGGTGGTGGTGC 
TCTCAGCGTTCTTCCAGTCGGCGCTTGTGTTTGGGGTGCTCCGCTCCTTTGGGGTCTTCT 
TCGTGGAGTTTGTGGCGGCGTTTGAGGAGCAGGCAGCGCGCGTCTCCTGGATCGCCTCCA 
TAGGAATCGCGGTGCAGCAGTTTGGGAGCCCGGTAGGCAGTGCCCTGAGCACGAAGTTCG 
GGCCCAGGCCCGTGGTGATGACTGGAGGCATCTTGGCTGCGCTGGGGATGCTGCTCGCCT 
CTTTTGCTACTTCCTTGACCCACCTATACCTGAGTATTGGGTTGCTGTCAGGCTCTGGCT 
GGGCTTTGACCTTCGCTCCGAGCCTGGCCTGCCTGTCCTGTTATTTCTCTCGCCGACGAT 
CCCTGGCCACCGGGCTGGCACTGACAGGCGTGGGCCTCTCCTCCTTCACATTTGCCCCCT 
TTTTCCAGTGGCTGCTCAGCCACTACGCCTGGAGGGGGTCCCTGCTGCTGGTGTCTGCCC 
TCTCCCTCCACCTAGTGGCCTGTGGTGCTCTCCTCCGCCCACCCTCCCTGGCTGAGGACC 
CTGCTGTGGGTGGTCCCAGGGCCCAACTCACCTCTCTCCTCCATCATGGCCCCTTCCTCC 
GTTACACTGTTGCCCTCACCCTGATCAACACTGGCTACTTCATTCCCTACCTCCACCTGG 
TGGCC C AT CT C C AGG AC CT GG ATTGGG AC C C ACT ACC TGC TGC CTT C CT AC TCT C AGTTG 
TTGCTATTTCTGACCTCGTGGGGCGTGTGGTCTCCGGATGGCTGGGAGATGCAGTCCCAG 
GGC CTGTG AC ACG ACT CCTG ATGC TCT GG AC C AC CTTGACTGGGGTGT C AC T AGCC C TGT 
TCCCTGTAGCTCAGGCTCCCACAGCCCTGGTGGCTCTGGCTGTGGCCTACGGCTTCACAT 
CAGGGGCTCTGGCCCCACTGGCCTTCTCCGTGCTGCCTGAACTAATAGGGACTAGAAGGA 
TTTACTGTGGCCTGGGACTGTTGCAGATGATAGAGAGCATCGGGGGGCTGCTGGGGCCTC 
CTCTCTCAGGCTACCTCCGGGATGTGTCAGGCAACTACACGGCTTCTTTTGTGGTGGCTG 
GGGCCTTCCTTCTTTCAGGGAGTGGCATTCTCCTCACCCTGCCCCACTTCTTCTGCTTCT 
CAACTACTACCTCCGGGCCTCAGGACCTTGTAACAGAAGCACTAGATACTAAAGTTCCCC 
TACCCAAGGAGGGGCTGGAAGAGGACTGAACTCCACAGAGTCAGGCCCAGAAAGCCAAAG 
CTTGACAGCTCCAGGTCTTCTCTTGCCACGTCTTGGTCTCCACAGAACCACAGTGCCTTA 
AGATTCTTGATCTGCCTCCCCCTAGAGCAGGCCTGGGGCTCCTGCAATGTGTGTGCCAAC 
CCTTT 



The NOV96c polypeptide (SEQ ID NO:306) encoded by SEQ ID NO:305 is 426 
10 amino acids in length and is presented using the one-letter amino acid code in Table 96F. The 
Psort profile for NOV96c predicts that this sequence is a Type III a membrane protein, has a 
signal peptide, and is likely to be localized to the plasma membrane with a certainty of 0.6400. 
In alternative embodiments, a NOV96c polypeptide is located to the Golgi with a certainty of 
0.4600, or to the endoplasmic reticulum (membrane) with a certainty of 0.3700. The Signal P 
15 predicts a likely cleavage site for a NOV96c peptide is between positions 34 and 35, i.e., at the 
dash in the sequence VAA-FE. 
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Table 96F. NOV96c Polypeptide Sequence (SEQ ID 
NO:306) 



MARRTEPPDGGWGWVWLSAFFQSALVFGVLRSFGVFFVEFVAAFEEQAARVSWIASIGI 
AVQQFGSPVGSALSTKFGPRPWMTGGILAALGMLLASFATSLTHLYLSIGLLSGSGWAL 
TFAPSLACLSCYFSRRRSLATGLALTGVGLSSFTFAPFFQWLLSHYAWRGSLLLVSALSL 
HLVACGALLRPPSLAEDPAVGGPRAQLTSLLHHGPFLRYTVALTLINTGYFIPYLHLVAH 
LQDLDWDPLPAAFLLSWAISDLVGRWSGWLGDAVPGPVTRLLMLWTTLTGVSLALFPV 
AQAPTALVALAVAYGFTSGALAPLAFSVLPELIGTRRIYCGLGLLQMIESIGGLLGPPLS 
GYLRDVSGNYTASFWAGAFLLSGSGILLTLPHFFCFSTTTSGPQDLVTEALDTKVPLPK 
EGLEED 



A BLAST analysis of NOV96 was run against the proprietary PatP GENESEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV96a had high 
homology to other proteins as shown in Table 96G. 



Table 96G. BLASTX results from PatP database for NOV96a 










Smallest 










Sum 








High 


Probability 


Sequences producing High- scoring Segment Pairs: 


Score 


P(N) 


patp 


AAU01618 


Human secreted protein 


940 


3 .Oe-94 


patp 


AAM93737 


Human polypeptide 


940 


3 . Oe-94 


patp 


AAB88570 


Human hydrophobic domain containing protein 


620 


2 . 5e-60 


patp 


AAY31642 


Human transport -associated protein-4 


602 


2 . 0e-58 


patp 


AAU01586 


Human secreted protein related to gene #26 


357 


1.8e-32 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 420 of 702 bases (59%) identical to a gb:GENBANK- 
ID:RNU87627|acc:U87627.1 mRNA from Rattus norvegicus (Rattus norvegicus putative 

10 monocarboxylate transporter (MCT3) mRNA). The full amino acid sequence of the protein of 
the invention was found to have 89 of 191 amino acid residues (46%) identical to, and 1 19 of 
191 amino acid residues (62%) similar to, the 504 amino acid residue ptnr: SPTREMBL- 
ACC:O95907 protein from Homo sapiens (Human) (DJ1039K5.2 (SIMILAR TO 
MONOCARBOXYLATE TRANSPORTER (MCT3)). NOV96 also has homology to the 

1 5 other proteins shown in the BLASTP data in Table 96H. 



Table 96H. NOV96 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 17491104 | re 
f |XP_064368 . 1 | 
(XM_064368) 


similar to solute 
carrier family 16 
(monocarboxylic acid 
transporters) , member 8 
(H. sapiens) 
[Homo sapiens] 


427 


424/427 
(99) 


425/427 
(99) 


0.0 



.. II L IS 



'fi i." ' , . \l M v 



gi | 14754801 | re 
f |XP_044738 . 1 | 
(XM_044738) 


solute carrier family 
16 (monocarboxylic acid 
transporters) , member 7 

[Homo S3.pj.GI2S] 


478 


141/410 
(34) 


213/410 
(51) 


2e-57 


gi | 4759120 | ref 
|NP_004722 . 1 | 
(NM_004731) 


solute carrier family 
16 (monocarboxylic acid 
transporters) , member 
7; monocarboxylate 
transporter 2 
[Homo sapiens} 


478 


141/410 
(34) 


213/410 
(51) 


2e-57 


gi|2497855|sp| 
Q6 3 3 44 |MOT2_RA 
T 


MONOCARBOXYLATE 
TRANSPORTER 2 (MCT 2) 


489 


140/410 
(34) 


215/410 
(52) 


7e-57 


gi | 1432167 | gb | 
AAB04023 . 1 j 
(U62316) 


monocarboxylate 
transporter 2 
[Rattus norvegicus] 


489 


141/410 
(34) 


215/410 
(52) 


8e-57 



This BLASTP data is displayed graphically in the ClustalW in Table 961. A multiple 
sequence alignment is given, with the NOV96a, b, and c proteins being shown on lines 1, 2, 
and 3 in a ClustalW analysis comparing the protein of the invention with the related protein 
5 sequences shown in Table 96H. 



Table 96L ClustalW Alignment of NOV96 



NOV9 6a 
NOV9 6b 
NOV9 6C 
gi | 17491104 | 
gi | 14754801 1 
gi | 4759120 | 
gi|2497855| 
gi | 1432167 j 



NOV96a 
NOV96b 
NOV9 6C 
gi | 17491104 | 
gi j 14754801 | 
gi | 4759120 | 
gi | 2497855 j 
gi | 1432167 j 



NOV9 6a 
NOV 9 6b 
N0V96C 
gi | 17491104 | 
gi j 14754801 j 
gi j 4759120 | 
gi j 2497855 | 
gi j 1432167 j 



NOV9 6a 
NOV9 6b 
N0V96C 
gi | 17491104 | 



(SEQ ID NO: 3 02) 
(SEQ ID NO: 3 04) 
(SEQ ID NO: 306) 
(SEQ ID NO:779) 
(SEQ ID NO:780) 
(SEQ ID NO: 781) 
(SEQ ID NO: 782 
(SEQ ID NO: 7 83) 
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gi | 14754801 | 
gi |4759120 | 
gi j 2497855 j 
gi j 1432167 j 



NOV9 6a 
NOV96b 
NOV9 6C 
gi | 17491104 | 
gi j 14754801 | 
gi |4759120 | 
gi |2497855 j 
gi | 1432167 j 



NOV9 6a 
NOV9 6b 
NOV9 6C 
gi | 17491104 | 
gi j 14754801 j 
gi | 4759120 | 
gi | 2497855 j 
gi j 1432167 | 



NOV9 6a 
NOV9 6b 
NOV96C 
gi | 17491104 | 
gi | 14754801 j 
gi |4759120 | 
gi |2497855| 
gi | 1432167 j 



NOV9 6a 
NOV9 6b 
NOV9 6c 
gi | 17491104 | 
gi j 14754801 j 
gi j 4759120 | 
gi | 2497855 j 
gi | 1432167 j 



NOV9 6a 
NOV9 6b 
NOV9 6C 
gi | 17491104 | 
gi j 14754801 | 
gi |4759120 | 
gi j 2497855 | 
gi j 1432167 | 



NOV9 6a 
NOV9 6b 
NOV9 6C 
gi | 17491104 | 
gi j 14754801 | 
gi | 4759120 | 
gi | 2497855 | 




210 



220 




230 240 
|....| |. % ..| |....|....|.-..|-...| 
PSLppP^GGPRAQLHS 

pslSdpBvggpI|aqlHs 

pslSedpBvggp^IaqlHs 

psiHdpHvggp^aqlHs 

Ilgpi^tt^ksknIstgkHeddsspkkiktkkstwekvnkyldfs 

,GPN^TT^KS KN^TGkBeDDS S PKKI KTKKST WEKVNKYLDFS 
GP|PRAg|KSKS^VGSRQDSSTKR- -LSKVSTAEKINRFLDFG IFT 
Gp|pRA§KSKS|§VGSRQDSSTKR- - LSKVSTAEKINRFLDFGgFT 



250 

FK 
FK 



270 



280 



290 



300 
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gi | 1432167 | ^K^L^i ^3Q0 K Y L SW!?S 1 V MB S G 1 YjJJ I CNA I NY RLLE KE RKkEKARR 



NOV96a 
NOV9 6b 
NOV9 6C 
gi | 17491104 | 
gi | 14754801 j 
gi j 4759120 | 
gi j 2497855 j 
gi | 1432167 | 



460 470 

....I.... I. | i 

A LDTKV^PgEGLE[3 

A LDTK *"* 

A LDTK' 

A LDTKV 

K TRESE 

K SRESE 



480 
..|.. 



I 



pHegleB I 

lEGLEg h 

:gle| S 

ISKHSy SvNVKVSNAQSVTSER- -ETNI 
JsKHS| BVNVKVSNAQSVTSER- -ETNI 
KKSASQASKEMEAfflsMSKQ-bSvTVKVSNTHNPPSDRDKESSI 
KKSASQASKEMEAaslsKQ-jC^TVKVSNTHNPPSDRDKESSI 



Monocarboxylates such as lactate and pyruvate play a central role in cellular 
metabolism and metabolic communication between tissues. Essential to these roles is their 
rapid transport across the plasma membrane, which is catalysed by a recently identified family 
5 of proton-linked monocarboxylate transporters (MCTs). Nine MCT-related sequences have so 
far been identified in mammals, each having a different tissue distribution, whereas six related 
proteins can be recognized in Caenorhabditis elegans and four in Saccharomyces cerevisiae. 

Direct demonstration of proton-linked lactate and pyruvate transport has been 
demonstrated for mammalian MCT1-MCT4, but only for MCT1 and MCT2 have detailed 

10 analyses of substrate and inhibitor kinetics been described following heterologous expression 
in Xenopus oocytes. MCT1 is ubiquitously expressed, but is especially prominent in heart and 
red muscle, where it is up-regulated in response to increased work, suggesting a special role in 
lactic acid oxidation. By contrast, MCT4 is most evident in white muscle and other cells with 
a high glycolytic rate, such as tumour cells and white blood cells, suggesting it is expressed 

15 where lactic acid efflux predominates. MCT2 has a ten-fold higher affinity for substrates than 
MCT1 and MCT4 and is found in cells where rapid uptake at low substrate concentrations 
may be required, including the proximal kidney tubules, neurons and sperm tails. MCT3 is 
uniquely expressed in the retinal pigment epithelium. MCT1 and MCT4 have been shown to 
interact specifically with OX-47 (CD147), a member of the immunoglobulin superfamily with 

20 a single transmembrane helix. This interaction appears to assist MCT expression at the cell 
surface (Halestrap and Price, 1999, Biochem. J. vol.343: 281-99). 

NOV96 is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, foreskin, 

25 heart, kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
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public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV96 is provided in Example 2. 

The NOV96 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
5 Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 

hypercalcemia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

10 immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 

conditions. NOV96 nucleic acids encoding the monocarboxylate transporter-like protein of 
the invention, or fragments thereof, may further be useful in diagnostic applications, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a monocarboxylate transporter-like 

15 protein includes the nucleic acid whose sequence is provided in Table 96 A, 105C, or 105E, or 
a fragment thereof. The invention also includes a mutant or variant nucleic acid any of whose 
bases may be changed from the corresponding base shown in Table 96A, 105C, or 105E while 
1 still encoding a protein that maintains its monocarboxylate transporter-like activities and 
physiological functions, or a fragment of such a nucleic acid. The invention further includes 

20 nucleic acids whose sequences are complementary to the sequence of Table 96A, 105C, or 

105E, including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

25 phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 41% of the bases 
may be so changed. 

30 The novel protein of the invention includes the monocarboxylate transporter-like 

protein whose sequence is provided in Table 96B, 105D, or 105F. The invention also includes 
a mutant or variant protein any of whose residues may be changed from the corresponding 
residue shown in Table 96B, 105D, or 105F while still encoding a protein that maintains its 
monocarboxylate transporter-like activities and physiological functions, or a functional 
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fragment thereof. In the mutant or variant protein, up to about 54% of the amino acid residues 
may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
5 methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV97 

NOV97 includes six carboxypeptidase-like proteins, designated herein as NOV97a, 
1 0 NO V97b, NOV97c, NO V97d, NO V97e, and NOV97f. 



NOV97a 

The disclosed NOV97a (alternatively referred to herein as CG57070-01) includes the 
1279 nucleotide sequence (SEQ ID NO:307) shown in Table 97A. A NOV97a ORF begins 
15 with a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 1258-1260. The disclosed NOV97a maps to human chromosome 7q 31. 



Table 97A. NOV97a Nucleotide Sequence (SEQ ID 
NO:307) 

ATGCGGGGGTTGCTGGTGTTGAGTGTCCTGTTGGGGGCTGTCTTTGGCAAGGAGGACTTT 
GTGGGGCATCAGGTGCTCCGAATCTCTGTAGCCGATGAGGCCCAGGTACAGAAGGTGAAG 
GAGCTGGAGGACCTGGAGCACCTGCAGCTGGACTTCTGGCGGGGGCCTGCCCACCCTGGC 
TCCCCCATCGACGTCCGAGTGCCCTTCCCCAGCATCCAGGCGGTCAAGATCTTTCTGGAG 
T C C C ACGGCAT C AGCT ATGAGAC C ATG AT CGAGG ACGTGC AGT CG CTG CTGG ACGAGGAG 
CAGGAGCAGATGTTCGCCTTCCGGTCCCGGGCGCGCTCCACCGACACTTTTAACTACGCC 
ACCTACCACACCCTGGAGGAGGTGTATAGCTGGATTGACAACTTTGTAATGGAGCATTCC 
GATATTGTCTCAAAAATTCAGATTGGCAACAGCTTTGAAAACCAGTCCATTCTTGTCCTG 
AAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATCGACACTGGAATTCACTCC 
CGGGAGTGGATCACCCATGCCACCGGCATCTGGACTGCCAATAAGATTGTCAGTGATTAT 
GGCAAAGACCGTGTCCTGACAGACATACTGAATGCCATGGACATCTTCATAGAGCTCGTC 
ACAAACCCTGATGGGTTTGCTTTTACCCACAGCATGAACCGCTTATGGCGGAAGAACAAG 
TCCATCAGACCTGGAATCTTCTGCATCGGCGTGGATCTCAACAGGAACTGGAAGTCGGGT 
TTTGGAGGAAATGGTTCTAACAGCAACCCCTGCTCAGAAACTTATCACGGGCCCTCCCCT 
CAGTCGGAGCCGGAGGTGGCTGCCATAGTGAACTTCATCACAGCCCATGGCAACTTCAAG 
GCTCTGATCTCCATCCACAGCTACTCTCAGATGCTTATGTACCCTTACGGCCGATTGCTG 
GAGCCCGTTTCAAATCAGAGGGAGTTGTACGATCTTGCCAAGGATGCGGTGGAGGCCTTG 
TATAAGGTCCATGGGATCGAGTACATTTTTGGCAGCATCAGCACCACCCTCTATGTGGCC 
AGTGGGATCACCGTCGACTGGGCCTATGACAGTGGCATCAAGTACGCCTTCAGCTTTGAG 
CTCCGGGACACTGGGCAGTATGGCTTCCTGCTGCCGGCCACACAGATCATCCCCACGGCC 
CAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACACCCTGAATCACCCCTACTAG 
CAGCACGACTGAGGGCAGG 



A NOV97a polypeptide (SEQ ID NO:308) encoded by SEQ ID NO:307 is 419 amino 
20 acids in length and is presented using the one-letter amino acid code in Table 97B. The Psort 
profile for NOV97a predicts that this sequence has a signal peptide and is likely to be 

762 



localized outside the cell with a certainty of 03703. In alternative embodiments, a NOV97a 
polypeptide is located to lysosomes with a certainty of 0.46200. The Signal P predicts a likely 
cleavage site for a NOV97a peptide is between positions 16 and 17, Le. 9 at the dash in the 
sequence VFG-KE. 



Table 97B. NOV97a Polypeptide Sequence (SEQ ID 
NO:308) 

MRGLLVLSVLLGAVFGKEDFVGHQVLRISVADEAQVQKVKELEDLEHLQLDFWRGPAHPG 
S P I D VRV P F P S I Q A VK I FLE S HG I S Y E TM I ED VQ S LLDE E QE QM F AFR S RARS TDT FN Y A 
TYHTLEEVYSWIDNFVMEHSDIVSKIQIGNSFENQSILVLKFSTGGSRHPAIWIDTGIHS 
REWITHATGIWTANKIVSDYGKDRVLTDILNAMDIFIELVTNPDGFAFTHSMNRLWRKNK 
SIRPGIFCIGVDLNRNWKSGFGGNGSNSNPCSETYHGPSPQSEPEVAAIVNFITAHGNFK 
ALISIHSYSQMLMYPYGRLLEPVSNQRELYDLAKDAVEALYKVHGIEYIFGSISTTLYVA 
SGITVDWAYDSGIKYAFSFELRDTGQYGFLLPATQIIPTAQETWMALRTIMEHTLNHPY 



NOV97b 

The disclosed NOV97b (alternatively referred to herein as CG57070-02) includes the 
1291 nucleotide sequence (SEQ ID NO:309) shown in Table 97C. A SEC2 ORF begins with 
a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a stop codon at 
nucleotides 1270-1272. The disclosed NOV97b maps to human chromosome 7q3 1 . 



Table 97C. NOV97b Nucleotide Sequence (SEQ ID 
NO:309) 

ATGCGGGGGTTGCTGGTGTTGAGTGTCCTGTTGGGGGCTGTCTTTGGCAAGGAGGACTTT 
GTGGGGCATCAGGTGCTCCGAATCTCTGTAGCCGATGAGGCCCAGGTACAGAAGGTGAAG 
GAGCTGGAGGACCTGGAGCACCTGCAGGTGGACTTCTGGCGTGGCCCAGCCAGGCCCAGC 
CTCCCTGTGGATATGAGAGTTCCTTTCTCTGAACTGAAAGACATCAAAGCTTATCTGGAG 
TCTCATGGACTTGCTTACAGCATCATGATAAAGGACATCCAGGTGAAGCCCCAGGTGCTG 
CTGGATGAGGAAAGACAGGCCATGGCGAAATCCCGCCGGCTGGAGCGCAGCACCAACAGC 
TTCAGTTACTCATCATACCACACCCTGGAGGAGGTATATAGCTGGATTGACAACTTTGTA 
ATGGAGCATTCCGATATTGTCTCAAAAATTCAGATTGGCAACAGCTTTGAAAACCAGTCC 
ATTCTTGTCCTGAAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATCGACACT 
GGAATTCACTCCCGGGAGTGGATCACCCATGCCACCGGCATCTGGACTGCCAATAAGATT 
GTCAGTGATTATGGCAAAGACCGTGTCCTGACAGACATACTGAATGCCATGGACATCTTC 
ATAGAGCTCGTCACAAACCCTGATGGGTTTGCTTTTACCCACAGCATGAACCGCTTATGG 
CGGAAGAACAAGTCCATCAGACCTGGAATCTTCTGCATCGGCGTGGATCTCAACAGGAAC 
TGGAAGTCGGGTTTTGGAGGAAATGGTTCTAACAGCAACCCCTGCTCAGAAACTTATCAC 
GGGCCCTCCCCTCAGTCGGAGCCGGAGGTGGCTGCCATAGTGAACTTCATCACAGCCCAT 
GGCAACTTCAAGGCTCTGATCTCCATCCACAGCTACTCTCAGATGCTTATGTACCCTTAC 
GGCCGATTGCTGGAGCCCGTTTCAAATCAGAGGGAGTTGTACGATCTTGCCAAGGATGCG 
GTGGAGGCCTTGTATAAGGTCCATGGGATCGAGTACATTTTTGGCAGCATCAGCACCACC 
CTCTATGTGGCCAGTGGGATCACCGTCGACTGGGCCTATGACAGTGGCATCAAGTACGCC 
TTCAGCTTTGAGCTCCGGGACACTGGGCAGTATGGCTTCCTGCTGCCGGCCACACAGATC 
ATCCCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACACCCTGAAT 
CACCCCTACTAGCAGCACGACTGAGGGCAGG . 



A NOV97b polypeptide (SEQ ID NO:310) encoded by SEQ ID NO:309 is 423 amino 
acids in length and is presented using the one-letter amino acid code in Table 97D. The Psort 
profile for NOV97b predicts that this sequence has a signal peptide and is likely to be 
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localized outside the cell with a certainty of 0.3703. In alternative embodiments, a NOV97b 
polypeptide is located to lysosomes with a certainty of 0.1900. The Signal P predicts a likely 
cleavage site for a NOV97b peptide is between positions 16 and 17, i.e., at the dash in the 
sequence VFG-KE. 

5 

Table 97D. NOV97b Polypeptide Sequence (SEQ ID 
NO:310) 

MRGLLVLSVLLGAVFGKEDFVGHQVLRISV7VDEAQVQKVKELEDLEHLQVDFWRGPARPS 
LPVDMRVPFSELKDIKAYLESHGLAYSIMIKDIQVKPQVLLDEERQAMAKSRRLERSTNS 
FSYSSYHTLEEVYSWIDNFVMEHSDIVSKIQIGNSFENQSILVLKFSTGGSRHPAIWIDT 
GIHSREWI THATG I WT ANK I VS D YGKDR VLTD I LNAMD I FIE L VTNPDG F AFT HS MNRL W 
RKNKSIRPGIFCIGVDLNRNWKSGFGGNGSNSNPCSETYHGPSPQSEPEVAAIVNFITAH 
GNFKALI S IHS YSQMLMYP YGRLLEPVSNQRELYDLAKDAVEALYKVHGI EYI FGS I STT 
LYVASGI TVDWAYDSG I KYAFS FELRDTGQYGFLLPATQI I PTAQETWMALRT IMEHTLN 
HPY 



NOV97c 

The disclosed NOV97c (alternatively referred to herein as CG57070-03) includes the 
1 344 nucleotide sequence (SEQ ID NO:3 1 1) shown in Table 97E. A NOV97a ORF begins 
10 with a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon 
at nucleotides 1327-1329. 



Table 97E. NO V97c Nucleotide Sequence (SEQ ID 
NO:311) 

TGAAGCTCACCAGGAGGAAGAAGCATGCAGGGCACCCCTGGAGGCGGGACGCGCCCTGGG 
CCATCCCCCGTGGACAGGCGGACACTCCTGGTCTTCAGCTTTATCCTGGCAGCAGCTTTG 
GGCCAAATGAATTTCACAGGGCAGGTTCTTCGAGTCCTGGCCAAAGATGAGAAGCAGCTT 
TCACTTCTCGGGGATCTGGAGGGCCTGAAACCCCAGAAGGTGGACTTCTGGCGTGGCCCA 
GCCAGGCCCAGCCTCCCTGTGGATATGAGAGTTCCTTTCTCTGAACTGAAAGACATCAAA 
GCTTATCTGGAGTCTCATGGACTTGCTTACAGCATCATGATAAAGGACATCCAGGTGCTG 
CTGGATGAGGAAAGACAGGCCATGGCGAAATCCCGCCGGCTGGAGCGCAGCACCAACAGC 
TTCAGTTACTCATCATACCACACCCTGGAGGAGATATATAGCTGGATTGACAACTTTGTA 
ATGGAGCATTCCGATATTGTCTCAAAAATTCAGATTGGCAACAGCTTTGAAAACCAGTCC 
ATTCTTGTCCTGAAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATTGACACT 
GGAATTCACTCCCGGGAGTGGATCACCCATGCCACCGGCATCTGGACTGCCAATAAGATT 
GTCAGTGATTATGGCAAAGACCGTGTCCTGACAGACATACTGAATGCCATGGACATCTTC 
ATAGAGCTCGTCACAAACCCTGATGGGTTTGCTTTTACCCACAGCATGAACCGCTTATGG 
CGGAAGAACAAGTCCATCAGACCTGGAAT CTTCTGCATCGGCGTGGAT CTCAACAGGAAC 
TGG AAGT C GGGTT TTGGAAATGGT T C T AAC AG C AACC C CTGCT C AGAAAC TT ATC ACGGG 
CCCTCCCCTCAGTCGGAGCCGGAGGTGGCTGCCATAGTGAACTTCATCACAGCCCATGGC 
AACTTCAAGGCTCTGATCTCCATCCACAGCTACTCTCAGATGCTTATGTACCCTTACGGC 
CGATTGCTGGAGCCCGTTTCAAATCAGAGGGAGTTGTACGATCTTGCCAAGGATGCGGTG 
GAGGCCTTGTATAAGGTCCATGGGATCGAGTACATTTTTGGCAGCATCAGCACCACCCTC 
GATGTGGCCAGTGGGATCACCGTCGACTGGGCCTATGACAGTGGCATCAAGTACGCCTTC 
AGCTTTGAGCTCCGGGACACTGGGCAGTATGGCTTCCTGCTGCCGGCCACACAGATCATC 
CCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACACCCTGAATCAC 
CCCTACTAGCAGCACGACTGAGGG 

A NOV97c polypeptide (SEQ ID NO:312) encoded by SEQ ID NO:31 1 is 434 amino 
15 acids in length and is presented using the one-letter amino acid code in Table 97F. The Psort 
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profile for NOV97c predicts that this sequence has a signal peptide and is likely to be 
localized outside the cell with a certainty of 0.5851 . In alternative embodiments, a NOV97c 
polypeptide is located to lysosomes with a certainty of 0.4366. The Signal P predicts a likely 
cleavage site for a NOV97c peptide is between positions 33 and 34, i.e., at the dash in the 
5 sequence ALG-QM. 



Table 97F. NOV97c Polypeptide Sequence (SEQ ID 
NO:312) 



MQGTPGGGTRPGPSPVDRRTLLVFSFILAAALGQMNFTGQVLRVLAKDEKQLSLLGDLEG 
LKPQKVDFWRGPARPSLPVDMRVPFSELKDIKAYLESHGLAYSIMIKDIQVLLDEERQAM 
AKSRRLERSTNSFSYSSYHTL.EE I YSWIDNFVMEHSDIVSKIQIGNSFENQSILVLKFST 
GGSRHPAIWIDTGIHSREWITHATGIWTANKIVSDYGKDRVLTDILNAMDIFIELVTNPD 
GFAFTHSMNRLWRKNKSIRPGIFCIGVDLNRNWKSGFGNGSNSNPCSETYHGPSPQSEPE 
VAAIVNFITAHGNFKAL,ISIHSYSQMLMYPYGRLL,EPVSNQRELYDLAKDAVEAL,YKVHG 
I E YI FGS I STTLDVASGI TVDWAYDSGI K YAFS FELRDTGQYGFLLPATQI I PTAQETWM 
ALRTIMEHTLNHPY . 



NOV97d 

The disclosed NOV97d (alternatively referred to herein as CG57070-04) includes the 
1 0 988 nucleotide sequence (SEQ ID NO:3 1 3) shown in Table 97G. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a TGA codon at 
nucleotides 973-975. The disclosed NOV97d maps to human chromosomes 7q31. 



Table 97G. NOV97d Nucleotide Sequence (SEQ ID 
NO:313) 

ATGCGGGGGTTGCTGGTGTTGAGTGTCCTGTTGGGGGCTGTCTTTGGCAAGGAGGACTTT 
GTGGGGCATCAGGTGCTCCGAATCTCTGTAGCCGATGAGGCCCAGGTACAGAAGGTGAAG 
GAGCTGGAGGACCTGGAGCACCTGCAGCTGGACTTCTGGCGGGGGCCTGCCCACCCTGGC 
TCCCCCATCGACGTCCGAGTGCCCTTCCCCAGCATCCAGGCGGTCAAGATCTTTCTGGAG 
TCCCACGGCATCAGCTATGAGACCATGATCGAGGACGTGCAGTCGCTGCTGGACGAGGAG 
CAGGAGCAGATGTTCGCCTTCCGGTCCCGGGCGCGCTCCACCGACACTTTTAACTACGCC 
ACCTACCACACCCTGGAGGAGGTGTATAGCTGGATTGACAACTTTGTAATGGAGCATTCC 
GATATTGTCTCAAAAATTCAGATTGGCAACAGCTTTGAAAACCAGTCCATTCTTGTCCTG 
AAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATTGACACTGGAATTCACTCC 
CGGGAGTGGATCACCCATGCCACCGGCATCTGGACTGCCAATAAGATTGTCAGTGATTAT 
GGCAAAGACCGTGTCCTGACAGACATACTGAATGCCATGGACATCTTCATAGAGCTCGTC 
ACAAACCCTGATGGGTTTGCTTTTACCCACAGCATGAACCGCTTATGGCGGAAGAACAAG 
TCCATCAGACCTGGAATCTTCTGCATCGGCGTGGATCTCAACAGGAACTGGAAGTCGGGT 
TTTGGAGATGTGGCCAGTGGGATCACCGTCGACTGGGCCTACGACAGTGGCATCAAGTAC 
GCCTTCAGCTTTGAGCTCCGGGACACTGGGCAGTATGGCTTCCTGCTGCCGGCCACACAG 
ATCATCCCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACACCCTG 
AATCACCCCTACTAGCAGCACGACTGAG , 



1 5 A NOV97d polypeptide (SEQ ID NO:3 14) encoded by SEQ ID NO:3 1 3 is 324 amino 

acids in length and is presented using the one-letter amino acid code in Table 97H. The Psort 
profile for NOV97d predicts that this sequence has a signal peptide and is likely to be 
localized to lysosomes with a certainty of 0.4757, or outside the cell with a certainty of 
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0.3703. The Signal P predicts a likely cleavage site for a NOV97d peptide is between 
positions 16 and 17, i.e., at the dash in the sequence VFG-KE. 



Table 97H. NOV97d Polypeptide Sequence (SEQ ID 
NO:314) 

MRGLLVLSVLLGAVFGKEDFVGHQVLRISVADEAQVQKVKELEDLEHLQLDFWRGPAHPG 
SPIDVRVPFPSIQAVKIFLESHGISYETMIEDVQSLLDEEQEQMFAFRSRARSTDTFNYA 
TYHTLEEVYSWIDNFVMEHSDIVSKIQIGNSFENQSILVLKFSTGGSRHPAIWIDTGIHS 
RE W I THATGI WTANKI VSD YGKDRVLTD I LNAMD I F I EL VTNPDGFAFTHSMNRLWRKNK 
SIRPGIFCIGVDLNRNWKSGFGDVASGITVDWAYDSGIKYAFSFELRDTGQYGFLLPATQ 
1 1 PTAQETWMALRTIMEHTLNHPY 



NOV97e 

The disclosed NOV97e (alternatively referred to herein as CG57070-05) includes the 
1348 nucleotide sequence (SEQ ID NO:315) shown in Table 971. A NOV97e ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 25-27 and ends with a stop codon 
at nucleotides 1333-1335. The disclosed NOV97e maps to human chromosome 7. 



Table 971. NOV97e Nucleotide Sequence (SEQ ID 
NO:315) 

TGAAGCTCACCAGGAGGAAGAAGCATGCAGGGTACTCCTGGAGGCGGGACGCGCCCTGGG 
CCATCCCCCGTGGACAGGCGGACACTCCTGGTCTTCAGCTTTATCCTGGCAGCAGCTTTG 
GGCC AAATGAATTT CAC AGGGGAC C AGGTT C TT CG AGTC C TGG C C AAAGATG AG AAGC AG 
CTTTCACTTCTCGGGGATCTGGAGGGCCTGAAACCCCAGAAGGTGGACTTCTGGCGTGGC 
CCAGCCAGGCCCAGCCTCCCTGTGGATATGAGAGTTCCTTTCTCCGAACTGAAAGACATC 
AAAGCTTATCTGGAGTCTCATGGACTTGCTTACAGCATCATGATAAAGGACATCCAGGTG 
CTGCTGGATGAGGAAAGACAGGCCATGGCGAAATCCCGCCGGCTGGAGCGCAGCACCAAC 
AGCTT C AGTTACT CATC AT AC CAC AC C CTGG AGG AGAT AT AT AGCTGG ATTGACAACTT T 
GTAATGGAGCATTCCGATATTGTCTCAAAAATTCAGATTGGCAACAGCTTTGAAAACCAG 
TCCATTCTTGTCCTGAAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATCGAC 
ACTGG AAT T C ACT CC C GGGAGTGG AT C ACCC ATGC C ACCGGC AT CTGG ACTGC C AAT AAG 
ATTGTCAGTGATTATGGCAAAGACCGTGTCCTGACAGACATACTGAATGCCATGGACATC 
TTCATAGAGCTCGTCACAAACCCTGATGGGTTTGCTTTTACCCACAGCATGAACCGCTTA 
TGGCGGAAGAACAAGTCCATCAGACCTGGAATCTTCTGCATCGGCGTGGATCTCAACAGG 
AACTGGAAGTCGGGTTTTGGAGGAAATGGTTCTAACAGCAACCCCTGCTCAGAAACTTAT 
CACGGGCCCTCCCCTCAGTCGGAGCCGGAGGTGGCTGCCATAGTGAACTTCATCACAGCC 
CATGGCAACTTCAAGGCTCTGATCTCCATCCACAGCTACTCTCAGATGCTTATGTACCCT 
TACGGCCGATTGCTGGAGCCCGTTTCAAATCAGAGGGAGTTGTACGATCTTGCCAAGGAT 
GCGGTGGAGGCCTTGTATAAGGTCCATGGGATCGAGTACATTTTTGGCAGCATCAGCACC 
ACCCTCTATGTGGCCAGTGGGATCACCGTCGACTGGGCCTATGACAGTGGCATCAAGTAC 
GCCTTCAGCTTTGAGCTCCGGGACACTGGGCAGTATGGCTTCCTGCTGCCGGCCACACAG 
ATCATCCCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACAACCTG 
AATCACCCCTACTAGCAGCACGACTGAG 

A NOV97e polypeptide (SEQ ID NO:316) encoded by SEQ ID NO:315 is 436 amino 
acids in length and is presented using the one-letter amino acid code in Table 97J. The Psort 
profile for NOV97e predicts that this sequence has a signal peptide and is likely to be 
localized outside the cell with a certainty of 0.5851 . In alternative embodiments, a NOV97e 
polypeptide is located to lysosomes with a certainty of 0.4421. The Signal P predicts a likely 
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cleavage site for a NOV97e peptide is between positions 33 and 34, i.e., at the dash in the 
sequence ALG-QM. 



Table 97 J. NOV97e Polypeptide Sequence (SEQ ID 
NO:316) 



MQGTPGGGTRPGPSPVDRRTLLVFSFILAAALGQMNFTGDQVLRVLAKDEKQLSLLGDLE 
GLKPQKVDFWRGPARPSLPVDMRVPFSELKDIKAYLESHGLAYSIMIKDIQVLLDEERQA 
MAKSRRLERSTNSFSYSSYHTLEEIYSWIDNFVMEHSDIVSKIQIGNSFENQSILVLKFS 
TGGSRHPAIWIDTGIHSREWITHATGIWTANKIVSDYGKDRVLTDILNAMDIFIELVTNP 
DGFAFTHSMNRLWRKNKSIRPGIFCIGVDLNRNWKSGFGGNGSNSNPCSETYHGPSPQSE 
PEVAAIVNFITAHGNFKALISIHSYSQMLMYPYGRLLEPVSNQRELYDLAKDAVEALYKV 
HGIEYIFGSISTTLYVASGITVDWAYDSGIKYAFSFELRDTGQYGFLLPATQIIPTAQET 

WMALRTIMEHNLNHPY . „ 



NOV97f 

The disclosed NOV97f (alternatively referred to herein as CG57070-06) includes the 
975 nucleotide sequence (SEQ ID NO:317) shown in Table 97K. A SEC2 ORF begins with a 
Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a TAG codon at 
nucleotides 973-975. The disclosed NOV97f maps to human chromosome 7q31 . 



Table 97K. NO V97f Nucleotide Sequence (SEQ ID 
NO:317 ) 



ATGCGGGGGTTGCTGGTGTTGAGTGTCCTGTTGGGGGCTGTCTTTGGCAAGGAGGACTTT 
GTGGGGCATCAGGTGCTCCGAATCTCTGTAGCCGATGAGGCCCAGGTACAGAAGGTGAAG 
GAGCTGGAGGACCTGGAGCACCTGCAGCTGGACTTCTGGCGGGGGCCTGCCCACCCTGGC 
TCCCCCATCGACGTCCGAGTGCCCTTCCCCAGCATCCAGGCGGTCAAGATCTTTCTGGAG 
TCCCACGGCATCAGCTATGAGACCATGATCGAGGACGTGCAGTCGCTGCTGGACGAGGAG 
CAGGAGCAGATGTTCGCCTTCCGGTCCCGGGCGCGCTCCACCGACACTTTTAACTACGCC 
ACCTACCACACCCTGGAGGAGGTGTATAGCTGGATTGACAACTTTGTAATGGAGCATTCC 
GATATTGTCTCAAATATTCAGATTGGCAACAGCTTTGAAAACCAGTCCATTCTTGTCCTG 
AAGTTCAGCACTGGAGGTTCTCGGCACCCAGCCATCTGGATCGACACTGGAATTCACTCC 
CGGGAGTGGATCACCCGTGCCACCGGCATCTGGACTGCCAATAAGATTGTCAGTGATTAT 
GGCAAAGACCGTGTCCTGACAGACATACTGAATGCCATGGACATCTTCATAGGGCTCGTC 
AC AAAC CC TG ATGGGTTTG CT TTT AC C CACAG C ATGAAC CGCT T ATGGCGG AAG AAC AAG 
TCCATCAGACCTGGAATCTTCTGCATCGGCGTGGATCTCAACAGGAACTGGAAGTCGGGT 
TTTGGAGATGTGGCCAGTGGGATCACCGTCGACTGGGCCTATGACAGTGGCATCAAGTAC 
GCCTTCAGCTTTGAGCTCCGGGACACTGGGCAGTATGGCTTCCTGCTGCCGGCCACACAG 
ATCATCCCCACGGCCCAGGAGACGTGGATGGCGCTTCGGACCATCATGGAGCACATCCTG 

AATCACCCCTACTAG , . 



A NOV97f polypeptide (SEQ ID NO:318) encoded by SEQ ID NO:317 is 324 amino 
acids in length and is presented using the one-letter amino acid code in Table 97L. The Psort 
profile for NOV97f predicts that this sequence has a signal peptide and is likely to be localized 
outside the cell with a certainty of 0.3989. In alternative embodiments, a NOV97f polypeptide 
is located to lysosomes with a certainty of 0.5061. The Signal P predicts a likely cleavage site 
for a NOV97f peptide is between positions 16 and 17, i.e., at the dash in the sequence VFG- 
KE. 
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Table 97L. NOV97f Polypeptide Sequence (SEQ ID 
NO:318) 



MRGLLVLSVLLGAVFGKEDFVGHQVLRISVADEAQVQKVKELEDLEHLQLDFWRGPAHPG 
SPIDVRVPFPSIQAVKIFLESHGISYETMIEDVQSLLDEEQEQMFAFRSRARSTDTFNYA 
TYHTLEEVYSWIDNFVMEHSDIVSNIQIGNSFENQSILVLKFSTGGSRHPAIWIDTGIHS 
REWITRATGIWTANKIVSDYGKDRVLTDILNAMDIFIGLVTNPDGFAFTHSMNRLWRKNK 
SIRPGIFCIGVDLNRNWKSGFGDVASGITVDWAYDSGIKYAFSFELRDTGQYGFLLPATQ 
1 1 PTAQETWMALRTIMEHILNHPY _ _ 



A BLAST analysis of NOV97 was run against the proprietary PatP GENE SEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV97 had high 
homology to other proteins as shown in Table 97M. 



Table 97M. BLASTX results from PatP database for NOV97 



Sequences producing High- scoring Segment Pairs: 

patp :AAE01663 Novel human protease #2 

patp: AAB47565 Protease PRTS-7 

patp : AAE016 64 Novel human protease #3 

patp : AAR97618 Human carboxypeptidase Al 

patp: AAY28 915 Human regulatory protein HRGP-1 



Smallest 
Sum 

High Probability 
Score P (N) 

1898 9.3e-196 

1898 9.3e-196 

1013 1.8e-174 

1682 7.2e-173 

1682 7.2e-173 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 1017 of 1264 bases (80%) identical to a gb:GENBANK- 
ID:HSPCBXAl|acc:X67318.1 mRNA from Homo sapiens (H.sapiens mRNA for 
procarboxypeptidase Al). The full amino acid sequence of the protein of the invention was 
found to have 315 of 419 amino acid residues (75%) identical to, and 357 of 419 amino acid 
residues (85%) similar to, the 419 amino acid residue ptnr:SWISSNEW-ACC:P15085 protein 
from Homo sapiens (Human) (CARBOXYPEPTIDASE Al PRECURSOR (EC 3.4.17.1)). 
NOV97 also has homology to the other proteins shown in the BLASTP data in Table 97N. 



Table 97N. NOV97 BLASTP results 




Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 17939400 | ref 
|NP 525124. l| 
<NM 080385) 


carboxypeptidase AS 
[Homo sapiens] 


436 


354/418 
(84) 


382/418 
(90) 


0 . 0 


gi | 4502997 | ref | 
NP_001859 . 1 | 
(NM_001868) 


pancreatic 
carboxypeptidase Al 
precursor ; 
Carboxypeptidase A 
[Homo sapiens] 


419 


315/419 
(75) 


357/419 
(85) 


0 . 0 
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gi | 13528975|gb| 
AAH05279 . 1 | AAHO 
5279 (BC005279) 


carboxypeptidase Al 
(pancreatic) 
[Homo sapiens] 


419 


314/419 
(74) 


356/419 
(84) 


0.0 


gi [4336196 | gb | A 
AD17690 . 1 | 
(AF076222) 


carboxypeptidase Al 
precursor [Sus scrofa] 


419 


295/419 
(68) 


345/419 | 
(81) 


e-177 


gi | 115878 | sp | PO 
073 0 | CBPA BOVIN 


CARBOXYPEPTIDASE A 
PRECURSOR 


419 


289/419 
(68) 


345/419 
(81) 


e-176 



This BLASTP data is displayed graphically in the ClustalW in Table 970. A multiple 
sequence alignment is given, with the NOV97a-f proteins being shown on lines 1-6 in a 
ClustalW analysis comparing the protein of the invention with the related protein sequences 
shown in Table 97N. 



Table 970. ClustalW Alignment of NOV97 



NOV9 7a 


(SEQ 


ID 


NO 


308) 


NOV9 7b 


(SEQ 


ID 


NO 


:310) 


NOV97C 


(SEQ 


ID 


NO 


:312) 


NOV9 7d 


(SEQ 


ID 


NO 


:314) 


NOV97e 


(SEQ 


ID 


NO 


:316) 


NOV9 7f 


(SEQ 


ID 


NO 


:318) 


gi | 17939400 | 


(SEQ 


ID 


NO 


:784) 


gi | 4502997 | 


(SEQ 


ID 


NO 


:785) 


gi | 13528975 | 


(SEQ 


ID 


NO 


:786) 


gi | 4336196 | 


(SEQ 


ID 


NO 


:787) 


gi |115878| 


(SEQ 


ID 


NO 


:788) 
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NOV97a 
NOV97b 
NOV97C 
NOV97d 
NOV97e 
NOV97f 

gi 
gi 
gi 
gi 
gi 



17939400 | 
4502997 | 
13528975 | 
4336196 | 
115878 | 



NOV97a 

NOV97b 

NOV97C 

NOV97d 

NOV97e 

NOV97f 

gi | 17939400 | 

gi |4502997 | 

gi [13528975 | 

gi |4336196 | 

gi|ll5878| 



NOV9 7a 
NOV97b 
NOV97C 
NOV9 7d 
NOV97e 



20 
-I 



I 



C^GTPgGGTRPGPSPVDRRT 

i 

QGTP 



B GGTRPGPSPVDRRT 



30 
. | . . . 

55f 



50 
J. 



QGTPgGGTRPGPSPVDRRT 
1- 



LL VLS VLLGAVgGKEDFVGHQVLRI S VADE 
LLVL S VLLGAvSgKEDFVGHQVLR I S VA DE 
T J,^s{gLgA|jflGg^FBGBQ VLRiMBflaDE- 

llvlsvllgavBgkedfvghqvlr isva de 

T .T A7fj5^[|HLBlAMGjBg^FBGB0VLR>8iBaaDE 



llvlsvllgavBgkedfvghqvlrisvade 

LLVl3slS3LSAgiG5SSiF3G|gQVLRSHSDE 



LLV[5s|liLSAOTGg5SlFaGi3QVLRSS2DE| 

llvlsvllgavBgkedfvghqvlrisvade 
llvlsvllgavBgkedfvghqvlri svade 
llBIs vllgSv^kedfvghqvlri SvjgDE 
llRlsvllgaotgkedfvghqvlri^ade 



60 



70 



80 



90 



100 
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NOV97f 

gi | 17939400 | 

gi j 4502997 | 

gi | 13528975 | 

gi|4336196| 

gi | 115878 | 



NOV97a 

NOV97b 

NOV97C 

NOV97d 

NOV97e 

NOV97f 

gi | 17939400 | 

gi |4502997 | 

gi | 13528975 | 

gi |4336196 | 

gi | 115878 | 



NOV97a 

NOV97b 

NOV97C 

NOV97d 

NOV97e 

NOV9 7 f 

gi | 17939400 | 

gi|4502997| 

gi j 13528975 | 

gi [4336196 | 

gi j 115878 | 



NOV9 7a 
NOV97b 
NOV97C 
NOV9 7d 
NOV97e 
NOV97f 

gi 
gi 
gi 
gi 
gi 



17939400 | 
4502997 | 
13528975 | 
4336196 | 
115878 | 



NOV97a 

NOV9 7b 

NOV97C 

NOV97d 

NOV97e 

NOV97f 

gi 1 17939400 | 

gi |4502997 | 

gi | 13528975 | 

gi |4336196 | 

gi |115878| 



NOV97a 
NOV9 7b 
NOV9 7C 
NOV97d 




JrarstBtfnyatyhtleeHy 



AKSgRLEj^^S{|SB§g| 

!f ' 

F 

!SjC|G 



RgRARSTgTFNYATYHTLEEIY 

rBrarstItfnyatyhtleeiy 
"rar^tSt FNYAT YHTLEE I Y 



rarstStfnyatyhtlweiy 



160 



170 



180 



190 



200 




srewitsatgiwtanki 
srewitgatgiwtanki' 
srewitSatgiwtanki 
srewitSatgiwtanki 
srewitSatgiwtanki 
srewitSatgiwtanki 
srewitSatgiwtanki 
srewStSaSgSw 



srewwt gAgGgw 
srew3tSa§g|w 

srewitsatgSw 



vsdygkdrvltdilnamdi 
:vsdygkdrvltdilnamdi 
: vsdygkdrvltdilnamdi 
: vsdygkdrvltdilnamdi 
:vsdygkdrvltdilnamdi 
: vsdygkdrvltdilnamdi 
:vsdygkdrvltdilnamdi 
dygSd^^tSil^Edi 
dygBd ^S tHi l|BIdi 
tHil^Sdi 
tBilSImdi 



250 



fielvtnpdgfaft 
fi elvtnpdgfaft 
f I elvtnpdgfaft 
fi elvtnpdgfaft 
. fi elvtnpdgfaft 
: f i^l vtnpdgfaft 
: fi elvtnpdgfaft 
: fjIeSvtnpdgfaft 
: f9e| vtnpdgfaft 
:f|e| vtnpdgfaft 
:fSei vtnpdgfaft 





ISIHSYSQ|LMYPYG 

isihsysqSlmypyg 

ISIHSYSQgLMYPYG 

isihsysq|l|ypyg 

sis IHSYSOfflLgYPYG 




380 

. . I . . 



I 



390 

. . I . . 



I 



400 

- - 1 



ttFIyvasgitvdwaydsgikyafsf 
ttSyvasgitvdwaydsgi kyafs f 
ttSjSvasgitvdwaydsgi kyafs f 
vasgitvdwaydsgikyafsf 
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NOV97e 
NOV9 7f 

gi 
gi 
gi 
gi 

gi 



17939400 | 
4502997 | 
13528975 | 
4336196 | 
115878 | 



NOV97a 

NOV97b 

NOV97C 

NOV9 7d 

NOV97e 

NOV97f 

gi | 17939400 | 

gi j 4502997 | 

gi | 13528975 | 

gi|4336196| 

gi | 115878 | 




410 



420 



I 



430 



440 



JL 



ELRDTGQYGFLLPATQI 
ELRDTGQYGFLLPATQI 
ELRDTGQYGFLLPATQI 
ELRDTGQYGFLLPATQI 

elrdtgqygfllpatqi 
elrdtgqygfllpatqi 
elrdtgqygfllpatqi 
elrdtg|ygfllpa|qi 

ELRDTGgYGFLLPA|QI 
|ELRDTg|yGFLLPA| QI 

elrdtgIygfllpaIqi I 



1 



iptaqetwmalrtimehtlnhpy 
iptaqetwmalrtimehtlnhpy 
I ptaqetwmalrt imehtlnhpy 
iptaqetwmalrtimehtlnhpy 
i ptaqetwmalrt imehtlnhpy 
i ptaqetwmalrt imehglnhpy 
iptaqetwmalrtimehtlnhpy 

I PTaJSeTW jALfflT IMEHTLNHPY 
I PTa]|eTwSalHt IMEHTLNHPY 
I PTAQETwj^LWT IMEHTLNHPY 

i ptaqetw|S^timehtlni!31y 



Table 97P lists the domain description from DOMAIN analysis results against NOV97. 
This indicates that the NOV97 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 97P. Domain Analysis of NOV97 


enliSmartlsmart00631, Zn pept, Zn pept domain SEQ ID NO: 887 








CD-Length = 286 residues, 100.0% aligned 






Score = 


301 bits (770), Expect = 6e-83 




NOV97 : 


122 


YHTLEEVYSWIDNFVMEHSDIVSKIQIGNSFENQSILVLKFSTGGSR- -HPAIWIDTGIH 


179 






YH+ EE+ +W+ + D+V + IG S E + I VLK S G R PA+WID GIH 




Sbj Ct : 


1 


YHSYEEIEAWLKKLAARYPDLVRLVSIGKSVEGRPIWVLKISNGPGRDGKPAVWIDAGIH 


60 


NOV97 : 


180 


SREWITHATGIWTANKIVSDYGKDRVLTDILNAMDIFIELVTNPDGFAFTHSMNRLWRKN 


239 






+REWI AT ++ N+++ +YG D +T +L+ D +1 V NPDG+ +TH+ +RLWRKN 




Sbjct 


61 


AREWIGPATALYLINQLLENYGSDPRVTKLLDKTDWYIVPVLNPDGYEYTHTSDRLWRKN 


120 


NOV 9 7 


240 


KSIRPGIFCIGVDLNRNWKSGFGGJVTGSWSTITPCSETYHGPSPQSEPEVAAIVNFITAHGNF 


299 






+S G C GVDLNRN+ +G G++SNPCSETY GPSP SEPE A+ +F+ ++ 




Sbjct 


121 


RSPNSGSNCRGVDLNRNFPFHWGETGASSNPCSETYAGPSPFSEPETKAVRDFLRSNRKI 


180 


NOV 9 7 


300 


KAL I S IHS YSQMLMYPYGRLLEPV - SNQRELYDLAKDAVEALiYKVHG - I EY I FGS I STTL 


357 






K I +HSYSQ+++YPYG + N +L ++AK +AL VHG Y +G + L 




Sbj Ct 


: 181 


KLYIDLHSYSQLILYPYGYTKNDLPPNVEDLPEVAKALADALASVHGGTRYTYGISNGAL 


240 


NOV 9 7 


: 358 


YVASG I TVDWAYD - SGI KYAFS FELRDTGQYGFLLPATQI I PTAQE 4 02 








Y ASG + DWAY G+ ++++ ELRD G+YGFLLP +QIIPT E 




Sbjct 


: 241 


YPASGGSDDWAYGTLGVPFSYTLELRDKGRYGFLLPPSQIIPTGWE 286 





Carboxypeptidase A (EC 3.4.2.1 ) is a pancreatic exopeptidase. Three different forms 

of human pancreatic procarboxypeptidase A have been isolated. The Al and A2 (600688) 

forms are monomeric proteins with different biochemical properties. Honey et al. (1984, 1986) 

found that an 8.6-kb human DNA fragment (detected by means of a rat cDNA probe for CPA) 

cosegregated with chromosome 7. The assignment was narrowed by demonstration of absence 
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of the human DNA fragment in cells with a deletion of 7q22-qter. By studying mouse-hamster 
hybrid cells, Honey et al. (1986) assigned the CPA gene to mouse chromosome 6. Trypsin 
(276000) is also on human 7q22-qter and on mouse 6. Stewart et al. (1990) concluded from 
multipoint linkage analysis with established chromosome 7 markers that the most likely 

5 location of carboxypeptidase is 7q3 1 -qter. It lies distal to cystic fibrosis at a distance of 
approximately 12 cM. 

NOV97 is predicted to be expressed in at least the following tissues: pancreas. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, public EST sources, literature 

10 sources, and/or RACE sources. Further expression data for NOV97 is provided in Example 2. 

The NOV97 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer ? s disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 

15 Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 

disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV97 nucleic acids encoding the carboxypeptidase-like protein of the invention, 

20 or fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

The novel nucleic acid of the invention encoding a carboxypeptidase Al-like protein 
includes the nucleic acid whose sequence is provided in Table 97A, 97C, 97E, 97G, 971, or 
97K or a fragment thereof. The invention also includes a mutant or variant nucleic acid any of 

25 whose bases may be changed from the corresponding base shown in Table 97 A, 97C, 97E, 
97G, 971, or 97K while still encoding a protein that maintains its carboxypeptidase Al-like 
activities and physiological functions, or a fragment of such a nucleic acid. The invention 
further includes nucleic acids whose sequences are complementary to the sequences of Table 
97A, 97C, 97E, 97G, 971, or 97K, including nucleic acid fragments that are complementary to 

30 any of the nucleic acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 




10 



15 



in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 20% of the bases 
may be so changed. 

The novel protein of the invention includes the carboxypeptidase Al-like protein 
whose sequence is provided in Table 97B, 97D, 97F, 97H, 97J, or 97L. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residue shown in Table 97B, 97D, 97F, 97H, 97J, or 97L while still encoding a 
protein that maintains its carboxypeptidase Al-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 25% of the amino 
acid residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



NOV98 

The disclosed NOV98 (alternatively referred to herein as CG56939-01) includes the 
20 5583 nucleotide sequence (SEQ ID NO:319) shown in Table 98A. A NOV98 ORF begins 

with a Kozak consensus ATG initiation codon at nucleotides 1-3 and ends with a TGA codon 
at nucleotides 4630-4632. The disclosed NOV98 maps to human chromosome 1. 



Table 98A. NOV98 Nucleotide Sequence (SEQ ID 
NO:319) 



GCCGCGTGCGTCCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGC 
GAGGAGGAGGCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCG 
GTGCAGCACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTG 
GTGGCCCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGAC 
CCCCTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCT 
GCACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCATG 
CGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGGACC 
CACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCGGCTTC 
GGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTGCAAGAAG 
AGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACCTACAGCAAC 
GAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCCTGCTCAGCCGC 
GGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTTCGGCAGCACCTGT 
GCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCGACCTGCCGTGGCGCC 
CCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCGGCGAGTGCCAGCTCCTG 
CGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTCAAGAAGTTCGACGGCCCTTGTGACCCC 
TGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGTGTGAACCCGCGCACGCGGCGC 
CCTGAGATGCGCCTACGGCCCGAGAGCTGCCCTGCCCGGCAGGCGCCAGTGTGTGGGGAC 
GACGGAGTCACCTACGAAAACGACTGTGTCATGGGCCGATCGGGGGCCGCCCGGGGTCTC 
CTCCTGCAGAAAGTGCGCTCCGGCCAGTGCCAGGGTCGAGACCAGTGCCCGGAGCCCTGC 
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CGGTTCAATGCCGTGTGCCTGTCCCGCCGTGGCCGTCCCCGCTGCTCCTGCGACCGCGTC 

ACCTGTGACGGGGCCTACAGGCCCGTGTGTGCCCAGGACGGGCGCACGTATGACAGTGAT 

TGCTGGCGGCAGCAGGCTGAGTGCCGGCAGCAGCGTGCCATCCCCAGCAAGCACCAGGGC 

CCGTGTGACCAGGCCCCGTCCCCATGCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGT 

GCTGTGAAGAACGGGCAGGCAGCGTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGAT 

CCTGTGTGCGGCAGCGACGGCGTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCC 

TGTACCCTCGGGCGGGAGATCCAGGTGGCGCGCAAAGGACCCTGTGACCGCTGCGGGCAG 

TGCCGCTTTGGAGCCCTGTGCGAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGC 

GTGGCTTTGGCCCAGCCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATG 

CTGCACGTGCACGCCTGCACACACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGC 

GAGACCTGTGGAGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTG 

TGT C C C CGGTGTG AGC AC C C C C CG C CCGGC CCCGTGTGTGGC AGCGAC GGTGT C AC CT AC 

GGCAGTGCCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCC 

CGGGCAGGGCCGTGCGAGCAGGCCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGAC 

GGTGACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAG 

GACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGGC 

TCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAG 

CGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGGCCCCACCTTCGCCCCGCTGCCG 

C C TGTGGC C C C CT T AC ACTGT GCC C AG AC GCC CT ACGG CTGCT GC C AGG AC AAT AT C AC C 

GCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAACCCCCATGGC 

TCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCCCAGGTGTGGGG 

GGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGGCATCGTCACCGAT 

GGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCCGTGCGGGATGACTGT 

GAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTGGACCCAAGTGTGGGCAG 

TGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGCTGACGCTTCTGCGCCTGCG 

ACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGCGTGGAGGAGTCTGGCTCAGCC 

CACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCAACGCTACCAAGGTCTGTGGGTCA 

GATGGAGTCACATACGGCAACGAGTGTCAGCTGAAGACCATCGCCTGCCGACGGTGTCAC 

CTACGCCAGGGCCTGCAAATCTCTATCCAGAGCCTGGGCCCGTGCCAGGAGGCTGTTGCT 

CCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCCAGGGCTCCTCCTGAGC 

CAGGCACTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAGCAGTACCGCACACAGC 

CAGACCACCCCTCCGCCCTCATCGCGACCTCGGACCACTGCCAGCGTCCCCAGGACCACC 

GTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCCCTGCACCCAGCCTGGTGGCG 

TCCGCCTTTGGTGAATCTGGCAGCACTGATGGAAGCAGCGATGAGGAACTGAGCGGGGAC 

CAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCCCGAGCCCTTGGAGGGCAGCAGCGTGGCC 

ACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCTACAACCCCTGCCATGGGGCGGCGCCC 

TGCCGTGTGCTGCCCGAGGGTGGTGCTCAGTGCGAGTGCCCCCTGGGGCGTGAGGGCACC 

TTCTGCCAGACAGCCTCGGGGCAGGACGGCTCTGGGCCCTTCCTGGCTGACTTCAACGGC 

TTCTCCCACCTGGAGCTGAGAGGCCTGCACACCTTTGCACGGGACCTGGGGGAGAAGATG 

GCGCTGGAGGTCGTGTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAG 

AAG ACGG ACGG C AAGGGGGACTT CGTGT C GCTGGC ACTGCGGG AC CG C CGC CT GGAGTT C 

CGCTACGACCTGGGCAAGGGGGCAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGA 

GCCTGGACCAGGGTCTCACTGGAGCGAAACGGCCGCAAGGGTGCCCTGCGTGTGGGCGAC 

GGCCCCCGTGTGTTGGGGGAGTCCCCGGTTCCGCACACCGTCCTCAACCTGAAGGAGCCG 

CTCTACGTAGGGGGCGCTCCCGACTTCAGCAAGCTGGCCCGTGCTGCTGCCGTGTCCTCT 

GGCTTCGACGGTGCCATCCAGCTGGTCTCCCTCGGAGGCCGCCAGCTGCTGACCCCGGAG 

CACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCCCTGCACCCGGGCCTCA 

GGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAGGCTGCCTATGTGTGCCTG 

TGTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGCTGGTGGAGAAGTCAGCGGGG 

GACGTGGATACCTTGGCCTTTGACGGGCGGACCTTTGTCGAGTACCTCAACGCTGTGACC 

GAGAGCGAGAAGGCACTGCAGAGCAACCACTTTGAACTGAGCCTGCGCACTGAGGCCACG 

CAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGGAGCGGGCAGACTATGTGGCACTGGCC 

ATTGTGGACGGGCACCTGCAACTGAGCTACAACCTGGGCTCCCAGCCCGTGGTGCTGCGT 

TCCACCGTGCCCGTCAACACCAACCGCTGGTTGCGGGTCGTGGCACATAGGGAGCAGAGG 

GAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACCGGCTCCTCCCCGCTGGGCGCC 

ACGCAGCTGGACACTGATGGAGCCCTGTGGCTTGGGGGCCTGCCGGAGCTGCCCGTGGGC 

CCAGCACTGCCCAAGGCCTACGGCACAGGCTTTGTGGGCTGCTTGCGGGATGTGGTGGTG 

GGCCGGCACCCGCTGCACCTGCTGGAGGACGCCGTCACCAAGCCAGAGCTGCGGCCCTGC 

CCCACCCCATGAGCTGGCACCAGAGCCCCGCGCCCGCTGTAATTATTTTCTATTTTTGTA 

AACTTGTTGCTTTTTGATATGATTTTCTTGCCTGAGTGTTGGCCGGAGGGACTGCTGGCC 

CGGCCTCCCTTCCGTCCAGGCAGCCGTGCTGCAGACAGACCTAGTGCTGAGGGATGGACA 

GGCGAGGTGGCAGCGTGGAGGGCTCGGCGTGGATGGCAGCCTCAGGACACACACCCCTGC 

CTCAAGGTGCTGAGCCCCCGCCTTGCACTGCGCCTGCCCCACGGTGTCCCCGCCGGGAAG 

CAGCCCCGGCTCCTGAATCACCCTCGCTCCGTCAGGCGGGACTCGTGTCCCAAAAAGGAA 

GGGGCTGCTGAGGTCTGATGGGGCCCTTCCTCCGGGTGACCCCACAGGGCCTTTCCAAGC 

CCCTATTTGAGCTGCTCCTTCCTGTGTGTGCTCTGGACCCTGCCTCGGCCTCCTGCGCCA 

ATACTGTGACTTCCAAACAATGTTACTGCTGGGCACAGCTCTGCGTTGCTCCCGTGCTGC 

CTGCGCCAGCCCCAGGCTGCTGAGGAGCAGAGGCCAGACCAGGGCCGATCTGGGTGTCCT 

GACCCTCAGCTGGCCCTGCCCAGCCACCCTGGACATGACCGTATCCCTCTGCCACACCCC 
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AGG C C CTG CG AGGGGC T AT CG AG AGGAGC T CACTGTGGG ATGGGGTTG AC CT CTGC CGC C 
TGCCTGGGTATCTGGGCCTGGCCATGGCTGTGTTCTTCATGTGTTGATTTTATTTGACCC 
CTGGAGTGGTGGGTCTCATCTTTCCCATCTCGCCTGAGAGCGGCTGAGGGCTGCCTCACT 
GCAAATCCTCCCCACAGCGTCAGTGAAAGTCGTCCTTGTCTCAGAATGACCAGGGGCCAG 
CCAGTGTCTGACCAAGGTCAAGGGGCAGGTGCAGAGGTGGCAGGGATGGCTCGGAAGCCA 
GAA 



A NOV98 polypeptide (SEQ ID NO:320) encoded by SEQ ID NO:319 is 1 543 amino 
acids in length and is presented using the one-letter amino acid code in Table 98B. The Psort 
profile for NOV98 predicts that this sequence has no signal peptide and is likely to be 
localized to the cytoplasm with a certainty of 0.4500. In alternative embodiments, a NOV98 
polypeptide is located to microbodies with a certainty of 0.3000. 



Table 98B. NOV98 Polypeptide Sequence (SEQ ID 
NO:320) 

AAC VI* PG AGGT C P ERALE RRE E E ANWLTGT VE E I LNVD P VQHT Y S C KVR VWR Y L KGKDL 
VARE S LL DGGNK W I S G FGD P L I CDNQ VS TGDTR I F F VN PAP P YL W P AHKNE LM LNS S LM 
RITLRNLEEVEFCVEDKPGTHFTPVPPTPPDACRGMLCGFGAVCEPNAEGPGRASCVCKK 
SPCPSWAPVCGSDASTYSNECELQRAQCSQQRRIRLLSRGPCGSRDPCSNVTCSFGSTC 
ARSADGLTASCLCPATCRGAPEGTVCGSDGADYPGECQLLRRACARQENVFKKFDGPCDP 
CQGALPDPSRSCRVNPRTRRPEMRLRPESCPARQAPVCGDDGVTYENDCVMGRSGAARGL 
LLQKVRSGQCQGRDQCPEPCRFNAVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSD 
C WRQQ AE C RQ QRA I P S KHQG P CD QAP S P C LG VQ C AFGAT C AVKNGQ AACE CLQ AC S S L YD 
PVCGSDGVTYGSACELEATACTLGREIQVARKGPCDRCGQCRFGALCEAETGRCVCPSEC 
VALAQPVCGSDGHTYPSECMLHVHACTHQISLHVASAGPCETCGDAVCAFGAVCSAGQCV 
CPRCEHPPPGPVCGSDGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGED 
GDCEQELCRQRGGIWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQ 
RGLYVAAQGACRGPTFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHG 
SYGGTCDPATGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDC 
EQMTGLCSCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSA 
HCVCPMLTCPEANATKVCGSDGVTYGNECQL.KTI ACRRCHLRQGLQI S IQSLGPCQEAVA 
PSTHPTSASVTVTTPGIiLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTT 
WPVLTVPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGPEPLEGSSVA 
TPGPPVERASCYNPCHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLADFNG 
FSHLELRGLHTFARDLGEKMALEWFLARGPSGLLLYNGQKTDGKGDFVSLALRDRRLEF 
R YDLG KG AAV I RS RE P VTLGAWTR VS LERNGRKG ALR VGDG P R VL GE S P V P HT VLNL KE P 
L YVGGAPDFS KLARAAAVSSGFDGAI QLVSLGGRQLLTPEHVLRQVDVTS FAGHPCTRAS 
GHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDGRTFVEYLNAVT 
ESEKAL.QSNHFELSLRTEATQGLVLWSGKATERADYVALAIVDGHLQLSYNLGSQPWLR 
S T VP VNTNRW L RWAHREQRE G S LQ VGNE AP VTG S S P LGATQLDTDG ALW LGGL P E L P VG 
PALPKAYGTGFVGCLRDVWGRHPLHLLEDAVTKPELRPCPTP 



A BLAST analysis of NOV98 was run against the proprietary PatP GENE SEQ Protein 
Patent database. It was found, for example, that the amino acid sequence of NOV98 had high 
homology to other proteins as shown in Table 98C. 



Table 98C. BLASTX results from PatP database for NOV98 



Sequences producing High-scoring Segment Pairs: 



Smallest 
Sum 

High Probability- 
Score P(N) 



patp : AAW26609 Human agrin - Homo sapiens, 4 92 aa . 
patp : AAB93 754 Human protein sequence 



2349 
2179 



2 .2e-246 
1 .5e-225 



775 




patp 


AAY73993 


Human 


prostate tumor EST fragment 


2177 


2 


5e- 


225 


patp 


AAB31889 


Amino 


acid sequence of a human protein 


380 


1 


2e- 


62 


patp 


AAU16 93 8 


Human 


novel secreted protein 


551 


1 


2e- 


51 



In a search of sequence databases, it was found, for example, that the nucleic acid 
sequence of this invention has 3158 of 3416 bases (92%) identical to a gbiGENBANK- 
ID: AF01 6903 |acc:AF0 16903.1 mRNA from Homo sapiens (agrin precursor mRNA). The full 
5 amino acid sequence of the protein of the invention was found to have 1092 of 1 1 14 amino 
acid residues (98%) identical to, and 1093 of 1 1 14 amino acid residues (98%) similar to, the 
2026 amino acid residue ptnr:SPTREMBL-ACC:O00468 protein from Homo sapiens 
(Human) (AGRIN PRECURSOR). NOV98 also has homology to the other proteins shown in 
the BLASTP data in Table 98D. 
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Table 98D. NOV98 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 

(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 2988422 | gb 
| AAC39776 . 1 | 
(AF016903) 


agrin precursor 
[Homo sapiens} 


2026 


1088/1097 
(99) 


1088/1097 
(99) 


0 . 0 


gi | 399021 | sp | 
P253 04 | AGRI_R 
AT 


Agrin precursor 


1959 


772/958 
(80) 


825/958 
(85) 


0 . 0 


gi|202800|gb| 
AAA40703 . 1 | 
(M64780) 


agrin [Rattus 
norvegicus] 


1940 


772/958 
(80) 


825/958 
(83) 


0.0 


gi | 202799 | gb | 
AAA40702 . 1 | 
(M64780) 


agrin [Rattus 
norvegicus] 


1937 


769/958 
(80) 


822/958 
(85) 


0 . 0 


gi | 399020 | sp | 
P316 96 | AGRI_C 
HICK 


Agrin precursor 


1169 


637/1017 
(62) 


758/1017 
(73) 


0 . 0 



This BLASTP data is displayed graphically in the ClustalW in Table 98E. A multiple 
sequence alignment is given, with the NOV98 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
15 Table 98D. 



Table 98E. ClustalW Alignment of NOV98 


NOV9 8 


(SEQ 


ID NO: 320) 


gi |2988422 | 


(SEQ 


ID NO:789) 


gi | 399021 | 


(SEQ 


ID NO: 790) 


gi |202800| 


(SEQ 


ID NO:79l) 


gi | 202799 | 


(SEQ 


ID NO: 792) 


gi | 399020 j 


(SEQ 


ID NO: 793) 






10 20 30 40 50 
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NOV9 8 

gi [2988422 | 
gi j 399021 | 
gi | 202800 | 
gi | 202799 | 
gi |399020| 



NOV 9 8 

gi |2988422 | 
gi | 399021 
gi |202800 
gi | 202799 
gi | 399020 



NOV9 8 

gi | 2988422 | 
gi|39902l| 
gi j 202800 j 
gi | 202799 | 
gi|399020| 



NOV9 8 

gi | 2988422 | 
gi | 399021 | 
gi | 202800 j 
gi | 202799 j 
gi | 399020 | 



NOV9 8 

gi | 2988422 | 
gi j 399021 1 
gi |202800| 
gi [202799 | 
gi |399020| 



NOV9 8 

gi | 2988422 | 
gi j 399021 | 
gi |202800 | 
gi j 202799 j 
gi|399020| 



NOV9 8 

gi [2988422 | 
gi j 399021 1 
gi | 202800 | 
gi j 202799 j 
gi | 399020 | 



AACVLPGAGGTC 
AACVLPGAGGTC 




JeeManwltgtvee I LNVDPVQHTYSCKVR 
eMaNVVLTGTVEE I LNVDPVQHTYSCKVR 
PR@E 



PRQE- 

IprJe- 



60 

....I L- 

VW RYL KgKDlj V 

wrylkHkd£\| 

pHasKv! 

pBasS 



70 



100 




80 90 

|....|....|....|....|....|....|....| 

Jeslldggnkwisgfgdplicdnqvstgdtriffvnp 
islldggnkwi sgfgdplicdnqvstgdtri ffvnp 




120 



130 



140 



150 



160 



IritiIrnleeVe 
IlhritLrnleeve 

}FA§ %L 

&FA?I 1|L 

'^GFAV &L 

,MR ITjiRNLEEVEHCVEEHRKLLAD 

170 



200 



PffiPPTPPDgCRGMLCGFGAVCEPj 
pRpPTPPDHcRGMLCGFGAVCEPi 

pSpptppdScrgmlcgfgavceps 
pHpptppdRcrgmlcgfgavceps 
uHpptppdRcrgmlcgfgavceps 
pptpSdScrgmlcgfgavceS, 



QT 



210 



220 



I 



dastysnecelqraqcgqqrrirll 
dastysnecelqraqc§qqrrirll 
dastysnecelqraqcnqqrrirll 
dastysnecelqraqcnqqrrirllj 

DASTYSNECELQRAQCNQQRRIRLL^ 

dBstysnecelSSaqcnqqrri! 




250 



Ri 

r| 



260 



270 



280 



300 



NOV9 8 






gi | 2988422 | 






gi j 399021 1 




5»H 


gi|202800| 






gi | 202799 j 






gi |399020| 








310 



320 




I 



360 



crvnprtr 
crvnprtr 
crvnprtr 
crvnprtr 
crvnprtr 
crvnprtr 



370 




350 



^1 


CGDDGV 




CGDDGV 


3i 


CGDDGV 


|i 


CGDDGV 


Si 


CGDDGV 




CGDDGV 



380 
• - I - . 



390 



400 
^1 



TYENDCVMgRgGAgRGLLLQKVRSGQCQgRDQCPEraCgFNgVCLSRRGRP 

tyendcvmBrBgaBrglllqkvrsgqcqBrdqcpeHcSfnBvclsrrgrp 
tyendcvm1rHgaBrglllqkvrsgqcqHrdqcpeBc|fnSvclsrrgrp 
tyendcvmBrHgaBrglllqkvrsgqcqHrdqcpeHcSfnwclsrr 
tyendcvmHrHgaHrglllqkvrsgqcqtodqc^^ 

TY^^CVM§R{|GAgRGL!^QKVRSGQCQ^D|ci^^c|FNSvCL[SRHB 
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NOV9 8 

gi|2988422| 
gi j 399021 | 
gi |202800 | 
gi | 202799 j 
gi |399020| 



NOV9 8 

gi |2988422 | 
gi|39902l| 
gi j 202800 j 
gi | 202799 | 
gi|399020| 



NOV9 8 

gi | 2988422 | 
gi j 399021 | 
gi | 202800 j 
gi | 202799 | 
gi | 399020 | 



NOV9 8 

gi | 2988422 | 
gi|39902l| 
gi | 202800 | 
gi j 202799 j 
gi j 399020 j 



NOV9 8 

gi | 2988422 | 
gi|39902l| 
gi | 202800 j 
gi | 202799 j 
gi | 399020 j 



410 



cscdrvtcdgi 
cscdrvtcdg 
cscdrvtcdg 
cscdrv tcd g 

cscdrv^h g 

cscdrBtcdg 



450 



NOV9 8 

gi|2988422| 
gi|39902l| 
gi | 202800 j 
gi | 202799 | 
gi | 399020 j 



NOV9 8 

gi | 2988422 | 
gi|39902l| 




610 



620 



640 
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Ill 
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! 

Q 
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* 
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? ■ 
»le 





1 



HKS 



NOV9 8 


GPCE 




gi | 2988422 | 


GPCE 


h 


gi j 399021 1 


GPCE 


py 


gi | 202800 | 


GPCE 


|a 


gi |202799| 


GPCE 


5a 


gi | 399020 j 


GPCE 


D- 




TAAlGVGLAGCPSgcfflCNPHGSYgGTCDPATGQ 
TAAgGVGLAGCPSacfflcNPHGSYBGTCDPATGQ 
TAA§GVGIiAGCPSTcScNPHGSY§GTCDPATGQ 
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gi | 202800 | 
gi | 202799 j 
gi j 399020 j 



NOV9 8 

gi | 2988422 | 
gi |39902l| 
gi j 202800 | 
gi |202799( 
gi|399020| 



NOV98 

gi | 2988422 | 
gi | 399021 1 
gi (202800 | 
gi | 202799 j 
gi | 399020 | 



NOV98 

gi | 2988422 | 
gi (399021 1 
gi (202800 | 
gi j 202799 | 
gi [399020 j 




960 



NOV98 


Q 




L 




IS 


SQSI 


gi | 2988422 | 


Q 




L 




IS 


UQSI 


gi | 399021 | 


Q 




L 




IS 


Sqsi 


gi|202800| 


Q 




L 




IS 


Hqsi 


gi|202799| 


Q 




L 




IS 


Hqsi 


gi|399020| 


Q 


GQL 


ll 





970 
I 



980 



990 



1000 



STH 


PTSAS 


STH 


PTSAS 




PTSAS 


± 


PTSAS 




PTSAS 




rMPP^PLPTLPLDKL! 



1010 



NOV98 


gi 


2988422 


gi 


399021 | 


gi 


202800 | 


gi 


202799 | 


gi 


399020 j 




1080 



NOV98 

gi | 2988422 | 
gi|39902l| 
gi j 202800 j 
gi | 202799 j 
gi|399020| 



NOV 9 8 

gi | 2988422 | 
gi|39902l| 
gi | 202800 | 
gi j 202799) 
gi|399020| 




1090 



1100 



J 



I 



ELS GDwE AS GGGSGGQE Pi 

elsgdBeasgggsgglep 
elsgdSeasgggsgglep 
elsgd|easgggsgglep 
elsgdIeasgggsgglep 



1130 



1140 

• ♦ I — I 



1150 

-•I 



EGSNCPATKj 
EGSNCPATK 1 
EGSNCPATK 
EGSNCPATK 
EGSNCPATK 



1160 



1170 



1180 



1190 
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gvegqelfytpemadpkselfgetarsiestlddlfrnsdvkkdfgsvrl 
gvegqelfytpemadpkselfgetarsiestlddlfrnsdvkkdfSsvrl 
gvegqelfytpemadpkselfgetarsiestlddlfrnsdvkkdfBsvrl 
gvegqelfytpemadpkselfgetarsiestlddlfrnsdvkkdfEsvrl 

gVEGQELFYTPEMADPKSELFGETARSIEsBLD|LFRNSDVKSjDFjSs|Rg 



NOV9 8 

gi | 2988422 | 
gi |39902l| 
gi | 202800 j 
gi j 202799 | 
gi | 399020 | 



NOV98 

gi | 2988422 | 
gi | 399021 1 
gi | 202800 j 
gi|202799| 
gi |399020| 



NOV9 8 

gi | 2988422 | 
gi |399021| 
gi j 202800 j 
gi |202799 j 
gi|399020| 



NOV 9 8 

gi | 2988422 | 
gi |399021 1 
gi | 202800 j 
gi |202799| 
gi|399020| 



NOV9 8 

gi 
gi 
gi 
gi 
gi 



2988422 | 
39902l| 
202800 j 
202799 | 
399020 | 



NOV9 8 

gi | 2988422 | 
gi|39902l| 
gi j 202800 j 
gi |202799| 
gi |399020| 



NOV9 8 


gi 


2988422 


gi 


399021 | 


gi 


202800 | 


gi 


202799 j 


gi 


399020 j 



NOV9 8 

gi |2988422 | 
gi j 399021 1 
gi | 202800 | 
gi j 202799 | 
gi | 399020 | 




1310 
..|... 



1320 
..|... 



1330 
..|... 



1340 
..|... 



1350 
••I 




1410 



1420 



1430 



1440 



1450 




1460 
..|... 



1470 



1480 



1490 



1500 




1510 



1520 



1530 



1540 



1550 
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1560 
..|... 



1570 
..|... 



1580 
..|... 



1590 



NOV 9 8 

gi | 2988422 | 
gi j 399021 | 
gi | 202800 | 
gi |202799| 
gi | 399020 | 



NOV98 

gi | 2988422 | 
gi | 399021 1 
gi j 202800 | 
gi | 202799 | 
gi j 399020 j 



NOV98 

gi | 2988422 | 
gi | 399021 | 
gi j 202800 j 
gi|202799| 
gi|399020| 



NOV9 8 

gi | 2988422 | 
gi | 399021 | 
gi | 202800 j 
gi|202799| 
gi |399020| 



NOV 9 8 

gi | 2988422 | 
gi|39902l| 
gi j 202800 j 
gi |202799| 
gi|399020| 



NOV9 8 

gi | 2988422 | 
gi j 399021 1 
gi j 202800 j 
gi | 202799 j 
gi |399020| 



NOV 9 8 

gi | 2988422 | 
gi | 399021 1 
gi | 202800 j 
gi|202799| 
gi|399020| 



NOV98 

gi | 2988422 | 
gi | 399021 1 



1600 
-I 



grlG| 
BBvr 




1800 



1910 

..I... 



1920 
. . I - . . 



1930 
. . I . . . 



1940 



1950 



SELTNE I PAPETLDS RAL F 



SEKALQSNHFELSLRTEATQGLVLWgGKAjjE 
SEKALQSNHFELSLRTEATQGLVLwBGKAjjE 
SEKALQSNHFELSLRTEATQGLVLWgGKAgE 
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sekalqsnhfelslrteatqglvlwHgkaSe 
sekalosnhfelslrteatqglvlwHgk aB e 

SEKALQSNHFELSE^TEATQGLBLW§GKgjE 



SLQVGNEAPVTGSSPLGATQLDTDGALWLGGLg| 
SLQVGNE APVTGS S PLGATQLDTDGALWLGGLgyjj 

slqvgneapvtgssplgatqldtdgalwlgglg| 
slqvgneapvtgs s plgatqldtdgalwlggl5b 
slqvgneapvtgssplgatqldtdgalwlgglSj 
slqvgneapvtgs s plgatqldtdgalwlggjmil 



gi | 202800 | 
gi |202799| 
gi|399020| 



NOV98 

gi | 2988422 | 
gi |39902l| 
gi j 202800 j 
gi|202799| 
gi|399020 j 



NOV9 8 

gi | 2988422 | 
gi j 399021 | 
gi j 202800 | 
gi j 202799 j 
gi|399020| 



NOV 9 8 

gi | 2988422 | 
gi|39902l| 
gi | 202800 j 
gi | 202799 | 
gi|399020| 



Table 98F lists the domain description from DOMAIN analysis results against NOV98. 
This indicates that the NOV98 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 98F. Domain Analysis of NOV98 


enl!PfamlDfam00054, laminin G, Laminin G domain SEQ ID NO: 888 








CD-Length = 134 residues, 99.3% aligned 






Score = 


152 bits (385), Expect = le-37 




NOV 9 8 


: 1396 


RTEATQGLVLWSGKATERADYVAXjAIVDGHLQLSYNLGSQPVVIjRSTVPVNTNRWLRVVA 


1455 






RT GL+L+ G T+R D++AL + DG L++SY+LGS P V+RS +N +W RV 




Sbj Ct 


: 2 


RTTE P S GLL L YGGTNTDR - D FL ALE LRDGRLE V S YDLGSGPAWRS GDRLNDGK WHR VE L 


60 


NOV 9 8 


: 1456 


HREQREGSLQVGNEAPVTGSSPLGATQ LDTDGALWLGGLPELPVGPALPKAYGTGF 


1511 






R R+G+L V E V G SP G LD D L++GGLPE L A T F 




Sbjct 


: 61 


ERNGRKGTLSVDGEESVDGESPSGPDVPHENLDLDTPLYVGGLPE-LSVKRLLAAISTSF 


119 


NOV98 


: 1512 


VGCLRDVWGRHPLH 152 6 








GC+RDV+V PL 




Sbjct 


: 120 


KGCIRDVIVNGKPLD 134 





Synapses are essential relay stations for the transmission of information between 
neurones and other cells. An ordered and tightly regulated formation of these structures is 
crucial for the functioning of the nervous system. The synapse is also involved in perception, 
10 learning and memory. Understanding the sequence of steps that is involved in establishing 
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{ 




synapses during development might also help to understand mechanisms that cause changes in 
synapses during learning and memory. 

For practical reasons, most of the current knowledge of synapse development is 
derived from studies of the vertebrate neuromuscular junction. Upon arrival of a motor axon at 
5 the muscle fiber, signals released from its growth cone initiate the formation of a synapse. This 
process consists of two stages: arrest of axon growth at the target area and differentiation of 
pre- and postsynaptic cells at the site of nerve-muscle contact. 

Studies of regenerating neuromuscular junctions in vertebrates have revealed that 
important signals for the formation of this synapse are located in the synaptic basal lamina, 
10 and attempts to identify these signals have led to the isolation of agrin and other components. 
The induction of the intensively studied synapse between nerve and muscle is initiated by the 
binding of neuron-specific isoforms of the basal membrane protein agrin to receptors on the 
surface of myotubes. Agrin activates a receptor complex that includes the muscle-specific 
kinase and most likely additional, yet to be identified, components. Receptor activation leads 
1 5 to the aggregation of acetylcholine receptors (AChR) and other proteins of the postsynaptic 

apparatus. This activation process has unique features which distinguish it from other receptor 
tyrosine kinases. In particular, the autophosphorylation of the kinase domain, which usually 
induces the recruitment of adaptor and signalling molecules, is not sufficient for AChR 
aggregation. Apparently, interactions of the extracellular domain with unknown components 
20 are also required for this process. 

Agrin binds to a second protein complex on the muscle surface known as the 
dystrophin-associated glycoprotein complex. This binding forms one end of a molecular link 
between the extracellular matrix and the cytoskeleton. 

While many components of the machinery triggering postsynaptic differentiation have 
25 now been identified, the picture of the molecular pathway causing the redistribution of 

synaptic proteins is still incomplete. Recent advances implicate proteins such as dystroglycan, 
MuSK, and rapsyn in the transduction of agrin signals. Additional functions of agrin have 
been discovered, including the upregulation of gene transcription in myonuclei and the control 
of presynaptic differentiation. 
30 Agrin therefore appears to play a unique role in controlling synaptic differentiation on 

both sides of the neuromuscular junction. 

NOV98 is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
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kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus, bone, cerebral medulla/ cerebral white matter, cervix, colon, epidermis, 
foreskin, hair follicles, liver, lung, lymphoid tissue, ovary, parathyroid gland, parietal lobe, 

5 retina, skin, vein, whole organism. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, public EST sources, literature sources, and/or RACE sources. Further 
expression data for NOV98 is provided in Example 2. 

The NOV98 nucleic acids and proteins are useful in potential therapeutic applications 

10 implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 

15 thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 

immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV98 nucleic acids encoding the agrin-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

20 The novel nucleic acid of the invention encoding a agrin-like protein includes the 

nucleic acid whose sequence is provided in Table 98B, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 98B while still encoding a protein that maintains its agrin- 
like activities and physiological functions, or a fragment of such a nucleic acid. The invention 

25 further includes nucleic acids whose sequences are complementary to the sequence of Table 
98B, including nucleic acid fragments that are complementary to any of the nucleic acids just 
described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
30 include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 

phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
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In the mutant or variant nucleic acids, and their complements, up to about 8% of the bases may 
be so changed. 

The novel protein of the invention includes the agrin-like protein whose sequence is 
provided in Table 98B. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the correspondinkresidue shown in Table 98B while still 
encoding a protein that maintains its agrin-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 2% df the amino acid 
residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies" section 
below. 



15 NOV99 

The disclosed NOV99 (alternatively referred to herein as CG570 10-01) includes the 
1513 nucleotide sequence (SEQ ID NO:32 1 ) shown in Table 99A. A NOV99 ORF begins 
with a Kozak consensus ATG initiation codon at nucleotides 396-398 and ends with a TGA 
codon at nucleotides 1410-1412. The disclosed NOV99 maps to human chromosome 
20 14q32.33. 



Table 99 A. NOV99 Nucleotide Sequence (SEQ ID 
NO:321) 



GGAGCCCCCGCCCTGGGATTCCCAGGTGTTTTCATTTGGTGATCAGCACTGAACACAGAA 
GAGTCATGACGGAGTTTGGGCTGAGCTGGGTTTTCCTTGTTGCTATTTTTAAAGGTGTCC 
AGTGTGAGGTGCAGCTGGTGGAGTCTGGGGGAGACTTGGTCCAGCCTGGGGGGTCCCTGA 
GACTCTCCTGTGCAGCCTCTGGATTCACCTTCAGTAGTTATGCTATGCACTGGGTCCGCC 
AGGCTCCAGGGAAGGGACTGAAATATGTTTCAGGTATTAGTAGTAATGGGCGTAGAACAT 
ATTATGCAAATTCTGTGAAGGGCAGATTCACCATCTCCAGAGACAATTCCAAGAACACGT 
TGTATCTTCAAATGGGCAGCCTGAGAGCTGAGGACATGGCTGTGTATTACTGTGTGTCCG 
GGGGAATCTATGATAGTAGTGGTCCCTTTGACTACTGGGGCCAGGGAACCCTGGTCACCG 
TCTCCTCAGCATCCCCGACCAGCCCCAAGGTCTTCCCGCTGAGCCTCTGCAGCACCCAGC 
CAGATGGGAACGTGGTCATCGCCTGCCTGGTCCAGGGCTTCTTCCCCCAGGAGCCACTCA 
GTGTGACCTGGAGCGAAAGCGGACAGGGCGTGACCGCCAGAAACTTCCCACCCAGCCAGG 
ATGCCTCCGGGGACCTGTACACCACGAGCAGCCAGCTGACCCTGCCGGCCACACAGTGCC 
TAGCCGGCAAGTCCGTGACATGCCACGTGAAGCACTACACGAATCCCAGCCAGGATGTGA 
CTGTGCCCTGCCCAGTTCCCTCAACTCCACCTACCCCATCTCCCTCAACTCCACCTACCC 
CATCTCCCTCATGCTGCCACCCCCGACTGTCACTGCACCGACCGGCCCTCGAGGACCTGC 
TCTTAGGTTCAGAAGCGAACCTCACGTGCACACTGACCGGCCTGAGAGATGCCTCAGGTG 
TCACCTTCACCTGGACGCCCTCAAGTGGGAAGAGCGCTGTTCAAGGACCACCTGAGCGTG 
ACCTCTGTGGCTGCTACAGCGTGTCCAGTGTCCTGCCTGGCTGTGCCCAGCCATGGAACC 
ATGGGGAGACCTTCACCTGCACTGCTGCCCACCCCGAGTTGAAGACCCCACTAGTTCGCT 
GGCTGCAGGGGTCACAGGAGCTGCCCCGCGAGAAGTACCTGACTTGGGCATCCCGGCAGG 
AGCCCAGCCAGGGCACCACCACCTTCGCTGTGACCAGCATACTGCGCGTGGCAGCCGAGG 
ACTGGAAGAAGGGGGACACCTTCTCCTGCATGGTGGGCCACGAGGCCCTGCCGCTGGCCT 
TCACACAGAAGACCATCGACCGCTTGGCGGGTAAACCCACCCATGTCAATGTGTCTGTTG 
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TCATGGTGGAGGTGGACGGCACCTGCTACTGAGCCGCCCGCCTGTCCCCACCCCTGAATA 
AACTCCATGCTCCCCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA - 



A NOV99 polypeptide (SEQ ID NO:322) encoded by SEQ ID NO:321 is 338 amino 
acids in length and is presented using the one-letter amino acid code in Table 99B. The Psort 
profile for NOV99 predicts that this sequence has no signal peptide and is likely to be 
localized to the cytoplasm with a certainty of 0.4500. In alternative embodiments, a NOV99 
polypeptide is located to microbodies with a certainty of 0.1315. 



Table 99B. NOV99 Polypeptide Sequence (SEQ ID 
NO:322) 



MAVYYCVSGGIYDSSGPFDYWGQGTL.VTVSSASPTSPKVFPLSLCSTQPDGNWIACLVQ 
GFFPQEPLSVTWSESGQGVTARNFPPSQDASGDLYTTSSQLTLPATQCLAGKSVTCHVKH 
YTNPSQDVTVPCPVPSTPPTPSPSTPPTPSPSCCHPRLSLHRPAIiEDLLLGSEANLTCTL 
TGLRDASGVTFTWTPSSGKSAVQGPPERDLCGCYSVSSVLPGCAQPWNHGETFTCTAAHP 
ELKTPLVRWLQGSQELPREKYLTWASRQEPSQGTTTFAVTSILRVAAEDWKKGDTFSCMV 
GHEAL PLAFTQKTIDRLAGKPTHVNVSWMVEVDGTCY 



A BLAST analysis of NOV99 was run against the proprietary PatP GENESEQ Protein 
10 Patent database. It was found, for example, that the amino acid sequence of NOV99 had high 
homology to other proteins as shown in Table 99C. 



Table 99C. BLASTX results from PatP database for NOV99 



Sequences producing High- scoring Segment Pairs: 

patp:AAY884 8 3 Cancer suppressor gene product 
patp: AAB82 914 Human immune response protein HIRP3 
patp:AAM93283 Human polypeptide, 

patp :AAY44 723 Human immune system molecule, ISMO-4 
patp:AAY963Q4 Human IGFAM-16 immunoglobulin - Homo sapiens 



Smallest 
Sum 

High Probability 
Score P (N) 

1335 9.8e-187 

1268 1.2e-179 

1266 1.9e-179 

1262 5.1e-179 

1254 3.6e-178 



In a search of sequence databases, it was found, for example, that the nucleic acid 
15 sequence of this invention has 1 122 of 1 133 bases (99%) identical to a gbiGENBANK- 

ID:AF067420|acc:AF067420.1 mRNA from Homo sapiens (SNC73 protein (SNC73) mRNA). 
The full amino acid sequence of the protein of the invention was found to have 244 of 253 
amino acid residues (96%) identical to, and 247 of 253 amino acid residues (97%) similar to, 
the 384 amino acid residue ptnr:SPTREMBL-ACC:Q9UP60 protein from Homo sapiens 
20 (Human) (SNC73 PROTEIN)(Fig. 3B). In addition to smaller changes, the sequence of this 
invention lacks 46 internal amino acids, when compared to ptnr:SPTREMBL-ACC:Q9UP60 
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protein from Homo sapiens (Human) (SNC73 PROTEIN). NOV99 also has homology to the 
other proteins shown in the BLASTP data in Table 99D. 



Table 99D. NOV99 BLASTP results 


Gene Index / 
Identifier 


Protein / Organism 


Length 
(aa) 


Identity 

(%) 


Positive 
(%) 


Expect 


gi | 229537 |prf 
| | 752400A 


IgA H 

[ Homo s api en s ] 


475 


292/384 
(76) 


325/384 
(84) 


e-168 


gi | 229585 |prf 
| | 763134A 


IgAl Bur 
[Homo sapiens] 


686 


298/383 
(77) 


323/383 
(83) 


e-165 


gi | 223099 |prf 
| | 0506249A 


IgA-alphal Bur 
[Homo sapiens] 


472 


294/383 
(76) 


323/383 
(83) 


e-165 


gi|3201900|gb 
| AAC19365 . 1 | 
(AF067420) 


SNC73 protein 
[Homo sapiens] 


384 


333/384 
(86) 


336/384 
(86) 


e-155 


gi | 14042015 |d 
bj |BAB55072 . 1 
| (AK027379) 


unnamed protein product 
[Homo sapiens] 


494 


321/387 
(82) 


325/387 
(83) 


e-150 



5 This BLASTP data is displayed graphically in the ClustalW in Table 99E. A multiple 

sequence alignment is given, with the NOV99 protein being shown on line 1 in a ClustalW 
analysis comparing the protein of the invention with the related protein sequences shown in 
Table 99D. 



Table 99E. ClustalW Alignment of NOV99 



NOV9 9 (SEQ ID NO: 3 22) 

gi|229537| (SEQ ID NO:794) 

gi|229585| (SEQ ID NO:795) 

gi|223099| (SEQ ID NO:796) 

gi|3201900| (SEQ ID NO:797) 

gi|l4042015| (SEQ ID NO:798) 



40 50 



NOV9 9 
gi | 229537 | 
gi | 229585 j 
gi | 223099 | 
gi | 3201900 | 
gi j 14042015 | 



NOV9 9 
gi|229537| 
gi j 229585 j 
gi j 223099 j 
gi j 3201900 | 
gi j 14042015 | 



NOV9 9 



ESALTZPRSVSGSPGHSVTISCIGTSSNVGDYKYVSWYZZHPGKAPKLII 



60 70 80 90 100 

|....|....|...-|....|....|....|.-.-|....|....| 



YEVSSRPSGVPDRFSGSKSGBTASLTISGLQAEDEABYYCCSYIGSYVFG 



110 120 130 140 150 
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gi | 229537 | 
gi | 229585 | 
gi j 223099 j 
gi | 3201900 | 
gi | 14042015 | 



TGTKVLVIGZPKANPTVTLFPPSSZZLZABKATLVCLISBFYPGAVTVAW 



NOV9 9 
gi | 229537 | 
gi | 229585 j 
gi |223099 j 
gi | 3201900 | 
gi | 14042015 | 



NOV9 9 

gi | 229537 | 

gi|229585| 

gi j 223099 | 

gi|3201900| 

gi j 14042015 | 



NOV9 9 

gi | 229537 | 

gi | 229585 | 

gi | 223099 j 

gi j 3201900 | 

gi|l4042015| 



NOV9 9 
gi | 229537 | 
gi |229585 | 
gi j 223099 j 
gi j 3201900 | 
gi | 14042015 | 



NOV9 9 
gi | 229537 | 
gi | 229585 j 
gi j 223099 j 
gi | 3201900 | 
gi | 14042015 | 



NOV9 9 
gi|229537| 
gi j 229585 j 
gi |223099| 
gi | 3201900 | 
gi j 14042015 | 



NOV9 9 
gi | 229537 | 
gi |229585| 
gi |223099| 
gi | 3201900 | 
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Table 99F lists the domain description from DOMAIN analysis results against NOV99. 
This indicates that the NOV99 sequence has properties similar to those of other proteins 
known to contain this domain. 



Table 99F. Domain Analysis of NO V99 



gnllSmartlsmart00407 , IGcl, Immunoglobulin C-Type SEQ ID NO: 889 

CD-Length = 75 residues, 96.0% aligned 
Score - 46.6 bits (109), Expect = 2e-06 



NOV99: 53 WIACLVQGFFPQEPLSVTWSESGQGVT- - ARNFPPSQDASGDLYTTSSQLTLPATQCLA ' 110 

+ CLV GF+P + +VTW ++GQ VT + P +D G Y SS LT+ A+ + 
Sbjct : 2 ATLVCLVTGFYP-PDITVTWLKNGQEVTSGVKTTDPLKDKDG-TYFLSSYLTVSASTWES 59 

NOV99: 111 GKSVTCHVKHYTNP 124 
G TC V H 

Sbjct: 6 0 GDVYTCQVTHEGLT 7 3 
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SNC73 was identified by subtractive hybridization between normal mucosa and 
colorectal cancer tissue as a gene which is down-regulated in colorectal cancer. It is highly 
homologous to the constant region of immunoglobulin alpha-1 chain. In higher vertebrates 
there are five classes of antibodies, IgA, IgD, IgE, IgG, and IgM, each with its own class of 
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heavy chain - alpha, delta, epsilon, gamma, and mu, respectively. IgA molecules have alpha 
chains, IgG molecules have gamma chains, and so on. In addition, there are a number of 
subclasses of IgG and IgA immunoglobulins; for example, there are four human IgG 
subclasses (IgGl, IgG2, IgG3, and IgG4) having gammal, gamma2, gamma3, and gamma4 
heavy chains, respectively. 

The various heavy chains impart a distinctive conformation to the hinge and tail 
regions of antibodies and give each class (and subclass) characteristic properties of its own. 
IgA is the principal class of antibody in secretions (saliva, tears, milk, and respiratory and 
intestinal secretions). It is transported through secretory epithelial cells from the extracellular 
fluid into the secreted fluid by the Poly Ig receptor, another type of Fc receptor that is unique 
to secretory epithelia. IgA serves both to defend against local infection and to prevent access 
of foreign antigens to the general immunologic system. This function is in accord with the 
potential role of SNC73 in colorectal cancer. 

NOV99 is predicted to be expressed in at least the following tissues: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 
kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, 
salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, 
trachea and uterus. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
public EST sources, literature sources, and/or RACE sources. Further expression data for 
NOV99 is provided in Example 2. 

The NOV99 nucleic acids and proteins are useful in potential therapeutic applications 
implicated in various pathological disorders described further herein, for example, Von 
Hippel-Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, 
hypercalceimia, Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch- 
Nyhan syndrome, multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral 
disorders, addiction, anxiety, pain, neuroprotection, hemophilia, hypercoagulation, idiopathic 
thrombocytopenic purpura, immunodeficiencies, hemophilia, hypercoagulation, 
immunodeficiencies, graft vesus host disease as well as other diseases, disorders and 
conditions. NOV99 nucleic acids encoding the SNC73-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 
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The novel nucleic acid of the invention encoding a SNC73-like protein includes the 
nucleic acid whose sequence is provided in Table 99A, or a fragment thereof. The invention 
also includes a mutant or variant nucleic acid any of whose bases may be changed from the 
corresponding base shown in Table 99A while still encoding a protein that maintains its 
SNC73-like activities and physiological functions, or a fragment of such a nucleic acid. The 
invention further includes nucleic acids whose sequences are complementary to the sequence 
of Table 99 A, including nucleic acid fragments that are complementary to any of the nucleic 
acids just described. 

The invention additionally includes nucleic acids or nucleic acid fragments, or 
complements thereto, whose structures include chemical modifications. Such modifications 
include, by way of non-limiting example, modified bases, and nucleic acids whose sugar 
phosphate backbones are modified or derivatized. These modifications are carried out at least 
in part to enhance the chemical stability of the modified nucleic acid, such that they may be 
used, for example, as antisense binding nucleic acids in therapeutic applications in a subject. 
In the mutant or variant nucleic acids, and their complements, up to about 1% of the bases may 
be so changed. 

The novel protein of the invention includes the SNC73-like protein whose sequence is 
provided in Table 99B. The invention also includes a mutant or variant protein any of whose 
residues may be changed from the corresponding residue shown in Table 99B while still 
encoding a protein that maintains its SNC73-like activities and physiological functions, or a 
functional fragment thereof. In the mutant or variant protein, up to about 4% of the amino acid 
residues may be so changed. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the substances of the invention for use in therapeutic or diagnostic 
methods. These antibodies may be generated according to methods known in the art, using 
prediction from hydrophobicity charts, as described in the "Anti-NOVX Antibodies'' section 
below. 



NOVX Nucleic Acids and Polypeptides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
NOVX polypeptides or biologically active portions thereof Also included in the invention are 
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nucleic acid fragments sufficient for use as hybridization probes to identify NOVX-encoding 
nucleic acids (e.g., NOVX mRNAs) and fragments for use as PCR primers for the 
amplification and/or mutation of NOVX nucleic acid molecules. As used herein, the term 
"nucleic acid molecule" is intended to include DNA molecules (e.g., cDNA or genomic 
DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA generated using 
nucleotide analogs, and derivatives, fragments and homologs thereof. The nucleic acid 
molecule may be single-stranded or double-stranded, but preferably is comprised double- 
stranded DNA. 

An NOVX nucleic acid can encode a mature NOVX polypeptide. As used herein, a 
"mature" form of a polypeptide or protein disclosed in the present invention is the product of a 
naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 
polypeptide, precursor or proprotein includes, by way of nonlimiting example, the full-length 
gene product, encoded by the corresponding gene. Alternatively, it may be defined as the 
polypeptide, precursor or proprotein encoded by an ORF described herein. The product 
"mature" form arises, again by way of nonlimiting example, as a result of one or more 
naturally occurring processing steps as they may take place within the cell, or host cell, in 
which the gene product arises. Examples of such processing steps leading to a "mature" form 
of a polypeptide or protein include the cleavage of the N-terminal methionine residue encoded 
by the initiation codon of an ORF, or the proteolytic cleavage of a signal peptide or leader 
sequence. Thus a mature form arising from a precursor polypeptide or protein that has 
residues 1 to N, where residue 1 is the N-terminal methionine, would have residues 2 through 
N remaining after removal of the N-terminal methionine. Alternatively, a mature form arising 
from a precursor polypeptide or protein having residues 1 to N, in which an N-terminal signal 
sequence from residue 1 to residue M is cleaved, would have the residues from residue M+l to 
residue N remaining. Further as used herein, a "mature" form of a polypeptide or protein may 
arise from a step of post-translational modification other than a proteolytic cleavage event. 
Such additional processes include, by way of non-limiting example, glycosylation, 
myristoylation or phosphorylation. In general, a mature polypeptide or protein may result 
from the operation of only one of these processes, or a combination of any of them. 

The term "probes", as utilized herein, refers to nucleic acid sequences of variable 
length, preferably between at least about 10 nucleotides (nt), 100 nt, or as many as 
approximately, e.g., 6,000 nt, depending upon the specific use. Probes are used in the 
detection of identical, similar, or complementary nucleic acid sequences. Longer length 
probes are generally obtained from a natural or recombinant source, are highly specific, and 
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much slower to hybridize than shorter-length oligomer probes. Probes may be single- or 
double-stranded and designed to have specificity in PCR, membrane-based hybridization 
technologies, or ELISA-like technologies. 

The term "isolated" nucleic acid molecule, as utilized herein, is one, which is separated 
5 from other nucleic acid molecules which are present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic 
acid (i.e., sequences located at the 5'- and 3'-termini of the nucleic acid) in the genomic DNA 
of the organism from which the nucleic acid is derived. For example, in various embodiments, 
the isolated NOVX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 

10 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, 
spleen, etc.). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can 
be substantially free of other cellular material or culture medium when produced by 
recombinant techniques, or of chemical precursors or other chemicals when chemically 

15 synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, or a 
complement of this aforementioned nucleotide sequence, can be isolated using standard 
molecular biology techniques and the sequence information provided herein. Using all or a 

20 portion of the nucleic acid sequence of SEQ ID NOS:2n-l, wherein n is an integer between 1 
and 162 as a hybridization probe, NOVX molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described in Sambrook, et al., (eds.), 
Molecular Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989; and Ausubel, et al., (eds.), CURRENT PROTOCOLS IN 

25 Molecular Biology, John Wiley & Sons, New York, NY, 1993.) 

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard 
PCR amplification techniques. The nucleic acid so amplified can be cloned into an 
appropriate vector and characterized by DNA sequence analysis. Furthermore, 

30 oligonucleotides corresponding to NOVX nucleotide sequences can be prepared by standard 
synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide 
residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in a 
PCR reaction. A short oligonucleotide sequence may be based on, or designed from, a 
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genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 
Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 50 nt, or 
100 nt in length, preferably about 15 nt to 30 nt in length. In one embodiment of the 
5 invention, an oligonucleotide comprising a nucleic acid molecule less than 100 nt in length 
would further comprise at least 6 contiguous nucleotides SEQ ID NOS:2n-l, wherein n is an 
integer between 1 and 162, or a complement thereof. Oligonucleotides may be chemically 
synthesized and may also be used as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention comprises a 

10 nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID 

NOS:2n-l, wherein n is an integer between 1 and 162, or a portion of this nucleotide sequence 
(e.g., a fragment that can be used as a probe or primer or a fragment encoding a biologically- 
active portion of an NOVX polypeptide). A nucleic acid molecule that is complementary to 
the nucleotide sequence shown SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162 

15 is one that is sufficiently complementary to the nucleotide sequence shown SEQ ID NOS:2n- 
1, wherein n is an integer between 1 and 162 that it can hydrogen bond with little or no 
mismatches to the nucleotide sequence shown SEQ ID NOS:2n-l, wherein n is an integer 
between 1 and 162, thereby forming a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen base 

20 pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means 
the physical or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van 
der Waals, hydrophobic interactions, and the like. A physical interaction can be either direct 
or indirect. Indirect interactions may be through or due to the effects of another polypeptide or 

25 compound. Direct binding refers to interactions that do not take place through, or due to, the 
effect of another polypeptide or compound, but instead are without other substantial chemical 
intermediates. 

Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic 
acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific 
30 hybridization in the case of nucleic acids or for specific recognition of an epitope in the case of 
amino acids, respectively, and are at most some portion less than a full length sequence. 
Fragments may be derived from any contiguous portion of a nucleic acid or amino acid 
sequence of choice. Derivatives are nucleic acid sequences or amino acid sequences formed 
from the native compounds either directly or by modification or partial substitution. Analogs 




are nucleic acid sequences or amino acid sequences that have a structure similar to, but not 
identical to, the native compound but differs from it in respect to certain components or side 
chains. Analogs may be synthetic or from a different evolutionary origin and may have a 
similar or opposite metabolic activity compared to wild type. Homologs are nucleic acid 
5 sequences or amino acid sequences of a particular gene that are derived from different species. 

Derivatives and analogs may be full length or other than full length, if the derivative or 
analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 
analogs of the nucleic acids or proteins of the invention include, but are not limited to, 
molecules comprising regions that are substantially homologous to the nucleic acids or 

10 proteins of the invention, in various embodiments, by at least about 70%, 80%, or 95% 

identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid sequence of 
identical size or when compared to an aligned sequence in which the alignment is done by a 
computer homology program known in the art, or whose encoding nucleic acid is capable of 
hybridizing to the complement of a sequence encoding the aforementioned proteins under 

15 stringent, moderately stringent, or low stringent conditions. See e.g. Ausubel, et aL, CURRENT 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by a homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those 

20 sequences coding for isoforms of NOVX polypeptides. Isoforms can be expressed in different 
tissues of the same organism as a result of, for example, alternative splicing of RNA. 
Alternatively, isoforms can be encoded by different genes. In the invention, homologous 
nucleotide sequences include nucleotide sequences encoding for an NOVX polypeptide of 
species other than humans, including, but not limited to: vertebrates, and thus can include, e.g., 

25 frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide 
sequences also include, but are not limited to, naturally occurring allelic variations and 
mutations of the nucleotide sequences set forth herein. A homologous nucleotide sequence 
does not, however, include the exact nucleotide sequence encoding human NOVX protein. 
Homologous nucleic acid sequences include those nucleic acid sequences that encode 

30 conservative amino acid substitutions (see below) in SEQ ID NOS:2n-l, wherein n is an 
integer between 1 and 162, as well as a polypeptide possessing NOVX biological activity. 
Various biological activities of the NOVX proteins are described below. 

An NOVX polypeptide is encoded by the open reading frame ("ORF") of an NOVX 
nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be translated 
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into a polypeptide. A stretch of nucleic acids comprising an ORF is uninterrupted by a stop 
codon. An ORF that represents the coding sequence for a full protein begins with an ATG 
"start" codon and terminates with one of the three "stop" codons, namely, TAA, TAG, or 
TGA. For the purposes of this invention, an ORF may be any part of a coding sequence, with 
5 or without a start codon, a stop codon, or both. For an ORF to be considered as a good 

candidate for coding for a bona fide cellular protein, a minimum size requirement is often set, 
e.g., a stretch of DNA that would encode a protein of 50 amino acids or more. 

The nucleotide sequences determined from the cloning of the human NOVX genes 
allows for the generation of probes and primers designed for use in identifying and/or cloning 
10 NOVX homologues in other cell types, e.g. from other tissues, as well as NOVX homologues 
from other vertebrates. The probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 
or 400 consecutive sense strand nucleotide sequence SEQ ID NOS:2n-l, wherein n is an 
1 5 integer between 1 and 1 62; or an anti-sense strand nucleotide sequence of SEQ ID NOS:2n-l , 
wherein n is an integer between 1 and 162; or of a naturally occurring mutant of SEQ ID 
NOS:2n-l, wherein n is an integer between 1 and 162. 

Probes based on the human NOVX nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homologous proteins. In various 
20 embodiments, the probe further comprises a label group attached thereto, e.g. the label group 
can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such 
probes can be used as a part of a diagnostic test kit for identifying cells or tissues which mis- 
express an NOVX protein, such as by measuring a level of an NOVX-encoding nucleic acid in 
a sample of cells from a subject e.g., detecting NOVX mRNA levels or determining whether a 
25 genomic NOVX gene has been mutated or deleted. 

"A polypeptide having a biologically-active portion of an NOVX polypeptide" refers 
to polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 
polypeptide of the invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. A nucleic acid fragment encoding a "biologically- 
30 active portion of NOVX" can be prepared by isolating a portion SEQ ID NOS:2n-l, wherein n 
is an integer between 1 and 162, that encodes a polypeptide having an NOVX biological 
activity (the biological activities of the NOVX proteins are described below), expressing the 
encoded portion of NOVX protein (e.g., by recombinant expression in vitro) and assessing the 
activity of the encoded portion of NOVX. 
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NOVX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences shown in SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162 
due to degeneracy of the genetic code and thus encode the same NOVX proteins as that 
5 encoded by the nucleotide sequences shown in SEQ ID NOS:2n-l, wherein n is an integer 

between 1 and 162. In another embodiment, an isolated nucleic acid molecule of the invention 
has a nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ 
IDNOS:2n, wherein ri is an integer between 1 and 162. 

In addition to the human NOVX nucleotide sequences shown in SEQ ID NOS:2n-l, 
10 wherein n is an integer between 1 and 162, it will be appreciated by those skilled in the art that 
DNA sequence polymorphisms that lead to changes in the amino acid sequences of the NOVX 
polypeptides may exist within a population (e.g., the human population). Such genetic 
polymorphism in the NOVX genes may exist among individuals within a population due to 
natural allelic variation. As used herein, the terms "gene" and "recombinant gene" refer to 
1 5 nucleic acid molecules comprising an open reading frame (ORF) encoding an NOVX protein, 
preferably a vertebrate NOVX protein. Such natural allelic variations can typically result in 
1-5% variance in the nucleotide sequence of the NOVX genes. Any and all such nucleotide 
variations and resulting amino acid polymorphisms in the NOVX polypeptides, which are the 
result of natural allelic variation and that do not alter the functional activity of the NOVX 
20 polypeptides, are intended to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding NOVX proteins from other species, and 
thus that have a nucleotide sequence that differs from the human SEQ ID NOS:2n-l, wherein 
n is an integer between 1 and 162 are intended to be within the scope of the invention. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of the NOVX 
25 cDNAs of the invention can be isolated based on their homology to the human NOVX nucleic 
acids disclosed herein using the human cDNAs, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 6 nucleotides in length and hybridizes under stringent conditions to the 
30 nucleic acid molecule comprising the nucleotide sequence of SEQ ID NOS:2n-l, wherein n is 
an integer between 1 and 162. In another embodiment, the nucleic acid is at least 10, 25, 50, 
100, 250, 500, 750, 1000, 1 500, or 2000 or more nucleotides in length. In yet another 
embodiment, an isolated nucleic acid molecule of the invention hybridizes to the coding 
region. As used herein, the term "hybridizes under stringent conditions" is intended to 
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describe conditions for hybridization and washing under which nucleotide sequences at least 
60% homologous to each other typically remain hybridized to each other. 

Homologs {i.e., nucleic acids encoding NOVX proteins derived from species other 
than human) or other related sequences {e.g., paralogs) can be obtained by low, moderate or 
high stringency hybridization with all or a portion of the particular human sequence as a probe 
using methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions 
under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no 
other sequences. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures than 
shorter sequences. Generally, stringent conditions are selected to be about 5 °C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The 
Tm is the temperature (under defined ionic strength, pH and nucleic acid concentration) at 
which 50% of the probes complementary to the target sequence hybridize to the target 
sequence at equilibrium. Since the target sequences are generally present at excess, at Tm, 
50% of the probes are occupied at equilibrium. Typically, stringent conditions will be those in 
which the salt concentration is less than about 1 .0 M sodium ion, typically about 0.01 to 1 .0 M 
sodium ion (or other salts) at 

pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes, primers or 
oligonucleotides {e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers and 
oligonucleotides. Stringent conditions may also be achieved with the addition of destabilizing 
agents, such as formamide. 

Stringent conditions are known to those skilled in the art and can be found in Ausubel, 
et aL, (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. 
(1989), 6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically remain 
hybridized to each other. A non-limiting example of stringent hybridization conditions are 
hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM 
EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA 
at 65°C, followed by one or more washes in 0.2X SSC, 0.01% BSA at 50°C. An isolated 
nucleic acid molecule of the invention that hybridizes under stringent conditions to the 
sequences SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, corresponds to a 
naturally-occurring nucleic acid molecule. As used herein, a "naturally-occurring" nucleic 
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acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs in 
nature (e.g., encodes a natural protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic 
acid molecule comprising the nucleotide sequence of SEQ ID NOS:2n-l, wherein n is an 
5 integer between 1 and 162, or fragments, analogs or derivatives thereof, under conditions of 
moderate stringency is provided. A non-limiting example of moderate stringency 
hybridization conditions are hybridization in 6X SSC, 5X Denhardt's solution, 0.5% SDS and 
100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more washes in 
IX SSC, 0.1% SDS at 37°C. Other conditions of moderate stringency that may be used are 

10 well-known within the art. See, e.g., Ausubel, et al. (eds.), 1993, Current Protocols in 
Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990; Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY. 

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule 
comprising the nucleotide sequences SEQ ID NOS:2n-l, wherein n is an integer between 1 

15 and 162, or fragments, analogs or derivatives thereof, under conditions of low stringency, is 
provided. A non-limiting example of low stringency hybridization conditions are 
hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA, 0.02% 
PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm DNA, 10% (wt/vol) 
dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 25 mM Tris-HCl (pH 

20 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of low stringency that may be 
used are well known in the art (e.g., as employed for cross-species hybridizations). See, e.g., 
Ausubel, et al. (eds.), 1993, Current Protocols in Molecular Biology, John Wiley & 
Sons, NY, and Kriegler, 1990, Gene Transfer and Expression, A Laboratory Manual, 
Stockton Press, NY; Shilo and Weinberg, 1981. Proc Natl Acad Sci USA 78: 6789-6792. 

25 

Conservative Mutations 

In addition to naturally-occurring allelic variants of NOVX sequences that may exist in 
the population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into the nucleotide sequences SEQ ID NOS:2n-l, wherein n is an integer between 1 
30 and 162, thereby leading to changes in the amino acid sequences of the encoded NOVX 
proteins, without altering the functional ability of said NOVX proteins. For example, 
nucleotide substitutions leading to amino acid substitutions at "non-essential" amino acid 
residues can be made in the sequence SEQ ID NOS:2n, wherein n is an integer between 1 and 
162. A "non-essential" amino acid residue is a residue that can be altered from the wild-type 
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sequences of the NOVX proteins without altering their biological activity, whereas an 
"essential" amino acid residue is required for such biological activity. For example, amino 
acid residues that are conserved among the NOVX proteins of the invention are predicted to be 
particularly non-amenable to alteration. Amino acids for which conservative substitutions can 
5 be made are well-known within the art. 

Another aspect of the invention pertains to nucleic acid molecules encoding NOVX 
proteins that contain changes in amino acid residues that are not essential for activity. Such 
NOVX proteins differ in amino acid sequence from SEQ ID NOS:2n-l, wherein n is an 
integer between 1 and 1 62 yet retain biological activity. In one embodiment, the isolated 

10 nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein 
comprises an amino acid sequence at least about 45% homologous to the amino acid 
sequences SEQ ID NOS:2n, wherein n is an integer between 1 and 162. Preferably, the 
protein encoded by the nucleic acid molecule is at least about 60% homologous to SEQ ID 
NOS:2n, wherein n is an integer between 1 and 162; more preferably at least about 70% 

15 homologous SEQ ID NOS:2n, wherein n is an integer between 1 and 162; still more preferably 
at least about 80% homologous to SEQ ID NOS:2n, wherein n is an integer between 1 and 
162; even more preferably at least about 90% homologous to SEQ ID NOS:2n, wherein n is an 
integer between 1 and 162; and most preferably at least about 95% homologous to SEQ ID 
NOS:2n, wherein n is an integer between 1 and 162. 

20 An isolated nucleic acid molecule encoding an NOVX protein homologous to the 

protein of SEQ ID NOS:2n, wherein n is an integer between 1 and 162 can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 
sequence of SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, such that one or 
more amino acid substitutions, additions or deletions are introduced into the encoded protein. 

25 Mutations can be introduced into SEQ ID NOS:2n-l, wherein n is an integer between 1 

and 162 by standard techniques, such as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 
predicted, non-essential amino acid residues. A "conservative amino acid substitution" is one 
in which the amino acid residue is replaced with an amino acid residue having a similar side 

30 chain. Families of amino acid residues having similar side chains have been defined within 
the art. These families include amino acids with basic side chains (e.g., lysine, arginine, 
histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains 
(e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side 
chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
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tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side 
chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted non-essential 
amino acid residue in the NOVX protein is replaced with another amino acid residue from the 
same side chain family. Alternatively, in another embodiment, mutations can be introduced 
5 randomly along all or part of an NOVX coding sequence, such as by saturation mutagenesis, 
and the resultant mutants can be screened for NOVX biological activity to identify mutants 
that retain activity. Following mutagenesis SEQ ID NOS:2n-l, wherein n is an integer 
between 1 and 162, the encoded protein can be expressed by any recombinant technology 
known in the art and the activity of the protein can be determined. 

10 The relatedness of amino acid families may also be determined based on side chain 

interactions. Substituted amino acids may be fully conserved "strong" residues or fully 
conserved "weak" residues. The "strong" group of conserved amino acid residues may be any 
one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 

15 substituted for each other. Likewise, the "weak" group of conserved residues may be any one 
of the following: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, NDEQHK, NEQHRK, 
VLIM, HFY, wherein the letters within each group represent the single letter amino acid code. 

In one embodiment, a mutant NOVX protein can be assayed for (/) the ability to form 
protein:protein interactions with other NOVX proteins, other cell-surface proteins, or 

20 biologically-active portions thereof, (//) complex formation between a mutant NOVX protein 
and an NOVX ligand; or (Hi) the ability of a mutant NOVX protein to bind to an intracellular 
target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant NOVX protein can be assayed for the ability to 
regulate a specific biological function (e.g., regulation of insulin release). 

25 Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules 
that are hybridizable to or complementary to the nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, or 
fragments, analogs or derivatives thereof. An "antisense" nucleic acid comprises a nucleotide 
30 sequence that is complementary to a "sense" nucleic acid encoding a protein (e.g., 

complementary to the coding strand of a double-stranded cDNA molecule or complementary 
to an mRNA sequence). In specific aspects, antisense nucleic acid molecules are provided that 
comprise a sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides 
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or an entire NOVX coding strand, or to only a portion thereof. Nucleic acid molecules 
encoding fragments, homologs, derivatives and analogs of an NOVX protein of SEQ ID 
NOS:2n, wherein n is an integer between 1 and 162, or antisense nucleic acids complementary 
to an NOVX nucleic acid sequence of SEQ ID NOS:2n-l, wherein n is an integer between 1 
5 and 162, are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding an NOVX protein. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues. In another embodiment, the antisense nucleic acid 

1 0 molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence 

encoding the NOVX protein. The term "noncoding region" refers to 5' and 3 1 sequences which 
flank the coding region that are not translated into amino acids (i.e., also referred to as 5' and 
3 f untranslated regions). 

Given the coding strand sequences encoding the NOVX protein disclosed herein, 

15 antisense nucleic acids of the invention can be designed according to the rules of Watson and 
Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary 
to the entire coding region of NOVX mRNA, but more preferably is an oligonucleotide that is 
antisense to only a portion of the coding or noncoding region of NOVX mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 

20 start site of NOVX mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 
20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) 
can be chemically synthesized using naturally-occurring nucleotides or variously modified 

25 nucleotides designed to increase the biological stability of the molecules or to increase the 
physical stability of the duplex formed between the antisense and sense nucleic acids (e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic 
acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 

30 xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
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beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 
2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
urac il -5 -oxy acetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 
5 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

10 The antisense nucleic acid molecules of the invention are typically administered to a 

subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding an NOVX protein to thereby inhibit expression of the protein (e.g., by 
inhibiting transcription and/or translation). The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, for example, in the case of an 

15 antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 

20 such that they specifically bind to receptors or antigens expressed on a selected cell surface 

(e.g., by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell 
surface receptors or antigens). The antisense nucleic acid molecules can also be delivered to 
cells using the vectors described herein. To achieve sufficient nucleic acid molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a strong 

25 pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual p-units/the 
strands run parallel to each other. See, e.g., Gaultier, et al., 1987. Nucl. Acids Res. 15: 

30 6625-6641 . The antisense nucleic acid molecule can also comprise a 

2'-o-methyIribonucleotide (See, e.g., Inoue, et ah 1987. Nucl. Acids Res. 15: 6131-6148) or a 
chimeric RNA-DNA analogue (See, e.g., Inoue, et al., 1987. FEBS Lett. 215: 327-330. 
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Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified bases, 
and nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the modified 
5 nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 
therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 

10 complementary region. Thus, ribozymes (e.g., hammerhead ribozymes as described in 

Haselhoff and Gerlach 1988. Nature 334: 585-591) can be used to catalytically cleave NOVX 
mRNA transcripts to thereby inhibit translation of NOVX mRNA. A ribozyme having 
specificity for an NOVX-encoding nucleic acid can be designed based upon the nucleotide 
sequence of an NOVX cDNA disclosed herein (/.£?., SEQ ID NOS:2n-l, wherein n is an 

15 integer between 1 and 162). For example, a derivative of a Tetrahymena L-19 IVS RNA can 
. be constructed in which the nucleotide sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in an NOVX-encoding mRNA. See, e.g., U.S. Patent 
4,987,071 to Cech, et ah and U.S. Patent 5,1 16,742 to Cech, et al. NOVX mRNA can also be 
used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA 

20 molecules. See, e.g., Bartel et aL, (1993) Science 261:141 1-1418. 

Alternatively, NOVX gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NOVX nucleic acid (e.g., the NOVX 
promoter and/or enhancers) to form triple helical structures that prevent transcription of the 
NOVX gene in target cells. See, e.g., Helene, 1991. Anticancer Drug Des. 6: 569-84; Helene, 

25 etal. 1992. Ann. N.Y. Acad. Sci. 660: 27-36; Maher, 1992. Bioassays 14: 807-15. 

In various embodiments, the NOVX nucleic acids can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility 
of the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can 
be modified to generate peptide nucleic acids. See, e.g., Hyrup, et al., 1996. Bioorg Med 

30 Chem 4: 5-23. As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic 

acid mimics (e.g., DNA mimics) in which the deoxyribose phosphate backbone is replaced by 
a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
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standard solid phase peptide synthesis protocols as described in Hyrup, et al, 1996. supra; 
Perry-O'Keefe, et a!., 1996. Proc. Natl. Acad. Sci. USA 93: 14670-14675. 

PNAs of NOVX can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
5 expression by, e.g., inducing transcription or translation arrest or inhibiting replication. PNAs 
of NOVX can also be used, for example, in the analysis of single base pair mutations in a gene 
(e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.g., Si nucleases (See, Hyrup, et al., \996.supra); or as probes or primers 
for DNA sequence and hybridization (See, Hyrup, et al, 1996, supra; Perry-O'Keefe, et al., 
10 1996. supra). 

In another embodiment, PNAs of NOVX can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 

0 

formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras of NOVX can be generated that 

1 5 may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes (e.g., Cleavage signal- 1 protein H and DNA polymerases) to interact 
with the DNA portion while the PNA portion would provide high binding affinity and 
specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in 
terms of base stacking, number of bonds between the nucleobases, and orientation (see, Hyrup, 

20 et al., 1996. supra). The synthesis of PNA-DNA chimeras can be performed as described in 

Hyrup, et al, 1996. supra and Finn, et ah, 1996. Nucl Acids Res 24: 3357-3363. For example, 
a DNA chain can be synthesized on a solid support using standard phosphoramidite coupling 
chemistry, and modified nucleoside analogs, e.g., 

5 , _(4- me thoxytrityl)amino-5 , -deoxy-thymidine phosphoramidite, can be used between the 
25 PNA and the 5' end of DNA. See, e.g., Mag, et al, 1989. Nucl Acid Res 17: 5973-5988. PNA 
monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5* 
PNA segment and a 3' DNA segment. See, e.g., Finn, et al, 1996. supra. Alternatively, 
chimeric molecules can be synthesized with a 5 ? DNA segment and a 3 ? PNA segment. See, 
e.g., Petersen, et al, 1975. Bioorg. Med. Chem. Lett. 5: 1 1 19-1 1 124. 
30 In other embodiments, the oligonucleotide may include other appended groups such as 

peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across 
the cell membrane (see, e.g., Letsinger, et al., 1989. Proc. Natl. Acad. Sci. U.S.A. 86: 
6553-6556; Lemaitre, et al., 1987. Proc. Natl. Acad. Sci. 84: 648-652; PCT Publication No. 
WO88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. WO 89/10134). In 
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addition, oligonucleotides can be modified with hybridization triggered cleavage agents (see, 
e.g., Krol, et a!., 1988. BioTechniques 6:958-976) or intercalating agents (see, e.g., Zon, 1988. 
Pharm. Res. 5: 539-549). To this end, the oligonucleotide may be conjugated to another 
molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport agent, a 
5 hybridization-triggered cleavage agent, and the like. 

NOVX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the amino 
acid sequence of NOVX polypeptides whose sequences are provided in SEQ ID NOS:2n, 
wherein n is an integer between 1 and 162. The invention also includes a mutant or variant 

10 protein any of whose residues may be changed from the corresponding residues shown in SEQ 
ID NOS:2n, wherein n is an integer between 1 and 162 while still encoding a protein that 
maintains its NOVX activities and physiological functions, or a functional fragment thereof. 

In general, an NOVX variant that preserves NOVX-like function includes any variant 
in which residues at a particular position in the sequence have been substituted by other amino 

15 acids, and further include the possibility of inserting an additional residue or residues between 
two residues of the parent protein as well as the possibility of deleting one or more residues 
from the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed 
by the invention. In favorable circumstances, the substitution is a conservative substitution as 
defined above. 

20 One aspect of the invention pertains to isolated NOVX proteins, and biologically- 

active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided 
are polypeptide fragments suitable for use as immunogens to raise anti-NOVX antibodies. In 
one embodiment, native NOVX proteins can be isolated from cells or tissue sources by an 
appropriate purification scheme using standard protein purification techniques. In another 

25 embodiment, NOVX proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an NOVX protein or polypeptide can be synthesized chemically 
using standard peptide synthesis techniques. 

An "isolated" or "purified" polypeptide or protein or biologically-active portion thereof 
is substantially free of cellular material or other contaminating proteins from the cell or tissue 

30 source from which the NOVX protein is derived, or substantially free from chemical 

precursors or other chemicals when chemically synthesized. The language "substantially free 
of cellular material" includes preparations of NOVX proteins in which the protein is separated 
from cellular components of the cells from which it is isolated or recombinantly-produced. In 
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one embodiment, the language "substantially free of cellular material" includes preparations of 
NOVX proteins having less than about 30% (by dry weight) of non-NOVX proteins (also 
referred to herein as a "contaminating protein"), more preferably less than about 20% of 
non-NOVX proteins, still more preferably less than about 10% of non-NOVX proteins, and 
5 most preferably less than about 5% of non-NOVX proteins. When the NOVX protein or 
biologically-active portion thereof is recombinantly-produced, it is also preferably 
substantially free of culture medium, i.e., culture medium represents less than about 20%, 
more preferably less than about 10%, and most preferably less than about 5% of the volume of 
the NOVX protein preparation. 

10 The language "substantially free of chemical precursors or other chemicals" includes 

preparations of NOVX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 
language "substantially free of chemical precursors or other chemicals" includes preparations 
of NOVX proteins having less than about 30% (by dry weight) of chemical precursors or 

15 non-NOVX chemicals, more preferably less than about 20% chemical precursors or 

non-NOVX chemicals, still more preferably less than about 10% chemical precursors or 
non-NOVX chemicals, and most preferably less than about 5% chemical precursors or 
non-NOVX chemicals. 

Biologically-active portions of NOVX proteins include peptides comprising amino 

20 acid sequences sufficiently homologous to or derived from the amino acid sequences of the 
NOVX proteins {e.g., the amino acid sequence shown in SEQ ID NOS:2n, wherein n is an 
integer between 1 and 162) that include fewer amino acids than the full-length NOVX 
proteins, and exhibit at least one activity of an NOVX protein. Typically, biologically-active 
portions comprise a domain or motif with at least one activity of the NOVX protein. A 

25 biologically-active portion of an NOVX protein can be a polypeptide which is, for example, 
10, 25, 50, 100 or more amino acid residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
functional activities of a native NOVX protein. 

30 In an embodiment, the NOVX protein has an amino acid sequence shown SEQ ID 

NOS:2n, wherein n is an integer between 1 and 162. In other embodiments, the NOVX 
protein is substantially homologous to SEQ ID NOS:2n, wherein n is an integer between 1 and 
162, and retains the functional activity of the protein of SEQ ID NOS:2n, wherein n is an 
integer between 1 and 162, yet differs in amino acid sequence due to natural allelic variation 
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or mutagenesis, as described in detail, below. Accordingly, in another embodiment, the 
NOVX protein is a protein that comprises an amino acid sequence at least about 45% 
homologous to the amino acid sequence SEQ ID NOS:2n, wherein n is an integer between 1 
and 162, and retains the functional activity of the NOVX proteins of SEQ ID NOS:2n, wherein 
5 n is an integer between 1 and 162. 

Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced 
10 in the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a 
second amino or nucleic acid sequence). The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as the 
corresponding position in the second sequence, then the molecules are homologous at that 

15 position (i.e., as used herein amino acid or nucleic acid "homology" is equivalent to amino 
acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity 
between two sequences. The homology may be determined using computer programs known 
in the art, such as GAP software provided in the GCG program package. See, Needleman and 

20 Wunsch, 1970. J Mol Biol 48: 443-453. Using GCG GAP software with the following settings 
for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension 
penalty of 0.3, the coding region of the analogous nucleic acid sequences referred to above 
exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 
99%o, with the CDS (encoding) part of the DNA sequence shown in SEQ ID NOS:2n-l, 

25 wherein n is an integer between 1 and 162. 

The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 

30 positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of 

nucleic acids) occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the region of comparison (i.e., 
the window size), and multiplying the result by 100 to yield the percentage of sequence 
identity. The term "substantial identity" as used herein denotes a characteristic of a 
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polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 80 
percent sequence identity, preferably at least 85 percent identity and often 90 to 95 percent 
sequence identity, more usually at least 99 percent sequence identity as compared to a 
reference sequence over a comparison region. 

5 

Chimeric and Fusion Proteins 

The invention also provides NOVX chimeric or fusion proteins. As used herein, an 
NOVX "chimeric protein" or "fusion protein" comprises an NOVX polypeptide operatively- 
linked to a non-NOVX polypeptide. An "NOVX polypeptide" refers to a polypeptide having 

10 an amino acid sequence corresponding to an NOVX protein SEQ ID NOS:2n, wherein n is an 
integer between 1 and 162, whereas a "non-NOVX polypeptide" refers to a polypeptide having 
an amino acid sequence corresponding to a protein that is not substantially homologous to the 
NOVX protein, e.g., a protein that is different from the NOVX protein and that is derived from 
the same or a different organism. Within an NOVX fusion protein the NOVX polypeptide can 

1 5 correspond to all or a portion of an NOVX protein. In one embodiment, an NOVX fusion 
protein comprises at least one biologically-active portion of an NOVX protein. In another 
embodiment, an NOVX fusion protein comprises at least two biologically-active portions of 
an NOVX protein. In yet another embodiment, an NOVX fusion protein comprises at least 
three biologically-active portions of an NOVX protein. Within the fusion protein, the term 

20 "operatively-linked" is intended to indicate that the NOVX polypeptide and the non-NOVX 

polypeptide are fused in-frame with one another. The non-NOVX polypeptide can be fused to 
the N-terminus or C-terminus of the NOVX polypeptide. 

In one embodiment, the fusion protein is a GST-NOVX fusion protein in which the 
NOVX sequences are fused to the C-terminus of the GST (glutathione S-transferase) 

25 sequences. Such fusion proteins can facilitate the purification of recombinant NOVX 
polypeptides. 

In another embodiment, the fusion protein is an NOVX protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian host 
cells), expression and/or secretion of NOVX can be increased through use of a heterologous 
30 signal sequence. 

In yet another embodiment, the fusion protein is an NOVX-immunoglobulin fusion 
protein in which the NOVX sequences are fused to sequences derived from a member of the 
immunoglobulin protein family. The NOVX-immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and administered to a subject to inhibit 
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an interaction between an NOVX ligand and an NOVX protein on the surface of a cell, to 
thereby suppress NOVX-mediated signal transduction in vivo. The NOVX-immunoglobulin 
fusion proteins can be used to affect the bioavailability of an NOVX cognate ligand. 
Inhibition of the NOVX ligand/NOVX interaction may be useful therapeutically for both the 
treatment of proliferative and differentiative disorders, as well as modulating (e.g. promoting 
or inhibiting) cell survival. Moreover, the NOVX-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NOVX antibodies in a subject, to purify 
NOVX ligands, and in screening assays to identify molecules that inhibit the interaction of 
NOVX with an NOVX ligand. 

An NOVX chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction 
enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, 
alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In 
another embodiment, the fusion gene can be synthesized by conventional techniques including 
automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers that give rise to complementary overhangs between two 
consecutive gene fragments that can subsequently be annealed and reamplified to generate a 
chimeric gene sequence (see, e.g., Ausubel, et al. (eds.) Current Protocols in Molecular 
Biology, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety {e.g., a GST polypeptide). An NOVX-encoding 
nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the NOVX protein. 

NOVX Agonists and Antagonists 

The invention also pertains to variants of the NOVX proteins that function as either 
NOVX agonists (i.e., mimetics) or as NOVX antagonists. Variants of the NOVX protein can 
be generated by mutagenesis (e.g., discrete point mutation or truncation of the NOVX protein). 
An agonist of the NOVX protein can retain substantially the same, or a subset of, the 
biological activities of the naturally occurring form of the NOVX protein. An antagonist of 
the NOVX protein can inhibit one or more of the activities of the naturally occurring form of 
the NOVX protein by, for example, competitively binding to a downstream or upstream 
member of a cellular signaling cascade which includes the NOVX protein. Thus, specific 
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biological effects can be elicited by treatment with a variant of limited function. In one 
embodiment, treatment of a subject with a variant having a subset of the biological activities 
of the naturally occurring form of the protein has fewer side effects in a subject relative to 
treatment with the naturally occurring form of the NOVX proteins. 
5 Variants of the NOVX proteins that function as either NOVX agonists (i.e., mimetics) 

or as NOVX antagonists can be identified by screening combinatorial libraries of mutants 
(e.g., truncation mutants) of the NOVX proteins for NOVX protein agonist or antagonist 
activity. In one embodiment, a variegated library of NOVX variants is generated by 
combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 

10 library. A variegated library of NOVX variants can be produced by, for example, 

enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential NOVX sequences is expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the set of 
NOVX sequences therein. There are a variety of methods which can be used to produce 

15 libraries of potential NOVX variants from a degenerate oligonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be performed in an automatic DNA synthesizer, 
and the synthetic gene then ligated into an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of all of the sequences encoding the 
desired set of potential NOVX sequences. Methods for synthesizing degenerate 

20 oligonucleotides are well-known within the art. See, e.g., Narang, 1983. Tetrahedron 39: 3; 
Itakura, et al., 1984. Annu. Rev. Biochem. 53: 323; Itakura, et ah, 1984. Science 198: 1056; 
Ike, et al, 1983. Nucl. Acids Res. 1 1 : 477. 

Polypeptide Libraries 

25 In addition, libraries of fragments of the NOVX protein coding sequences can be used 

to generate a variegated population of NOVX fragments for screening and subsequent 
selection of variants of an NOVX protein. In one embodiment, a library of coding sequence 
fragments can be generated by treating a double stranded PCR fragment of an NOVX coding 
sequence with a nuclease under conditions wherein nicking occurs only about once per 

30 molecule, denaturing the double stranded DNA, renaturing the DNA to form double-stranded 
DNA that can include sense/antisense pairs from different nicked products, removing single 
stranded portions from reformed duplexes by treatment with Si nuclease, and ligating the 
resulting fragment library into an expression vector. By this method, expression libraries can 
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be derived which encodes N-terminal and internal fragments of various sizes of the NOVX 
proteins. 

Various techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
5 products having a selected property. Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis of NOVX proteins. The most 
widely used techniques, which are amenable to high throughput analysis, for screening large 
gene libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 

10 combinatorial genes under conditions in which detection of a desired activity facilitates 

isolation of the vector encoding the gene whose product was detected. Recursive ensemble 
mutagenesis (REM), a new technique that enhances the frequency of functional mutants in the 
libraries, can be used in combination with the screening assays to identify NOVX variants. 
See, e.g., Arkin and Yourvan, 1992. Proc. Natl. Acad. Set USA 89: 781 1-7815; Delgrave, et 

1 5 a/., 1 993. Protein Engineering 6:327-33 1 . 

NOVX Antibodies 

The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
contain an antigen binding site that specifically binds (immunoreacts with) an antigen. Such 

20 antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F ab , 
F ab > and F (ab ')2 fragments, and an F ab expression library. In general, antibody molecules 
obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ 
from one another by the nature of the heavy chain present in the molecule. Certain classes 
have subclasses as well, such as IgGi, IgG 2 , and others. Furthermore, in humans, the light 

25 chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a 
reference to all such classes, subclasses and types of human antibody species. 

An isolated protein of the invention intended to serve as an antigen, or a portion or 
fragment thereof, can be used as an immunogen to generate antibodies that 
immunospecifically bind the antigen, using standard techniques for polyclonal and monoclonal 
30 antibody preparation. The full-length protein can be used or, alternatively, the invention 
provides antigenic peptide fragments of the antigen for use as immunogens. An antigenic 
peptide fragment comprises at least 6 amino acid residues of the am ino acid sequence of the 
full length protein, such as an amino acid sequence shown in SEQ ID NOs: 2n, wherein n is an 
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integer between 1 and 162, and encompasses an epitope thereof such that an antibody raised 
against the peptide forms a specific immune complex with the full length protein or with any 
fragment that contains the epitope. Preferably, the antigenic peptide comprises at least 10 
amino acid residues, or at least 15 amino acid residues, or at least 20 amino acid residues, or at 
5 least 30 amino acid residues. Preferred epitopes encompassed by the antigenic peptide are 
regions of the protein that are located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of NOVX that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobic ity analysis of the human NOVX protein sequence will 

10 indicate which regions of a NOVX polypeptide are particularly hydrophilic and, therefore, are 
likely to encode surface residues useful for targeting antibody production. As a means for 
targeting antibody production, hydropathy plots showing regions of hydrophilic ity and 
hydrophobicity may be generated by any method well known in the art, including, for 
example, the Kyte Doolittle or the Hopp Woods methods, either with or without Fourier 

15 transformation. See, e.g., Hopp and Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; 
Kyte and Doolittle 1982, J. Mol. Biol. 157: 105-142, each incorporated herein by reference in 
their entirety. Antibodies that are specific for one or more domains within an antigenic protein, or 
derivatives, fragments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 

20 thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal 
or monoclonal antibodies directed against a protein of the invention, or against derivatives, 
fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 

25 Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 
Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be immunized by one or more injections with the native 
30 protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 

immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated 
to a second protein known to be immunogenic in the mammal being immunized. Examples of 
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such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further 
include an adjuvant. Various adjuvants used to increase the immunological response include, 
but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum 
5 hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as Bacille 
Calmette-Guerin and Corynebacterium parvum, or similar immunostimulatory agents. 
Additional examples of adjuvants which can be employed include MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

10 The polyclonal antibody molecules directed against the immunogenic protein can be 

isolated from the mammal (e.g., from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 

1 5 immobilized on a column to purify the immune specific antibody by immunoaffinity 

chromatography. Purification of immunoglobulins is discussed, for example, by D. Wilkinson 
(The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 
2000), pp. 25-28). 

20 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 
used herein, refers to a population of antibody molecules that contain only one molecular 
species of antibody molecule consisting of a unique light chain gene product and a unique 
heavy chain gene product. In particular, the complementarity determining regions (CDRs) of 
25 the monoclonal antibody are identical in all the molecules of the population. MAbs thus 
contain an antigen binding site capable of immunoreacting with a particular epitope of the 
antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature , 256:495 (1975). In a hybridoma method, a mouse, 
30 hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 
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The immunizing agent will typically include the protein antigen, a fragment thereof or 
a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of 
human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 
5 line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 
[Goding, Monoclonal Antibodies: Principles and Practice , Academic Press, (1986) pp. 59- 
103]. Immortalized cell lines are usually transformed mammalian cells, particularly myeloma 
cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are 
employed. The hybridoma cells can be cultured in a suitable culture medium that preferably 

10 contains one or more substances that inhibit the growth or survival of the unfused, 

immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. 

15 Preferred immortalized cell lines are those that fuse efficiently, support stable high 

level expression of antibody by the selected antibody-producing cells, and are sensitive to a 
medium such as HAT medium. More preferred immortalized cell lines are murine myeloma 
lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San 
Diego, California and the American Type Culture Collection, Manassas, Virginia. Human 

20 myeloma and mouse-human heteromyeloma cell lines also have been described for the 

production of human monoclonal antibodies [Kozbor, J. Immunol., 133 :3001 (1984); Brodeur 
et al., Monoclonal Antibody Production Techniques and Applications , Marcel Dekker, Inc., 
New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for 

25 the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in 
the art. The binding affinity of the monoclonal antibody can, for example, be determined by 

30 the Scatchard analysis of Munson and Pollard, Anal. Biochem. , 107:220 (1980). It is an 
objective, especially important in therapeutic applications of monoclonal antibodies, to 
identify antibodies having a high degree of specificity and a high binding affinity for the target 
antigen. 
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After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods (Goding,1986). Suitable culture 
media for this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMT- 
1640 medium. Alternatively, the hybridoma cells can be grown in vivo as ascites in a 
mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from 
the culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxy lapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of 
the invention can be readily isolated and sequenced using conventional procedures (e.g., by 
using oligonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a 
preferred source of such DNA. Once isolated, the DNA can be placed into expression vectors, 
which are then transfected into host cells such as simian COS cells, Chinese hamster ovary 
(CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to 
obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also 
can be modified, for example, by substituting the coding sequence for human heavy and light 
chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4,816,567; Morrison, Nature 368 , 812-13 (1994)) or by covalently joining to the 
immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin 
polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant 
domains of an antibody of the invention, or can be substituted for the variable domains of one 
antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody. 

Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further 
comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against the 
administered immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 
immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
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Humanization can be performed following the method of Winter and co-workers (Jones et al., 
Nature , 321:522-525 (1986); Riechmann et a!., Nature , 332:323-327 (1988); Verhoeyen et al., 
Science , 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
5 instances, Fv framework residues of the human immunoglobulin are replaced by 

corresponding non-human residues. Humanized antibodies can also comprise residues which 
are found neither in the recipient antibody nor in the imported CDR or framework sequences. 
In general, the humanized antibody will comprise substantially all of at least one, and typically 
two, variable domains, in which all or substantially all of the CDR regions correspond to those 
10 of a non-human immunoglobulin and all or substantially all of the framework regions are 

those of a human immunoglobulin consensus sequence. The humanized antibody optimally 
also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that 
of a human immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. 
Struct. Biol., 2:593-596 (1992)). 

15 

Human Antibodies 

Fully human antibodies essentially relate to antibody molecules in which the entire 
sequence of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 

20 Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 

hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: MONOCLONAL 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 
antibodies may be utilized in the practice of the present invention and may be produced by 

25 using human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or 
by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol. , 227 :381 (1991); 
30 Marks et al., J. Mol. Biol. , 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. This 
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approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 
5,625,126; 5,633,425; 5,661,016, and in Marks etal. ( Bio/Technology 10, 779-783 (1992)); 
Lonberg et al. ( Nature 368 856-859 (1994)); Morrison ( Nature 368, 812-13 (1994)); Fishwild 
et al,( Nature Biotechnology 14, 845-51 (1996)); Neuberger ( Nature Biotechnology 14, 826 
5 (1996)); and Lonberg and Huszar ( Intern. Rev. Immunol. 13 65-93 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's 
endogenous antibodies in response to challenge by an antigen. (See PCT publication 
WO94/02602). The endogenous genes encoding the^heavy and light immunoglobulin chains in 

10 the nonhuman host have been incapacitated, and active loci encoding human heavy and light 
chain immunoglobulins are inserted into the host's genome. The human genes are 
incorporated, for example, using yeast artificial chromosomes containing the requisite human 
DNA segments. An animal which provides all the desired modifications is then obtained as 
progeny by crossbreeding intermediate transgenic animals containing fewer than the full 

15 complement of the modifications. The preferred embodiment of such a nonhuman animal is a 
mouse, and is termed the Xenomouse™ as disclosed in PCT publications WO 96/33735 and 
WO 96/34096. This animal produces B cells which secrete fully human immunoglobulins. 
The antibodies can be obtained directly from the animal after immunization with an 
immunogen of interest, as, for example, a preparation of a polyclonal antibody, or alternatively 

20 from immortalized B cells derived from the animal, such as hybridomas producing 

monoclonal antibodies. Additionally, the genes encoding the immunoglobulins with human 
variable regions can be recovered and expressed to obtain the antibodies directly, or can be 
further modified to obtain analogs of antibodies such as, for example, single chain Fv 
molecules. 

25 An example of a method of producing a nonhuman host, exemplified as a mouse, 

lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent 
No. 5,939,598. It can be obtained by a method including deleting the J segment genes from at 
least one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of 
the locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain 

30 locus, the deletion being effected by a targeting vector containing a gene encoding a selectable 
marker; and producing from the embryonic stem cell a transgenic mouse whose somatic and 
germ cells contain the gene encoding the selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is disclosed 
in U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a 
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nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, 
introducing an expression vector containing a nucleotide sequence encoding a light chain into 
another mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell 
expresses an antibody containing the heavy chain and the light chain. 
5 In a further improvement on this procedure, a method for identifying a clinically 

relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 

10 F a b Fragments and Single Chain Antibodies 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent 
No. 4,946,778). In addition, methods can be adapted for the construction of F ab expression 
libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective 

15 identification of monoclonal F ab fragments with the desired specificity for a protein or 

derivatives, fragments, analogs or homologs thereof. Antibody fragments that contain the 
idiotypes to a protein antigen may be produced by techniques known in the art including, but 
not limited to: (i) an F (ab ' )2 fragment produced by pepsin digestion of an antibody molecule; (ii) 
an F ab fragment generated by reducing the disulfide bridges of an F (ab ') 2 fragment; (iii) an F ab 

20 fragment generated by the treatment of the antibody molecule with papain and a reducing 
agent and (iv) F v fragments. 

Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
25 have binding specificities for at least two different antigens. In the present case, one of the 

binding specificities is for an antigenic protein of the invention. The second binding target is 
any other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
30 immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature, 305 :537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce 
a potential mixture of ten different antibody molecules, of which only one has the correct 
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bispecific structure. The purification of the correct molecule is usually accomplished by 
affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, published 
13 May 1993, and in Traunecker et ah, EMBO J. , 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen 
5 combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part 
of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant 
region (CHI) containing the site necessary for light-chain binding present in at least one of the 
fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 

10 immunoglobulin light chain, are inserted into separate expression vectors, and are co- 
transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et al., Methods in Enzymology , 121 :210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodimers which 

15 are recovered from recombinant cell culture. The preferred interface comprises at least a part 
of the CH3 region of an antibody constant domain. In this method, one or more small amino 
acid side chains from the interface of the first antibody molecule are replaced with larger side 
chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the 
large side chain(s) are created on the interface of the second antibody molecule by replacing 

20 large amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a 

mechanism for increasing the yield of the heterodimer over other unwanted end-products such 
as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments 
(e.g. F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from 

25 antibody fragments have been described in the literature. For example, bispecific antibodies 
can be prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a 
procedure wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. 
These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite 
to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' 

30 fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the 
Fab'-TNB derivatives is then reconverted to the Fab' -thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB derivative 
to form the bispecific antibody. The bispecific antibodies produced can be used as agents for 
the selective immobilization of enzymes. 
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Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) 
describe the production of a fully humanized bispecific antibody F(ab')2 molecule. Each Fab' 
fragment was separately secreted from E. coli and subjected to directed chemical coupling in 
5 vitro to form the bispecific antibody. The bispecific antibody thus formed was able to bind to 
cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly 
from recombinant cell culture have also been described. For example, bispecific antibodies 
10 have been produced using leucine zippers. Kostelny et al., J. Immunol. 148(5):1547-1553 
(1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' 
portions of two different antibodies by gene fusion. The antibody homodimers were reduced 

at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. 

•> 

This method can also be utilized for the production of antibody homodimers. The "diabody" 

1 5 technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has 
provided an alternative mechanism for making bispecific antibody fragments. The fragments 
comprise a heavy-chain variable domain (V H ) connected to a light-chain variable domain (V L ) 
by a linker which is too short to allow pairing between the two domains on the same chain. 
Accordingly, the V H and V L domains of one fragment are forced to pair with the 

20 complementary V L and V H domains of another fragment, thereby forming two antigen-binding 
sites. Another strategy for making bispecific antibody fragments by the use of single-chain Fv 
(sFv) dimers has also been reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). 

25 Exemplary bispecific antibodies can bind to two different epitopes, at least one of 

which originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm 
of an immunoglobulin molecule can be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or 
Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and FcyRIII (CD 16) so as 

30 to focus cellular defense mechanisms to the cell expressing the particular antigen. Bispecific 
antibodies can also be used to direct cytotoxic agents to cells which express a particular 
antigen. These antibodies possess an antigen-binding arm and an arm which binds a cytotoxic 
agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another 



821 




bispecific antibody of interest binds the protein antigen described herein and further binds 
tissue factor (TF). 

Heterocon jugate Antibodies 

5 Heteroconjugate antibodies are also within the scope of the present invention. 

Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
antibodies have, for example, been proposed to target immune system cells to unwanted cells 
(U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro using 
10 known methods in synthetic protein chemistry, including those involving crosslinking agents. 
For example, immunotoxins can be constructed using a disulfide exchange reaction or by 
forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate 
and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 
4,676,980. 

15 

Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing interchain 

20 disulfide bond formation in this region. The homodimeric antibody thus generated can have 
improved internalization capability and/or increased complement-mediated cell killing and 
antibody-dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med ., 176 : 1191- 
1 195 (1992) and Shopes, J. Immunol ., 148 : 2918-2922 (1992). Homodimeric antibodies with 
enhanced anti-tumor activity can also be prepared using heterobifunctional cross-linkers as 

25 described in Wolff et al. Cancer Research , 53: 2560-2565 (1993). Alternatively, an antibody 
can be engineered that has dual Fc regions and can thereby have enhanced complement lysis 
and ADCC capabilities. See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

Immunoconjugates 

30 The invention also pertains to immunoconjugates comprising an antibody conjugated 

to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active 

toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive 

isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have 

35 been described above. Enzymatically active toxins and fragments thereof that can be used 
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include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain 
(from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, 
5 gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples 



Conjugates of the antibody and cytotoxic agent are made using a variety of 
bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate 

10 (SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl 
adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 
diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates 
(such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1,5-difIuoro- 

15 2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in 
Vitetta et al.. Science , 238 : 1098(1987). Carbon- 14-labe led l-isothiocyanatobenzyl-3- 
methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for 
conjugation of radionucleotide to the antibody. See W094/1 1026. 



20 streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 

25 Immunoliposomes 

The antibodies disclosed herein can also be formulated as immunoliposomes. 
Liposomes containing the antibody are prepared by methods known in the art, such as 
described in Epstein et ah, Proc. Natl. Acad. Sci. USA . 82: 3688 (1985); Hwang et ah, Proc. 
Natl Acad. Sci. USA . 77: 4030 (1980); and U.S. Pat. Nos. 4,485,045 and 4,544,545. 
30 Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation 
method with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG- 
derivatized phosphatidylethanolamine (PEG-PE). Liposomes are extruded through filters of 
defined pore size to yield liposomes with the desired diameter. Fab' fragments of the antibody 



include 212 Bi, 131 I, 131 ln, 90 Y, and 186 Re. 



In another embodiment, the antibody can be conjugated to a "receptor" (such 
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of the present invention can be conjugated to the liposomes as described in Martin et al J. 
Biol. Cheiru 257 : 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic 
agent (such as Doxorubicin) is optionally contained within the liposome. See Gabizon et al., J. 
National Cancer Inst., 81(19): 1484 (1989). 

5 

Diagnostic Applications of Antibodies Directed Against the Proteins of the Invention 

Antibodies directed against a protein of the invention may be used in methods known 
within the art relating to the localization and/or quantitation of the protein (e.g., for use in 
measuring levels of the protein within appropriate physiological samples, for use in diagnostic 
10 methods, for use in imaging the protein, and the like). In a given embodiment, antibodies 

against the proteins, or derivatives, fragments, analogs or homologs thereof, that contain the 
antigen binding domain, are utilized as pharmacologically-active compounds (see below). 

An antibody specific for a protein of the invention can be used to isolate the protein by 
standard techniques, such as immunoaffinity chromatography or immunoprecipitation. Such 

1 5 an antibody can facilitate the purification of the natural protein antigen from cells and of 

recombinantly produced antigen expressed in host cells. Moreover, such an antibody can be 
used to detect the antigenic protein (e.g., in a cellular lysate or cell supernatant) in order to 
evaluate the abundance and pattern of expression of the antigenic protein. Antibodies directed 
against the protein can be used diagnostically to monitor protein levels in tissue as part of a 

20 clinical testing procedure, e.g., to, for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling (i.e., physically linking) the antibody to a 
detectable substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 

25 phosphatase, P-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent 
materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 

30 luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 125 I, 
131 I, 35 Sor 3 H. 

Antibody Therapeutics 



1 Ol.fi !?" iT.f 41 -S IP ». O i 1 Oil" 




824 



Antibodies of the invention, including polyclonal, monoclonal, humanized and fully 
human antibodies, may used as therapeutic agents. Such agents will generally be employed to 
treat or prevent a disease or pathology in a subject. An antibody preparation, preferably one 
having high specificity and high affinity for its target antigen, is administered to the subject 
and will generally have an effect due to its binding with the target. Such an effect may be one 
of two kinds, depending on the specific nature of the interaction between the given antibody 
molecule and the target antigen in question. In the first instance, administration of the 
antibody may abrogate or inhibit the binding of the target with an endogenous ligand to which 
it naturally binds. In this case, the antibody binds to the target and masks a binding site of the 
naturally occurring ligand, wherein the ligand serves as an effector molecule. Thus the 
receptor mediates a signal transduction pathway for which ligand is responsible. 

Alternatively, the effect may be one in which the antibody elicits a physiological result 
by virtue of binding to an effector binding site on the target molecule. In this case the target, a 
receptor having an endogenous ligand which may be absent or defective in the disease or 
pathology, binds the antibody as a surrogate effector ligand, initiating a receptor-based signal 
transduction event by the receptor. 

A therapeutically effective amount of an antibody of the invention relates generally to 
the amount needed to achieve a therapeutic objective. As noted above, this may be a binding 
interaction between the antibody and its target antigen that, in certain cases, interferes with the 
functioning of the target, and in other cases, promotes a physiological response. The amount 
required to be administered will furthermore depend on the binding affinity of the antibody for 
its specific antigen, and will also depend on the rate at which an administered antibody is 
depleted from the free volume other subject to which it is administered. Common ranges for 
therapeutically effective dosing of an antibody or antibody fragment of the invention may be, 
by way of nonlimiting example, from about 0.1 mg/kg body weight to about 50 mg/kg body 
weight. Common dosing frequencies may range, for example, from twice daily to once a 
week. 

Pharmaceutical Compositions of Antibodies 

Antibodies specifically binding a protein of the invention, as well as other molecules 
identified by the screening assays disclosed herein, can be administered for the treatment of 
various disorders in the form of pharmaceutical compositions. Principles and considerations 
involved in preparing such compositions, as well as guidance in the choice of components are 
provided, for example, in Remington : The Science And Practice Of Pharmacy 19th ed. 
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(Alfonso R. Gennaro, et al., editors) Mack Pub. Co., Easton, Pa. : 1995; Drug Absorption 
Enhancement : Concepts, Possibilities, Limitations, And Trends, Harwood Academic 
Publishers, Langhorne, Pa., 1994; and Peptide And Protein Drug Delivery (Advances In 
Parenteral Sciences, Vol. 4), 1991, M. Dekker, New York. 
5 If the antigenic protein is intracellular and whole antibodies are used as inhibitors, 

internalizing antibodies are preferred. However, liposomes can also be used to deliver the 
antibody, or an antibody fragment, into cells. Where antibody fragments are used, the smallest 
inhibitory fragment that specifically binds to the binding domain of the target protein is 
preferred. For example, based upon the variable-region sequences of an antibody, peptide 

10 molecules can be designed that retain the ability to bind the target protein sequence. Such 
peptides can be synthesized chemically and/or produced by recombinant DNA technology. 
See, e.g., Marasco et al., Proc. Natl. Acad. Sci. USA, 90: 7889-7893 (1993). The formulation 
herein can also contain more than one active compound as necessary for the particular 
indication being treated, preferably those with complementary activities that do not adversely 

15 affect each other. Alternatively, or in addition, the composition can comprise an agent that 
enhances its function, such as, for example, a cytotoxic agent, cytokine, chemotherapeutic 
agent, or growth-inhibitory agent. Such molecules are suitably present in combination in 
amounts that are effective for the purpose intended. 

The active ingredients can also be entrapped in microcapsules prepared, for example, 

20 by coacervation techniques or by interfacial polymerization, for example, 

hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) 
microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 
albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

25 The formulations to be used for in vivo administration must be sterile. This is readily 

accomplished by filtration through sterile filtration membranes. 

Sustained-release preparations can be prepared. Suitable examples of sustained- 
release preparations include semipermeable matrices of solid hydrophobic polymers 
containing the antibody, which matrices are in the form of shaped articles, e.g., films, or 

30 microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (for 
example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides (U.S. Pat. 
No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable 
ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON 
DEPOT ™ (injectable microspheres composed of lactic acid-glycolic acid copolymer and 
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leuprolide acetate), and poly~D-(-)-3-hydroxybutyric acid. While polymers such as ethylene- 
vinyl acetate and lactic acid-glycolic acid enable release of molecules for over 100 days, 
certain hydrogels release proteins for shorter time periods. 

5 ELISA Assay 

An agent for detecting an analyte protein is an antibody capable of binding to an 
analyte protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, 
or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., F a b or F( a b>2) 
can be used. The term "labeled", with regard to the probe or antibody, is intended to 

10 encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 
labeling include detection of a primary antibody using a fluorescently-labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 

15 fluorescently-labeled streptavidin. The term "biological sample" is intended to include tissues, 
cells and biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. Included within the usage of the term "biological sample", therefore, is 
blood and a fraction or component of blood including blood serum, blood plasma, or lymph. 
That is, the detection method of the invention can be used to detect an analyte mRNA, protein, 

20 or genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro 
techniques for detection of an analyte mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of an analyte protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and 
immunofluorescence. In vitro techniques for detection of an analyte genomic DNA include 

25 Southern hybridizations. Procedures for conducting immunoassays are described, for example 
in "ELISA: Theory and Practice: Methods in Molecular Biology", Vol. 42, J. R. Crowther 
(Ed.) Human Press, Totowa, NJ, 1995; "Immunoassay", E. Diamandis and T. Christopoulus, 
Academic Press, Inc., San Diego, CA, 1996; and "Practice and Thory of Enzyme 
Immunoassays", P. Tijssen, Elsevier Science Publishers, Amsterdam, 1985. Furthermore, in 

30 vivo techniques for detection of an analyte protein include introducing into a subject a labeled 
anti-an analyte protein antibody. For example, the antibody can be labeled with a radioactive 
marker whose presence and location in a subject can be detected by standard imaging 
techniques. 
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NOVX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an NOVX protein, or derivatives, fragments, analogs or 
homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule capable 
5 of transporting another nucleic acid to which it has been linked. One type of vector is a 

"plasmid", which refers to a circular double stranded DNA loop into which additional DNA 
segments can be ligated. Another type of vector is a viral vector, wherein additional DNA 
segments can be ligated into the viral genome. Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced (e.g., bacterial vectors having a 

10 bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., 
non-episomal mammalian vectors) are integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are replicated along with the host genome. 
Moreover, certain vectors are capable of directing the expression of genes to which they are 
operatively-linked. Such vectors are referred to herein as "expression vectors". In general, 

15 expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. 
In the present specification, "plasmid" and "vector" can be used interchangeably as the 
plasmid is the most commonly used form of vector. However, the invention is intended to 
include such other forms of expression vectors, such as viral vectors (e.g., replication defective 
retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions. 

20 The recombinant expression vectors of the invention comprise a nucleic acid of the 

invention in a form suitable for expression of the nucleic acid in a host cell, which means that 
the recombinant expression vectors include one or more regulatory sequences, selected on the 
basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid 
sequence to be expressed. Within a recombinant expression vector, "operably-linked" is 

25 intended to mean that the nucleotide sequence of interest is linked to the regulatory 

sequence(s) in a manner that allows for expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell when the vector is introduced into the 
host cell). 

The term "regulatory sequence" is intended to includes promoters, enhancers and other 
30 expression control elements (e.g., polyadenylation signals). Such regulatory sequences are 
described, for example, in Goeddel, Gene Expression Technology: Methods in 
ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include 
those that direct constitutive expression of a nucleotide sequence in many types of host cell 
and those that direct expression of the nucleotide sequence only in certain host cells (e.g., 
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tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the 
design of the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, etc. The expression vectors of the 
invention can be introduced into host cells to thereby produce proteins or peptides, including 
5 fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., NOVX 
proteins, mutant forms of NOVX proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
NOVX proteins in prokaryotic or eukaryotic cells. For example, NOVX proteins can be 
expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression 

10 vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in Escherichia coli with 

15 vectors containing constitutive or inducible promoters directing the expression of either fusion 
or non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded 
therein, usually to the amino terminus of the recombinant protein. Such fusion vectors 
typically serve three purposes: (/) to increase expression of recombinant protein; (//) to 
increase the solubility of the recombinant protein; and (///) to aid in the purification of the 

20 recombinant protein by acting as a ligand in affinity purification. Often, in fusion expression 
vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the 
recombinant protein to enable separation of the recombinant protein from the fusion moiety 
subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors 

25 include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 1988. Gene 67: 31-40), pMAL 
(New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) that fuse 
glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the 
target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 

30 (Amrann et aL, (1988) Gene 69:301-315) and pET 1 Id (Studier et al., Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 
60-89). 

One strategy to maximize recombinant protein expression in E. coli is to express the 
protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant 
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protein. See, e.g., Gottesman, Gene Expression Technology: Methods in Enzymology 
1 85, Academic Press, San Diego, Calif. (1990) 1 19-128. Another strategy is to alter the 
nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the 
individual codons for each amino acid are those preferentially utilized in E. coli (see, e.g., 
5 Wada, et aL, 1992. Nucl. Acids Res. 20: 21 1 1-21 18). Such alteration of nucleic acid 
sequences of the invention can be carried out by standard DNA synthesis techniques. 

In another embodiment, the NOVX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSec 1 
(Baldari, et aL, 1987. EMBO J. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 30: 

10 933-943), pJRY88 (Schultz et aL, 1987. Gene 54: 1 13-123), pYES2 (Invitrogen Corporation, 
San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). 

Alternatively, NOVX can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., 
SF9 cells) include the pAc series (Smith, et aL, 1983. Mol. Cell. Biol. 3: 2156-2165) and the 

15 pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
cells using a mammalian expression vector. Examples of mammalian expression vectors 
include pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et aL, 1987. EMBO 
J. 6: 187-195). When used in mammalian cells, the expression vector's control functions are 

20 often provided by viral regulatory elements. For example, commonly used promoters are 

derived from polyoma, adenovirus 2, cytomegalovirus, and simian virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of 
Sambrook, et aL, Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

25 In another embodiment, the recombinant mammalian expression vector is capable of 

directing expression of the nucleic acid preferentially in a particular cell type (e.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
promoters include the albumin promoter (liver-specific; Pinkert, et aL, 1987. Genes Dev. 1 : 

30 268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol. 43: 

235-275), in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EMBO J. 
8: 729-733) and immunoglobulins (Banerji, et aL, 1983. Cell 33: 729-740; Queen and 
Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters (e.g., the neurofilament 
promoter; Byrne and Ruddle, 1989. Proc. Natl. Acad. Sci. USA 86: 5473-5477), 
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pancreas-specific promoters (Edlund, et al., 1985. Science 230: 912-916), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European 
Application Publication No. 264,166). Developmentally-regulated promoters are also 
encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. Science 249: 374-379) 
5 and the ct-fetoprotein promoter (Campes and Tilghman, 1989. Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. That 
is, the DNA molecule is operatively-linked to a regulatory sequence in a manner that allows 
for expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to 

10 NOVX mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the 

antisense orientation can be chosen that direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen that direct constitutive, tissue specific or cell type specific expression 
of antisense RNA. The antisense expression vector can be in the form of a recombinant 

1 5 plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the activity of which can be determined by the 
cell type into which the vector is introduced. For a discussion of the regulation of gene 
expression using antisense genes see, e.g., Weintraub, et aL, "Antisense RNA as a molecular 
tool for genetic analysis," Reviews-Trends in Genetics, Vol. 1(1) 1986. 

20 Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms refer 
not only to the particular subject cell but also to the progeny or potential progeny of such a 
cell. Because certain modifications may occur in succeeding generations due to either 

25 mutation or environmental influences, such progeny may not, in fact, be identical to the parent 
cell, but are still included within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, NOVX protein can 
be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to 

30 those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the terms "transformation" and 
"transfection" are intended to refer to a variety of art-recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium 
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chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Suitable methods for transforming or transfecting host cells can be found in 
Sambrook, et al. (Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), 
5 and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may integrate 
the foreign DNA into their genome. In order to identify and select these integrants, a gene that 
encodes a selectable marker {e.g., resistance to antibiotics) is generally introduced into the 

1 0 host cells along with the gene of interest. Various selectable markers include those that confer 
resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a 
selectable marker can be introduced into a host cell on the same vector as that encoding 
NOVX or can be introduced on a separate vector. Cells stably transfected with the introduced 
nucleic acid can be identified by drug selection (e.g., cells that have incorporated the 

15 selectable marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can 
be used to produce (i.e., express) NOVX protein. Accordingly, the invention further provides 
methods for producing NOVX protein using the host cells of the invention. In one 
embodiment, the method comprises culturing the host cell of invention (into which a 

20 recombinant expression vector encoding NOVX protein has been introduced) in a suitable 
medium such that NOVX protein is produced. In another embodiment, the method further 
comprises isolating NOVX protein from the medium or the host cell. 

Transgenic NOVX Animals 

The host cells of the invention can also be used to produce non-human transgenic 
25 animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte or 
an embryonic stem cell into which NOVX protein-coding sequences have been introduced. 
Such host cells can then be used to create non-human transgenic animals in which exogenous 
NOVX sequences have been introduced into their genome or homologous recombinant 
animals in which endogenous NOVX sequences have been altered. Such animals are useful 
30 for studying the function and/or activity of NOVX protein and for identifying and/or 

evaluating modulators of NOVX protein activity. As used herein, a "transgenic animal" is a 
non-human animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in 
which one or more of the cells of the animal includes a transgene. Other examples of 
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transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, 
amphibians, etc. A transgene is exogenous DNA that is integrated into the genome of a cell 
from which a transgenic animal develops and that remains in the genome of the mature 
animal, thereby directing the expression of an encoded gene product in one or more cell types 
5 or tissues of the transgenic animal. As used herein, a "homologous recombinant animal" is a 
non-human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
NOVX gene has been altered by homologous recombination between the endogenous gene 
and an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell 

r 

of the animal, prior to development of the animal. 

10 A transgenic animal of the invention can be created by introducing NOVX-encoding 

nucleic acid into the male pronuclei of a fertilized oocyte (e.g., by microinjection, retroviral 
infection) and allowing the oocyte to develop in a pseudopregnant female foster animal. The 
human NOVX cDNA sequences SEQ ID NOS:2n-l, wherein n is an integer between 1 and 
162 can be introduced as a transgene into the genome of a non-human animal. Alternatively, a 

1 5 non-human homologue of the human NOVX gene, such as a mouse NOVX gene, can be 

isolated based on hybridization to the human NOVX cDNA (described further supra) and used 
as a transgene. Intronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the transgene. A tissue-specific 
regulatory sequence(s) can be operably-linked to the NOVX transgene to direct expression of 

20 NOVX protein to particular cells. Methods for generating transgenic animals via embryo 
manipulation and microinjection, particularly animals such as mice, have become 
conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866; 
4,870,009; and 4,873,191; and Hogan, 1986. In: MANIPULATING THE MOUSE Embryo, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. Similar methods are used for 

25 production of other transgenic animals. A transgenic founder animal can be identified based 
upon the presence of the NOVX transgene in its genome and/or expression of NOVX mRNA 
in tissues or cells of the animals. A transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, transgenic animals carrying a transgene- 
encoding NOVX protein can further be bred to other transgenic animals carrying other 

30 transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains at 
least a portion of an NOVX gene into which a deletion, addition or substitution has been 
introduced to thereby alter, e.g., functionally disrupt, the NOVX gene. The NOVX gene can 
be a human gene (e.g., the cDNA of SEQ ID NOS:2n-l, wherein n is an integer between 1 and 
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162), but more preferably, is a non-human homologue of a human NOVX gene. For example, 
a mouse homologue of human NOVX gene of SEQ ID NOS:2n-l, wherein n is an integer 
between 1 and 162 can be used to construct a homologous recombination vector suitable for 
altering an endogenous NOVX gene in the mouse genome. In one embodiment, the vector is 
5 designed such that, upon homologous recombination, the endogenous NOVX gene is 

functionally disrupted {i.e., no longer encodes a functional protein; also referred to as a "knock 
out" vector). 

Alternatively, the vector can be designed such that, upon homologous recombination, 
the endogenous NOVX gene is mutated or otherwise altered but still encodes functional 

10 protein (e.g., the upstream regulatory region can be altered to thereby alter the expression of 
the endogenous NOVX protein). In the homologous recombination vector, the altered portion 
of the NOVX gene is flanked at its 5'- and 3'-termini by additional nucleic acid of the NOVX 
gene to allow for homologous recombination to occur between the exogenous NOVX gene 
carried by the vector and an endogenous NOVX gene in an embryonic stem cell. The 

15 additional flanking NOVX nucleic acid is of sufficient length for successful homologous 

recombination with the endogenous gene. Typically, several kilobases of flanking DNA (both 
at the 5'- and 3*-termini) are included in the vector. See, e.g., Thomas, et oL, 1987. Cell 5 1 : 
503 for a description of homologous recombination vectors. The vector is ten introduced into 
an embryonic stem cell line (e.g., by electroporation) and cells in which the introduced NOVX 

20 gene has homologously-recombined with the endogenous NOVX gene are selected. See, e.g., 
Li, etal, 1992. Cell 69: 915. 

The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to 
form aggregation chimeras. See, e.g., Bradley, 1987. In: Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 1 13-152. 

25 A chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal 
and the embryo brought to term. Progeny harboring the homologously-recombined DNA in 
their germ cells can be used to breed animals in which all cells of the animal contain the 
homologously-recombined DNA by germline transmission of the transgene. Methods for 
constructing homologous recombination vectors and homologous recombinant animals are 

30 described further in Bradley, 1991. Curr. Opin. Biotechnol. 2: 823-829; PCT International 
Publication Nos.: WO 90/1 1354; WO 91/01 140; WO 92/0968; and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that contain 
selected systems that allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage PI. For a description of the 
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cre/loxP recombinase system, See, e.g., Lakso, et al., 1992. Proc. Natl. Acad. Sci. USA 89: 
6232-6236. Another example of a recombinase system. is the FLP recombinase system of 
Saccharomyces cerevisiae. See, O'Gorman, et ah, 1991 . Science 251:1351-1355. If a cre/loxP 
recombinase system is used to regulate expression of the transgene, animals containing 
5 transgenes encoding both the Cre recombinase and a selected protein are required. Such 
animals can be provided through the construction of "double" transgenic animals, e.g., by 
mating two transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 

10 according to the methods described in Wilmut, et aL, 1997. Nature 385: 810-813. In brief, a 
cell {e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit the 
growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through the use of 
electrical pulses, to an enucleated oocyte from an animal of the same species from which the 
quiescent cell is isolated. The reconstructed oocyte is then cultured such that it develops to 

15 morula or blastocyte and then transferred to pseudopregnant female foster animal. The 

offspring borne of this female foster animal will be a clone of the animal from which the cell 
{e.g., the somatic cell) is isolated. 

Pharmaceutical Compositions 

The NOVX nucleic acid molecules, NOVX proteins, and anti-NOVX antibodies (also 
20 referred to herein as "active compounds") of the invention, and derivatives, fragments, analogs 
and homologs thereof, can be incorporated into pharmaceutical compositions suitable for 
administration. Such compositions typically comprise the nucleic acid molecule, protein, or 
antibody and a pharmaceutical ly acceptable carrier. As used herein, "pharmaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, 
25 antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 

compatible with pharmaceutical administration. Suitable carriers are described in the most 
recent edition of Remington's Pharmaceutical Sciences, a standard reference text in the field, 
which is incorporated herein by reference. Preferred examples of such carriers or diluents 
include, but are not limited to, water, saline, finger's solutions, dextrose solution, and 5% 
30 human serum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be 

used. The use of such media and agents for pharmaceutically active substances is well known 
in the art. Except insofar as any conventional media or agent is incompatible with the active 
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compound, use thereof in the compositions is contemplated. Supplementary active 
compounds can also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with its 
intended route of administration. Examples of routes of administration include parenteral, 
5 e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (i.e., topical), 
transmucosal, and rectal administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the following components: a sterile 
diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, 
propylene glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or 

10 methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such 
as ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be 
adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of 

1 5 glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, 

20 Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the composition must be 
sterile and should be fluid to the extent that easy syringeability exists. It must be stable under 
the conditions of manufacture and storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The carrier can be a solvent or 
dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 

25 propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. 
The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by 
the maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic 

30 acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about by 
including in the composition an agent which delays absorption, for example, aluminum 
monostearate and gelatin. 
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Sterile injectable solutions can be prepared by incorporating the active compound (e.g., 
an NOVX protein or anti-NOVX antibody) in the required amount in an appropriate solvent 
with one or a combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound into a 
5 sterile vehicle that contains a basic dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders for the preparation of sterile injectable 
solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of 
the active ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

10 Oral compositions generally include an inert diluent or an edible carrier. They can be 

enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
administration, the active compound can be incorporated with excipients and used in the form 
of tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier 
for use as a mouthwash, wherein the compound in the fluid carrier is applied orally and 

15 swished and expectorated or swallowed. Pharmaceutical ly compatible binding agents, and/or 
adjuvant materials can be included as part of the composition. The tablets, pills, capsules, 
troches and the like can contain any of the following ingredients, or compounds of a similar 
nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient 
such as starch or lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a 

20 lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, e.g., 

25 a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration, detergents, bile salts, and fusidic acid 

30 derivatives. Transmucosal administration can be accomplished through the use of nasal sprays 
or suppositories. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in the art. 
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The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention enemas 
for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will protect 
5 the compound against rapid elimination from the body, such as a controlled release 

formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of 
such formulations will be apparent to those skilled in the art. The materials can also be 
10 obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 

suspensions (including liposomes targeted to infected cells with monoclonal antibodies to viral 
antigens) can also be used as pharmaceutical ly acceptable carriers. These can be prepared 
according to methods known to those skilled in the art, for example, as described in U.S. 
Patent No. 4,522,811. 

15 It is especially advantageous to formulate oral or parenteral compositions in dosage 

unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active compound calculated to produce the 
desired therapeutic effect in association with the required pharmaceutical carrier. The 

20 specification for the dosage unit forms of the invention are dictated by and directly dependent 
on the unique characteristics of the active compound and the particular therapeutic effect to be 
achieved, and the limitations inherent in the art of compounding such an active compound for 
the treatment of individuals. 

The nucleic acid molecules of the invention can be inserted into vectors and used as 

25 gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, 
intravenous injection, local administration (see, e.g., U.S. Patent No. 5,328,470) or by 
stereotactic injection (see, e.g., Chen, et aL, 1994. Proc. Natl. Acad. Sci. USA 91: 3054-3057). 
The pharmaceutical preparation of the gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release matrix in which the gene delivery 

30 vehicle is imbedded. Alternatively, where the complete gene delivery vector can be produced 
intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical preparation can 
include one or more cells that produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 
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Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express NOVX 
protein (e.g., via a recombinant expression vector in a host cell in gene therapy applications), 
to detect NOVX mRNA (e.g., in a biological sample) or a genetic lesion in an NOVX gene, 
5 and to modulate NOVX activity, as described further, below. In addition, the NOVX proteins 
can be used to screen drugs or compounds that modulate the NOVX protein activity or 
expression as well as to treat disorders characterized by insufficient or excessive production of 
NOVX protein or production of NOVX protein forms that have decreased or aberrant activity 
compared to NOVX wild-type protein (e.g. ; diabetes (regulates insulin release); obesity (binds 

10 and transport lipids); metabolic disturbances associated with obesity, the metabolic syndrome 
X as well as anorexia and wasting disorders associated with chronic diseases and various 
cancers, and infectious disease(possesses anti-microbial activity) and the various 
dyslipidemias. In addition, the anti-NOVX antibodies of the invention can be used to detect 
and isolate NOVX proteins and modulate NOVX activity. In yet a further aspect, the invention 

15 can be used in methods to influence appetite, absorption of nutrients and the disposition of 
metabolic substrates in both a positive and negative fashion. 

The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra. 

20 Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) that bind to NOVX proteins or have a 
stimulatory or inhibitory effect on, e.g., NOVX protein expression or NOVX protein activity. 

25 The invention also includes compounds identified in the screening assays described herein. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of an NOVX 
protein or polypeptide or biologically-active portion thereof. The test compounds of the 
invention can be obtained using any of the numerous approaches in combinatorial library 

30 methods known in the art, including: biological libraries; spatially addressable parallel solid 
phase or solution phase libraries; synthetic library methods requiring deconvolution; the 
"one-bead one-compound" library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is limited to peptide libraries, 
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while the other four approaches are applicable to peptide, non-peptide oligomer or small 
molecule libraries of compounds. See, e.g., Lam, 1997 Anticancer Drug Design 12: 145. 

A "small molecule" as used herein, is meant to refer to a composition that has a 
molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 
5 molecules can be, e.g., nucleic acids, peptides, polypeptides, peptidomimetics, carbohydrates, 
lipids or other organic or inorganic molecules. Libraries of chemical and/or biological 
mixtures, such as fungal, bacterial, or algal extracts, are known in the art and can be screened 
with any of the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found in the art, 
10 for example in: DeWitt, et aL, 1993. Proc. Natl. Acad. Sci. U.S.A. 90: 6909; Erb, et aL, 1994. 
Proc. Natl. Acad. Sci. U.S.A. 91:1 1422; Zuckermann, et aL, 1994. J. Med. Chem. 37: 2678; 
Cho, et aL, 1993. Science 261: 1303; Carrell, et aL, 1994. Angew. Chem. Int. Ed. Engl. 33: 
2059; Carell, et aL, 1994. Angew. Chem. Int. Ed. Engl. 33: 2061 ; and Gallop, et aL, 1994. J. 
Med. Chem. 37: 1233. 

15 Libraries of compounds may be presented in solution {e.g., Houghten, 1992. 

Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips (Fodor, 
1993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores (Ladner, 
U.S. Patent 5,233,409), plasmids (Cull, et aL, 1992. Proc. Natl. Acad. Sci. USA 89: 
1865-1869) or on phage (Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. Science 

20 249: 404-406; Cwirla, et aL, 1990. Proc. Natl. Acad. Sci. U.S.A. 87: 6378-6382; Felici, 1991. 
J. Mol. Biol. 222: 301-310; Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses a 
membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the cell 
surface is contacted with a test compound and the ability of the test compound to bind to an 

25 NOVX protein determined. The cell, for example, can of mammalian origin or a yeast cell. 
Determining the ability of the test compound to bind to the NOVX protein can be 
accomplished, for example, by coupling the test compound with a radioisotope or enzymatic 
label such that binding of the test compound to the NOVX protein or biologically-active 
portion thereof can be determined by detecting the labeled compound in a complex. For 

30 example, test compounds can be labeled with 125 I, 35 S, 14 C, or 3 H, either directly or indirectly, 
and the radioisotope detected by direct counting of radioemission or by scintillation counting. 
Alternatively, test compounds can be enzymatically-labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 
determination of conversion of an appropriate substrate to product. In one embodiment, the 
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assay comprises contacting a cell which expresses a membrane-bound form of NOVX protein, 
or a biologically-active portion thereof, on the cell surface with a known compound which 
binds NOVX to form an assay mixture, contacting the assay mixture with a test compound, 
and determining the ability of the test compound to interact with an NOVX protein, wherein 
5 determining the ability of the test compound to interact with an NOVX protein comprises 
determining the ability of the test compound to preferentially bind to NOVX protein or a 
biologically-active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of NOVX protein, or a biologically-active portion thereof, 

10 on the cell surface with a test compound and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the NOVX protein or biologically-active 
portion thereof. Determining the ability of the test compound to modulate the activity of 
NOVX or a biologically-active portion thereof can be accomplished, for example, by 
determining the ability of the NOVX protein to bind to or interact with an NOVX target 

15 molecule. As used herein, a "target molecule" is a molecule with which an NOVX protein 
binds or interacts in nature, for example, a molecule on the surface of a cell which expresses 
an NOVX interacting protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane or a 
cytoplasmic molecule. An NOVX target molecule can be a non-NOVX molecule or an 

20 NOVX protein or polypeptide of the invention. In one embodiment, an NOVX target 

molecule is a component of a signal transduction pathway that facilitates transduction of an 
extracellular signal (e.g. a signal generated by binding of a compound to a membrane-bound 
NOVX molecule) through the cell membrane and into the cell. The target, for example, can be 
a second intercellular protein that has catalytic activity or a protein that facilitates the 

25 association of downstream signaling molecules with NOVX. 

Determining the ability of the NOVX protein to bind to or interact with an NOVX 
target molecule can be accomplished by one of the methods described above for determining 
direct binding. In one embodiment, determining the ability of the NOVX protein to bind to or 
interact with an NOVX target molecule can be accomplished by determining the activity of the 

30 target molecule. For example, the activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the target (i.e. intracellular Ca 2+ , 
diacylglycerol, IP 3 , etc.), detecting catalytic/enzymatic activity of the target an appropriate 
substrate, detecting the induction of a reporter gene (comprising an NOVX-responsive 
regulatory element operatively linked to a nucleic acid encoding a detectable marker, e.g., 
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luciferase), or detecting a cellular response, for example, cell survival, cellular differentiation, 
or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay comprising 
contacting an NOVX protein or biologically-active portion thereof with a test compound and 
5 determining the ability of the test compound to bind to the NOVX protein or biologically- 
active portion thereof. Binding of the test compound to the NOVX protein can be determined 
either directly or indirectly as described above. In one such embodiment, the assay comprises 
contacting the NOVX protein or biologically-active portion thereof with a known compound 
which binds NOVX to form an assay mixture, contacting the assay mixture with a test 

10 compound, and determining the ability of the test compound to interact with an NOVX 
protein, wherein determining the ability of the test compound to interact with an NOVX 
protein comprises determining the ability of the test compound to preferentially bind to NOVX 
or biologically-active portion thereof as compared to the known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting NOVX 

1 5 protein or biologically-active portion thereof with a test compound and determining the ability 
of the test compound to modulate (e.g. stimulate or inhibit) the activity of the NOVX protein 
or biologically-active portion thereof. Determining the ability of the test compound to 
modulate the activity of NOVX can be accomplished, for example, by determining the ability 
of the NOVX protein to bind to an NOVX target molecule by one of the methods described 

20 above for determining direct binding. In an alternative embodiment, determining the ability of 
the test compound to modulate the activity of NOVX protein can be accomplished by 
determining the ability of the NOVX protein further modulate an NOVX target molecule. For 
example, the catalytic/enzymatic activity of the target molecule on an appropriate substrate 
can be determined as described, supra. 

25 In yet another embodiment, the cell-free assay comprises contacting the NOVX protein 

or biologically-active portion thereof with a known compound which binds NOVX protein to 
form an assay mixture, contacting the assay mixture with a test compound, and determining 
the ability of the test compound to interact with an NOVX protein, wherein determining the 
ability of the test compound to interact with an NOVX protein comprises determining the 

30 ability of the NOVX protein to preferentially bind to or modulate the activity of an NOVX 
target molecule. 

The cell-free assays of the invention are amenable to use of both the soluble form or 
the membrane-bound form of NOVX protein. In the case of cell-free assays comprising the 
membrane-bound form of NOVX protein, it may be desirable to utilize a solubilizing agent 
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such that the membrane-bound form of NOVX protein is maintained in solution. Examples of 
such solubilizing agents include non-ionic detergents such as n-octylglucoside, 
n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 
5 Isotridecypoly(ethylene glycol ether) m N-dodecyl--N,N-dimethyl-3-ammonio-l -propane 
sulfonate, 3-(3-cholamidopropyl) dimethylamminiol-1 -propane sulfonate (CHAPS), or 
3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l -propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention, it may be 
desirable to immobilize either NOVX protein or its target molecule to facilitate separation of 

10 complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to NOVX protein, or interaction of 
NOVX protein with a target molecule in the presence and absence of a candidate compound, 
can be accomplished in any vessel suitable for containing the reactants. Examples of such 
vessels include microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a 

1 5 fusion protein can be provided that adds a domain that allows one or both of the proteins to be 
bound to a matrix. For example, GST-NOVX fusion proteins or GST-target fusion proteins 
can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtiter plates, that are then combined with the test compound or the 
test compound and either the non-adsorbed target protein or NOVX protein, and the mixture is 

20 incubated under conditions conducive to complex formation (e.g., at physiological conditions 
for salt and pH). Following incubation, the beads or microtiter plate wells are washed to 
remove any unbound components, the matrix immobilized in the case of beads, complex 
determined either directly or indirectly, for example, as described, supra. Alternatively, the 
complexes can be dissociated from the matrix, and the level of NOVX protein binding or 

25 activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the NOVX protein or its target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
NOVX protein or target molecules can be prepared from biotin-NHS 

30 (N-hydroxy-succinimide) using techniques well-known within the art (e.g., biotinylation kit, 
Pierce Chemicals, Rockford, 111.), and immobilized in the wells of streptavidin-coated 96 well 
plates (Pierce Chemical). Alternatively, antibodies reactive with NOVX protein or target 
molecules, but which do not interfere with binding of the NOVX protein to its target molecule, 
can be derivatized to the wells of the plate, and unbound target or NOVX protein trapped in 




the wells by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of complexes 
using antibodies reactive with the NOVX protein or target molecule, as well as enzyme-linked 
assays that rely on detecting an enzymatic activity associated with the NOVX protein or target 
5 molecule. 

In another embodiment, modulators of NOVX protein expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of NOVX 
mRNA or protein in the cell is determined. The level of expression of NOVX mRNA or 
protein in the presence of the candidate compound is compared to the level of expression of 

10 NOVX mRNA or protein in the absence of the candidate compound. The candidate 

compound can then be identified as a modulator of NOVX mRNA or protein expression based 
upon this comparison. For example, when expression of NOVX mRNA or protein is greater 
(i.e., statistically significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of NOVX mRNA or protein 

15 expression. Alternatively, when expression of NOVX mRNA or protein is less (statistically 

significantly less) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of NOVX mRNA or protein expression. The level of 
NOVX mRNA or protein expression in the ceils can be determined by methods described 
herein for detecting NOVX mRNA or protein. 

20 In yet another aspect of the invention, the NOVX proteins can be used as "bait 

proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; 
Zervos, et al., 1993. Cell 72: 223-232; Madura, et al, 1993. J. Biol Chem. 268: 12046-12054; 
Bartel, et aL, 1993. Biotechniques 14: 920-924; Iwabuchi, et al., 1993. Oncogene 8: 
1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact with 

25 NOVX ("NOVX-binding proteins" or "NOVX-bp") and modulate NOVX activity. Such 
NOVX-binding proteins are also likely to be involved in the propagation of signals by the 
NOVX proteins as, for example, upstream or downstream elements of the NOVX pathway. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 

30 two different DNA constructs. In one construct, the gene that codes for NOVX is fused to a 

gene encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In the 
other construct, a DNA sequence, from a library of DNA sequences, that encodes an 
unidentified protein ("prey" or "sample") is fused to a gene that codes for the activation 
domain of the known transcription factor. If the "bait" and the "prey" proteins are able to 
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interact, /« v/Vo, forming an NOVX-dependent complex, the DNA-binding and activation 
domains of the transcription factor are brought into close proximity. This proximity allows 
transcription of a reporter gene (e.g., LacZ) that is operably linked to a transcriptional 
regulatory site responsive to the transcription factor. Expression of the reporter gene can be 
5 detected and cell colonies containing the functional transcription factor can be isolated and 
used to obtain the cloned gene that encodes the protein which interacts with NOVX. 

The invention further pertains to novel agents identified by the aforementioned 
screening assays and uses thereof for treatments as described herein. 

Detection Assays 

10 Portions or fragments of the cDNA sequences identified herein (and the corresponding 

complete gene sequences) can be used in numerous ways as polynucleotide reagents. By way 
of example, and not of limitation, these sequences can be used to: (/*) map their respective 
genes on a chromosome; and, thus, locate gene regions associated with genetic disease; (//) 
identify an individual from a minute biological sample (tissue typing); and (Jit) aid in forensic 

15 identification of a biological sample. Some of these applications are described in the 
subsections, below. 

Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
20 sequence can be used to map the location of the gene on a chromosome. This process is called 
chromosome mapping. Accordingly, portions or fragments of the NOVX sequences, SEQ ID 
NOS:2n-l, wherein n is an integer between 1 and 162, or fragments or derivatives thereof, can 
be used to map the location of the NOVX genes, respectively, on a chromosome. The 
mapping of the NOVX sequences to chromosomes is an important first step in correlating 
25 these sequences with genes associated with disease. 

Briefly, NOVX genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the NOVX sequences. Computer analysis of the NOVX, 
sequences can be used to rapidly select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be used 
30 for PCR screening of somatic cell hybrids containing individual human chromosomes. Only 
those hybrids containing the human gene corresponding to the NOVX sequences will yield an 
amplified fragment. 
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Somatic cell hybrids are prepared by fusing somatic cells from different mammals 
(e.g., human and mouse cells). As hybrids of human and mouse cells grow and divide, they 
gradually lose human chromosomes in random order, but retain the mouse chromosomes. By 
using media in which mouse cells cannot grow, because they lack a particular enzyme, but in 
5 which human cells can, the one human chromosome that contains the gene encoding the 

needed enzyme will be retained. By using various media, panels of hybrid cell lines can be 
established. Each cell line in a panel contains either a single human chromosome or a small 
number of human chromosomes, and a full set of mouse chromosomes, allowing easy 
mapping of individual genes to specific human chromosomes. See, e.g., D'Eustachio, et al., 
10 1983. Science 220: 919-924. Somatic cell hybrids containing only fragments of human 

chromosomes can also be produced by using human chromosomes with translocations and 
deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day using 

15 a single thermal cycler. Using the NOVX sequences to design oligonucleotide primers, sub- 
localization can be achieved with panels of fragments from specific chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose division has been blocked in 

20 metaphase by a chemical like colcemid that disrupts the mitotic spindle. The chromosomes 
can be treated briefly with trypsin, and then stained with Giemsa. A pattern of light and dark 
bands develops on each chromosome, so that the chromosomes can be identified individually. 
The FISH technique can be used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have a higher likelihood of binding to a unique 

25 chromosomal location with sufficient signal intensity for simple detection. Preferably 1,000 
bases, and more preferably 2,000 bases, will suffice to get good results at a reasonable amount 
of time. For a review of this technique, see, Verma, et al., Human Chromosomes: A 
Manual of Basic Techniques (Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 

30 chromosome or a single site on that chromosome, or panels of reagents can be used for 

marking multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding 
regions of the genes actually are preferred for mapping purposes. Coding sequences are more 
likely to be conserved within gene families, thus increasing the chance of cross hybridizations 
during chromosomal mapping. 
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Once a sequence has been mapped to a precise chromosomal location, the physical 
position of the sequence on the chromosome can be correlated with genetic map data. Such 
data are found, e.g., in McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library). The relationship between genes 
5 and disease, mapped to the same chromosomal region, can then be identified through linkage 
analysis (co-inheritance of physically adjacent genes), described in, e.g., Egeland, et aL, 1987. 
Nature, 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the NOVX gene, can be determined. If a mutation is 
10 observed in some or all of the affected individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the particular disease. Comparison of 
affected and unaffected individuals generally involves first looking for structural alterations in 
the chromosomes, such as deletions or translocations that are visible from chromosome 
spreads or detectable using PCR based on that DNA sequence. Ultimately, complete 
1 5 sequencing of genes from several individuals can be performed to confirm the presence of a 
mutation and to distinguish mutations from polymorphisms. 

Tissue Typing 

The NOVX sequences of the invention can also be used to identify individuals from 
20 minute biological samples. In this technique, an individual's genomic DNA is digested with 
one or more restriction enzymes, and probed on a Southern blot to yield unique bands for 
identification. The sequences of the invention are useful as additional DNA markers for RFLP 
("restriction fragment length polymorphisms," described in U.S. Patent No. 5,272,057). 

Furthermore, the sequences of the invention can be used to provide an alternative 
25 technique that determines the actual base-by-base DNA sequence of selected portions of an 

individual's genome. Thus, the NOVX sequences described herein can be used to prepare two 
PCR primers from the 5'- and 3'-termini of the sequences. These primers can then be used to 
amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this manner, 
30 can provide unique individual identifications, as each individual will have a unique set of such 
DNA sequences due to allelic differences. The sequences of the invention can be used to 
obtain such identification sequences from individuals and from tissue. The NOVX sequences 
of the invention uniquely represent portions of the human genome. Allelic variation occurs to 
some degree in the coding regions of these sequences, and to a greater degree in the noncoding 
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regions. It is estimated that allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Much of the allelic variation is due to single 
nucleotide polymorphisms (SNPs), which include restriction fragment length polymorphisms 
(RFLPs). 

5 Each of the sequences described herein can, to some degree, be used as a standard 

against which DNA from an individual can be compared for identification purposes. Because 
greater numbers of polymorphisms occur in the noncoding regions, fewer sequences are 
necessary to differentiate individuals. The noncoding sequences can comfortably provide 
positive individual identification with a panel of perhaps 10 to 1,000 primers that each yield a 
10 noncoding amplified sequence of 100 bases. If predicted coding sequences, such as those in 
SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162 are used, a more appropriate 
number of primers for positive individual identification would be 500-2,000. 

Predictive Medicine 

15 The invention also pertains to the field of predictive medicine in which diagnostic 

assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. Accordingly, 
one aspect of the invention relates to diagnostic assays for determining NOVX protein and/or 
nucleic acid expression as well as NOVX activity, in the context of a biological sample (e.g., 

20 blood, serum, cells, tissue) to thereby determine whether an individual is afflicted with a 
disease or disorder, or is at risk of developing a disorder, associated with aberrant NOVX 
expression or activity. The disorders include metabolic disorders, diabetes, obesity, infectious 
disease, anorexia, cancer-associated cachexia, cancer, neurodegenerative disorders, 
Alzheimer's Disease, Parkinson's Disorder, immune disorders, and hematopoietic disorders, 

25 and the various dyslipidemias, metabolic disturbances associated with obesity, the metabolic 
syndrome X and wasting disorders associated with chronic diseases and various cancers. The 
invention also provides for prognostic (or predictive) assays for determining whether an 
individual is at risk of developing a disorder associated with NOVX protein, nucleic acid 
expression or activity. For example, mutations in an NOVX gene can be assayed in a 

30 biological sample. Such assays can be used for prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset of a disorder characterized by or 
associated with NOVX protein, nucleic acid expression, or biological activity. 

Another aspect of the invention provides methods for determining NOVX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate therapeutic or 
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prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents (e.g., drugs) for therapeutic or 
prophylactic treatment of an individual based on the genotype of the individual (e.g., the 
genotype of the individual examined to determine the ability of the individual to respond to a 
5 particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g., 
drugs, compounds) on the expression or activity of NOVX in clinical trials. 

These and other agents are described in further detail in the following sections. 

10 Diagnostic Assays 

An exemplary method for detecting the presence or absence of NOVX in a biological 
sample involves obtaining a biological sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting NOVX protein or nucleic acid (e.g., 
mRNA, genomic DNA) that encodes NOVX protein such that the presence of NOVX is 

15 detected in the biological sample. An agent for detecting NOVX mRNA or genomic DNA is a 
labeled nucleic acid probe capable of hybridizing to NOVX mRNA or genomic DNA. The 
nucleic acid probe can be, for example, a full-length NOVX nucleic acid, such as the nucleic 
acid of SEQ ID NOS:2n-l, wherein n is an integer between 1 and 162, or a portion thereof, 
such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and 

20 sufficient to specifically hybridize under stringent conditions to NOVX mRNA or genomic 
DNA. Other suitable probes for use in the diagnostic assays of the invention are described 
herein. 

An agent for detecting NOVX protein is an antibody capable of binding to NOVX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 

25 preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be 
used. The term "labeled", with regard to the probe or antibody, is intended to encompass 
direct labeling of the probe or antibody by coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect labeling of the probe or antibody by 
reactivity with another reagent that is directly labeled. Examples of indirect labeling include 

30 detection of a primary antibody using a fluorescently-labeled secondary antibody and 

end-labeling of a DNA probe with biotin such that it can be detected with fluorescently- 
labeled streptavidin. The term "biological sample" is intended to include tissues, cells and 
biological fluids isolated from a subject, as well as tissues, cells and fluids present within a 
subject. That is, the detection method of the invention can be used to detect NOVX mRNA, 
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protein, or genomic DNA in a biological sample in vitro as well as in vivo. For example, in 
vitro techniques for detection of NOVX mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of NOVX protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and 
5 immunofluorescence. In vitro techniques for detection of NOVX genomic DNA include 
Southern hybridizations. Furthermore, in vivo techniques for detection of NOVX protein 
include introducing into a subject a labeled anti-NOVX antibody. For example, the antibody 
can be labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

10 In one embodiment, the biological sample contains protein molecules from the test 

subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject. A preferred biological sample is a 
peripheral blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 

1 5 sample from a control subject, contacting the control sample with a compound or agent 
capable of detecting NOVX protein, mRNA, or genomic DNA, such that the presence of 
NOVX protein, mRNA or genomic DNA is detected in the biological sample, and comparing 
the presence of NOVX protein, mRNA or genomic DNA in the control sample with the 
presence of NOVX protein, mRNA or genomic DNA in the test sample. 

20 The invention also encompasses kits for detecting the presence of NOVX in a 

biological sample. For example, the kit can comprise: a labeled compound or agent capable of 
detecting NOVX protein or mRNA in a biological sample; means for determining the amount 
of NOVX in the sample; and means for comparing the amount of NOVX in the sample with a 
standard. The compound or agent can be packaged in a suitable container. The kit can further 

25 comprise instructions for using the kit to detect NOVX protein or nucleic acid. 

Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant NOVX 
30 expression or activity. For example, the assays described herein, such as the preceding 

diagnostic assays or the following assays, can be utilized to identify a subject having or at risk 
of developing a disorder associated with NOVX protein, nucleic acid expression or activity. 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk for 
developing a disease or disorder. Thus, the invention provides a method for identifying a 
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disease or disorder associated with aberrant NOVX expression or activity in which a test 
sample is obtained from a subject and NOVX protein or nucleic acid (e.g., mRNA, genomic 
DNA) is detected, wherein the presence of NOVX protein or nucleic acid is diagnostic for a 
subject having or at risk of developing a disease or disorder associated with aberrant NOVX 
5 expression or activity. As used herein, a "test sample" refers to a biological sample obtained 
from a subject of interest. For example, a test sample can be a biological fluid (e.g., serum), 
cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine whether 
a subject can be administered an agent (e.g., an agonist, antagonist, peptidomimetic, protein, 

10 peptide, nucleic acid, small molecule, or other drug candidate) to treat a disease or disorder 
associated with aberrant NOVX expression or activity. For example, such methods can be 
used to determine whether a subject can be effectively treated with an agent for a disorder. 
Thus, the invention provides methods for determining whether a subject can be effectively 
treated with an agent for a disorder associated with aberrant NOVX expression or activity in 

15 which a test sample is obtained and NOVX protein or nucleic acid is detected (e.g., wherein 
the presence of NOVX protein or nucleic acid is diagnostic for a subject that can be 
administered the agent to treat a disorder associated with aberrant NOVX expression or 
activity). 

The methods of the invention can also be used to detect genetic lesions in an NOVX 
20 gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 

characterized by aberrant cell proliferation and/or differentiation. In various embodiments, the 
methods include detecting, in a sample of cells from the subject, the presence or absence of a 
genetic lesion characterized by at least one of an alteration affecting the integrity of a gene 
encoding an NOVX-protein, or the misexpression of the NOVX gene. For example, such 
25 genetic lesions can be detected by ascertaining the existence of at least one of: (/') a deletion of 
one or more nucleotides from an NOVX gene; (if) an addition of one or more nucleotides to an 
NOVX gene; (Hi) a substitution of one or more nucleotides of an NOVX gene, (iv) a 
chromosomal rearrangement of an NOVX gene; (v) an alteration in the level of a messenger 
RNA transcript of an NOVX gene, (v/) aberrant modification of an NOVX gene, such as of the 
30 methylation pattern of the genomic DNA, (v/7) the presence of a non-wild-type splicing pattern 
of a messenger RNA transcript of an NOVX gene, (viii) a non-wild-type level of an NOVX 
protein, (zjc) allelic loss of an NOVX gene, and (x) inappropriate post-translational 
modification of an NOVX protein. As described herein, there are a large number of assay 
techniques known in the art which can be used for detecting lesions in an NOVX gene. A 
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preferred biological sample is a peripheral blood leukocyte sample isolated by conventional 
means from a subject. However, any biological sample containing nucleated cells may be 
used, including, for example, buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer in a 
5 polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), such - 
as anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g., 
Landegran, et al., 1988. Science 241: 1077-1080; and Nakazawa, et al, 1994. Proc. Natl 
Acad. Sci. USA 91 : 360-364), the latter of which can be particularly useful for detecting point 
mutations in the NOVX-gene (see, Abravaya, et al., 1995. Nucl. Acids Res. 23: 675-682). 

10 This method can include the steps of collecting a sample of cells from a patient, isolating 
nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, contacting the 
nucleic acid sample with one or more primers that specifically hybridize to an NOVX gene 
under conditions such that hybridization and amplification of the NOVX gene (if present) 
occurs, and detecting the presence or absence of an amplification product, or detecting the size 

15 of the amplification product and comparing the length to a control sample. It is anticipated 
that PCR and/or LCR may be desirable to use as a preliminary amplification step in 
conjunction with any of the techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication (see, 
Guatelli, et al., 1990. Proc. Natl. Acad. Sci. USA 87: 1874-1878), transcriptional amplification 

20 system (see, Kwoh, et al, 1989. Proc. Natl. Acad. Sci. USA 86: 1 173-1 177); QP Replicase 
(see, Lizardi, et al, 1988. BioTechnology 6: 1 197), or any other nucleic acid amplification 
method, followed by the detection of the amplified molecules using techniques well known to 
those of skill in the art. These detection schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in very low numbers. 

25 In an alternative embodiment, mutations in an NOVX gene from a sample cell can be 

identified by alterations in restriction enzyme cleavage patterns. For example, sample and 
control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and compared. 
Differences in fragment length sizes between sample and control DNA indicates mutations in 

30 the sample DNA. Moreover, the use of sequence specific ribozymes (see, e.g., U.S. Patent 

No. 5,493,53 1) can be used to score for the presence of specific mutations by development or 
loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations in NOVX can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays containing 
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hundreds or thousands of oligonucleotides probes. See, e.g., Cronin, et aL, 1996. Human 
Mutation 7: 244-255; Kozal, et aL, 1996. Nat. Med 2: 753-759. For example, genetic 
mutations in NOVX can be identified in two dimensional arrays containing light-generated 
DNA probes as described in Cronin, et al., supra. Briefly, a first hybridization array of probes 
5 can be used to scan through long stretches of DNA in a sample and control to identify base 
changes between the sequences by making linear arrays of sequential overlapping probes. 
This step allows the identification of point mutations. This is followed by a second 
hybridization array that allows the characterization of specific mutations by using smaller, 
specialized probe arrays complementary to all variants or mutations detected. Each mutation 

10 array is composed of parallel probe sets, one complementary to the wild-type gene and the 
other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the art 
can be used to directly sequence the NOVX gene and detect mutations by comparing the 
sequence of the sample NOVX with the corresponding wild-type (control) sequence. 

15 Examples of sequencing reactions include those based on techniques developed by Maxim and 
Gilbert, 1977. Proc. Natl. Acad. Sci. USA 74: 560 or Sanger, 1977. Proc. Natl Acad. Sci. USA 
74: 5463. It is also contemplated that any of a variety of automated sequencing procedures 
can be utilized when performing the diagnostic assays {see, e.g., Naeve, et aL, 1995. 
Biotechniques 19: 448), including sequencing by mass spectrometry (see, e.g., PCT 

20 International Publication No. WO 94/1 6 1 0 1 ; Cohen, et aL, 1 996. Adv. Chromatography 36: 
127-162; and Griffin, et aL, 1993. Appl. Biochem. Biotechnol. 38: 147-159). 

Other methods for detecting mutations in the NOVX gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes. See, e.g., Myers, et aL, 1985. Science 230: 1242. In general, the 

25 art technique of "mismatch cleavage" starts by providing heteroduplexes of formed by 

hybridizing (labeled) RNA or DNA containing the wild-type NOVX sequence with potentially 
mutant RNA or DNA obtained from a tissue sample. The double-stranded duplexes are 
treated with an agent that cleaves single-stranded regions of the duplex such as which will 
exist due to basepair mismatches between the control and sample strands. For instance, 

30 RNA/DNA duplexes can be treated with Cleavage signal-1 protein and DNA/DNA hybrids 
treated with Si nuclease to enzymatically digesting the mismatched regions. In other 
embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or 
osmium tetroxide and with piperidine in order to digest mismatched regions. After digestion 
of the mismatched regions, the resulting material is then separated by size on denaturing 
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polyacrylamide gels to determine the site of mutation. See, e.g., Cotton, et al., 1988. Proc. 
Natl Acad. Sci. USA 85: 4397; Saleeba, et al, 1992. Methods Enzymol 217: 286-295. In an 
embodiment, the control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
5 proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 

mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 
NOVX cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T 
at G/T mismatches. See, e.g., Hsu, et al, 1994. Carcinogenesis 15: 1657-1662. According to 

10 an exemplary embodiment, a probe based on an NOVX sequence, e.g., a wild-type NOVX 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 
treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to identify 

15 mutations in NOVX genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids. See, e.g., Orita, et aL, 1989. Proc. Natl. Acad. Sci. USA: 86: 2766; Cotton, 
1993. Mutat. Res. 285: 125-144; Hayashi, 1992. Genet. Anal. Tech. Appl 9: 73-79. 
Single-stranded DNA fragments of sample and control NOVX nucleic acids will be denatured 

20 and allowed to renature. The secondary structure of single-stranded nucleic acids varies 

according to sequence, the resulting alteration in electrophoretic mobility enables the detection 
of even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), in 
which the secondary structure is more sensitive to a change in sequence. In one embodiment, 

25 the subject method utilizes heteroduplex analysis to separate double stranded heteroduplex 
molecules on the basis of changes in electrophoretic mobility. See, e.g., Keen, et al. 9 1991. 
Trends Genet. 7: 5. 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient 
30 gel electrophoresis (DGGE). See, e.g., Myers, et al., 1985. Nature 313: 495. When DGGE is 
used as the method of analysis, DNA will be modified to insure that it does not completely 
denature, for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich 
DNA by PCR. In a further embodiment, a temperature gradient is used in place of a 
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denaturing gradient to identify differences in the mobility of control and sample DNA. See, 
e.g., Rosenbaum and Reissner, 1987. Biophys. Chem. 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not limited 
to, selective oligonucleotide hybridization, selective amplification, or selective primer 
5 extension. For example, oligonucleotide primers may be prepared in which the known 

mutation is placed centrally and then hybridized to target DNA under conditions that permit 
hybridization only if a perfect match is found. See, e.g., Saiki, et aL, 1986. Nature 324: 163; 
Saiki, et aL 7 1989. *Proc. Natl Acad. Sci. USA 86: 6230. Such allele specific oligonucleotides 
are hybridized to PCR amplified target DNA or a number of different mutations when the 
10 oligonucleotides are attached to the hybridizing membrane and hybridized with labeled target 
DNA. 

Alternatively, allele specific amplification technology that depends on selective PCR 
amplification may be used in conjunction with the instant invention. Oligonucleotides used as 
primers for specific amplification may carry the mutation of interest in the center of the 

15 molecule (so that amplification depends on differential hybridization; see, e.g., Gibbs, et aL, 
1989. Nucl. Acids Res. 17: 2437-2448) or at the extreme 3'-terminus of one primer where, 
under appropriate conditions, mismatch can prevent, or reduce polymerase extension {see, e.g., 
Prossner, 1993. Tibtech. 1 1 : 238). In addition it may be desirable to introduce a novel 
restriction site in the region of the mutation to create cleavage-based detection. See, e.g., 

20 Gasparini, et al, 1 992. Mol. Cell Probes 6: 1 . It is anticipated that in certain embodiments 
amplification may also be performed using Taq ligase for amplification. See, e.g., Barany, 
1991. Proc. Natl. Acad. Set USA 88: 189. In such cases, ligation will occur only if there is a 
perfect match at the 3'-terminus of the 5' sequence, making it possible to detect the presence of 
a known mutation at a specific site by looking for the presence or absence of amplification. 

25 The methods described herein may be performed, for example, by utilizing 

pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
patients exhibiting symptoms or family history of a disease or illness involving an NOVX 
gene. 

30 Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 

NOVX is expressed may be utilized in the prognostic assays described herein. However, any 
biological sample containing nucleated cells may be used, including, for example, buccal 
mucosal cells. 
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Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on NOVX activity 
(e.g., NOVX gene expression), as identified by a screening assay described herein can be 
5 administered to individuals to treat (prophylactically or therapeutically) disorders (The 

disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 
associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 
Disorder, immune disorders, and hematopoietic disorders, and the various dyslipidemias, 
metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 

10 disorders associated with chronic diseases and various cancers.) In conjunction with such 
treatment, the pharmacogenomics (i.e., the study of the relationship between an individual's 
genotype and that individual's response to a foreign compound or drug) of the individual may 
be considered. Differences in metabolism of therapeutics can lead to severe toxicity or 
therapeutic failure by altering the relation between dose and blood concentration of the 

15 pharmacologically active drug. Thus, the pharmacogenomics of the individual permits the 

selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such pharmacogenomics can further be used to 
determine appropriate dosages and therapeutic regimens. Accordingly, the activity of NOVX 
protein, expression of NOVX nucleic acid, or mutation content of NOVX genes in an 

20 individual can be determined to thereby select appropriate agent(s) for therapeutic or 
prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. See 
e.g., Eichelbaum, 1996. Clin. Exp. Pharmacol. Physiol., 23: 983-985; Linder, 1997. Clin. 

25 Chem., 43: 254-266. In general, two types of pharmacogenetic conditions can be 

differentiated. Genetic conditions transmitted as a single factor altering the way drugs act on 
the body (altered drug action) or genetic conditions transmitted as single factors altering the 
way the body acts on drugs (altered drug metabolism). These pharmacogenetic conditions can 
occur either as rare defects or as polymorphisms. For example, glucose-6-phosphate 

30 dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main 
clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 
determinant of both the intensity and duration of drug action. The discovery of genetic 
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polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and 
cytochrome NEUROPEPTIDE Y/PEPTIDE YY RECEPTOR enzymes CYP2D6 and 
CYP2C19) has provided an explanation as to why some patients do not obtain the expected 
drug effects or show exaggerated drug response and serious toxicity after taking the standard 
5 J and safe dose of a drug. These polymorphisms are expressed in two phenotypes in the 

population, the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of 
PM is different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead to the 
absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite 

1 0 frequently experience exaggerated drug response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed metabolite 
morphine. At the other extreme are the so called ultra-rapid metabolizers who do not respond 
to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been identified 

15 to be due to CYP2D6 gene amplification. 

Thus, the activity of NOVX protein, expression of NOVX nucleic acid, or mutation 
content of NOVX genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles encoding 

20 drug-metabolizing enzymes to the identification of an individual's drug responsiveness 

phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse 
reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency when 
treating a subject with an NOVX modulator, such as a modulator identified by one of the 
exemplary screening assays described herein. 

25 

Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of NOVX (e.g., the ability to modulate aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical trials. For 
30 example, the effectiveness of an agent determined by a screening assay as described herein to 
increase NOVX gene expression, protein levels, or upregulate NOVX activity, can be 
monitored in clinical trails of subjects exhibiting decreased NOVX gene expression, protein 
levels, or downregulated NOVX activity. Alternatively, the effectiveness of an agent 
determined by a screening assay to decrease NOVX gene expression, protein levels, or 
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downregulate NOVX activity, can be monitored in clinical trails of subjects exhibiting 
increased NOVX gene expression, protein levels, or upregulated NOVX activity. In such 
clinical trials, the expression or activity of NOVX and, preferably, other genes that have been 
implicated in, for example, a cellular proliferation or immune disorder can be used as a "read 
5 out" or markers of the immune responsiveness of a particular cell. 

By way of example, and not of limitation, genes, including NOVX, that are modulated 
in cells by treatment with an agent (e.g., compound, drug or small molecule) that modulates 
NOVX activity (e.g., identified in a screening assay as described herein) can be identified. 
Thus, to study the effect of agents on cellular proliferation disorders, for example, in a clinical 

10 trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of 

NOVX and other genes implicated in the disorder. The levels of gene expression (i.e., a gene 
expression pattern) can be quantified by Northern blot analysis or RT-PCR, as described 
herein, or alternatively by measuring the amount of protein produced, by one of the methods 
as described herein, or by measuring the levels of activity of NOVX or other genes. In this 

15 manner, the gene expression pattern can serve as a marker, indicative of the physiological 

response of the cells to the agent. Accordingly, this response state may be determined before, 
and at various points during, treatment of the individual with the agent. 

In one embodiment, the invention provides a method for monitoring the effectiveness 
of treatment of a subject with an agent (e.g., an agonist, antagonist, protein, peptide, 

20 peptidomimetic, nucleic acid, small molecule, or other drug candidate identified by the 

screening assays described herein) comprising the steps of (/) obtaining a pre-administration 
sample from a subject prior to administration of the agent; (//) detecting the level of expression 
of an NOVX protein, mRNA, or genomic DNA in the preadministration sample; (Hi) obtaining 
one or more post-administration samples from the subject; (/v) detecting the level of 

25 expression or activity of the NOVX protein, mRNA, or genomic DNA in the 

post-administration samples; (v) comparing the level of expression or activity of the NOVX 
protein, mRNA, or genomic DNA in the pre-administration sample with the NOVX protein, 
mRNA, or genomic DNA in the post administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. For example, increased administration 

30 of the agent may be desirable to increase the expression or activity of NOVX to higher levels 
than detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased 
administration of the agent may be desirable to decrease expression or activity of NOVX to 
lower levels than detected, i.e., to decrease the effectiveness of the agent. 
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Methods of Treatment 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
NOVX expression or activity. The disorders include cardiomyopathy, atherosclerosis, 
5 hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 

atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, 
transplantation, adreno leukodystrophy, congenital adrenal hyperplasia, prostate cancer, 
neoplasm; adenocarcinoma, lymphoma, uterus cancer, fertility, hemophilia, hypercoagulation, 
10 idiopathic thrombocytopenic purpura, immunodeficiencies, graft versus host disease, AIDS, 
bronchial asthma, Crohn's disease; multiple sclerosis, treatment of Albright Hereditary 
Ostoeodystrophy, and other diseases, disorders and conditions of the like. 
These methods of treatment will be discussed more fully, below. 

15 Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that antagonize 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may 

20 be utilized include, but are not limited to: (/) an aforementioned peptide, or analogs, 

derivatives, fragments or homologs thereof; (/*/) antibodies to an aforementioned peptide; (///) 
nucleic acids encoding an aforementioned peptide; (zv) administration of antisense nucleic acid 
and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion within the 
coding sequences of coding sequences to an aforementioned peptide) that are utilized to 

25 "knockout" endogenous function of an aforementioned peptide by homologous recombination 
(see, e.g., Capecchi, 1989. Science 244: 1288-1292); or (v) modulators ( i.e., inhibitors, 
agonists and antagonists, including additional peptide mimetic of the invention or antibodies 
specific to a peptide of the invention) that alter the interaction between an aforementioned 
peptide and its binding partner. 

30 Diseases and disorders that are characterized by decreased (relative to a subject not 

suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate activity 
may be administered in a therapeutic or prophylactic manner. Therapeutics that may be 
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utilized include, but are not limited to, an aforementioned peptide, or analogs, derivatives, 
fragments or homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide and/or 
RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in vitro for 
5 RNA or peptide levels, structure and/or activity of the expressed peptides (or mRNAs of an 
aforementioned peptide). Methods that are well-known within the art include, but are not 
limited to, immunoassays (e.g., by Western blot analysis, immunoprecipitation followed by 
sodium dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, immunocytochemistry, etc.) 
and/or hybridization assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in 
10 situ hybridization, and the like). 

Prophylactic Methods 

In one aspect, the invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant NOVX expression or activity, by administering to the 

1 5 subject an agent that modulates NOVX expression or at least one NOVX activity. Subjects at 
risk for a disease that is caused or contributed to by aberrant NOVX expression or activity can 
be identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the manifestation 
of symptoms characteristic of the NOVX aberrancy, such that a disease or disorder is 

20 prevented or, alternatively, delayed in its progression. Depending upon the type of NOVX 

aberrancy, for example, an NOVX agonist or NOVX antagonist agent can be used for treating 
the subject. The appropriate agent can be determined based on screening assays described 
herein. The prophylactic methods of the invention are further discussed in the following 
subsections. 

25 

Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating NOVX expression 
or activity for therapeutic purposes. The modulatory method of the invention involves 
contacting a cell with an agent that modulates one or more of the activities of NOVX protein 
30 activity associated with the cell. An agent that modulates NOVX protein activity can be an 
agent as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate 
ligand of an NOVX protein, a peptide, an NOVX peptidomimetic, or other small molecule. In 
one embodiment, the agent stimulates one or more NOVX protein activity. Examples of such 
stimulatory agents include active NOVX protein and a nucleic acid molecule encoding NOVX 
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that has been introduced into the cell. In another embodiment, the agent inhibits one or more 
NOVX protein activity. Examples of such inhibitory agents include antisense NOVX nucleic 
acid molecules and anti-NOVX antibodies. These modulatory methods can be performed in 
vitro {e.g., by culturing the cell with the agent) or, alternatively, in vivo {e.g., by administering 
5 the agent to a subject). As such, the invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of an 
NOVX protein or nucleic acid molecule. In one embodiment, the method involves 
administering an agent {e.g., an agent identified by a screening assay described herein), or 
combination of agents that modulates {e.g., up-regulates or down-regulates) NOVX expression 
10 or activity. In another embodiment, the method involves administering an NOVX protein or 
nucleic acid molecule as therapy to compensate for reduced or aberrant NOVX expression or 
activity. 

Stimulation of NOVX activity is desirable in s/Ywations in which NOVX is abnormally 
downregulated and/or in which increased NOVX activity is likely to have a beneficial effect. 
15 One example of such a situation is where a subject has a disorder characterized by aberrant 

cell proliferation and/or differentiation {e.g., cancer or immune associated disorders). Another 
example of such a situation is where the subject has a gestational disease {e.g., preclampsia). 

Determination of the Biological Effect of the Therapeutic 

In various embodiments of the invention, suitable in vitro or in vivo assays are 
20 performed to determine the effect of a specific Therapeutic and whether its administration is 
indicated for treatment of the affected tissue. 

In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic exerts 
the desired effect upon the cell type(s). Compounds for use in therapy may be tested in 
25 suitable animal model systems including, but not limited to rats, mice, chicken, cows, 
monkeys, rabbits, and the like, prior to testing in human subjects. Similarly, for in vivo 
testing, any of the animal model system known in the art may be used prior to administration 
to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

30 The NOVX nucleic acids and proteins of the invention are useful in potential 

prophylactic and therapeutic applications implicated in a variety of disorders including, but not 
limited to: metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 
associated cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, 
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immune disorders, hematopoietic disorders, and the various dyslipidemias, metabolic 
disturbances associated with obesity, the metabolic syndrome X and wasting disorders 
associated with chronic diseases and various cancers. 

As an example, a cDNA encoding the NOVX protein of the invention may be useful in 
5 gene therapy, and the protein may be useful when administered to a subject in need thereof. 
By way of non-limiting example, the compositions of the invention will have efficacy for 
treatment of patients suffering from: metabolic disorders, diabetes, obesity, infectious disease, 
anorexia, cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's 
Disease, Parkinson's Disorder, immune disorders, hematopoietic disorders, and the various 
10 dyslipidemias. 

Both the novel nucleic acid encoding the NOVX protein, and the NOVX protein of the 
invention, or fragments thereof, may also be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. A further use could 
be as an anti-bacterial molecule (i.e., some peptides have been found to possess anti-bacterial 
15 properties). These materials are further useful in the generation of antibodies, which 

immunospecifically-bind to the novel substances of the invention for use in therapeutic or 
diagnostic methods. 

The invention will be further described in the following examples, which do not limit 
the scope of the invention described in the claims. 

20 

Examples 

Example 1. Identification of NOVX clones 

The novel NOVX target sequences identified in the present invention were subjected to 
the exon linking process to confirm the sequence. PCR primers were designed by starting at 

25 the most upstream sequence available, for*he forward primer, and at the most downstream 

>- 

sequence available for the reverse primer. Table 100A shows the sequences of the PCR 
primers used for obtaining different clones. In each case, the sequence was examined, walking 
inward from the respective termini toward the coding sequence, until a suitable sequence that 
is either unique or highly selective was encountered, or, in the case of the reverse primer, until 
30 the stop codon was reached. Such primers were designed based on in silico predictions for the 
full length cDNA, part (one or more exons) of the DNA or protein sequence of the target 
sequence, or by translated homology of the predicted exons to closely related human 
sequences from other species. These primers were then employed in PCR amplification based 
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on the following pool of human cDNAs: adrenal gland, bone marrow, brain - amygdala, brain 
- cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, 
fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary 
gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, small 
5 intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, uterus. Usually the resulting 
amplicons were gel purified, cloned and sequenced to high redundancy. The PCR product 
derived from exon linking was cloned into the pCR2.1 vector from Invitrogen. The resulting 
bacterial clone has an insert covering the entire open reading frame cloned into the pCR2.1 
vector. Table 100B shows a list of these bacterial clones. The resulting sequences from all 
10 clones were assembled with themselves, with other fragments in CuraGen Corporation's 
database and with public ESTs. Fragments and ESTs were included as components for an 
assembly when the extent of their identity with another component of the assembly was at 
least 95% over 50 bp. In addition, sequence traces were evaluated manually and edited for 
corrections if appropriate. These procedures provide the sequence reported herein. 

15 Table 100A. PCR Primers for Exon Linking 



NOVX 
Clone 


Primer 1 (5' - 3' ) 


SEQ 

ID 

NO 


Primer 2 (5' - 3' ) 


SEQ 

ID ' 
NO 


NOVld 


TGACTATGGCCTGGAGTTTCCGT 


890 


TT ATT C C AG AG AT C C TTGGC AGAAGC 


891 ; 


NOV3C 


ACCCTGGCCCAGCTCTGAGT 


892 


G AGCT C AGACGT ACT CTCGGG C AG 


893 


NOV4C 


ACCTGTCGCAATGGCTTTAATCTTTAG 


894 


ATCTCTGGCCTTGTCGGAGTCTTAC 


895 


NOV5a 


GTTTCCCCACCCCGCAGA 


896 


GAGAGGATCAAAGAACCAGACAGGAG 


897 


NOV5b 


ACGGCGATGACCCCCCAG 


898 


GGGGGAGGGGCTCAAACAAGA 


899 


NOV5C 


ACGGCGATGACCCCCCAG 


900 


GGGGG AGGGG CT C AAAC AAGA 


901 


NOV5d 


ACGGCGATGACCCCCCAG 


902 


GGGGGAGGGGCTCAAACAAGA 


903 


NOV5e 


ACGGCGATGACCCCCCAG 


904 


GGGGGAGGGGCTCAAACAAGA 


905 


NOV6 


GTCGAACCGGGGGACCTG 


906 


TGGGGAAGGTGCTCAGCCC 


907 


NOV7a 


CAGCCAAACCCACCTCCACCAT 


908 


TTTGGCTGGCTTATATAGCGAGCTCCT 


909 


NOV7b 


CATTGCCAATTCTAAATCCATCATTTG 


910 


TC TT C T AAAG CAC AAAT AAC AC C T C C A 


911 


NOV7C 


ATGGCCGAGGCCGCGGAG 


912 


AGCTTCATTTCATTCTTTTTGCAACATCTG 


913 


NOV8 


GGGGACCATGGGGAGCGATC 


914 


CGAGAGGTTTTCTAGGCAGATTTGGAGC 


915 


NOVIBb 


ATGGCAGCAGAAAACCATTCTTTT 


916 


TCTGTTCATAAGAATGTTCTTTTCCCTAGC 


917 


NO VI 8b 


AGATGGATGG AAC C AATGGC AG 


918 


AGTGGCCCTGGATGGAAGTGA 


919 


NOV22C 


CACTCCAACAGTTAATGCTTCCCTTG 


920 


GATGGTCTCGATCTCCTGACCTCTT 


921 


NOV24b 


ATGGCCGAGGCCGCGGAG 


922 


AGCTTCATTTCATTCTTTTTGCAACATCTG 


923 


NOV26b 


GTTCCTGCTGTCTGGACTTTTTCTGT 


924 


TTTTTGGAGAAAGCTGCAAAAGTTTAT 


925 


NOV27b 


CT C AGTC C CT CGGG CT C AT AC CT A 


926 


GGACTCAGAGCTCCTGCCTTTCTGT 


927 


NOV2 7C 


CTCAGTCCCTCGGGCTCATACCTA 


928 


CTCAGAGCTCCTGCCTTTCTGTCC 


929 


NOV2 9b 


GT AGC C AC AAG AC CGGGT C CG 


930 


C C C TGGC CTC TTGGAACTGCT TG AT 


931 


NOV32 


CTTGGAGGCTGCAGGTCCTGGAC 


932 


AGGAGCATCCTTCATCCCACTAGAGGT 


933 


NOV33 


GCTGGATTGGGTGATCTCTCAGAGC 


934 


CTCTTACTCCTGGCAAGCCCTGC 


935 


NOV3 6b 


ATGCAAGAAACAATTTTTTTTTTTGAGA 
CG 


936 


TTTT AC C CAT T C ACC AGT TT AAGG AC G 


937 


NOV3 8 


CAATGACCTCCTCATTGCTTCTGG 


938 


CAAAGCCCCAGGTCCTCTTGCTAG 


939 


NOV3 9a 


CCATCCGAGGCTCCTGAACC 


940 


C AAAGCC CC AGGT CC T CT TGC 


941 


NOV3 9b 


GTTCT TG AAC C AGGGC C ATT C AC 


942 


CCTGGGAAACTTCATCTTGGTCTCTT 


943 


NOV42b 


ATGGACGGTGAGGCAGTCC 


944 


CCACACCCTGGCCCATG 


945 


NOV42C 


ATGGACGGTGAGGCAGTCC 


946 


CCACACCCTGGCCCATG 


947 


NOV42d 


CTGGAGGATGAAGGAAGCAGAGATG 


948 


CCAGAGAACAAGCAGAGCTCAGAGG 


949 


NOV4 6b 


GTTTCTGAGCATGGATCCAACCA 


950 


AAAGGGCAGAGGCTCTTCCTCAC 


951 


NOV4 8b 


GCTACCCTCTCTGCTGGCTACCTAAC 


952 


TTGATTTTCACCACCTCCATTTGTTCT 


953 


NOV50b 


AAGATGTCGCAGCCCAAGAAAAG 


954 


TGCTTTGGGAGGTAGCTGGGA 


955 
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NOV51 


CTGGCCTAATG7AATGTCTCTGAGC 

l 


956 


TGATCATGGAGGAATAATCTAATATGCCTTA 
G 


957 


NOV5 2 


TCCCTGATGTCCAGCTCTGGCT 


958 


ATAGACTAACTGCACCCACAGGCCTCT 


959 


NOV56b 


ATGGCGAAGATTGAGAAAAACGCT 


960 


CT CT C AG AT C T C C AGG CAGAAGT TC AG 


961 


NOV5 8b 


AGTCTTGCCTTCTTTTGAGCCTAAGTC 


962 


CACATTCAACATATCTGAGGCTGTGG 


963 


NOV6 0a 


CG AAT TGGCT TC CG AGTGAAAAT C 


964 


TTATTTAAAGGTCAAGGCCTCGAAGTG 


965 


NOV61 


CAGCTGTGCCCTCATCCTTGTGCCTGCT 
ACGTCC 


966 


CTCGAGCTTGGTCACTGTGATTCCCACCGTG 
ATATGGTCTGCC 


967 


NOV63 


GG AT C C AG CT AC C CG ATC TGGTGGCTG A 
CGGGCAGC 


968 


GTCGACGCCCTTGCAGGTGTAGACCTCCTCA 
CG 


969 


NOV6 8 


GGCACGCTCCCTCTGGCT 


9 7 0 


TTCACTGTGGGGGCCTGG 


971 


INUVb yr) 




9 72 


GTG 


973 


NOV7 0a 


TATGCTGTCTATGCTGAATCTTCTAATC 
TTGTCT 


974 


TTTAAGAATGTTGAATATTGGCC CCCAC 


975 


NOW 6b 


ACCATTACATCATCGTGGCAAATTAAA 


976 


GAAGTCACAAGTGTCTTTCTTCTCAGGA 


977 


NOV 8 lb 


GTCATGCGCTGCCCCAAGT 


978 


CCAATGAGAGTCAGCACTGGAGC 


979 


NOV8 7b 


TCTCTCATGGCCCCCAAAGAC 


980 


AGT C AGTG CGG CGGG AAG A 


981 


NOV96a 


GGTGCTCTCAGCGTTCTTCCAGTC 


982 


CTAGTGCTTCTGTTACAAGGTCCTGGG 


983 


NOV96d 


AGCTGGATTGACAACTTTGTAATGGAG 


984 


CTCAGTCGTGCTGCTAGTAGGGGT 


985 


NOV9 6e 


TGAAGCTCACCAGGAGGAAGAAG 


986 


CTCAGTCGTGCTGCTAGTAGGGGT 


987 


NOV9 6f 


TCCCATGACCTGCCACTTCC 


988 


CGCTACCTGCAGCCGCA 


989 


NOV9 8 


CCGGCCCCGTGTGTGGCA 


990 


GGGGCTCTGGTGCCAGCTCATG 


991 


NOV9 9 


CAGCCAAACCCACCTCCACCAT 


992 


TTTGGCTGGCTTATATAGCGAGCTCCT 


993 



Physical clone: Exons were predicted by homology and the intron/exon boundaries 
were determined using standard genetic rules. Exons were further selected and refined by 
means of similarity determination using multiple BLAST (for example, tBlastN, BlastX, and 
5 BlastN) searches, and, in some instances, GeneScan and Grail. Expressed sequences from both 
public and proprietary databases were also added when available to further define and 
complete the gene sequence. The DNA sequence was then manually corrected for apparent 
inconsistencies thereby obtaining the sequences encoding the full-length protein. 



Table 100B. Physical Clones for PCR products 



NOVX Clone 


Bacterial Clone 


NO vi a 


Genomic clone: AC080137 


NOVlb 


Genomic clone: GMAC036188 


NOVld 


Physical clone: GMAC080137 A. 698589. M14 


NOV2 


Physical clone: AC023194 


NOV3a 


Genomic clone: AL136383 


NOV3C 


Physical clone: GMAL136383 A. 698589. C6 


NOV4a 


Genomic clone: AC04 6164 


NOV4C 


Physical clone 


GMAC046164 A. 698589. A2 


NOV5C 


Physical clone 


155643499 


NOV5d 


Physical clone 


155643499 


NOV5e 


Physical clone 


127939: : 153729589 . 698590 . P9 


NOV7a 


Physical clone 


153634912 


NOV7b 


Physical clone 


127 994 : : 13 88 963 06 . 698587 . E16 


NOV7c 


Physical clone 


127557: : CG564550 - 0 1 . 698587 . F19 


NOV9a 


Physical clone 
AI307658 


152568459, AC005961.1, AC068256.2, AI308124, 


NOV10 


Physical clone 


12 6218 : :CG55 964-01 . 6 98 509 .G10 


NOV13b 


Physical clone 


126388: : CG56 02 1 - 0 1 . 6 9853 9 . G15 


NOV15b 


Physical clone 


126 694 : : CG56 065 - 01 . 6 98561 . C17 


NOV16a 


Physical clone 


126696: : CG56 067 - 01 . 6 98561 . All 


NOV16b 


Physical clone 


128769 : :GMAC084434 A. 698655 . G14 


NOV17b 


Physical clone 


128198 : :GMba430il5 A. 698589. F10 


NOV17d 


Physical clone 


105889: :sggc draft ba430il5 20000823 . 698368 . G10 


NOV18b 


Physical clone 


128205 : :GMba430il5 D. 698590. E4 
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Physical clone: 128207 : :GMba430il5 E. 698590. C4 


NOV? 1 A 


Physical clone: 153485226, 145329583 




Physical Clone: 50119 :: 86375666 . 244185 . F7 


1M v d 


Genomic clone: AC011522.6 




Physical clone 


16 1710 74 7. 698893. K3 


v z j a 


Physical clone 


140117553 146712128 146712112 148412737 


MOV 9 7 r- 


Physical clone 


: 140117553 


HWUZ1U 


Physical clone 


127557: :CG564550-01. 698591. Nil 


NOV2 5 


Physical clone 


151537975 and 128978463 


lMUVidba 


Physical clone 


: 127998335 


JNUVZoD 


Physical clone 


127561: :CG56461-01. 698589. G7 


i\IOV<d / a 


Physical clone 


111787393, HSAJ9617 


INO V .£ / Xj 


Physical clone 


: 111787393 EXT .698587. Kl 8 


INO v Z 1 C 


Physical clone 


167695055 170842341 170842333 


NO V ^ 8 


Physical clone 


AC004832 


ino v.^ y a 


Physical clone 


AC004832 


wuvz yr> 


Physical clone 


112824: :COR100399281 . 698230 . B12 


TyT/^T T<-) Q _ 

NOV2 y c 


Physical clone 


AC004832 


INO V 3 U 


Genomic clone : AC004 83 2 


NOV 3 1 


Genomic clone: 94329210 


NOV3 3 


Physical Clone: 153778095, 138978176, 146713055, 105100551, 
153777948 


NOV3 4 


Physical clone 


125858 : :GMAC026083 E. 698508. J15 


NOV3 5 


Physical clone 


114740: :AC011711 . 698329 . 110 


NOV3 6a 


Physical clone 


152568436, AL132780 


NOV3 6D 


Physical clone 


152568436 134511756 


NOV 3 7 


Physical clone 


: 151557368, 138195002, 152762569, 152768078 


WUVJ o 


Physical clone 


107207 : 


AC0617 0 7 . 6 98 315 . F14 


novj 9a 


Physical clone 


127119: 


AC0617 07 . 698564. H2 0 


NOV3 9JD 


Physical clone 


: 128110: 


ADENOSINE A3 RECEPTOR . 6 9 86 57 . H22 . 


NOV4 0 


Physical clone 


AC068471, AC068471, AV655524, T67857 


NO V4 1 a 


Physical clone 


AC007278, AW242630.1 xn01f05.Xl 


MAT T n "i -i 


Physical clone 


AC007395, 153103275, 153103263, 153103260 




Physical clone 
152189065 


153103275 153103263 153103260 153103539 153103266 


1NU V t i C 


Physical clone: 54701683_EXT . 698433 . J23 


V4ZU 


Physical clone: AC0073 95_A. 698587 . M17 




Genomic clone: AC021773 


Mn\74 A 

Vi\J V *± f± 


Physical clone: AC023654, 78743598 


NOV4 5 


Genomic clone: AC023 078 


JNU V4 b a 


Genomic clone: AC023 07 8 




Physical clone: 128292 : :AC023078 A. 698657. G13 




Genomic clone: AC02 3 6 54 




Physical clone: 151667972 




Genomic clone: AF152363 


KfCWTA ft a 


Genomic clone: AC012510.5 


TsTf^\7/l Qh 

i>iKJ v ft oxj 


Physical clone: 128669 : :AC012510 5 final . 6 98656 . J20 


NOV4 9 


Genomic clone: AC011492 


NOV5 0a 


Physical clone 


153 778754, 122656699 


NOV5 0b 


Physical clone 


AK001421 A. 698657. J10 


NOV51 


Physical clone 


126131: :CG55922-01. 698509. 09 


NOV52 


Physical clone 
152186815, 152: 


153623113, 152186811, 148441423, 148441418, 
209564, 126066491, 129293170, 126630256, 124459512 


NOV5 3 


Physical clone: 151222559 


NOV54a 


Physical clone: 153512063 


NOV55 


Genomic clone: AL13 8816 . 12 , AL158192.12 


NOV56a 


Genomic clone: AC019100.4 


NOV57 


Physical clone: 50222151, 150222148 


NOV54b 


Physical clone: 164698940 


NOV56b 


Physical clone: 5504 8 : : nh0443k08 A. 698002 . M9 



Example 2. Quantitative expression analysis of clones in various cells and tissues 
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The quantitative expression of various clones was assessed using microtiter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines and 
tissues using real time quantitative PCR (RTQ PCR). RTQ PCR was performed on an Applied 
Biosystems ABI PRISM® 7700 or an ABI PRISM® 7900 HT Sequence Detection System. 
5 Various collections of samples are assembled on the plates, and referred to as Panel 1 

(containing normal tissues and cancer cell lines), Panel 2 (containing samples derived from 
tissues from normal and cancer sources), Panel 3 (containing cancer cell lines), Panel 4 
(containing cells and cell lines from normal tissues and cells related to inflammatory 
conditions), Panel 5D/5I (containing human tissues and cell lines with an emphasis on 

10 metabolic diseases), AI_comprehensive_panel (containing normal tissue and samples from 
autoimmune diseases), Panel CNSD.01 (containing central nervous system samples from 
normal and diseased brains) and CNS_neurodegeneration_panel (containing samples from 
normal and Alzheimer's diseased brains). 

RNA integrity from all samples is controlled for quality by visual assessment of 

15 agarose gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio as a 
guide (2:1 to 2.5:1 28s: 18s) and the absence of low molecular weight RNAs that would be 
indicative of degradation products. Samples are controlled against genomic DNA 
contamination by RTQ PCR reactions run in the absence of reverse transcriptase using probe 
and primer sets designed to amplify across the span of a single exon. 

20 First, the RNA samples were normalized to reference nucleic acids such as 

constitutively expressed genes (for example, P-actin and GAPDH). Normalized RNA (5 ul) 
was converted to cDNA and analyzed by RTQ-PCR using One Step RT-PCR Master Mix 
Reagents (Applied Biosystems; Catalog No. 4309169) and gene-specific primers according to 
the manufacturer's instructions. 

25 In other cases, non-normalized RNA samples were converted to single strand cDNA 

(sscDNA) using Superscript II (Invitrogen Corporation; Catalog No. 18064-147) and random 
hexamers according to the manufacturer's instructions. Reactions containing up to 10 \ig of 
total RNA were performed in a volume of 20 |il and incubated for 60 minutes at 42°C. This 
reaction can be scaled up to 50 \ig of total RNA in a final volume of 100 uT sscDNA samples 

30 are then normalized to reference nucleic acids as described previously, using IX TaqMan® 
Universal Master mix (Applied Biosystems; catalog No. 4324020), following the 
manufacturer's instructions. 

Probes and primers were designed for each assay according to Applied Biosystems 
Primer Express Software package (version I for Apple Computer's Macintosh Power PC) or a 
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similar algorithm using the target sequence as input. Default settings were used for reaction 
conditions and the following parameters were set before selecting primers: primer 
concentration = 250 nM, primer melting temperature (Tm) range = 58°-60°C, primer optimal 
Tm = 59°C, maximum primer difference = 2°C, probe does not have 5*G, probe Tm must be 
5 10°C greater than primer Tm, amplicon size 75bp to lOObp. The probes and primers selected 
(see below) were synthesized by Synthegen (Houston, TX, USA). Probes were double purified 
by HPLC to remove uncoupled dye and evaluated by mass spectroscopy to verify coupling of 
reporter and quencher dyes to the 5' and 3' ends of the probe, respectively. Their final 
concentrations were: forward and reverse primers, 900nM each, and probe, 200nM. 

10 PCR conditions: When working with RNA samples, normalized RNA from each tissue 

and each cell line was spotted in each well of either a 96 well or a 384-well PCR plate 
(Applied Biosystems). PCR cocktails included either a single gene specific probe and primers 
set, or two multiplexed probe and primers sets (a set specific for the target clone and another 
gene-specific set multiplexed with the target probe). PCR reactions were set up using 

15 TaqMan® One-Step RT-PCR Master Mix (Applied Biosystems, Catalog No. 43 1 3803) 

following manufacturer's instructions. Reverse transcription was performed at 48°C for 30 
minutes followed by amplification/PCR cycles as follows: 95°C 10 min, then 40 cycles of 
95°C for 15 seconds, 60°C for 1 minute. Results were recorded as CT values (cycle at which a 
given sample crosses a threshold level of fluorescence) using a log scale, with the difference in 

20 RNA concentration between a given sample and the sample with the lowest CT value being 
represented as 2 to the power of delta CT. The percent relative expression is then obtained by 
taking the reciprocal of this RNA difference and multiplying by 100. 

When working with sscDNA samples, normalized sscDNA was used as described 
previously for RNA samples. PCR reactions containing one or two sets of probe and primers 

25 were set up as described previously, using IX TaqMan® Universal Master mix (Applied 
Biosystems; catalog No. 4324020), following the manufacturer's instructions. PCR 
amplification was performed as follows: 95°C 10 min, then 40 cycles of 95°C for 15 seconds, 
60°C for 1 minute. Results were analyzed and processed as described previously. 
Panels 1, 1.1, 1.2, and 1.3D 

30 The plates for Panels 1, 1 .1, 1 .2 and 1 .3D include 2 control wells (genomic DNA 

control and chemistry control) and 94 wells containing cDNA from various samples. The 
samples in these panels are broken into 2 classes: samples derived from cultured cell lines and 
samples derived from primary normal tissues. The cell lines are derived from cancers of the 
following types: lung cancer, breast cancer, melanoma, colon cancer, prostate cancer, CNS 
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cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, gastric cancer and 
pancreatic cancer. Cell lines used in these panels are widely available through the American 
Type Culture Collection (ATCC), a repository for cultured cell lines, and were cultured using 
the conditions recommended by the ATCC. The normal tissues found on these panels are 
5 comprised of samples derived from all major organ systems from single adult individuals or 
fetuses. These samples are derived from the following organs: adult skeletal muscle, fetal 
skeletal muscle, adult heart, fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult 
lung, fetal lung, various regions of the brain, the spleen, bone marrow, lymph node, pancreas, 
salivary gland, pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, 
10 colon, bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. 

In the results for Panels 1, 1.1, 1.2 and 1.3D, the following abbreviations are used: 

ca. = carcinoma, 
* = established from metastasis, 
met = metastasis, 
1 5 s cell var = small cell variant, 

non-s = non-sm = non-small, 
squam = squamous, 
pi. eff = pi effusion = pleural effusion, 
glio = glioma, 
20 astro = astrocytoma, and 

neuro = neuroblastoma. 
General_screening_panel_vl.4 

The plates for Panel 1 .4 include 2 control wells (genomic DNA control and chemistry 
control) and 94 wells containing cDNA from various samples. The samples in Panel 1 .4 are 

25 broken into 2 classes: samples derived from cultured cell lines and samples derived from 

primary normal tissues. The cell lines are derived from cancers of the following types: lung 
cancer, breast cancer, melanoma, colon cancer, prostate cancer, CNS cancer, squamous cell 
carcinoma, ovarian cancer, liver cancer, renal cancer, gastric cancer and pancreatic cancer. 
Cell lines used in Panel 1.4 are widely available through the American Type Culture 

30 Collection (ATCC), a repository for cultured cell lines, and were cultured using the conditions 
recommended by the ATCC. The normal tissues found on Panel 1.4 are comprised of pools of 
samples derived from all major organ systems from 2 to 5 different adult individuals or 
fetuses. These samples are derived from the following organs: adult skeletal muscle, fetal 
skeletal muscle, adult heart, fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult 



868 



lung, fetal lung, various regions of the brain, the spleen, bone marrow, lymph node, pancreas, 
salivary gland, pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, 
colon, bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. 
Abbreviations are as described for Panels 1, 1.1, 1.2, and 1.3D. 
5 Panels 2D and 2.2 

The plates for Panels 2D and 2.2 generally include 2 control wells and 94 test samples 
composed of RNA or cDNA isolated from human tissue procured by surgeons working in 
close cooperation with the National Cancer Institute's Cooperative Human Tissue Network 
(CHTN) or the National Disease Research Initiative (NDRI). The tissues are derived from 

10 human malignancies and in cases where indicated many malignant tissues have "matched 
margins" obtained from noncancerous tissue just adjacent to the tumor. These are termed 
normal adjacent tissues and are denoted "NAT" in the results below. The tumor tissue and the 
"matched margins" are evaluated by two independent pathologists (the surgical pathologists 
and again by a pathologist at NDRI or CHTN). This analysis provides a gross 

1 5 histopathological assessment of tumor differentiation grade. Moreover, most samples include 
the original surgical pathology report that provides information regarding the clinical stage of 
the patient. These matched margins are taken from the tissue surrounding (i.e. immediately 
proximal) to the zone of surgery (designated "NAT", for normal adjacent tissue, in Table RR). 
In addition, RNA and cDNA samples were obtained from various human tissues derived from 

20 autopsies performed on elderly people or sudden death victims (accidents, etc.). These tissues 
were ascertained to be free of disease and were purchased from various commercial sources 
such as Clontech (Palo Alto, CA), Research Genetics, and Invitrogen. 
Panel 3D 

The plates of Panel 3D are comprised of 94 cDNA samples and two control samples. 

25 Specifically, 92 of these samples are derived from cultured human cancer cell lines, 2 samples 
of human primary cerebellar tissue and 2 controls. The human cell lines are generally obtained 
from ATCC (American Type Culture Collection), NCI or the German tumor cell bank and fall 
into the following tissue groups: Squamous cell carcinoma of the tongue, breast cancer, 
prostate cancer, melanoma, epidermoid carcinoma, sarcomas, bladder carcinomas, pancreatic 

30 cancers, kidney cancers, leukemias/lymphomas, ovarian/uterine/cervical, gastric, colon, lung 
and CNS cancer cell lines. In addition, there are two independent samples of cerebellum. 
These cells are all cultured under standard recommended conditions and RNA extracted using 
the standard procedures. The cell lines in panel 3D and 1 .3D are of the most common cell lines 
used in the scientific literature. 
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Panels 4D, 4R, and 4.1D 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) 
composed of RNA (Panel 4R) or cDNA (Panels 4D/4.1D) isolated from various human cell 
lines or tissues related to inflammatory conditions. Total RNA from control normal tissues 
5 such as colon and lung (Stratagene, La Jolla, CA) and thymus and kidney (Clontech) was 

employed. Total RNA from liver tissue from cirrhosis patients and kidney from lupus patients 
was obtained from BioChain (Biochain Institute, Inc., Hayward, CA). Intestinal tissue for 
RNA preparation from patients diagnosed as having Crohn's disease and ulcerative colitis was 
obtained from the National Disease Research Interchange (NDRI) (Philadelphia, PA). 

10 Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle cells, 

small airway epithelium, bronchial epithelium, microvascular dermal endothelial cells, 
microvascular lung endothelial cells, human pulmonary aortic endothelial cells, human 
umbilical vein endothelial cells were all purchased from Clonetics (Walkersville, MD) and 
grown in the media supplied for these cell types by Clonetics. These primary cell types were 

1 5 activated with various cytokines or combinations of cytokines for 6 and/or 12-14 hours, as 
indicated. The following cytokines were used; IL-1 beta at approximately l-5ng/ml, TNF 
alpha at approximately 5-10ng/ml, IFN gamma at approximately 20-50ng/ml, IL-4 at 
approximately 5-10ng/ml, IL-9 at approximately 5-10ng/ml, IL-13 at approximately 5- 
lOng/ml. Endothelial cells were sometimes starved for various times by culture in the basal 

20 media from Clonetics with 0.1% serum. 

Mononuclear cells were prepared from blood of employees at CuraGen Corporation, 
using Ficoll. LAK cells were prepared from these cells by culture in DMEM 5% FCS 
(Hyclone), lOO^iM non essential amino acids (Gibco/Life Technologies, Rockville, MD), 
ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0~ 5 M (Gibco), and lOmM Hepes 

25 (Gibco) and Interleukin 2 for 4-6 days. Cells were then either activated with 10-20ng/ml PMA 
and l-2|ig/ml ionomycin, IL-12 at 5-10ng/ml, IFN gamma at 20-50ng/mI and IL-1 8 at 5- 
lOng/ml for 6 hours. In some cases, mononuclear cells were cultured for 4-5 days in DMEM 
5% FCS (Hyclone), 100|iM non essential amino acids (Gibco), ImM sodium pyruvate 
(Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and lOmM Hepes (Gibco) with PHA 

30 (phytohemagglutinin) or PWM (pokeweed mitogen) at approximately 5u.g/ml. Samples were 
taken at 24, 48 and 72 hours for RNA preparation. MLR (mixed lymphocyte reaction) samples 
were obtained by taking blood from two donors, isolating the mononuclear cells using Ficoll 
and mixing the isolated mononuclear cells 1:1 at a final concentration of approximately 
2xl0 6 cells/ml in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids (Gibco), ImM 



870 




sodium pyruvate (Gibco), mercaptoethanol (5.5x1 0" 5 M) (Gibco), and 10mM Hepes (Gibco). 
The MLR was cultured and samples taken at various time points ranging from 1- 7 days for 
RNA preparation. 

Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve VS 
5 selection columns and a Vario Magnet according to the manufacturer's instructions. 

Monocytes were differentiated into dendritic cells by culture in DMEM 5% fetal calf serum 
(FCS) (Hyclone, Logan, UT), lOOuM non essential amino acids (Gibco), ImM sodium 
pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and lOmM Hepes (Gibco), 50ng/ml 
GMCSF and 5ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of monocytes 

10 for 5-7 days in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids (Gibco), ImM 
sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), lOmM Hepes (Gibco) and 
10% AB Human Serum or MCSF at approximately 50ng/ml. Monocytes, macrophages and 
dendritic cells were stimulated for 6 and 12-14 hours with lipopolysaccharide (LPS) at 
lOOng/ml. Dendritic cells were also stimulated with anti-CD40 monoclonal antibody 

15 (Pharmingen) at 10fig/ml for 6 and 12-14 hours. 

CD4 lymphocytes, CD8 lymphocytes and NK cells were also isolated from 
mononuclear cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection columns 
and a Vario Magnet according to the manufacturer's instructions. CD45RA and CD45RO CD4 
lymphocytes were isolated by depleting mononuclear cells of CD8, CD56, CD 14 and CD 19 

20 cells using CD8, CD56, CD14 and CD19 Miltenyi beads and positive selection. CD45RO 

beads were then used to isolate the CD45RO CD4 lymphocytes with the remaining cells being 
CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO CD4 and CD8 lymphocytes were 
placed in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids (Gibco), ImM 
sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0~ 5 M (Gibco), and lOmM Hepes (Gibco) and 

25 plated at 10 6 cells/ml onto Falcon 6 well tissue culture plates that had been coated overnight 

with 0.5fig/ml anti-CD28 (Pharmingen) and 3ug/ml anti-CD3 (OKT3, ATCC) in PBS. After 6 
and 24 hours, the cells were harvested for RNA preparation. To prepare chronically activated 
CD8 lymphocytes, we activated the isolated CD8 lymphocytes for 4 days on anti-CD28 and 
anti-CD3 coated plates and then harvested the cells and expanded them in DMEM 5% FCS 

30 (Hyclone), 1 00 uM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 

mercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM Hepes (Gibco) and IL-2. The expanded CD8 
cells were then activated again with plate bound anti-CD3 and anti-CD28 for 4 days and 
expanded as before. RNA was isolated 6 and 24 hours after the second activation and after 4 
days of the second expansion culture. The isolated NK cells were cultured in DMEM 5% FCS 
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(Hyclone), lOO^iM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
mercaptoethanol 5.5xl0~ 5 M (Gibco), and lOmM Hepes (Gibco) and IL-2 for 4-6 days before 
RNA was prepared. 

To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with sterile 
5 dissecting scissors and then passed through a sieve. Tonsil cells were then spun down and 
resupended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and lOmM 
Hepes (Gibco). To activate the cells, we used PWM at 5|ig/ml or anti-CD40 (Pharmingen) at 
approximately 10|ig/ml and IL-4 at 5-10ng/ml. Cells were harvested for RNA preparation at 

1 0 24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon plates 
were coated overnight with lOug/ml anti-CD28 (Pharmingen) and 2ug/ml OKT3 (ATCC), and 
then washed twice with PBS. Umbilical cord blood CD4 lymphocytes (Poietic Systems, 
German Town, MD) were cultured at 10 5 -10 6 cel ls/ml in DMEM 5% FCS (Hyclone), lOOuM 

15 non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (4ng/ml). IL-12 (5ng/ml) and anti-IL4 (lug/ml) 
were used to direct to Thl, while IL-4 (5ng/ml) and anti-IFN gamma (l^g/ml) were used to 
direct to Th2 and IL-10 at 5ng/ml was used to direct to Trl . After 4-5 days, the activated Thl, 
Th2 and Trl lymphocytes were washed once in DMEM and expanded for 4-7 days in DMEM 

20 • 5% FCS (Hyclone), lOOuM non essential amino acids (Gibco), ImM sodium pyruvate 
(Gibco), mercaptoethanol 5.5xl0~ 5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (lng/ml). 
Following this, the activated Thl, Th2 and Trl lymphocytes were re-stimulated for 5 days 
with anti-CD28/OKT3 and cytokines as described above, but with the addition of anti-CD95L 
(1 jig/ml) to prevent apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes were 

25 washed and then expanded again with IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes 
were maintained in this way for a maximum of three cycles. RNA was prepared from primary 
and secondary Thl, Th2 and Trl after 6 and 24 hours following the second and third 
activations with plate bound anti-CD3 and anti-CD28 mAbs and 4 days into the second and 
third expansion cultures in Interleukin 2. 

30 The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, 

KU-812. EOL cells were further differentiated by culture in O.lmM dbcAMP at 5xl0 5 cells/ml 
for 8 days, changing the media every 3 days and adjusting the cell concentration to 
5x1 0 5 cells/ml. For the culture of these cells, we used DMEM or RPMI (as recommended by 
the ATCC), with the addition of 5% FCS (Hyclone), lOO^M non essential amino acids 
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(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), lOmM Hepes 
(Gibco). RNA was either prepared from resting cells or cells activated with PMA at lOng/ml 
and ionomycin at l^ig/ml for 6 and 14 hours. Keratinocyte line CCD106 and an airway 
epithelial tumor line NCI-H292 were also obtained from the ATCC. Both were cultured in 
5 DMEM 5% FCS (Hyclone), lOOuM non essential amino acids (Gibco), ImM sodium pyruvate 
(Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM Hepes (Gibco). CCD1 106 cells were 
activated for 6 and 14 hours with approximately 5 ng/ml TNF alpha and lng/ml IL-1 beta, 
while NCI-H292 cells were activated for 6 and 14 hours with the following cytokines: 5ng/ml 
IL-4, 5ng/ml IL-9, 5ng/ml IL-1 3 and 25ng/ml IFN gamma. 

10 For these cell lines and blood cells, RNA was prepared by lysing approximately 

10 7 cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane 
(Molecular Research Corporation) was added to the RNA sample, vortexed and after 10 
minutes at room temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. The 
aqueous phase was removed and placed in a 15ml Falcon Tube. An equal volume of 

1 5 isopropanol was added and left at -20°C overnight. The precipitated RNA was spun down at 
9,000 rpm for 15 min in a Sorvall SS34 rotor and washed in 70% ethanol. The pellet was 
redissolved in 300|il of RNAse-free water and 35\xl buffer (Promega) 5|il DTT, 7ul RNAsin 
and 8p.l DNAse were added. The tube was incubated at 37°C for 30 minutes to remove 
contaminating genomic DNA, extracted once with phenol chloroform and re-precipitated with 

20 1/10 volume of 3M sodium acetate and 2 volumes of 100% ethanol. The RNA was spun down 
and placed in RNAse free water. RNA was stored at -80°C. 
AI comprehensive panel_vl.O 

The plates for Alcomprehensive panel vl.O include two control wells and 89 test 

samples comprised of cDNA isolated from surgical and postmortem human tissues obtained 
25 from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was extracted from 

tissue samples from the Backus Hospital in the Facility at CuraGen. Total RNA from other 

tissues was obtained from Clinomics. 

Joint tissues including synovial fluid, synovium, bone and cartilage were obtained from 

patients undergoing total knee or hip replacement surgery at the Backus Hospital. Tissue 
30 samples were immediately snap frozen in liquid nitrogen to ensure that isolated RNA was of 

optimal quality and not degraded. Additional samples of osteoarthritis and rheumatoid arthritis 

joint tissues were obtained from Clinomics. Normal control tissues were supplied by 

Clinomics and were obtained during autopsy of trauma victims. 
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Surgical specimens of psoriatic tissues and adjacent matched tissues were provided as 
total RNA by Clinomics. Two male and two female patients were selected between the ages of 
25 and 47. None of the patients were taking prescription drugs at the time samples were 
isolated. 

Surgical specimens of diseased colon from patients with ulcerative colitis and Crohns 
disease and adjacent matched tissues were obtained from Clinomics. Bowel tissue from three 
female and three male Crohn's patients between the ages of 41-69 were used. Two patients 
were not on prescription medication while the others were taking dexamethasone, 
phenobarbital, or tylenol. Ulcerative colitis tissue was from three male and four female 
patients. Four of the patients were taking lebvid and two were on phenobarbital. 

Total RNA from post mortem lung tissue from trauma victims with no disease or with 
emphysema, asthma or COPD was purchased from Clinomics. Emphysema patients ranged in 
age from 40-70 and all were smokers, this age range was chosen to focus on patients with 
cigarette-linked emphysema and to avoid those patients with alpha- lanti -trypsin deficiencies. 
Asthma patients ranged in age from 36-75, and excluded smokers to prevent those patients that 
could also have COPD. COPD patients ranged in age from 35-80 and included both smokers 
and non-smokers. Most patients were taking corticosteroids, and bronchodilators. 

In the labels employed to identify tissues in the Alcomprehensive panel vl.O panel, 
the following abbreviations are used: 
AI = Autoimmunity 

Syn = Synovial 

Normal = No apparent disease 

Rep22 /Rep20 = individual patients 

RA = Rheumatoid arthritis 

Backus = From Backus Hospital 

OA = Osteoarthritis 

(SS) (BA) (MF) = Individual patients 

Adj = Adjacent tissue 

Match control = adjacent tissues 

-M — Male 

-F = Female 

COPD = Chronic obstructive pulmonary disease 
Panels 5D and 51 
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The plates for Panel 5D and 51 include two control wells and a variety of cDNAs 
isolated from human tissues and cell lines with an emphasis on metabolic diseases. Metabolic 
tissues were obtained from patients enrolled in the Gestational Diabetes study. Cells were 
obtained during different stages in the differentiation of adipocytes from human mesenchymal 
5 stem cells. Human pancreatic islets were also obtained. 

In the Gestational Diabetes study subjects are young (18-40 years), otherwise healthy 
women with and without gestational diabetes undergoing routine (elective) Caesarean section. 
After delivery of the infant, when the surgical incisions were being repaired/closed, the 
obstetrician removed a small sample (<1 cc) of the exposed metabolic tissues during the 

10 closure of each surgical level. The biopsy material was rinsed in sterile saline, blotted and fast 
frozen within 5 minutes from the time of removal. The tissue was then flash frozen in liquid 
nitrogen and stored, individually, in sterile screw-top tubes and kept on dry ice for shipment to 
or to be picked up by CuraGen. The metabolic tissues of interest include uterine wall (smooth 
muscle), visceral adipose, skeletal muscle (rectus) and subcutaneous adipose. Patient 

1 5 descriptions are as follows: 



Patient 2 Diabetic Hispanic, overweight, not on insulin 

Patient 7-9 Nondiabetic Caucasian and obese (BMI>30) 

Patient 10 Diabetic Hispanic, overweight, on insulin 

20 Patient 1 1 Nondiabetic African American and overweight 

Patient 12 Diabetic Hispanic on insulin 



Adipocyte differentiation was induced in donor progenitor cells obtained from Osirus 
(a division of Clonetics/BioWhittaker) in triplicate, except for Donor 3U which had only two 
25 replicates. Scientists at Clonetics isolated, grew and differentiated human mesenchymal stem 
cells (HuMSCs) for CuraGen based on the published protocol found in Mark F. Pittenger, et 
al., Multilineage Potential of Adult Human Mesenchymal Stem Cells Science Apr 2 1999: 
143-147. Clonetics provided Trizol lysates or frozen pellets suitable for mRNA isolation and 
ds cDNA production. A general description of each donor is as follows: 
30 Donor 2 and 3 U: Mesenchymal Stem cells, Undifferentiated Adipose 

Donor 2 and 3 AM: Adipose, AdiposeMidway Differentiated 

Donor 2 and 3 AD: Adipose, Adipose Differentiated 

Human cell lines were generally obtained from ATCC (American Type Culture 
Collection), NCI or the German tumor cell bank and fall into the following tissue groups: 
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kidney proximal convoluted tubule, uterine smooth muscle cells, small intestine, liver HepG2 
cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These cells are all 
cultured under standard recommended conditions and RNA extracted using the standard 
procedures. All samples were processed at CuraGen to produce single stranded cDNA. 
5 Panel 51 contains all samples previously described with the addition of pancreatic islets 

from a 58 year old female patient obtained from the Diabetes Research Institute at the 
University of Miami School of Medicine. Islet tissue was processed to total RNA at an outside 
source and delivered to CuraGen for addition to panel 51. 

In the labels employed to identify tissues in the 5D and 51 panels, the following 
10 abbreviations are used: 

GO Adipose = Greater Omentum Adipose 
SK = Skeletal Muscle 
UT = Uterus 
PL = Placenta 
1 5 AD = Adipose Differentiated 

AM = Adipose Midway Differentiated 
U = Undifferentiated Stem Cells 
Panel CNSD.01 

The plates for Panel CNSD.01 include two control wells and 94 test samples 
20 comprised of cDNA isolated from postmortem human brain tissue obtained from the Harvard 
Brain Tissue Resource Center. Brains are removed from calvaria of donors between 4 and 24 
hours after death, sectioned by neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. 
All brains are sectioned and examined by neuropathologists to confirm diagnoses with clear 
associated neuropathology. 
25 Disease diagnoses are taken from patient records. The panel contains two brains from 

each of the following diagnoses: Alzheimer's disease, Parkinson's disease, Huntington's 
disease, Progressive Supernuclear Palsy, Depression, and "Normal controls". Within each of 
these brains, the following regions are represented: cingulate gyrus, temporal pole, globus 
palladus, substantia nigra, Brodman Area 4 (primary motor strip), Brodman Area 7 (parietal 
30 cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 17 (occipital cortex). Not all 
brain regions are represented in all cases; e.g., Huntington's disease is characterized in part by 
neurodegeneration in the globus palladus, thus this region is impossible to obtain from 
confirmed Huntington's cases. Likewise Parkinson's disease is characterized by degeneration 
of the substantia nigra making this region more difficult to obtain. Normal control brains were 
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examined for neuropathology and found to be free of any pathology consistent with 
neurodegeneration . 

In the labels employed to identify tissues in the CNS panel, the following abbreviations 
are used: 

5 PSP = Progressive supranuclear palsy 

Sub Nigra = Substantia nigra 
Glob Palladus= Globus palladus 
Temp Pole = Temporal pole 
Cing Gyr = Cingulate gyrus 
1 0 B A 4 = Brodman Area 4 

Panel CNS NeurodegenerationJVl.O 

The plates for Panel CNS NeurodegenerationJVl.O include two control wells and 47 
test samples comprised of cDNA isolated from postmortem human brain tissue obtained from 
the Harvard Brain Tissue Resource Center (McLean Hospital) and the Human Brain and 

15 Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare System). Brains are 
removed from calvaria of donors between 4 and 24 hours after death, sectioned by 
neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are sectioned and 
examined by neuropathologists to confirm diagnoses with clear associated neuropathology. 

Disease diagnoses are taken from patient records. The panel contains six brains from 

20 Alzheimer's disease (AD) patients, and eight brains from "Normal controls" who showed no 
evidence of dementia prior to death. The eight normal control brains are divided into two 
categories: Controls with no dementia and no Alzheimer's like pathology (Controls) and 
controls with no dementia but evidence of severe Alzheimer's like pathology, (specifically 
senile plaque load rated as level 3 on a scale of 0-3; 0 = no evidence of plaques, 3 = severe AD 

25 senile plaque load). Within each of these brains, the following regions are represented: 

hippocampus, temporal cortex (Brodman Area 21), parietal cortex (Brodman area 7), and 
occipital cortex (Brodman area 17). These regions were chosen to encompass all levels of 
neurodegeneration in AD. The hippocampus is a region of early and severe neuronal loss in 
AD; the temporal cortex is known to show neurodegeneration in AD after the hippocampus; 

30 the parietal cortex shows moderate neuronal death in the late stages of the disease; the 

occipital cortex is spared in AD and therefore acts as a "control" region within AD patients. 
Not all brain regions are represented in all cases. 

In the labels employed to identify tissues in the CNS_Neurodegeneration_Vl .0 panel, 
the following abbreviations are used: 

877 




AD = Alzheimer's disease brain; patient was demented and showed AD-like 
pathology upon autopsy 

Control = Control brains; patient not demented, showing no neuropathology 
Control (Path) = Control brains; pateint not demented but showing sever AD-like 
5 pathology 

SupTemporal Ctx = Superior Temporal Cortex 
Inf Temporal Ctx = Inferior Temporal Cortex 

NOV9a and NOV9b 

Expression of gene NOV9a and variant NOV9b was assessed using the primer-probe 
10 sets Ag2930, Ag4297 and Ag573, described in Tables AA, AB and AC. Results of the RTQ- 
PCR runs are shown in Tables AD, AE, AF, AG, and AH. Please note that the probe and 
primer set Ag4297 do not match the NOV9b variant. 



Table AA. Probe Name Ag2930 



Primers 


Sequences 


^ 1 Start 
Length j position 


SEQ ID NO: 


Forward 


5 * -attggattttccaactccatct-3 1 


22 j 1213 


994 


Probe 


TET-5 ' -tcccattgtctatgcatttatgaatga- 
3 ' -TAMRA 


27 j 1239 


995 


Reverse 


5 1 -tgcaataacaaactgcagacaa-3 * 


22 j 1285 


996 



1 5 Table AB. Probe Name Ag4297 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward; 


5 ' -tctacaccaccttcatccttgt-3 ' 


22 


1007 


997 


Probe 


TET-5 1 -ctgcctcttatggagaagaaacgagctg- 
3 * -TAMRA 


28 


1042 


998 


Reverse 


5 ■ -caccactgtcaccatcataatg-3 * 


22 


1071 


999 



Table AC. Probe Name Ag573 



Primers; 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward; 


5 ' - tgtccagtctaccgctgttgtg-3 ■ 


22 


738 


1000 


Probe 


TET-5'- i 
agaaatcctcactatgacctgcattgctgtg-3 ' - 1 
TAMRA 


31 


762 


1001 


Reverse 


5 ' -cacaagtccctggtgccttt-3 1 


20 


794 


1002 



Table AD. Panel 1.1 

Tissue Name \ Rel. Exp.(%) j Rel. Exp.(%) ] Tissue Name ] Rel. Exp.(%) ) Rel. Expj(%) 

878 



• 





Ag573, Run 
109566077 


Ag573, Run 
111156622 




Ag573, Run 
109566077 


Ag573, Run 
111156622 


Adrenal gland 


0.0 


0.8 


Renal ca. UO- 
31 


0.1 


6.0 


Bladder 


0.0 


0.0 


Renal ca. RXF 

393 


0.0 


0.0 


Brain 

(amygdala) 




0 <s 


I i vpr 

L/I V I 


o.o 


0.0 


Brain 

(cerebellum) 


n 1 


0 1 
yj. i 


bl Vtl yLK^LCXlJ ' 


0 0 

VJ.K? 


0.0 


Brain 

(hippocampus) 


0.0 


53 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (substantia 
nigra) 


in 7 


1 1 7 


idling 




0 0 


Brain (thalamus) 


0.1 


4.9 


Lung (fetal) 


0.0 


1.5 


Cerebral Cortex 


9.3 


1 1.2 


Lung ca. (non- 
s.cell) HOP-62 


4.5 


0.0 


Brain (fetal) 


0.0 


0.8 


Lung ca. (large! 
cell)NCI-H460 


0.0 


0.0 


Brain (whole) 


1.3 


6.0 


Lung ca. (non- 
s.cell) NCI- 
H23 


0.0 


0.0 


glio/astro U-118- 
MG 


0.0 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


2.2 


0.0 


astrocytoma SF- 
539 


0.0 


1.3 


Lung ca. (non- 
sm. cell) A549 


0.0 


u.o 


astrocytoma 
SNB-75 


0.0 


0.0 


Lung ca. (s.cell 
var.) SHP-77 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. 
(small cell) 
LX-1 


0.0 


0.0 


glioma U251 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


43.5 


47.0 


glioma SF-295 


1.3 


0.0 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


glioma SNB-19 


0.0 


0.5 


Lung ca. 
(squam.) NCI- 
H596 


100.0 


98.6 


glio/astro U87- 
MG 


0.0 


1.5 


Lymph node 


0.0 


0.0 


neuro*; met SK- 
N-AS 


1 2.6 


0.1 


Spleen 


0.0 


0.0 


Mammary gland 


| 0.0 


3.8 


Thymus 


0.0 


0.0 



879 



1 ff'iip'7 li? in ii ;•■ 



JJ if. h i: I i! 1 




\ 



t - , 

Breast ca. BT- 
549 ' 


0.0 


0.3 | 


Ovary 


0.0 


0.6 


Breast ca. MDA- 
N 


28.3 


39.8 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Breast ca.* 
(pl.ef) T47D 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.2 


1.8 


Breast ca.* 
(pl.ef) MCF-7 




0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Breast ca.* i 
(pl.ef) MDA- 
MB-231 


0.0 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


0.1 


Small intestine 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


2.5 


16.3 


Colorectal 


0.0 


0.0 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.8 


4.6 


Colon ca. HT29 


0.0 


0.0 


Pancreas 


2.5 


0.0 


Colon ca. CaCo- 
2 


0.0 


0.0 


Pancreatic ca. 
CAPAN 2 


0.0 


0.0 


Colon ca. HCT- 
15 


0.0 


0.0 

> 


Pituitary gland 


1.5 


2.7 


Colon ca. HCT- 
116 


0 0 

V/.V/ 




Placenta 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


O'.O 


Prostate 


0.0 


0.0 


Colon ca. 
SW480 


0.0 


0.0 


Prostate ca.* 
(bone met) PC- 
3 


O.l 


U.U 


Colon ca.* 
met) 


1 9 


0 0 

V/.V/ 


Salivarv (Hand 


0.1 


0.0 


oioiiiacii 


0 0 

VJ.yJ 


0 0 

V/.V/ 


1 1 uvllvu 

^ — , 


0.0 


0.0 


Gastric ca. (liver 


0.1 


0.0 


Spinal cord 


0.0 


2.4 


Heart 


3.8 


19.6 


Testis 


0.0 


2.0 


Skeletal muscle 
(Fetal) 


0.0 


0.3 


Thyroid 


0.0 


2.3 


Skeletal muscle 


U.U 


U.U 


uterus 


U.U 


0 0 


Endothelial cells 


0.0 


0.0 


Melanoma 
M14 


0.0 


0.5 


Heart (Fetal) 


11.0 


31.0 


Melanoma 
LOX IMVI 


0.0 


0.0 


Kidney 


1.3 


12.2 


Melanoma 
UACC-62 


0.0 


0.0 


Kidney (fetal) 


0.0 


3.9 


Melanoma SK- 
MEL-28 


0.0 


0.4 
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Renal ca. 786-0 


0.0 


5.4 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Renal ca. A498 


10.5 


16.4 


Melanoma 
Hs688(A).T 


0.0 


0.2 


Renal ca. ACHN 


85.3 


100.0 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 


Renal ca. TK-10 


60.7 


59.0 







Table AE. Panel 1.3D 



Tissue Name 


Rel Fxn (%\ 
Ag2930, Run 
158090377 


Rel Exd (%\ 
Ag2930, Run! 
165701939 


Tissue Name 


Rel. Exp.(%) 
Ag2930, Run 
158090377 


Rel. Exp.(%) 
Ag2930, Run 
165701939 


Liver 

adenocarcinoma 


0.0 


A A 
0.0 


Kidney (letal) 


in i 
IU. 1 


u.u 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


2.3 


7.8 


Pancreatic ca. 
CAPAN 2 


0.0 


0.0 


Renal ca. 
A498 


61.6 


44.8 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 
393 


0.0 


0.0 


Thyroid 


1.6 


0.0 


Renal ca. 
ACHN 


16.8 


100.0 


Salivary gland 


0.0 


0.0 


Renal ca. UO- 
31 


10.1 


15.0 


Pituitary gland 


3.1 


0.0 


Renal ca. TK- 
10 


22.5 


52.5 


Brain (fetal) 


9.7 


3.6 


Liver 


0.0 


0.0 


Brain (whole) 


11.0 


24.3 


Liver (fetal) 


1.3 


0.0 


Brain (amygdala) 


23.2 


9.3 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.0 


0.0 


Brain 

(hippocampus) 


100.0 


3.4 


Lung (fetal) 


0.0 


15.0 


Brain (substantia 
nigra) 


2.7 


19.5 


Lung ca. 
(small cell) 
LX-1 


0.0 


0.0 


Brain (thalamus) 


23.3 


6.8 


Lung ca. 
(small cell) 
NCI-H69 


39.2 


16.0 


Cerebral Cortex 


33.4 


0.0 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 
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Spinal cord 


6.4 


7.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


gUO/aSiro U0/-1VIO 


z.o 


A Q 


Lung ca. (non- 
sm. cell) A549 


0 0 


VJ.yj 


glio/astroU-118- | 


0.0 


0.0 


Lung ca. (non- 1 
s.cell) NCI- : 
H23 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- ] 
s.cell) HOP-62 


0.0 


0.0 


neuro*; met SK-N-j 


4.2 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


0.0 


0.0 

, 


astrocytoma SF- 

539 


5.0 


0.0 


Lung ca. 
(squam.) SW j 
900 


0.0 


0.0 


astrocytoma SNB- 
75 


0.0 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


8.4 


6.7 


glioma SNB- 19 


2.1 


0.0 


Mammary 
gland 


18.3 


0.0 


glioma U25 1 


0.0 


0.0 


Breast ca.* 
(pl.ef) MCF-7 


1.4 


4.6 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-23 1 


0.0 


0.0 


Heart (fetal) 


18.4 


0.0 


Breast ca.* 
(pl.ef) T47D 


0.0 


0.0 


Heart 


2.4 


4.1 


Breast ca. BT- 
549 


10.7 


5.8 


Skeletal muscle 
(fetal) 


1.5 


0.0 


Breast ca. 
MDA-N 


35.1 


4.7 


Skeletal muscle 


0.0 


0.0 


Ovary 


0.0 


0.0 


Bone marrow 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Thymus 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Spleen 


1.8 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


0.0 


Lymph node 


0.0 


11.6 


Ovarian ca. 
OVCAR-8 


12.9 


4.9 


Colorectal 


0.0 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


1.3 


3.3 


Ovarian ca.* 
(ascites) SK- 
OV-3 


2.9 


6.5 


Small intestine 


1.7 


0.0 


Uterus 


0.0 


0.0 
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Colon ca. SW480 


0.0 


0.0 


Placenta 


0.0 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


2.4 


0.0 


Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


0.0 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


0.0 


0.0 


Melanoma 
M14 


0.0 


0.0 


Bladder 


0.0 


0.0 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.0 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 


0.0 


8.5 


Adipose 


1.3 


0.0 



Table AF. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2930, Run 
158090382 


Tissue Name 


Rel. Exp.(%) 
Ag2930, Run 
158090382 


Normal Colon 


1.2 


Kidney Margin 
8120608 


0.6 


CC Well to Mod Diff 
(OD03866) 


0.4 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.3 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


3.2 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


1.7 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.5 


CC Margin (OD03921) 


2.2 


Thyroid Cancer 
064010 


0.3 
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CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


0.2 


Liver Marein (ODO4309V 


0.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.4 


Prostate Cancer 
(OD04410) 


0.4 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 


0.0 


Breast Cancer 

Metastasis 

(OD04655-05) 


0.0 


Prostate Cancer 
(OD04720-01) 


1 0 


Rreast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02) 


0.0 


Breast Cancer 1 024 


0.2 


Normal Lung 061010 


0.2 


Breast Cancer 
9100266 


3.5 


Lung Met to Muscle 


0.5 


Breast Margin 
Qi 0076S 


0.3 


\A 1 i c r* 1 r» \/f n ro i n 
IVlUloL/IC IVldltilll 

(OD04286) 


0.0 


I )1 vu.M V * dl 1 k_ \_ 1 

A209073 


1.0 


T uncr 1\./T pi 1 i on 5i n t f^r^nff^r 

L/U 1 1 ^ 1V1C11 l^lld.1 IL \^ClllV/d 

(OD03126) 


0.0 


RrpnQt lVTarcnn 

A2090734 


0.0 




0.4 


Normal F ivpr 

1 lV/lllllll LilVwl 


0.0 


Lung Cancer (OD04404) 


1.9 


Liver Cancer 064003 


0.2 


Lung Margin (OD04404) 


0.0 


Liver Cancer 1025 


u.o 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1 026 




Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 

02) ^ 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver I 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.2 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


5.0 


Normal Kidney 


1.6 


Bladder Cancer 
(OD047 18-01) 


0.0 
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Kidney Ca, Nuclear grade 
2 (0004338) 


0.0 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(OD04338) 


2.7 


Normal Ovary 


0.9 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


45.1 


Ovarian Cancer 
064008 


0.1 


Kidney Margin 
(OD04339) 


2.4 


Ovarian Cancer 
(OD04768-07) 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


100.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


3.1 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 
(OD04348) 


1.3 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


20.9 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 
(OD04622-03) 


0.3 


Stomach Margin 
9060394 


0.4 


Kidney Cancer 
(OD04450-01) 


33.4 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


2.4 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.0 



Table AG. Panel 3D 



Tissue Name 


Rel. 

Exp.(%) Ag2930, 
Run 162374504 


Tissue Name 


Rel. 
Exp.(%) 
Ag2930, Run 
162374504 


Daoy- 
Medulloblastoma 


1.9 


Ca Ski- Cervical 
epidermoid carcinoma 
(metastasis) 


0.0 


TE671- 
Medulloblastoma 


0.0 


ES-2- Ovarian clear 
cell carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


2.0 


Ramos- Stimulated 
with PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated 
with PMA/ionomycin 14h 


0.0 


XF-498-CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 



885 



SNB-78- Glioma 


0.0 


Raji- Burkitt's 
lymphoma 


0.0 


SF-268- 
Glioblastoma 


0.0 


Daudi- Burkitt's 
lymphoma 


0.0 


T98G- 
Glioblastoma 


0.0 


U266- B-cell 
plasmacytoma 


0.0 


SK-N-SH- 
Neuroblastoma 
(metastasis) 


0.0 


CA46- Burkitt's 
lymphoma 


0.0 


SF-295- 
Glioblastoma 


0.0 


RL- non-Hodgkin's 
B-cell lymphoma 


0.0 


Cerebellum 


2.7 


JMl-pre-B-cell 
lymphoma 


0.0 


Cerebellum 


7.3 


Jurkat- T cell 
leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


0.0 


TF-1- 
P rvth rol pi i Icem i a 

l j l j LI 1 1 Ulv UI\vl 1 1 1 CI 


0.0 


DMS-114-Small 
cell lung cancer 


0 0 


HUT 78- T-cell 
lymphoma 


0.0 


DMS-79- Small 
cell lung cancer 


0 0 


U937- Histiocytic 
lymphoma 


0.0 


NCI-H146- Small 
cell lung cancer 


0.0 


KU-812- 
Myelogenous leukemia 


0.0 


NCI-H526- Small 
cell lung cancer 


0.0 


769-P- Clear cell 
renal carcinoma 


39.0 


NCI-N417- Small 
cell lung cancer 


100.0 


Caki-2- Clear cell 
renal carcinoma 


4.6 


NCI-H82- Small 
cell lung cancer 


ft ft 
U.U 


SW 839- Clear cell 
renal carcinoma 


1 U.U 


NCI-H157- 
Squamous cell lung 
cancer (metastasis) 


0.0 


G401- Wilms' tumor 


0.0 


NCI-H1155- 
Large cell lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- 
Large cell lung cancer 


0.0 


CAPAN-1- 
Pancreatic adenocarcinoma 
(liver metastasis) 


0.0 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


5.0 


NCI-UMC-11- 
Lung carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell 
lung cancer 


0.0 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon 

cancer 


0.0 


MIA PaCa-2- 
Pancreatic carcinoma 


0.0 
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KM 12- Colon 

cancer 


0.0 


CFPAC-1- 
Pancreatic ductal 
adenocarcinoma 


0.0 


KM20L2- Colon 

cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon! 

cancer 


0.0 


T24- Bladder 
carcinma (transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


0.0 


5637- Bladder 
carcinoma 


0.0 


SW1 116- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder j 
carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder 
carcinma (transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- 
Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1080- 
Fibrosarcoma 


0.0 


NCI-SNU-5- 
Gastric carcinoma 


0.0 


MG-63- 
Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


0.0 


SK-LMS-1- 
Leiomyosarcoma (vulva) 


0.0 


NCI-SNU-16- 

Gastric* CflTCAT\C\TY\Z\ 

v. J tlO L 1 1 w Will vlllUlllvl 


0.0 


SJRH30- 
Rhabdomyosarcoma (met to 
bone marrow) 


0.0 


NCI-SNU-1- 
Gastric carcinoma 


0.0 


A431- Epidermoid 
carcinoma 


0.0 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- 
Melanoma 


0.0 


RF-48- Gastric 
adenocarcinoma 


0.0 


DU 145- Prostate 
carcinoma (brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


8.7 


MDA-MB-468- 
Breast adenocarcinoma 


0.0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous 
cell carcinoma of tongue 


0.0 


OVCAR-5- 
Ovarian carcinoma 


0.0 


SCC-9- Squamous 
cell carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous 
cell carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous 
cell carcinoma of tongue 


0.0 



Table AH. Panel 4D 





Rel. Exp.(%) 




Rel. Exp.(%) 


Tissue Name 


Ag2930, Run 


Tissue Name 


Ag2930, Run 




158090383 




158090383 



-i run "J* ™ f n i «»" ... n i t. .1 n 
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Secondary Th 1 act 


5.9 


HUVEC IL-lbeta 


0.0 


Secondary Thz act 




hlUVLC IrJN gamma 


u.u 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondarv Trl rest 


0.0 


Lung Microvascular EC 
none 


9.9 


Primarv Th1 act 

J. 11111 C4-1 J X 1 1 1 C*Vx V 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primarv Th'? net 


0.0 


Microvascular Dermal EC 
none 


15.2 


Primarv TV1 act 

1 1 1 1 1 1 £.11 Jf 111 CiV-/ L 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Pt*i tvi n t^v/ TVi 1 rf=**;t 
rilllldlji' llll icm 


0 0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 

_ — 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


22.1 


LAK cells rest 


0.0 


CCD 1 1 (Jo (Keratinocytes) 
TNFalnha + IL-lbeta 


8.2 


T AK cells 11 -2 


0.0 


Liver cirrhosis 

A — J A T v A \S 1 A A 1 IVO A O 


0.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


10.7 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 
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Two Way MLR 7 day ] 


0.0 


HPALC INF alpha + 1L-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 

^ 1-2 - ~ -""WY,, 2 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha; 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


R Ivmnhopvfp^ PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
± ivlav ionomycin 


0.0 


Dermal fibroblast 
rrnin7n tt _ i Kptc» 


0.0 


Dendritic cells none 


0.0 


T~^<r* t*m o 1 finrAnlcict TTh^M 

i^eriiidi iiuroDicioi iriN 
gamma 


0.0 


Dendritic cells LPS 


7.9 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


6.3 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


100.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag2930 Expression of the NOV9A gene is 
low/undetectable in all samples on this panel. (CTs>35). (Data not shown.) 



General_screeningpanel_vl.5 Summary: Ag4297 Expression of the NOV9A gene 
is low/undetectable in all samples on this panel. (CTs>35). (Data not shown.) 

.5 Panel 1.1 Summary: Ag573 Two experiments with the same probe and primer set 

produce results that are in excellent agreement, with highest expression of the NOV9A gene, a 
putative neuropeptide Y receptor, in lung cancer and renal cancer cell lines (CTs=23-26). 
Significant expression is also seen in a cluster of breast cancer cell lines. Thus, expression of 
this gene could be used to differentiate between these samples and other samples on this panel 

1 0 and as a marker to detect the presence of these cancers. Neuropeptide Y, which controls 
vasoconstriction and feeding behavior, is expressed in breast cancer (see ref. below). 
Furthermore, peptide receptors in human tumors represent clinically relevant targets for both 
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cancer diagnosis and treatment. Therefore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of breast, lung and renal cancers. 

This molecule, which encodes a neuropeptide Y receptor homolog, is also expressed in 
the brain. Neuropeptide Y and its receptors have been implicated in feeding behavior, learning 
5 and memory, and seizure. This gene would therefore be an excellent small molecule target for 
the treatment of epilepsy or any seizure disorder. 

Among tissues with metabolic function, this gene has low-to-moderate levels of 
expression in adrenal, heart, fetal skeletal muscle and pancreas. This gene product is highly 
expressed in fetal and adult heart. Since neuropeptide Y and its receptor are associated with 

10 appetite regulation, this gene product may be a small molecule target for the treatment of 

metabolic and endocrine disease, including obesity and Types 1 and 2 diabetes. In addition, 
the expression in heart and the suggested role of neuropeptide Y in vasoconstriction, 
cardiovascular signaling, and development of the heart suggest that this gene product may be 
useful in treating disorders that affect the heart. 

1 5 References: 

Reubi JC, Gugger M, Waser B, Schaer JC. Y(l)-mediated effect of neuropeptide Y in 
cancer: breast carcinomas as targets. Cancer Res 2001 Jun 1 ;61(1 1):4636-41 

Overexpression of selected peptide receptors in human tumors has been shown to 
represent clinically relevant targets for cancer diagnosis and therapy. Neuropeptide Y (NPY) is 

20 a peptide neurotransmitter mediating feeding behavior and vasoconstriction. It has never been 
shown to be involved in human cancer. We show here, using in vitro receptor 
autoradiography, a NPY receptor incidence of 85% in primary human breast carcinomas (n = 
95) and of 100% in lymph node metastases of receptor-positive primaries (n = 27), 
predominantly as Y(l) subtype, whereas non-neoplastic human breast expressed Y(2) 

25 preferentially. Y(l) mRNA was detected in Y(l)-expressing tumors by in situ hybridization, 
whereas Y(2) mRNA was found in normal breast tissue. The strong predominance of Y(l) in 
breast carcinomas compared with Y(2) in normal breast suggests that neoplastic 
transformation can switch the NPY receptor expression from Y(2) to Y(l) subtype. Moreover, 
in Y(l)-expressing human SK-N-MC tumor cells, an NPY-induced, dose-dependent inhibition 

30 of tumor cell growth of >40% was observed, suggesting a functional role of NPY in cancer, 
mediated by Y(l). NPY should therefore be added to the list of small regulatory peptides 
related to cancer. The high incidence of Y(l) in in situ, invasive, and metastatic breast cancers 
allows for the possibility to target them for diagnosis and therapy with NPY analogues. 
PMID: 11389101 
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Furtinger S, Pirker S, Czech T, Baumgartner C, Ransmayr G, Sperk G. Plasticity of Yl 
and Y2 receptors and neuropeptide Y fibers in patients with temporal lobe epilepsy. J Neurosci 
2001 Aug 1;21(15):5804-12 

Marked expression of neuropeptide Y (NPY) and its Y2 receptors in hippocampal 
5 mossy fibers has been reported in animal models of epilepsy. Because NPY can suppress 

glutamate release by activating presynaptic Y2 receptors, these changes have been proposed as 
an endogenous protective mechanism. Therefore, we investigated whether similar changes in 
the NPY system may also take place in human epilepsy. We investigated Yl and Y2 receptor 
binding and NPY immunoreactivity in hippocampal specimens that were obtained at surgery 

10 from patients with temporal lobe epilepsy and in autopsy controls. Significant increases in Y2 
receptor binding (by 43-48%) were observed in the dentate hilus, sectors CA1 to CA3, and 
subiculum of specimens with, but not in those without, hippocampal sclerosis. On the other 
hand, Yl receptor binding was significantly reduced (by 62%) in the dentate molecular layer 
of sclerotic specimens. In the same patients, the total lengths of NPY immunoreactive (NPY- 

1 5 IR) fibers was markedly increased (by 1 1 5-958%) in the dentate molecular layer and hilus, in 
the stratum lucidum of CA3, and throughout sectors CA1 to CA3 and the subiculum, as 
compared with autopsies. In nonsclerotic specimens, increases in lengths of NPY-IR fibers 
were more moderate and statistically not significant. NPY mRNA was increased threefold in 
hilar interneurons of sclerotic and nonsclerotic specimens. It is suggested that abundant 

20 sprouting of NPY fibers, concomitant upregulation of Y2 receptors, and downregulation of Yl 
receptors in the hippocampus of patients with Ammon's horn sclerosis may be endogenous 
anticonvulsant mechanisms. 
PMID: 11466452 

Rahmouni K, Haynes WG. Leptin signaling pathways in the central nervous system: 
25 interactions between neuropeptide Y and melanocortins. Bioessays. 2001 Dec;23(12):1095-9. 

No other hormone has drawn more attention than leptin in recent studies on the control 
of appetite, body weight and obesity. This hormone is produced by adipose tissue and enters 
the brain via a saturable specific transport mechanism. Leptin acts in the hypothalamus to 
modulate food intake and heat production as well as several other neuroendocrine pathways. 
30 The mechanisms through which leptin exerts its central nervous effects are now better 
understood. Proopiomelanocortin- and neuropeptide Y-containing neurons in the 
hypothalamus have emerged as potent candidate mediators of leptin action. These two 
neuropeptides have been shown to exert opposing effects using different pathways. Recently, 
Cowley et al. (2001) described a new circuit in the regulation of neuronal activity by leptin 
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with an interaction between these two pathways. These data add complexity to the 
mechanisms by which leptin achieves its effects in the central nervous system, but they also 
offer potential mechanisms to explain the phenomenon of leptin resistance observed in 
obesity. Copyright 2001 John Wiley & Sons, Inc. 
5 PMID: 11746228 

Michalkiewicz M, Michalkiewicz T, Kreulen DL, McDougall SJ. 
Increased blood pressure responses in neuropeptide Y transgenic rats. Am J Physiol 
Regul Integr Comp Physiol 2001 Aug;281(2):R4 17-26 

Considering the coexistence of neuropeptide Y (NPY) and norepinephrine in 

10 perivascular sympathetic nerves and the known vasoconstrictor cooperation of NPY with 
norepinephrine, we investigated the involvement of NPY in long-term control of 
cardiovascular functions using NPY transgenic (NPY-tg) rats. These rats were developed by 
injection of the rat (Sprague-Dawley) pronuclei with a 14.5-kb clone of the rat structural NPY 
gene. When compared with nontransgenic littermates, NPY concentrations were significantly 

1 5 increased in a number of cardiovascular tissues of NPY-tg hemizygotes. Direct basal mean 
arterial pressure and heart rate were not changed, but calculated total vascular resistance was 
significantly increased in NPY-tg subjects. Arterial pressure increases, in response to 
norepinephrine injection, were greater in the NPY-tg rats. Also, the hypotension and 
bradycardia in response to hemorrhage were significantly reduced in NPY-tg subjects. These 

20 results indicate that NPY, when expressed in increased amounts, potentiates the pressor effects 
of norepinephrine and contributes to maintaining blood pressure during hemorrhage, but it 
does not alter resting blood pressure. These transgenic rats will facilitate studies of the role of 
NPY signaling in cardiovascular regulation, particularly regarding its functional cooperation 
with norepinephrine. 

25 PMID: 11448843 

Horackova M, Slavikova J, Byczko Z. Postnatal development of the rat intrinsic 
cardiac nervous system: a confocal laser scanning microscopy study in whole-mount atria. 
Tissue Cell 2000 Oct;32(5):377-88 

We used confocal laser scanning microscopy and fluorescent immunohistochemistry to 

30 study the developmental pattern and distribution of specific neuronal phenotypes within the 
intrinsic cardiac nervous system in whole-mount atrial preparations from newborn to 5 week 
old rats. Individual ganglia and neuronal cell bodies were localized by means of two general 
neuronal markers: protein gene product 9.5 (PGP) and microtubule-associated protein two 
(MAP). In rats < or =2 weeks old there were two main subpopulations of intrinsic neurons 
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located in the intraatrial septum and around the origin of the superior vena cava. The more 
abundant was a population of strongly tyrosine hydroxylase (TH) immunoreactive (IR) 
neurons (10-40 microm in diameter) most of which were also PGP-IR. The second, less 
numerous (approximately 60-70% than the TH-IR group) type of neurons exhibited ChAT-IR 
5 which colocalized with MAP-IR. Towards the end of the second postnatal week and during the 
third, the ganglia containing these neurons became more numerous and their localization also 
included tissues around the origins of the inferior vena cava and the pulmonary veins, as well 
as both atrial walls close to the AV junction. During the second and third postnatal weeks, 
when the extrinsic innervation of the adrenergic and cholinergic phenotypes largely increases, 

10 the intrinsic innervation also changed greatly, and around the 21st postnatal day it appeared to 
acquire mature characteristics. The TH-IR neurons changed their characteristics and formed 
two types of ganglia. The larger ganglia containing large cells (20-40 microm in diameter) 
expressed TH-IR mostly close to their inner body surface (approximately 80-90% of identified 
neurons). Most of these neurons also expressed neuropeptide Y (NPY)-IR, specifically around 

15 their nuclei. The second type of small strongly TH-IR neurons (approximately 10% of all 
identified neurons) were contained in smaller groups (20-50 cells) which were usually 
embedded into much larger ganglia (100-400 cells), containing large (20-50 microm) neurons. 
Unlike all other intrinsic neurons, these small TH-IR cells did not exhibit any PGP-IR or 
MAP-IR. The number of ChAT-IR neurons increased at this stage, reaching approximately 

20 90% of the neurons identified by the general neuronal markers. These neurons were 

surrounded by a rich network of cholinergic varicose nerve fibers, some of which were likely 
of an extrinsic origin. We have also identified relatively small ganglia expressing 
immunoreactivity to vasoactive intestinal polypeptide (VIP), and to substance P (SP). The 
presented data indicate that the phenotypes of intrinsic neurons in the rat heart change greatly 

25 during the first month of postnatal development. This may be at least partially related to the 
development and maturation of functional extrinsic nervous control of the heart. 
PMID: 11201277 

Panel 1.3D Summary: Ag2930 The expression of the NOV9a gene was assessed in 
two independent runs on this panel. Low but significant levels of expression are seen in kidney 
30 cancer cells and a lung cancer cell, consistent with Panel 1.1. Please see the previous panel for 
discussion of utility of this gene in cancer. 

The expression in this panel also confirms expression of this gene product in the brain. 
Please see Panel 1.1 for discussion of utility of this gene in the central nervous system. 
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Panel 2D Summary: Ag2930 The NOV9a gene is expressed at low but significant 
levels in kidney cancer samples in this panel but not in the adjacent normal tissue samples 
(CTs=30-32). This expression is consistent with results in the preceding panels. This suggests 
that expression of this gene can be used as a diagnostic marker for the presence of kidney 
5 cancer. Furthermore, therapeutic inhibition of the gene product could potentially be used in the 
treatment of kidney cancer. 

Panel 3D Summary: Ag2930 The NOV9a gene expression is restricted to NCI-N417, 
a small cell lung cancer cell line (CT=33.81). Expression of this.gene can therefore be used for 
the diagnosis and treatment of this cancer. 
10 Panel 4D Summary: Ag2930 Expression of the NOV9a gene is low/undetectable in 

all samples on this panel. (CTs>35). (Data not shown.) 

NOV4b 

Expression of gene NOV4b was assessed using the primer-probe set Ag2955, 
described in Table BA. Results of the RTQ-PCR runs are shown in Tables BB, BC and BD. 

1 5 Table BA. Probe Name Ag2955 



Primers; 


Sequences 


Length \ 


Start 
Position 


SEQ ID NO: 


Forward; 


5 ■ -caatgcaatctgttggactttt-3 * 


22 




1003 


Probe 


TET-5 • -tctctcctgggatggattttatccat-3 ' - 
TAMRA 


26 


59 


1004 


Reverse 


5 ' -gttcttccagtgtggcaaataa-3 1 


U. ! 


94 1 


1005 



Table BB. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) Ag2955, 
Run 216860933 


Tissue Name 


Rel. Exp.(%) Ag2955, 
Run 216860933 


Adipose 


8.1 


Renal ca. TK-10 


8.0 


Melanoma* 
Hs688(A).T 


8.5 


Bladder 


18.9 


Melanoma* 
Hs688(B).T 


5.8 


Gastric ca. (liver met.) 
NCI-N87 


35.6 


Melanoma* M14 


1.1 


Gastric ca. KATO III 


2.5 


Melanoma* 
LOXIMVI 


31.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


3.1 


Colon ca. S W480 


4.4 


Squamous cell 
carcinoma SCC-4 


1.3 


Colon ca.* (SW480 
met) S W620 


1.7 


Testis Pool 


0.0 


Colon ca. HT29 


3.9 
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Prostate ca.* (bone 
met) PC-3 


36.9 


Colon ca. HCT-116 


5.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


5.9 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


10.5 


Colon ca. Colo-205 


3.3 


Ovarian ca. SK-OV- 

3 


47.3 


Colon ca SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


4.5 


Colon Pool 


1.6 


Ovarian ca. 
OVCAR-5 


6.5 


Small Intestine Pool 


2.4 


Ovarian ca. IGROV- 

i 
i 


21.0 


Stomach Pool 


4.3 


wvdiiciii ca. 
OVCAR-8 


3.1 


Bone Marrow Pool 


5.3 


Ovary 


4.3 


Fetal Heart 


2.9 






nedi i ruui 


D .O 


Breast ca. MDA- 


29.7 


Lymph Node Pool 


8.0 


Breast ca. BT 549 


3.0 


Fetal Skeletal Muscle 


1.6 


Breast ca. T47D 


6.3 


Skeletal Muscle Pool 


7.6 


Breast ca. MDA-N 


1.6 


Spleen Pool 


2.4 


Breast Pool 


0.7 


Thymus Pool 


1.2 


Trachea 


2.5 


CNS cancer (glio/astro) 
U87-MG 


2.0 


Lune 


1.5 


CNS cancer (glio/astro) 
U-118-MG 


1.2 


Fetal Lung 


9.6 


CNS cancer 
(neuro;met) SK-N-AS 


5.3 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 

'v 1 O 

j>J9 


2.6 


Lung ca. LX-1 


2.4 


CNb cancer (astro) 
SNB-75 


6.7 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


9.3 


Lung ca. SHP-77 


1.7 


CNS cancer (glio) SF- 
295 


62.9 


Lung ca. A549 


3.0 


Brain (Amygdala) Pool 


1.3 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


4.0 


Lung ca. NCI-H23 1 


59.0 


Brain (fetal) 


7.2 


Lung ca. NCI-H460 


2.1 


Brain (Hippocampus) 
Pool 


2.1 


Lung ca. HOP-62 


3.0 


Cerebral Cortex Pool 


3.0 
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Lung ca. NCI-H522 


65.1 


Brain (Substantia nigra) 
Pool 


0.6 


Liver \ 


0.0 


Brain (Thalamus) Pool 


2.6 


Fetal Liver 


3.1 


Brain (whole) 


0.0 


Liver ca. HepG2 


4.1 


Spinal Cord Pool 


0.0 


Kidney Pool 


4.4 


Adrenal Gland 


0.0 


Fetal Kidney 


23.2 


Pituitary gland Pool 


1.0 


Renal ca. 786-0 


2.5 


Salivary Gland 


0.0 


Renal ca. A498 


1.9 


Thyroid (female) 


1.7 


Renal ca. ACHN 


2.7 


Pancreatic ca. CAPAN2 


100.0 


Renal ca. UO-3 1 


1.7 


Pancreas Pool 


6.5 



Table BC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2955, 
Run 167906363 


Tissue Name 


Rel. Exp.(%) Ag2955, 
Run 167906363 


Liver adenocarcinoma 


8.5 


Kidney (fetal) 


4.4 


Pancreas 


0.0 


Renal ca. 786-0 


3.2 


Pancreatic ca. CAP AN 

2 


64.6 


Renal ca. A498 


15.1 


Adrenal gland 


0.0 


Renal ca. RXF 393 


10.7 


Thyroid 


1.5 


Renal ca. ACHN 


6.5 


Salivary gland 


1.1 


Renal ca. UO-31 


2.3 


Pituitary gland 


0.0 


Renal ca. TK-10 


13.2 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


2.7 


Liver (fetal) 


0.0 


Brain (amygdala) 


12.1 


Liver ca. 

(hepatoblast) HepG2 


8.5 


Brain (cerebellum) 


1.0 


Lung 


6.4 


Brain (hippocampus) 


0.0 


Lung (fetal) 


2.9 


Brain (substantia nigra) 


6.2 


Lung ca. (small cell) 
LX-1 


5.5 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


3.9 


Lung ca. (s.cell var.) 
SHP-77 


4.1 


Spinal cord 


1.9 


Lung ca. (large 
cell)NCI-H460 


1.1 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


19.2 


glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


26.1 


astrocytoma SW1783 


7.6 


Lung ca. (non-s.cell) 
HOP-62 


12.3 
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neuro*; met SK-N-AS 


3.5 


Lung ca. (non-s.cl) 
NCI-H522 


46.3 


astrocytoma SF-539 


13.2 


Lung ca. (squam.) 
SW 900 


16.7 


astrocytoma SNB-75 


3.6 


Lung ca. (squam.) 
NC1-H596 


2.8 


glioma SNB-19 


— ----- - -16.5 - 


Mammary gland 


u.u 


glioma U251 


26.6 


Breast ca. (pl.et) 
MCF-7 


3.0 


glioma SF-295 


6.0 


Breast ca. (pl.et) 
MDA-MB-231 


7.1 


Heart (fetal) 


0.0 


Breast ca.* (pl.et) 
T47D 


11.5 


Heart 


1.1 


Breast ca. BT-549 


1.1 


Skeletal muscle (fetal) 


3.3 


Breast ca. MDA-N 


5.7 


Skeletal muscle 


0.0 


Ovary 


0.0 


Rnnp marrnw 

JJ^llt 1 1 1CII I v„» w 


0.0 


Ovarian ca. OVCAR- 
3 


21.2 




0.0 


Ovarian ca. OVCAR- 
4 


7.4 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


14.3 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


4.7 


Colorectal 


1.4 


Ovarian ca. IGROV- 
1 


38.4 


» ~» — 

Stomach 


0.9 


Ovarian ca.* (ascites) 
SK-OV-3 


100.0 


Small intestine 


0.0 


Uterus 


U.U 


Colon ca. SW480 


6.8 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


7.6 


Prostate 


0.0 


Colon ca. HT29 


6.1 


Prostate ca.* (bone 
met)FC-3 


13.1 


Colon ca. HCT-116 


4.2 


Testis 


0.0 


Colon ca. CaCo-2 


19.3 


Melanoma 

rlSOoo^/\ ). I 


14.7 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


5.4 


Colon ca. HCC-2998 


3.6 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


20.6 


Melanoma M14 


0.0 


Bladder 


13.4 


Melanoma LOX 
IMVI 


25.5 


Trachea 


3.6 


Melanoma* (met) 


1.7 



897 








SK-MEL-5 




Kidney 


5.1 


Adipose 


13.9 



Table BD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2955, Run 
164306318 


Tissue Name 


Rel. Exp.(%) 
Ag2955, Run 
164306318 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


6.6 


Secondary Th2 act 


0.0 


HUVEC I FN gamma 


20.2 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


15.7 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + IL4 


8.4 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


6.0 


Primary Th 1 act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


8.7 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


7.7 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


6.4 


Primary Th 1 rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


9.5 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


5.3 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


19.9 


CD45RA CD4 
lymphocyte act 


13.5 


Coronery artery SMC rest 


21 .4 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


12.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


37.6 


Secondary CDS 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


29.3 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


25.7 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


100.0 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCD1 106 (Keratinocytes) 
none 


4.2 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


1.3 


LAK cells TL-2 


0.0 


Liver cirrhosis 


7.5 


LAK cells IL-2+IL- 12 


0.0 


Lupus kidney 


5.0 
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LAK cells IL-2+IFN 


0.0 


NCI-H292 none 


31.9 


T AK celk TT -9+ II -1 8 


0 0 




S? 1 


LAK cells 

P1V4 A /innnmvfin 


0.0 


NCI-H292 IL-9 


39.5 


NK Celk TT -? re^t 

1 ^ IX. V^V^ 1 1 J 1L Z# 1 VOl 


0 0 


NCM-H9Q? IT -T ^ 


i f\ f\ 

1 KJ.KJ 


Two Wav MI R ^ Hav 

I WKJ VV ciy iVIJLvlV J tidy 




NIPI-l I7Q9 I KM aqmmn 




Two Way MLR 5 day 


0.0 


HPAEC none 


4.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


3.0 


PBMC rest 


0.0 


Lung fibroblast none 


4.9 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


5.5 


Lung fibroblast IL-4 


9.9 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


6.3 


Ramos (B cell) 
ionomycin 


0 0 

VJ .VJ 


T una fihroHlfi<it TT -1^ 


1 7 

Y , / 


R lvmnhocvte<s PW1VT 


0.0 


Lung fibroblast IFN 
gamma 


1 1 7 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


59.9 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1 070 TNF alpha 


80.7 


EOL-T dbcAMP 

P1V/T A / lotinmvpin 


0.0 


Dermal fibroblast 
rrnin7n n _1 h^t^ 


35.4 


Dendritic cells none 


10.6 


l~^ i~rvi o 1 "fi 1*mv\V"%1 ci o t I C~M 

L'eriiiai iiDruDiaM ifin 
gamma 


70.2 


Dendritic cells LPS 


4.5 


Dermal fibroblast IL-4 


72.7 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


4.2 


Monocytes rest 


7.2 


IBD Crohn's 


0.0 


Monocytes LPS 


2.4 


Colon 


6.7 


Macrophages rest 


3.3 


Lung 


22.4 


Macrophages LPS 


0.0 


Thymus 


23.2 


HUVEC none 


7.3 


Kidney 


0.0 


HUVEC starved 


19.6 







General_screening_panel_vl.4 Summary: Ag2955 Highest expression of the 

NOV4b gene is seen in a pancreatic cancer cell line (CT=32.6). Low but significant levels of 

expression are also seen in melanoma, lung, brain, ovarian, breast and prostate cancer cell 

5 lines. Thus, expression of this gene might be used as a diagnostic marker for the presence of 

these cancers. Furthermore, therapeutic inhibition of this gene product may be useful in the 

treatment of melanoma, lung, brain, ovarian, breast and prostate cancers. 
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Panel 1.3D Summary: Ag2955 Highest expression of the NOV4b gene is seen in an 
ovarian cancer cell line, SK-OV-3, (CT=33.1). Low but significant levels of expression are 
also seen in melanoma, lung, brain and pancreatic cancer cell lines. Thus, expression of this 
gene might be used as a diagnostic marker for the presence of these cancers. Furthermore, 
5 therapeutic inhibition of this gene product may be useful in the treatment of melanoma, lung, 
brain, and pancreatic cancers. 

Panel 4D Summary: Ag 2955 The NOV4b gene is expressed at low but significant 
levels in treated and untreated dermal fibroblasts and in the basophil cell line treated with 
PMA and ionomyciri. The latter mimics the condition that leads to the degranulation and 

10 release of various mediators which contribute to the symptomatology of allergic diseases. This 
transcript encodes a claudin 6 like protein, a member of the Claudin tight junction family. The 
expression of this transcript could potentially be used as a marker for activated basophils and 
dermal fibroblasts. Furthermore, modulation of the activity or expression of this putative 
protein by antibodies may reduce the symptoms of patients suffering from allergic diseases 

15 asthma, ulcerative colitis, atopic diseases such as contact dermatitis and eczema, or 
inflammatory skin diseases. 

NOV3b 

Expression of gene NOV3b was assessed using the primer-probe set Ag2957, 
described in Table CA. Results of the RTQ-PCR runs are shown in Tables CB, CC, CD and 
20 CE. 

Table CA. Probe Name Ag2957 



PrimersJ Sequences 


Length 


Start 
Position 


SEQ ID NO : 


Forwardjs ' - caggcctctgcactt tgac- 3 ' 


19 _ 


438 


1006 


Probe j TET ~ 5 ' -ctgtctcgtggtatgccaccctggt-3 ' - 
ro e jTAMRA 


25 


459 


1007 


Reversejs 1 -ccaaattctgggttgaagaact-3 1 


22 


492 


1008 



Table CB. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) Ag2957, 
Run 216861284 


Tissue Name 


Rel. Exp.(%) Ag2957, 
Run 216861284 


Adipose 


1.8 


Renal ca. TK-10 


6.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 
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Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 


0.0 


Colon ca.* (SW480 

IllL/l^ O VV UZ.U 


4.6 


Testis Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


24.8 


Placenta 


41.5 


Colon cancer tissue 


1.4 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0 0 


Colon ca Colo-205 


0.0 


Ovarian ca. SK-OV- 

3 


7 7 


Colon ca SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 1 

i 
i 


0.0 


Stomach Pool 


0.0 


\J\ (llldAl Ld. 

OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.0 


DICtlM Ud. iVl\^J^-/ 


0 0 


\A 'A r t rV\r 1 1 


0 0 


Breast ca. MDA- 


0.0 


Lymph Node Pool 


op 


Breast ca. BT 549 


1.7 


Fetal Skeletal Muscle 


8.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


3.0 


Trachea 


2.8 


CNS cancer (glio/astro) 
U87-MG 


0.0 


Lung 


2.9 


CNS cancer (glio/astro) 
U-118-MG 


0.0 


Fetal Lung 


24.5 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


Lung ca. LX-1 


0.9 


CNS cancer (astro) 
SNB-75 


0.0 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 
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Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 
295 


0.0 


T nno pfi A S4Q 

I—/ LI 1 1 CI. / V -J ^ 37 


0.0 


Rrain ( Amvfrrlala'l Pool 


0.0 


T una ra NCT-HS76 


6 0 


Rrain fofrphellum^ 


0.0 


I nnC F ra NPT-H?^ 


0 0 


Rrain iTetaH 


1.7 


Lung ca. NCI-H460 


0.0 


I J 1 Cl 1 1 1 ^1 1 1 IJ I J wV_<clI 1 1 VJ LI j j 

Pool 


4.7 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


0.0 


Brain (Substantia nigra) 
Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


31.4 


Brain (whole) 


0.0 


Liver ca. HepG2 


7.6 


Spinal Cord Pool 


22.4 


Kidney Pool 


1.7 


Adrenal Gland 


0.0 


Fetal Kidney 


100.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


2.0 


Renal ca. UO-3 1 


0.0 


Pancreas Pool 


0.0 



Table CC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2957, 
Run 167906390 


Tissue Name 


Rel. Exp.(%) Ag2957, 
Run 167906390 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


34.2 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.1 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 i 


Salivary gland 


0.2 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


100.0 


Brain (whole) 


0.0 


Liver (fetal) 


2.6 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


2.7 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0.2 


Lung (fetal) 


0.8 


Brain (substantia nigra) 


0.3 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NC1-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


• 1.5 
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Spinal cord 


„ jLung ca. (large 
jcell)NCI-H460 


0.0 


glio/astro U87-MG 


_ ft jLung ca. (non-sm. 
jcell) A549 


0.0 


elio/astro U-l 18-MG 


0.0 


Lung ca. (non-s.cell) 
NC1-H23 


0.3 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.2 


Lung ca. (squam.) 

CY17 OAM 


2.1 


astrocytoma SNB-75 


0.1 


Lung ca. (squam.) 

i j ra/. 

JNC1-Hj9o 


0.0 

. , 


glioma SNB-19 


0.0 


Mammary gland 


1.7 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-23 1 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart 


0.6 


Breast ca. BT-549 


0.3 


Skeletal muscle (fetal) 


2.2 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


1.5 


Ovary 


0.0 


Rone marrow 

L/VJllv 1I1UI 1 WW 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thvmus 

1 11 Y 111 VI J 


2.3 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


0.1 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 


0.0 


Ovarian ca. 1GKOV- 
1 


0.0 

, — 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.1 


Small intestine 


0.0 


Uterus 


u.u 


Colon ca. SW480 


0.0 


Placenta 


0.8 


Colon ca.* 
SW620(SW480 met) 


0.3 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


0.0 


Colon ca. CaCo-2 


2.2 


Melanoma 
Hs688(A).T 


0.0 
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Colon ca. 
iissucyyjiJKJ d ooo ) 


0.0 


Melanoma* (met) 
Hsoo6(t$ ). 1 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.1 


Trachea 


0.2 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


30.6 


Adipose 


0.3 



Table CD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2957, Run 
170858345 


Tissue Name 


Rel. Exp.(%) 
Ag2957, Run 
170858345 


Normal Colon 


0.0 


Kidney Margin 
8120608 


68.8 


CC Well to Mod Diff 
(OD03866) 


1.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


24.3 


CC Gr.2 rectosigmoid 
(OD03868) 


13.6 


Kidney Cancer 
9010320 


3.4 


CC Margin (OD03868) 


1.9 


Kidney Margin 
9010321 


85.9 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


1.6 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


1.2 


Thyroid Cancer 
064010 


0.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


3.3 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


1.3 


Breast Cancer 
(OD04590-01) 


3.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 


2.6 


Breast Cancer I 


0.0 
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(OD04410) 




Metastasis 
(OD04655-05) 




Prostate Cancer 
(OD04720-01) 


0.0 


Breast Cancer 064006 

J— 9 _L \s K\ k-> K X-^Ullwwl \J\J T\J\J\J 


0.0 


Prostate Margin 
(OD04720-02) 


0.0 


Breast Cancer 1 024 


9.5 


Normal Lung 061010 


0.0 


Breast Cancer 
9100266 


3.3 


Lung Met to Muscle 
fOD04286^ 


0.0 


Breast Margin 
91 00765 


5.4 


Muscle Margin 
(OD04286) 


0.0 


J_> l taM V^-dllL/Cl 

A209073 


4.1 


Lunf? \4alipnant Panrer 

l-'UUg iflUllgllUllL V . i_ 1 1 1 ^ 1 

(OD03126) 


0.0 


LJ l I IVldl ££111 

A2090734 


4.9 


Lune Marein <'OD03126 , t 


0 0 


lN^IlllclI IjIVCI 


n n 

w.u 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


0.0 


Lung Margin (OD04404) 


0.0 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


12.4 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


5.7 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (OD043 10) 


1.4 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


0.0 


Normal Kidney 


85.3 


Bladder Cancer 
(OD04718-01) 


0.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


6.2 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(OD04338) 


32.3 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


13.5 


Ovarian Cancer 
064008 


0.0 


Kidney Margin 
(OD04339) 


92.0 


Ovarian Cancer 
(OD04768-07) 


28.7 


Kidney Ca, Clear cell 
type (OD04340) 


9.7 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


87.7 


Normal Stomach 


0.0 
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Kidney Ca, Nuclear grades 
3 (OD04348) 


u.u 


Gastric Cancer 
9060358 


0 0 


Kidney Margin 
(OD04348) 


i nn n 


Stomach Margin 
9060359 


0 0 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 
(OD04622-03) 


50.3 


Stomach Margin 
9060394 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


58.6 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.0 



Table CE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2957, Run 


Tissue Name 


Rel. Exp.(%) 
Ag2957, Run 
164306319 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.8 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th 1 rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 
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j fin 7?ni n ... ff'ii -s i o; 



Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


zry Inl/Inz/Irl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratmocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratmocytes) 

TMFalnl-m -1- TT _1hf»tfi 


0.0 


T AK celk II -? 


0 0 

u.u 


l^tl VCI CllTllOMts 


1 7 

1 . / 


T AK celk TT -9+TT -1? 


0 0 

u.u 


T limic Vinnp\/ 

.Lupub Kiuney 




LAK cells IL-2+IFN 

£^CU 1 1 1 1 Id 


0.0 


NCI-H292 none 


0.0 




u.u 


XTPT T49Q'? TT A 

IN l^- 1-1 IZ VZ 1 


U.U 


LAK cells 

P1V/T A /i /"Nn /~\m \/r~ in 
IT IV Lr\J HJI1LMI1 ytlll 


0.0 


NCI-H292 IL-9 


0.0 


NK Oik TT -? rp<2t 


u.u 


Xir^T 1-4909 TT 1 1 
INL^l-iiZ VZ 1 L- i J) 


u.u 


Two Wav MI R 1 Hav 
i wu way i v 1 1 > rv j uctjf 


ft ft 
u.u 


MPT T49Q9 TRW o^tnm^ 
rN^i-ixzvz if in gamma 


u.u 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


, , „ , 3 .i 

Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 INr alpha 


0.0 


b(_)L-l dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


2.3 


Macrophages rest 


0.0 


Lung 


0.0 
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Macrophages LPS 


0.0 (Thymus 


100.0 


HUVEC none 


0.0 jKidney 


7.6 


HUVEC starved 


o.o 1 





Generalscreeningpanel _vl.4 Summary: Ag2957 Expression of the NOV3b gene 
is restricted to placenta, fetal kidney and liver. Thus, expression of this gene could be used to 
differentiate between these samples and other samples on this panel. In addition, this gene 
5 shows no or very low expression in the cancer cell lines used in this panel. Thus, the absence 
of expression could potentially be used as a diagnostic marker for cancer. 

Panel 1.3D Summary: Ag2957 Expression of the NOV3b gene is restricted to kidney, 
spinal cord and liver. Thus, expression of this gene could be used to differentiate between 
these samples and other samples on this panel. This gene encodes a putative claudin. Claudins 

10 are components of tight junction strands. Thus, this specific pattern of expression may indicate 
that this gene product is involved in the formation of TJ strands in these tissues. 

Among the CNS regions on this panel, this tight junction protein is expressed only in 
the spinal cord and may be involved in the blood brain barrier in this region. This molecule 
may therefore be of utility in the treatment of spinal cord injury. Growth factors such as BDNF 

15 and NGF have been shown in animal models to enhance repair after spinal crush injury; 

however in the clinical condition it is hard to administer protein therapeutics due to the blood 
brain barrier. The selective downregulation of this molecule may therefore increase the 
amount of protein crossing the blood brain barrier in the spinal cord, while not hampering its 
function in the rest of the CNS. 

20 In addition, this gene shows no or very low expression in the cancer cell lines used in 

this panel. Thus, the absence of expression could potentially be used as a diagnostic marker for 
cancer. 

Panel 2D Summary: Ag2957 The NOV3b gene is consistently expressed in the 
normal kidney samples (CTs=32-33) but not in the adjacent kidney tumors. This result is in 
25 agreement with the expression in the previous panels. Thus, absence of expression of this gene 
could be used as a diagnostic marker for kidney cancer. Furthermore, therapeutic modulation 
of the function or expression of this gene may be a possible treatment for this cancer. 

Panel 4D Summary: Ag2957 The expression of the NOV3b transcript is restricted to 
the thymus (CT=32.1) but not in T cells. Thus, expression of this transcript could be used as a 
30 marker for this tissue. 



NOVla, NOVld, NOVlc, and NOVlb 
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Expression of gene NOV la and variants NOV Id, NOVlc and NOV lb was assessed 
using the primer-probe sets Ag2954 and Ag2956, described in Tables DA and DB. Results of 
the RTQ-PCR runs are shown in Tables DC and DD. 



Table DA. Probe Name Ag2954 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -cttcagccactaccctcctt-3 ' 


20 


391 


1009 


Probe 


TET-5 ' -ccatgccacaatccaagacttctgg-3 1 - 
TAMRA 


- 1 


428 


1010 


Reverse; 


5 ■ -atgtcagggatgctgtcatc-3 ■ 


20 


453 


1011 



Table DB. Probe Name A g 2956 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ■ -agtcagatggctccttcca-3 1 


i 


643 ..... i 


1012 


Probe 


TET-5 1 -cctcatgctaagacctaggaacctgg-3 1 
TAMRA 


26 


662 


1013 


Reverse! 


5 ' -ccagagatccttggcagaag-3 1 


20 j 


702 


1014 



Table DC. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) J Rel. Exp.(%) 
Ag2954, Run Ag2956, Run 
216607767 | 216607768 


Tissue Name 


Rel. Exp.(%) 
Ag2954, Run 
216607767 


Rel. Exp.(%) 
Ag2956, Run 
216607768 


Adipose 


0.0 J 0.0 


Renal ca. TK-10 


58.6 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


0.0 


Bladder 


0.0 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


0.0 


Gastric ca. (liver 
met.) NCI-N87 


51.8 


0.0 


Melanoma* 
M14 


25.0 


0.0 


Gastric ca. 
KATO III 


0.0 


0.0 


Melanoma* 
LOXIMVI 


0.0 


0.0 


Colon ca. SW- 
948 


0.0 


0.0 


Melanoma* 
SK-MEL-5 


0.0 


0.0 


Colon ca. SW480 


20.9 


0.0 


Squamous cell 

carcinoma 

SCC-4 


0.0 


0.0 


Colon ca.* 
(SW480 met) 
SW620 


0.0 


0.0 


Testis Pool 


24.3 


7.1 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met) 
PC-3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Prostate Pool 


0.0 


8.8 


Colon ca. CaCo-2 


0.0 


0.0 


Placenta 


0.0 


0.0 


Colon cancer 
tissue 


0.0 


0.0 
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Uterus Pool 


0.0 


0.0 


Colon ca. 
SW1116 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


62.4 


0.0 


Colon ca. Colo- 
205 


0.0 


0.0 


Ovarian ca. 
SK-OV-3 


0.0 


7.9 


Colon ca. SW-48 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


69.7 


0.0 


Colon Pool 


99.3 


0.0 


Ovarian ca. 
OVCAR-5 


58.2 


0.0 


Small Intestine 
Pool 


0.0 


10.1 


Ovarian ca. 
IGROV-1 


18.7 


0.0 


Stomach Pool 


25.9 


10.3 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Bone Marrow 
Pool 


28.3 


0 0 


Ovary 


0.0 


0.0 


Fetal Heart 


0.0 


0.0 


Breast ca. 
MCF-7 


78.5 


0.0 


Heart Pool 


0.0 


0.0 


Breast ca. 

MDA-MB- 

231 


0.0 


0.0 


LvmDh Node 
Pool 


0.0 


0.0 


Breast ca. BT 
549 


0.0 


0.0 


Fetal Skeletal 
Muscle 


22 S 


1 7 


Breast ca. 
T47D 


25.5 


0.0 


Skeletal Muscle 
Pool 


62.0 


26.2 


Breast ca. 
MDA-N 


57.8 \ 


0.0 


Spleen Pool 


59.5 


0.0 


Breast Pool 


0.0 


0.0 


Thymus Pool 


23.8 


11.1 


Trachea 


0.0 


0.0 


CNS cancer 
(glio/astro) U87- 
MG 


0.0 


0.0 


Lung 


27.9 


0.0 


CNS cancer 
(glio/astro) U- 
118-MG 


0.0 


_ — __ — _ 
9.7 


Fetal Lung 


0.0 


0.0 


CNS cancer 
(neuro;met) SK- 
N-AS 


17.0 


0.0 


Lung ca. NCI- 

JN41 / 


0.0 


0.0 


CNS cancer 
(astro) SF-539 


0.0 


0.0 


Lung ca. LX- 

I 


0.0 


0.0 


CNS cancer 
(astro) SNB-75 


0.0 


0.0 


Lung ca. NCI- 
H146 


0.0 


0.0 


CNS cancer 
(glio) SNB-19 


0.0 


0.0 


Lung ca. 
SHP-77 


0.0 


0.0 


CNS cancer 
(glio) SF-295 


0.0 


0.0 


Lung ca. 
A549 


28.7 


0.0 


Brain 

(Amygdala) Pool 


0.0 


0.0 
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Lung ca. NCI- 
H526 


0.0 


6.0 


Brain 

(cerebellum) 


56.3 


3.3 


Lung ca. NCI- 
H23 


28.1 


11.0 


Brain (fetal) 


27.5 


100.0 


Lun£? ca NCI- 
H460 


100.0 


7.4 


Brain 

(Hippocampus) 
Pool 


0.0 


0.0 


Lung ca. 
HOP-62 


0.0 


0.0 


Cerebral Cortex 
Pool 


0.0 


0.0 


Lung ca. NCI- 
H522 


0.0 


0.0 


Brain (Substantia 
nigra) Pool 


26.8 


0.0 


Liver 


0.0 


0.0 


Brain (Thalamus) 
Pool 


0.0 


0.0 


Fetal Liver 


54.7 


0.0 


Brain (whole) 


58.2 


6.2 


Liver ca. 
HepG2 


0.0 


0.0 


Spinal Cord Pool 


0.0 


0.0 


Kidney Pool 


0.0 


8.0 


Adrenal Gland 


0.0 


0.0 


Fetal Kidney 


0.0 


0.0 


Pituitary gland 
Pool 


25.3 


5.4 


Renal ca. 786- 
0 


0.0 


0.0 


Salivary Gland 


0.0 


0.0 


Renal ca. 
A498 


0.0 


0.0 


Thyroid (female) 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


0.0 


Pancreatic ca. 
CAPAN2 


85.3 


0.0 


Renal ca. UO- 
31 


0.0 


0.0 


Pancreas Pool 


0.0 


0.0 



Table DD. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
A g 2954, 
Run 
164329620 


Rel. 
Exp.(%) 
Ag2956, 
Run 
164401746 


Tissue Name 


Rel. 
Exp.(%) 
Ag2954, 
Run 
164329620 


Rel. 
Exp.(%) 
Ag2956, 

Run 
164401746 


Secondary Thl act 


0.0 


0.0 


HUVEC IL-lbeta 


48.3 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVEC IFN 
gamma 


0.0 


0.0 


Secondary Trl act 


20.0 


0.0 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


0.0 


Secondary Thl rest 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


0.0 


0.0 


Secondary Th2 rest 


o.o : 


0.0 


HUVEC IL-1 1 


0.0 


0.0 


Secondary Trl rest 


0.0 


0.0 


Lung 

Microvascular EC 


0.0 


0.0 
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none 






Primary Thl act 


0.0 


0.0 


Luna 

Microvascular EC 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.0 


0.0 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
Ibeta 


13.2 


0.0 


Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


0.0 


0.0 


Primary Th2 rest 


0.0 


0.0 


Small airway 
epithelium none 


38.7 


0.0 


Primary Trl rest 


17.0 


0.0 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC rest 


0.0 


19.1 


CD45RO CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


0.0 


0.0 


CD8 lymphocyte 
act 


A A 

o.u 


A A 

u.u 


Astrocytes rest 


A A 

0.1) 


A A 
U.U 


Secondary CD8 
lymphocyte rest 


0.0 


0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


0.0 


KU-812 
(Basophil) rest 


0.0 


0.0 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 

(Basophil) 

PMA/ionomycin 


0.0 


0.0 


2ry 

Thl/Th2/Trl_anti- 
CD95 CH11 


0.0 


0.0 


CCD 1106 

(Keratinocytes) 

none 


0.0 


0.0 


LAK cells rest 


0.0 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


LAK cells IL-2 


0.0 


0.0 


Liver cirrhosis 


72.2 


100.0 


LAK cells 1L-2+IL- 
12 


0.0 


0.0 


Lupus kidney 


0.0 


0.0 


LAK cells IL- 


11.3 


0.0 


NCI-H292 none 


0.0 


0.0 
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too 



o 



y. il 



2+IFN gamma 












LAK cells IL-2+ 
IL-1 8 


0.0 ' 


0.0 


NC1-H292 IL-4 


14.5 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


0.0 


NCI-H292 IL-9 


0.0 


0.0 


NK Cells IL-2 rest 


0.0 


0.0 


NCI-H292 1L-13 


0.0 


0.0 


Two Way MLR 3 
day 


0.0 


0.0 


NCI-H292 I FN 
gamma 


0.0 


0.0 


Two Way MLR 5 
day 


0.0 


0.0 


HPAEC none 


0.0 


0.0 


Two Way MLR 7 
day 


24.7 


0.0 


HPAEC TNF 
alpha + IL-1 beta 


0.0 


0.0 


PBMC rest 


6.9 


0.0 


Lung fibroblast 
none 


100.0 


0.0 


PBMC PWM 


15.3 


0.0 


Lung fibroblast 
TNF alpha + IL-1 
beta 


0.0 


0.0 




0 0 




Lung fibroblast 
IL-4 




yj.yj 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 
IL-9 


9.7 


32.8 


Ramos (B cell) 
ionomycin 


10.7 


0.0 


Lung fibroblast 
IL-1 3 


0.0 


0.0 


B lymphocytes 
PWM 


13.0 


0.0 


Lung fibroblast 
IFN gamma 


0.0 


0.0 


B lymphocytes 

CD40L and IL-4 

. 


0.0 

.... . 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


0.0 


EOL-l dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


13.1 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


•0.0 


Dermal fibroblast 
CCD 1070 IL-1 
beta 


0.0 


0.0 


Dendritic cells 


0.0 


0.0 


Dermal fibroblast \ 


0.0 


0.0 


Dendritic cells LPS 


0.0 


0.0 


LyCIIIlal lll/HJUJabl 

IL-4 


34.2 


26.8 


Dendritic cells anti- 
CD40 


0.0 


0.0 


IBD Colitis 2 


34.9 


0.0 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


20.0 


0.0 


Monocytes LPS 


0.0 


0.0 


Colon 


0.0 


0.0 


Macrophages rest 


13.0 


0.0 


Lung 


0.0 


0.0 


Macrophages LPS 


24.7 


0.0 


Thymus 


12.2 


0.0 


HUVEC none 


0.0 


0.0 


Kidney 


0.0 


0.0 


HUVEC starved 


40.1 


0.0 
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CNS_neurodegeneration_vl.O Summary: Ag2954/Ag2956 Expression of the 
NOV1 1 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

General_screeningpanelvl.4 Summary: Ag2954 The NOV la gene is expressed at 
a very low level or not at all in most of the cancer cell lines on this panel. Very low expression 
5 in cell lines from pancreatic, lung, breast and ovarian cancers suggests that it may be involved 
in these cancers. 

Ag2956 This gene is a member of the claudin family of proteins, and is only expressed 
in the fetal brain. It may be involved in the process of axonal growth or targeting and 
synaptogenesis (specifically in the development of tight junctions between neurons and other 
10 cell types). Therefore, this gene product may be of therapeutic benefit in the treatment of 
neuronal loss in clinical conditions such as head trauma or stroke where increased 
compensatory synaptogenesis is desireable. 

Panel 1.3D Summary: Ag2954/Ag2956 Expression of the NOV1 1 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 
15 Panel 4D Summary: Ag2954/Ag 2956 Two experiments with two different sets of 

primers show low but significant levels of expression of this transcript in liver cirrhosis, 
dermal and lung fibroblasts and endothelium. Thus, the NOV1 1 transcript may serve as a 
marker for these tissues and play a role in maintaining the integrity of these tissues. 

NOV2 

20 Expression of gene NOV2 was assessed using the primer-probe set Ag2958, described 

in Table EA. Results of the RTQ-PCR runs are shown in Table EB. 

Table EA. Probe Name Ag2958 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 • -gttgtcagggtagagcagaaga-3 ' 


22 


341 


1015 


Probe 


TET- 5 1 -ccaccaagaaaccttttgcaataaaa-3 * - 
TAMRA 


26 


363 


1016 


Reverse 


5 1 -taccttccctctctctggtttc-3 ' 


22 


395 


1017 



Table EB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2958, 
Run 167809085 


Tissue Name 


Rel. Exp.(%) Ag2958, 
Run 167809085 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.4 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.0 
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Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pitiiitfirv (?1?inrl 


0 0 


xvenai ca. iiv-ju 


u.u 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0.0 


Lung (fetal) 


0.0 


Brain f substantia niorai 


0.0 


Lung ca. (small cell) 
LX-1 


0 0 


Brain (thalamus^ 

U'UUl \ II 1 CI 1 til I 1 LI l1 J 


0.0 


Lung ca. (small cell) 
NCI-H69 


0 0 


(Cerebral Cortex 


0 0 


Lung ca. (s.cell var.) 
SHP-77 




Soinal cord 

UpillUl VV/IU 


0 6 


Lung ca. (large 
cell)NCI-H460 


0 0 

\J.\J 


&lio/astro U87-MG 


0 0 


Lung ca. (non-sm. 
cell) A549 


w.u 


elio/astro U-l 1 8-MG 


0 0 


Lung ca. (non-s.cell) 
NCI-H23 


KJ.VJ 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma oJNr$-iy 


0.0 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca. (pl.er) 
MCF-7 


0.4 


glioma SF-295 


0.0 


Breast ca.* (pl et) 
MDA-MB-23 1 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart 


0.0 1 


Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


2.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 


0.0 



915 



Fun 







5 




Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


0.0 i 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


0.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03 866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma Ml 4 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMV1 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


100.0 


Adipose 


0.0 



Panel 1.3D Summary: Ag2958 Expression of the NOV2 is restricted to the kidney 
(CT=31.8). In addition, this gene is expressed at higher levels in adult kidney when compared 
to expression in fetal kidney (CT value = 40). Thus, this gene product may be useful for the 
5 differentiation of adult and fetal kidney tissue. This highly specific expression pattern also 
suggests that this gene product may be a small molecule drug for the treatment of diseases of 
the kidney. 

Panel 4D Summary: Ag2958 Expression of the NOV2 gene is low/undetectable in all 
samples on this panel (CTs>35). (Data not shown.) 

10 NOV10 

Expression of gene NOV10, also known as CG55964-02, was assessed using the 
primer-probe set Ag2857, described in Table FA. 
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Table FA. Probe Name Ag2857 



Primers 


Sequences 


Length 


j Start 
Position 


SEQ ID NO: 


Forward 


5 1 -catttgccagagatttcttttg-3 ■ 


22 


j 300 


1018 


Probe 


TET-5 1 -caaatgtggcttattcactcattccagg- 
3 * -TAMRA 


28 


J336 


1019 


Reverse 


5 1 -agaaggatacccgattcaattg-3 ' 


22 


i 364 


1020 



CNSneurodegenerationvl.O Summary: Ag2857 Expression of the NOV 10 gene is 
low/undetectable in all samples on this panel. (CTs>35). (Data not shown.) 
5 Panel 1.3D Summary: Ag2857 Expression of the NOV 10 gene is low/undetectable in 

all samples on this panel. (CTs>35). (Data not shown.) 

Panel 2.2 Summary: Ag2857 Expression of the NOV 10 gene is low/undetectable in 
all samples on this panel. (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2857 Expression of the NOV 10 gene is low/undetectable in 
10 all samples on this panel. (CTs>35). (Data not shown.) 

NOV11 

Expression of gene NOV1 1 was assessed using the primer-probe set Ag2858, 
described in Table GA. 



Table GA. Probe Name Ag2858 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -cttaagtcagtcctggcagttg-3 1 


22 


674 


1021 


Probe 


TET-5 1 -aaattatttcagacctgcatctccca-3 ■ - 
TAMRA 


26 


716 


1022 


Reverse 


5 ' -agaacacaaggacagcacagat-3 ' 


22 


743 


1023 



15 

CNS_neurodegeneration_vl.O Summary: Ag2858 Expressin of the NOV1 1 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 

Panel 1.3D Summary: Ag2858 Expression of the NOV1 1 gene is low/undetectable in 
20 all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 2.2 Summary: Ag2858 Expression of the NOV1 1 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 



917 



11 Ir 



rj; t "J? 1 11? 



Panel 4D Summary: Ag2858 Expression of the NOV1 1 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

NOV12 

5 Expression of gene NOV 12 was assessed using the primer-probe set Ag2867, 

described in Table HA. Results of the RTQ-PCR runs are shown in Tables HB and HC. 



Table HA. Probe Name Ag2867 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 
Probe 


5 ' -gtcacaccttgtgactgcaa-3 ' 


20 


176 


1024 


TET-5 ' -acttacttgtgtgcagtgccatgcct-3 1 - 
TAMRA 


26 


196 


1025 


Reverse; 


5 ' -cattggaaacctttcaggaaa-3 ' 


21 


236 


1026 



Table HB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2867, 
Run 208699903 


Tissue Name 


Rel. Exp.(%) Ag2867, 
Run 208699903 


AD 1 Hippo 


7.9 


Control (Path) 3 
Temporal Ctx 


1.5 


AD 2 Hippo 


20.3 


Control (Path) 4 
Temporal Ctx 


22.1 


AD 3 Hippo 


5.0 


AD 1 Occipital Ctx 


13.4 


AD 4 Hippo 


6.6 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


65.5 


AD 3 Occipital Ctx 


3.6 


AD 6 Hippo 


44.4 


AD 4 Occipital Ctx 


12.9 


Control 2 Hippo 


14.3 


AD 5 Occipital Ctx 


15.4 


Control 4 Hippo 


9.1 


AD 6 Occipital Ctx 


8.7 


Control (Path) 3 
Hippo 


5.6 


Control 1 Occipital 
Ctx 


0.7 


AD 1 Temporal Ctx i 


100.0 


Control 2 Occipital 
Ctx 


1.4 


AD 2 Temporal Ctx 


12.6 


Control 3 Occipital 
Ctx 


14.5 


AD 3 Temporal Ctx 


7.6 


Control 4 Occipital 
Ctx 


3.4 


AD 4 Temporal Ctx 


14.0 


Control (Path) 1 
Occipital Ctx 


62.0 


AD 5 Inf Temporal 
Ctx 


55.9 


Control (Path) 2 
Occipital Ctx 


8.1 


AD 5 SupTemporal 


71.2 


Control (Path) 3 


0.0 
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Ctx 




Occipital Ctx 




AD 6 Inf Temporal 
Ctx 


45.7 


Control (Path) 4 
Occipital Ctx 


21.0 


AD 6 Sup Temporal 
Ctx 


16.5 


Control 1 Parietal 
Ctx 


0 9 


Control 1 Temporal 
Ctx 


0.9 


Control 2 Parietal 
Ctx 


8 ^ 

O .J 


Control 2 Temporal 
Ctx 


5.4 


Control 3 Parietal 
Ctx 


4.4 


Control 3 Temporal 
Ctx 


2.5 


Control (Path) 1 
Parietal Ctx 


40.1 


Control 4 Temporal 
Ctx 


6.3 


Control (Path) 2 
Parietal Ctx 


20.7 


Control (Path) 1 
Temporal Ctx 


39.8 


Control (Path) 3 
Parietal Ctx 


5.2 


Control (Path) 2 
Temporal Ctx 


24.8 


Control (Path) 4 
Parietal Ctx 


21.3 



Table HC. Panel 4D 



Tissue Name 


Rel. 

Exp.(%) Ag2867, 
Run 164311002 


Tissue Name 


Rel. 

Exp.(%) Ag2867, 
Run 164311002 


act 


Secondary Thl 


5.6 


HUVEC IL-lbeta 


4.2 


act 


Secondary Th2 


15.7 


HUVEC IFN 

gamma 


1.9 


act 


Secondary Trl 


11.8 


HUVEC TNF alpha 
+ IFN gamma ' 


2.7 


rest 


Secondary Thl 


3.9 


HUVEC TNF alpha 

+ IL4 


2.9 


rest 


Secondary Th2 


17.3 


HUVEC IL-11 


2.3 


rest 


Secondary Trl 


12.2 


Lung Microvascular 
EC none 


6.1 


Primary Thl act 


38.4 


Lung Microvascular 
EC TNFalpha + IL-lbeta 


7.2 


Primary Th2 act 


22.2 


Microvascular 
Dermal EC none 


6.7 


Primary Trl act 


29.7 


Micros vasular 
Dermal EC TNFalpha + IL- j 
Ibeta 


3.9 


Primary Thl rest 


93.3 


Bronchial epithelium 
TNFalpha + IL1 beta 


9.0 


Primary Th2 rest 


55.9 


Small airway 
epithelium none 


1.7 
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Primary Trl rest 


39.2 


Small airway 
epithelium TNFalpha + IL- 
lbeta 


16.7 


CD45RA CD4 
lymphocyte act 




Coronery artery 
SMC rest 


^ 9 


CD45RO CD4 
lymphocyte act 


1 S 6 


Coronery artery 
SMC TNFalpha + IL-lbeta 


1 ^ 


CD8 lymphocyte 

act 


16.0 


Astrocytes rest 


8.8 


Secondary CD8 
lymphocyte rest 


17.2 


Astrocytes 
TNFalpha + IL-lbeta 


6.2 


Secondary CD8 
lymphocyte act 


13.6 


KU-812 (Basophil) 

rest 


0.7 


CD4 lymphocyte 

none 


22.4 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry 

Thl/Th2/Trl anti-CD95 
CH11 


24.0 


CCD1106 
^jveraiinocyres^ none 


7.0 


LAK cells rest 


23.7 


CCD 1106 
(Keratinocytes) TNFalpha + 
IL-lbeta 


2.4 


LAK cells IL-2 


29.9 


Liver cirrhosis 


3.6 


LAK cells IL- 
2+IL-12 


^9 R 


L»upus Kiuney 


1 1 
1 . 1 


LAK cells IL- 
2+IFN gamma 


SO 7 


1 n 1 - 1 1Z.VZ. none 


17 8 
1 / .0 


LAK cells IL-2+ 

IL-18 


^0 ft 


Mpi.l I9Q9 TT _zl 


K 
1 J.J 


LAK cells 
PMA/ionomycin 


4 1 




1 ^ Q 

1 J.7 


NK Cells IL-2 

rest 


22.5 


NCI-H292 IL-13 


7.2 


Two Way MLR 

3 day 


26.1 


NCI-H292 IFN 

gamma 


8.2 


Two Way MLR 

5 day 


9.3 


HPAEC none 


2.7 


Two Way MLR 

7 day 


12.0 


HPAEC TNF alpha 
+ 1L-1 beta 


2.7 


PBMC rest 


11.6 


Lung fibroblast none 


5.5 


PBMC PWM 


65.1 


Lung fibroblast TNF 
alpha 4- IL-1 beta 


2.8 


PBMC PHA-L 


17.1 


Lung fibroblast IL-4 


7.2 


Ramos (B cell) 

none 


13.9 


Lung fibroblast 1L-9 


6.9 


Ramos (B cell) 
ionomycin 


23.8 


Lung fibroblast IL- 
13 


6.5 
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B lymphocytes 

PWM 




Lung fibroblast IFN 

gamma 


4 f\ 


B lymphocytes 
CD40LandIL-4 


i on n 


Dermal fibroblast 
CCD 1070 rest 


1 9 7 


POT 1 rlhr-AIVIP 




Dermal fibroblast 
CCD 1070 TNF alpha 


dO 1 
tu. 1 


EOL-1 dbcAMP 
PMA/ionomycin 


5.2 


Dermal fibroblast 
CCD 1070 IL-1 beta 


6.0 


Dendritic cells 

none 


6.5 


Dermal fibroblast 
IFN gamma 


2.6 


Dendritic cells 

T DC 


3.1 


Dermal fibroblast 

TT A 


5.8 


jLJenoriLic ceiis 
anti-CD40 


4.2 


IBD Colitis 2 


2.7 


Monocytes rest 


8.5 


IBD Crohn's 


5.5 


Monocytes LPS 


5.0 


Colon 


22.5 


Macrophages rest 


4.2 


Lung 


6.7 


Macrophages 

LPS 


2.4 


Thymus 


6.7 


HUVEC none 


2.6 


Kidney 


92.7 


HUVEC starved 


12.7 







CNS_neurodegeneration vl.O Summary: Ag2867 The NOV 12 gene represents a 
novel G-protein coupled receptor (GPCR) with expression in the brain. The GPCR family of 
receptors contains a large number of neurotransmitter receptors, including the dopamine, 
5 serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
adrenergic receptors as well. While the majority of antidepressants can be classified as 

10 selective serotonin reuptake inhibitors, blockade of the 5-HT1 A and a2 adrenergic receptors 

increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 
excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 
in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 

15 treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 

memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 
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In addition, this GPCR is found to be upregulated in the temporal cortex of Alzheimer's 
disease patients. Blockade of this receptor may be of use in the treatment of this disease and 
decrease neuronal death. 

References: 

5 El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 

JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 

1 . Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce 'depressant' effects in animal models believed to be relevant to depressive disorders, 

10 while adenosine receptor antagonists have been found to reverse adenosine-mediated 

'depressant' effect. 2. We have designed studies to assess whether adenosine A2A receptor 
antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 

15 sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 
A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 
p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 

20 and ZM 241385 (15 -60 mg kg(-l)) were effective in mice previously screened for having 

high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 

25 and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 

30 Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 

Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 
Psychiatry 2001 ;62 Suppl 15:12-7 
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Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 
reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 
5 released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT 1 A receptors, to SSRI treatment decouples the 
feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 

10 response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 

connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alpha 1 -adrenoceptor- 
mediated excitatory inputs from NE neurons. Inhibitory alpha2 -adrenoceptors on the NE 
. neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 

15 alpha2 -adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 
mechanism that does not require a time-dependent desensitization of receptors. These 
neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

20 Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 

Drugs 1999Nov;8(ll):1837-1848 

The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 

25 metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 
focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 
channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemically, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 

30 increased intracellular calcium influx, which causes neuronal cell death through necrosis. 
NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
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can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 
receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
5 Neuroreport 1998 Dec l;9(17):3955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 
ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 
assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
10 either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 
infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
15 potentially useful biological target for the reduction of brain injury. 

Panel 1.3D Summary: Ag2867 Results from one experiment with the NOV 12 gene 
are not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 2.2 Summary: Ag2867 Results from one experiment with the NOV12 gene are 
not included. The amp plot indicates that there were experimental difficulties with this run. 
20 Panel 4D Summary: Ag2867 Expression of the NOV 12 gene is widespread among 

samples in this panel, with highest expression in B lymphocytes stimulated with CD40L and 
IL-4 (CT-31.1). 

This transcript is also highly expressed in activated B cells and primary resting Thl 
and Th2 T cells. The expression of this transcript in PBMC treated with the B cell mitogen, 

25 PWM, confirms the importance of CG54575-01 gene expression in activated B cells. In 

addition, this transcript is also abundantly expressed on primary resting Thl cells (to a lesser 
degree on primary resting Th2 cells). Therefore, it appears that this gene, encoding a GPCR 
homolog, is a potential new member of the chemokine receptor family. The expression of this 
protein in activated B cells suggests a role for this protein in their trafficking to appropriate 

30 sites where they can fully activate antigen specific T cells. Thus, the protein encoded by this 
gene is likely to participate in the development of immune or inflammatory reactions. 

In addition, the high expression of this gene in the kidney suggests that the putative 
GPCR encoded for by this gene could allow cells within the kidney to respond to specific 
microenvironmental signals (For example, ref. 1). Therefore, antibody or small molecule 



924 



Ill 



n 



111 iP 



therapies designed with the protein encoded for by this gene could modulate kidney function 
and be important in the treatment of inflammatory or autoimmune diseases that affect the 
kidney, including lupus and glomerulonephritis. 
References: 
5 References: 

Mark M.D., Wittemann S., Herlitze S. (2000) G protein modulation of recombinant 
P/Q-type calcium channels by regulators of G protein signalling proteins. J. Physiol. 528 Pt 1 : 
65-77. 

1. Fast synaptic transmission is triggered by the activation of presynaptic Ca2+ 

10 channels which can be inhibited by Gbetagamma subunits via G protein-coupled receptors 

(GPCR). Regulators of G protein signalling (RGS) proteins are GTPase-accelerating proteins 
(GAPs), which are responsible for >1 00-fold increases in the GTPase activity of G proteins 
and might be involved in the regulation of presynaptic Ca2+ channels. In this study we 
investigated the effects of RGS2 on G protein modulation of recombinant P/Q-type channels 

15 expressed in a human embryonic kidney (HEK293) cell line using whole-cell recordings. 2. 
RGS2 markedly accelerates transmitter-mediated inhibition and recovery from inhibition of 
Ba2+ currents (IBa) through P/Q-type channels heterologously expressed with the muscarinic 
acetylcholine receptor M2 (mAChR M2). 3. Both RGS2 and RGS4 modulate the prepulse 
facilitation properties of P/Q-type Ca2+ channels. G protein reinhibition is accelerated, while 

20 release from inhibition is slowed. These kinetics depend on the availability of G protein alpha 
and betagamma subunits which is altered by RGS proteins. 4. RGS proteins unmask the Ca2+ 
channel beta subunit modulation of Ca2+ channel G protein inhibition. In the presence of 
RGS2, P/Q-type channels containing the beta2a and beta3 subunits reveal significantly altered 
kinetics of G protein modulation and increased facilitation compared to Ca2+ channels 

25 coexpressed with the beta lb or beta4 subunit. 
PMID: 11018106 

NOV13a and NOV13b 

Expression of gene NOV 13a and variant NOV 13b was assessed using the primer-probe 
set Ag2869, described in Table IA. Results of the RTQ-PCR runs are shown in Table IB. 

30 Table IA. Probe Name Ag2869 



Primers 


Sequences 


j Length 


Start 
Position 


SEQ ID NO: 


Forward; 


5 ' -tgtctgtggtagacaccacctt-3 ■ 


I 22 


475 


1027 


Probe 


TET-5 ' -ctgaggctaccctaccgaggcagtaa- 3 1 


"I 26 


501 


1028 



925 



t -m s "7 sp si i 'Pi ... n ;ii ""r t o p* 



(tamra 


i i 




Reversej5 1 - cacaaaagaaatgagcaatgct - 3 ' 


22 }528 


1029 



Table IB. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2869, Run 
164311007 


Tissue Name 


Rel. Exp.(%) 
Ag2869, Run 
164311007 


Secondary Thl act 


5.5 


HUVEC IL-lbeta 


1.7 


Secondary Th2 act 


68.3 


HUVEC IFN gamma 


9.5 


Secondary Trl act 


33.9 


HUVEC TNF alpha + IFN 
gamma 


5.6 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


8.9 


Secondary Th2 rest 


7.3 


HUVEC IL-11 


5.5 


Secondary Trl rest 


3.0 


Lung Microvascular EC 
none 


11.0 


Primary Thl act 


40.9 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


12.0 


Primary 1 hz act 


19.1 


Microvascular Dermal EC 
none 


D. 1 


Primary Trl act 


51.4 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


1 2 


Primary Th 1 rest 


23.5 


Bronchial epithelium 
TNFalpha + ILlbeta 


2.4 


Primary Th2 rest 


jSmall airway epithelium 

1 1 .8 

jnone 


0.8 


Primary Trl rest 


17.2 


Small airway epithelium 
TNFalpha + IL-lbeta 


9.6 


CD45RA CD4 
lymphocyte act 


1.6 


Coronery artery SMC rest 


5.5 


CD45RO CD4 
lymphocyte act 


12.8 


Coronery artery SMC 
TNFalpha + IL-lbeta 


2.5 


CD8 lymphocyte act 


1.0 


Astrocytes rest 


6.3 


Secondary CD8 
lymphocyte rest 


2.7 


Astrocytes TNFalpha + 
IL-lbeta 


1.8 


Secondary CD8 
lymphocyte act 


3.4 


KU-8 12 (Basophil) rest 


1.8 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


21.6 


2ryThl/Th2/Trl anti- 
CD95 CH11 


1.5 


CCD1 106 (Keratinocytes) 
none 


6.3 


LAK cells rest 


0.7 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


1.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


5.4 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.9 
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1.nfi'/?n 1 P ,.. O:: 



LAK cells IL-2+IFN 
gamma 


3.1 


NCI-H292 none 

- - — - ~.„„ ~ ~ ~ 


34.2 


t a 1S~ I1„ If Oi TT 10 

LAK cells IL-2+ 1L-1 5 


ZA 


\(py T-J9Q9 TT A 




LAK cells 
PMA/ionomycin 


0.0 


NC1-H292 IL-9 


29.7 


NK Cells IL-2 rest 


1 .3 


XT/^T T-T9Q9 TT 1 1 


99 8 


Two Way MLR 3 day 


0.6 


iNL,i-jrizvz iriN gamma 


9^ 9 
Zj.Z 


Two Way MLR 5 day 


1.1 


HPAEC none 


18.0 


Two Way MLR 7 day 


3.1 


HPAEC TNF alpha + IL-1 
beta 


19.9 


PBMC rest 


0.0 


Lung fibroblast none 


0.8 


PBMC PWM 


4.1 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


9.9 


Lung fibroblast IL-4 


8.9 


Ramos (B cell) none 


50.7 


Lung fibroblast IL-9 


7.9 


Ramos (B cell) 
ionomycin 


1 AA A 


T » ■ *-i rr TiKt'rvnl'lpf TT 1 ^ 

i^ung iiDroDiasi il<-i-> 


4 Q 


B lymphocytes PWM 


1 1 
1 . 1 


Lung fibroblast IFN 
gamma 




B lymphocytes CD40L 
and IL-4 


3.4 


Dermal fibroblast 
CCD 1070 rest 


42.6 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


39.8 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
rrniAin tt 1 k^+o 

ccdiu/u tl-1 oeta 


13.4 

_ L. _ 


Dendritic cells none 


2.1 


Dermal fibroblast IFN 
gamma 


2.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


2.6 


Dendritic cells anti- 
CD40 


0.6 


IBD Colitis 2 


1.1 


Monocytes rest 


0.0 


IBD Crohn's 


2.2 


Monocytes LPS 


0.0 


Colon 


3.2 


Macrophages rest 


4.8 


Lung 


1.0 


Macrophages LPS 


2.4 


Thymus 


4.4 


HUVEC none 


16.5 


Kidney 


16.3 


HUVEC starved 


17.6 







Panel 1.3D Summary: Ag2869 Expression of the NOV 13a gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 
5 Panel 2.2 Summary: Ag2869 Expression of the NOV 13a gene is low/undetectable in 

all samples on this panel (CTs>35). (Data not shown.) 
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Panel 4D Summary: Ag2869 Expression of the NOV 13a gene is widespread among 
the samples in this panel, with highest expression in the B cell line Ramos treated with 
ionomycin (CT=31 .1). Lower but still significant levels of expression are seen in untreated 
Ramos B cells. B cells represent a principle component of immunity and contribute to the 
5 immune response in a number of important functional roles, including antibody production. 
For example, production of antibodies against self-antigens is a major component in 
autoimmune disorders such a systemic lupus erythematosus, with B cells playing a major role. 
Since B cells play an important role in autoimmunity, inflammatory processes and 
inflammatory cascades, therapeutic modulation of this gene product may reduce or eliminate 

10 the symptoms of patients suffering from asthma, allergies, chronic obstructive pulmonary 
disease, emphysema, Crohn's disease, ulcerative colitis, rheumatoid arthritis, psoriasis, 
osteoarthritis, and other autoimmune disorders including systemic lupus erythematosus. 

Significant levels of expression are also seen in IL-4, IL-9, IL-13 and IFN gamma 
activated-NCI-H292 mucoepidermoid cells as well as untreated NCI-H292 cells. Moderate 

1 5 expression is also detected in both treated and untreated human pulmonary aortic endothelial 
cells The expression of this gene in cells derived from or within the lung suggests that this 
gene may be involved in normal conditions as well as pathological and inflammatory lung 
disorders that include chronic obstructive pulmonary disease, asthma, allergy and emphysema 

NOV14 

20 Expression of gene NOV 14 was assessed using the primer-probe set Ag2870, 

described in Table JA. Results of the RTQ-PCR runs are shown in Tables JB and JC. 

Table JA. Probe Name Ag2870 



Primers; 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward? 


5 1 -ttctcctgagagcaagaagttg-3 ' 


22 


823 


1030 


Probe 


TET-5 ■ -tgtgactcccatgttgaaccccatta-3 ' - 
TAMRA 


26 


865 


1031 


Reverse 


5 1 -tcttcacctcgctatttctcaa-3 ' 


22 


899 


1032 



Table JB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2870, 
Run 167646334 


Tissue Name 


Rel. Exp.(%) Ag2870, 
Run 167646334 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


2.5 


Pancreatic ca. CAP AN 

2 


0.0 


Renal ca. A498 


0.0 
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AHrpnal p"1anH 


0 0 


Renal ca RXF 191 


0.0 


T'h vroiH 

I 1 IV 1 KJ IU 


0.0 


Renal ca ACHN 


0.0 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


A A 

0.0 


Kenal ca. I K-l 0 


l.b 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0.0 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


A (\ 

o.o 


Lung ca. (small cell) 
LX-l 


A A 
0.0 


Brain (thalamus) 


1 


Lung ca. (small cell) 
NCI-H69 


A A 
0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


A A 
0.0 


Spinal cord 


A A 

0.0 


Lung ca. (large 
cell)NCI-H460 


A A 
0.0 


glio/astro U5/-JV10 


OA A 

20.0 


Lung ca. (non-sm. 
cell) A549 


A A 
0.0 


, T |' . /,, IT 1 1 O TV A 

glio/astro u-llo-JVl(j 


A A 

0.0 


Lung ca. (non-s.cell) 
NCI-H23 


A A 
0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


3.8 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 

r r/i7n 


0.0 


Heart 


0.0 


Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 ! 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 

3 


0.0 


Thymus 


8.7 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 


28.9 
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5 




Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


13.4 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


21.3 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca,* 
SW620(SW480 met) 


100.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


5.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03 866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


33.9 


Kidney 


0.0 jAdipose 


0.0 



Table JC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2870, Run 
164328103 


Tissue Name 


Rel. Exp.(%) 
Ag2870, Run 
164328103 


Secondary Th 1 act 


9.7 


HUVEC IL-lbeta 


9.3 


Secondary Th2 act 


30.4 


HUVEC IFN gamma 


11.6 


Secondary Trl act 


25.2 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Th 1 rest 


0.2 


HUVEC TNF alpha + IL4 


8^2 


Secondary Th2 rest 


1.9 


HUVEC IL-1 1 


0.0 


Secondary Trl rest j 


2.0 


Lung Microvascular EC 
none 


11.5 


Primary Th 1 act 


16.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


3.6 


Primary Th2 act 


12.3 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


29.1 


Microsvasular Dermal EC 


2.6 



930 








TNFalpha + IL-lbeta 




rruiidry 1 11 1 icsi 




Bronchial epithelium 
TNFalpha + ILlbeta 




Primary Th2 rest 


1.1 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


4.5 


Small airway epithelium 
TNFalpha + IL-lbeta 


13.3 


CD45RA CD4 
lymphocyte act 


3.2 


Coronery artery SMC rest 


1.7 


CD45RO CD4 
lymphocyte act 


6.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


1.6 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


A A 
U.U 


Astrocytes TNFalpha + 
IL-lbeta 


A A 
U.U 


Secondary CD8 
lymphocyte act 


1.4 


KU-812 fBasonhin rest 


2.4 


CD4 Ivmnhocvte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


8.7 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


4.7 


CCD1 106 (Keratinocytes) 
none 


6.6 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
1 Nr alpha + 1L- 1 beta 


0.0 


T A 1/ 11., II 

LAK cells 1L-2 


0.0 


Liver cirrhosis 


1.2 


T A IS ~ ~ 1 1 ^. IT 1 i If 

LAK cells 1L-2+IL-I2 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.6 


NCI-H292 none 


30.8 


LAK cells 1L-Z+ 1L-1 o 


A A 


XT^T T-JOQO TT A 


dZ.j 


LAK cells 
PMA/ionomycin 


0.4 


NCI-H292 IL-9 


26.8 


INK Cells iL-z rest 


A A 
U.U 




111 
1 1 .J 


i wo way mlk 3 day 


A A 
U.U 


XTf^T UOQO IT7XT rrorv^mo 

INLJ-Hzyz lriN gamma 


1 1 1 

Zl.j 


Two Way MLR 5 day 


0.0 


HPAEC none 


9.2 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


8.0 


PBMC rest 


0.0 


Lung fibroblast none 


1.9 


PBMC PWM 


0.9 


Lung fibroblast TNF alpha 
+ IL-1 beta 


1.8 


PBMC PHA-L 


5.0 


Lung fibroblast IL-4 


7.5 


Ramos (B cell) none 


36.3 


Lung fibroblast IL-9 


7.0 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-1 3 


0.5 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


1.4 


B lymphocytes CD40L 
and IL-4 


3.1 


Dermal fibroblast 
CCD 1070 rest 


26.8 
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;i ii I n 7 r* n 3 p , o t n .1 si p s 



EOL-l dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


23.2 


BOL-l dbcAMP 


0.0 


Dermal fibroblast 


14.6 


PN/f A /innnmurin 
r lVLr\J lUIHJIIIj'Olll 


fTTH 070 it 1 Kpta 


Dendritic cells none 


0.0 


nprmal fihrnhtaQt TFTsJ 
LyCIIIldl IlUlUulciSl iriN 

gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


2.3 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.4 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


1.8 


Macrophages rest 


3.5 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


0.3 


HUVEC none 


12.9 


Kidney 


8.1 


HUVEC starved 


22.7 







Panel 1.3D Summary: Ag2870 Expression of the NOV 14 gene is restricted to a 
sample derived from a colon cancer cell line (CT=34.4). Thus, expression of this gene could 
be used to differentiate between this sample and other samples on this panel and as a marker to 
5 detect the presence of colon cancer. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of colon cancer. 

Panel 2.2 Summary: Ag2870 Expression of the NOV 14 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2870 Expression of the NOV14 gene is highest in the B cell 
10 line Ramos treated with ionomycin (CT=30.2). . Lower but still significant levels of 

expression are seen in untreated Ramos B cells. . B cells represent a principle component of 
immunity and contribute to the immune response in a number of important functional roles, 
including antibody production. For example, production of antibodies against self-antigens is a 
major component in autoimmune disorders such a systemic lupus erythematosus, with B cells 
15 playing a major role. Since B cells play an important role in autoimmunity, inflammatory 

processes and inflammatory cascades, therapeutic modulation of this gene product may reduce 
or eliminate the symptoms of patients suffering from asthma, allergies, chronic obstructive 
pulmonary disease, emphysema, Crohn's disease, ulcerative colitis, rheumatoid arthritis, 
psoriasis, osteoarthritis, and other autoimmune disorders including systemic lupus 
20 erythematosus. 

Significant levels of expression are also seen in IL-4, IL-9, IL-13, IFN gamma 

activated and untreated NCI-H292 mucoepidermoid cells, IL-4, IL-9, IL-13 and IFN gamma 

activated lung fibroblasts, human pulmonary aortic endothelial cells (treated and untreated), 
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treated small airway epithelium and lung microvascular endothelial cells (treated and 
untreated). The expression of this gene in cells derived from or within the lung further 
suggests that this gene may be involved in normal conditions as well as pathological and 
inflammatory lung disorders that include chronic obstructive pulmonary disease, asthma, 
5 allergy and emphysema. 

NO VI 5a and NO VI 5b 

Expression of gene NOV 15a and variant NOV 15b was assessed using the primer-probe 
set Ag2875, described in Table KA. Results of the RTQ-PCR runs are shown in Table KB. 



Table KA. Probe Name Ag2875 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gaacatcatctcctaccctgaa-3 1 


22 


268 


1033 


Probe 


TET-5 ' -tgcatgactcagctctacttcttcctcg- 
3 1 -TAMRA 


28 


290 


1034 


Reverse 


5' -atgtgacactctgcaatagcaa-3 1 


22 


321 


1035 



10 



Table KB. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2875, Run 
164311029 


Tissue Name 


Rel. Exp.(%) 
Ag2875, Run 
164311029 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


3.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 * 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 
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CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-l beta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


yj.y) 


Astrocytes TNFalpha + 
IL-l beta 


u.u 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


24.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


5.1 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
1 N b alpha + 1L- 1 beta 


0.0 


T A I 7 1 1 TT 

LAK cells IL-2 


0.0 


Liver cirrhosis 


12.2 


T A IV 11^ TT ^ i II 1 >-» 

LAK cells 1L-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


20.7 


T A I y — 1 1 _ TT O i TT 1 O 

LAK cells IL-2+ IL-l 8 


0.0 


NCI-H292 IL-4 


a a 

0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


IVT/^T 1 IO AO TI 1 

NC1-H292 1L-13 


a a 

0.0 


Two Way MLR 3 day 


a a 

0.0 


NC1-H292 IrN gamma 


A A 

0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-l 
beta 


1.4 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-l beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0 0 


T una ti r\rr\riitict' TT _ 1 ^ 
ijUllg HUlUUlaM 1 L< 1 -J 


0 0 


o lympnocytes r w ivi 




Lung fibroblast IFN 
gamma 




B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


3.1 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-l beta 


0.0 


Dendritic cells none 


33.2 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


2.4 


Dermal fibroblast IL-4 


1.9 
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Dendritic cells anti- 
CD40 


100.0 


IBD Colitis 2 


16.3 


Monocytes rest 


0.0 


IBD Crohn's 


6.7 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


44.1 


Lung 


21.8 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


3.1 


Kidney 


0.0 


HUVEC starved 


0.0 





Panel 1.3D Summary: Ag2875 Results from one experiment with the NOV 15a gene 
are not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 2.2 Summary: Ag2875 Expression of the NOV 15a gene is low/undetectable in 
5 all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2875 Highest expression of the NOV15a is in anti-CD40 
treated dendritic cells (CT=33.2), with much lower expression in untreated dendritic cells. 
Thus, this gene product may be important in dendritic cell activation. Significant expression of 
this gene is also seen in liver cirrhosis. This gene encodes a putative GPCR; therefore, 
10 antibodies or small molecule therapeutics could reduce or inhibit fibrosis that occurs in liver 
cirrhosis. In addition, antibodies to this putative GPCR could also be used for the diagnosis of 
liver cirrhosis. In addition, significant expression of this gene is seen in resting macrophages. 
The putative GPCR encoded for by this transcript may therefore be important in macrophage 
detection of chemokine gradients and trafficking into specific sites within a tissue and in 
15 activation. Antibody or protein therapeutics designed against the protein encoded for by this 
transcript could reduce or inhibit inflammation in asthma, emphysema, allergy, psoriasis, 
arthritis, or any other condition in which macrophage localization/activation is important. 

NOV16A: Olfactory Receptor 

Expression of gene NOV 16a was assessed using the primer-probe set Ag2876, 
20 described in Table LA. Results of the RTQ-PCR runs are shown in Tables LB and LC. 

Table LA. Probe Name Ag2876 



Primers 


Sequences 


1 Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -acatcatctcctaccctgaatg-3 1 


J22 


273 


1036 


Probe 


TET-5 1 -catgactcagctttacttcttcctcatt- 
3 * -TAMRA 


1 


295 


1037 


Reverse 


5 ' -tacagccaacatgtgacactct-3 ' 


I 22 


334 


1038 



935 



■J 11017 311.1 r r ... O 1 
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Table LB. Panel 1.3D 



Tissue Name 


Rcl. Exp.(%) Ag2876, 
Run 167646343 


Tissue Name 


Rel. Exp.(%) Ag2876, 
Run 167646343 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


o.u 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Dltllll ^IlippUUdlllJJU^ 


0 0 


T iin? f fetal ^ 


0.0 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


1.7 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 

~»"— — . — — ™ 


glio/astroU-118-MG 


—— — — . — . — , 

0.0 


Lung ca. (non-s.cell) 
NCI-H23 


. 0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


0.0 


glioma U251 


1.4 


Breast ca.* (pl.ef) 
MCF-7 


19.2 


glioma SF-295 


2.6 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


100.0 


Heart 


0.0 


Breast ca. BT-549 


| 0.0 
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Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


2.1 


Ovary 


0.0 


Bone marrow 


u.u 


Ovarian ca. OVCAR- 

3 


0 0 


Thymus 


n ft 
U.U 


Ovarian ca. OVCAR- 
4 


u.u 


Snleen 

Lf Ivvl 1 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


T vmnh noHo 

L/ J 1 1 1 1 1 1 1 \J V.I V> 


0.0 


Ovarian ca. OVCAR-i 
8 


0.0 


Cnlnrpotal 


2.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.0 


Colon ra HT?Q 


0.0 


Prostate ca.* (bone 
met)PC-3 


3.6 


Colon ca. HCT-116 


0.0 


Testis 


8.5 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


24.7 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


0.0 



Table LC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2876, Run 
164328133 


Tissue Name 


Rel. Exp.(%) 
Ag2876, Run 
164328133 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


o.b 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 



937 



SppnnHarv Trl rp<:t 

OV^V^V^l 1 VJ c 1 1 J 111 It-M 


0 0 


Lung Microvascular EC 
none 


0 0 
yj. \j 


Pnmarv Th 1 


0 0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


A A 


Pnmarv Th'} art 
i i imai j iii^ awl 


0 0 

VJ.VJ 


Microvascular Dermal EC 
none 


a o 

u.u 


Pnmar\/ TV1 art 
1 1 lllldl 111 clCL 


A A 

u.u 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


n a 
u.u 


Primar\/ T^Vi 1 tv^ct 
riiiiidiy liii iCoi 


a a 

U.U 


Bronchial epithelium 
TNFalpha + IL1 beta 


a a 
u.u 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


2.1 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


9.7 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


u.u 


Astrocytes TNFalpha + 
IL-lbeta 


A A 
U.U 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


30.6 


2ry Thl/Th2/Trl anti- 
CD95CH11 


1.5 


CCD 1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
i JNr aipna * il>- i oeta 


0.0 


T Ak' rrllc TT 7 
l_y/\IV 1J-/-Z 


a a 
u.u 


Liver cirrnosis 


lUU.U 




a n 
u.u 


T lltMIC If 1 /T f\ C*\ / 

Lupus Kiuney 


u.u 


LAK cells IL-2+IFN 

JJ,ctll IIIlcl 


0.0 


NCI-H292 none 


0.0 


T AK opIIq TT _?+ TT -1 8 


a a 

u.u 


TsJr'T T-F9Q9 TT A 


A A 
U.U 


LAK cells 

rivirv/ 1 vjiivjin y em 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 

X ^ -ft. ^ I 1 J A J / Z-r ft V J ft* 


0.0 


NCI-H292 IL-13 


0 0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


8.8 


Two Way MLR 5 day 1 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + 1L-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 



■i rsi!i"..v';pn m z» ,. n i n sit* 

m 
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Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


4.8 


Ramos (B cell) 
ionomycin 


0 0 


I uno fiVMvihliact TT —1 1 
l^UHg I1UI vJUldc>l 1J_/ 1 _> 




o iy iiipinjL/y ics> r wivi 




Lung fibroblast IFN 
gamma 


1 ^ A. 
1 3.D 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD1 070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 


5.4 


PMA/ionomycin 


ccuiu/UJL-i beta 


Dendritic cells none 


10.2 


Dermal fibroblast IrN 
gamma 

<-—,C* 1 11111 ct 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


19.3 


Dendritic cells anti- 
CD40 


18.8 


IBD Colitis 2 


15.3 


Monocytes rest 


0.0 


IBD Crohn's 


2.2 


Monocytes LPS 


20.6 


Colon 


4.1 


Macrophages rest 


14.2 


Lung 


21.0 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 1.3D Summary: Ag2876 Expression of the NOV 16a gene is restricted to a 
sample derived from a breast cancer cell line (CT=32.5). Thus, expression of this gene could 
be used to differentiate between this sample and other samples on this panel and as a marker to 
5 detect the presence of breast cancer. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of breast cancer. 

Panel 4D Summary: Ag2876 Significant expression of the NOV 16a gene is detected 
in a liver cirrhosis sample (CT = 33.5). Furthermore, expression of this gene is not detected in 
normal liver in Panel 1.3D, suggesting that its expression is unique to liver cirrhosis. This gene 
10 encodes a putative GPCR; therefore, antibodies or small molecule therapeutics could reduce or 
inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this putative GPCR 
could also be used for the diagnosis of liver cirrhosis. 

NOV17a and NOV17b 

Expression of gene NOV 17a and variant NOV 17b was assessed using the primer-probe 
1 5 set Ag2969, described in Table MA. Results of the RTQ-PCR runs are shown in Tables MB 
and MC. 
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Table MA. Probe Name Ag2969 



Primers 


Sequences 




Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -cacccatgtatttcctgcttag-3 ■ 


22 


184 


1039 


Probe 


TET-5 ■ -tcagctctccctcattgacctaaatt-3 ' - 
TAMRA 


26 


206 


1040 


Reverse; 


5 1 -tcagaagccatcttaggaacaa-3 1 


22 


244 


1041 



Table MB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2969, 
Run 209778982 


Tissue Name 


Rel. Exp.(%) Ag2969, 
Run 209778982 


AD 1 Hippo 


0.5 


Control (Path) 3 
Temporal Ctx 


0.5 


AD 2 Hippo 


9.1 


Control (Path) 4 
Temporal Ctx 


23.7 


AD 3 Hippo 


0.7 


AD 1 Occipital Ctx 


4.2 


AD 4 Hippo 


1.6 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


1.9 


AD 6 Hippo 


24.3 


AD 4 Orrinital CYv 




Control 2 Hippo 




AD 5 Occipital Ctx 


17.8 


v^oniroi *+ riippo 


Z. 1 


AD o Uccipital Ctx 


14.5 


Control (Path) 3 

Hinnn 


1.1 


Control 1 Occipital 

l^lX 


0.8 


AD 1 Temporal Ctx 


2.0 


Control 2 Occipital 
t^tx 


29.7 


AD 2 Temporal Ctx 


15.5 


Control "3 Oppinital 

Ctx 


13.9 


AD 3 Temporal Ctx 


2.4 


Control 4 Occipital 
Ctx 


1.0 


AD 4 Temporal Ctx 


9.7 


Control (Path) 1 
Occipital Ctx 


65.5 


AD 5 Inf Temporal 
Ctx 


91.4 


Control (Path) 2 
Occipital Ctx 


4.8 


AD 5 Sup Temporal \ 
Ctx 


27.9 


Control (Path) 3 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 
Ctx 


42.9 


Control (Path) 4 
Occipital Ctx 


13.6 


AD 6 Sup Temporal 
Ctx 


39.2 


Control 1 Parietal 
Ctx 


1.2 


Control 1 Temporal ; 
Ctx 


0.0 


Control 2 Parietal 
Ctx 


28.9 


Control 2 Temporal 
Ctx 


11.8 


Control 3 Parietal 
Ctx 


9.6 


Control 3 Temporal 


9.3 


Control (Path) 1 


48.0 
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Ctx 




Parietal Ctx 




Control 3 Temporal 
Ctx 


2.5 


Control (Path) 2 
Parietal Ctx 


12.0 


Control (Path) 1 
Temporal Ctx 


38.4 


Control (Path) 3 
Parietal Ctx 


0.0 


Control (Path) 2 
Temporal Ctx 


12.2 


Control (Path) 4 
Parietal Ctx 


41.2 



Table MC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2969, Run 
164402668 


Tissue Name 


ReL Exp.(%) 
Ag2969, Run 
164402668 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


3.0 


HUVEC IFN gamma 


2.8 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


6.1 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


4.1 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


U.U 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


u.u 


Primary Thl rest 


10.5 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


13.6 


Small airway epithelium 
none 


0.0 ! 


Primary Trl rest 


7.9 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


2.0 


CD45RO CD4 
lymphocyte act 


7.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


3.4 


CD8 lymphocyte act 


4.9 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


4.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


8.5 


CD4 lymphocyte none 


16.7 


KU-8 12 (Basophil) 
PM A/ionom y c i n 


51.1 


2ry Thl/Th2/Trl_anti- 


3.2 


CCD1 106 (Keratinocytes) j 


0.0 
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CD95 CH11 




none 




LAK cells rest 


23.2 


CCD1 106 (Keratinocytes) 
i iNraipna -r il- i oeia 


0.0 




9Q ^ 


Liver cirrhosis 


i o /: 
I O.O 


1 v/\ rv CeiiS 1 1_,-Z ^1L- 1 Z 


zz.z 


Lupus kidney 


0.0 


LAK cells 1L-2+IFN 
gamma 


28.5 


NCI-H292 none 


0.0 


L/viv ceils 1L-Z+ il-io 


") 1 A 

Z 1 .U 


NCl-Hz9z IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


in is. L^ens iiv-z rest 


/.z 




A A 
0.0 


iwo way mlk 3 day 


J 1 .4 


NC1-H292 IrN gamma 


0.0 


Two Way MLR 5 day 


18.4 


HPAEC none 


0.0 


Two Way MLR 7 day 


2.9 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


45.7 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


13.8 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 




Lung fibroblast IL-13 


A A 

U.U 


T-l Ivmnnnpvtpc F> \\/ \ A 
o iyni|jinjcy ici> i w ivi 


1 ? O 
I z.w 


Lung fibroblast IFN 
gamma 


u.u 


B lymphocytes CD40L 
and IL-4 


39.2 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


1.1 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
r ivj / vy lunoniyL'in 


0.0 


Dermal fibroblast 
lluiu/u il-i Deta 


0.0 


Dendritic cells none 


4.2 


i^ermai iiuroDiast irJN 
gamma 


0.0 


Dendritic cells LPS 


5.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


6.4 


IBD Colitis 2 


11.1 


Monocytes rest 


1.5 


IBD Crohn's 


0.0 


Monocytes LPS 


10.7 


Colon 


0.0 


Macrophages rest 


11.4 


Lung 


2.4 


Macrophages LPS 


3.7 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag2969 The NOV17a gene represents a 
novel G-protein coupled receptor (GPCR) with expression in the brain. The GPCR family of 
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receptors contains a large number of neurotransmitter receptors, including the dopamine, 
serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
5 novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
adrenergic receptors as. well. While the majority of antidepressants can be classified as 
selective serotonin reuptake inhibitors, blockade of the 5-HT1 A and a2 adrenergic receptors 
increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 

10 excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 
in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 
treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 
memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 

15 In addition, this panel shows that this GPCR is upregulated in the temporal cortex of 

Alzheimer's disease patients. Therefore, blockade of this receptor may be of use in the 
treatment of this disease and decrease neuronal death. 
References: 

El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 
20 JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 

1. Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce 'depressant' effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor antagonists have been found to reverse adenosine-mediated 
25 'depressant' effect. 2. We have designed studies to assess whether adenosine A2A receptor 

antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 
sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 
30 A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 
p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 
and ZM 241385 (15 -60 mg kg(-l)) were effective in mice previously screened for having 
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high immobility time, while SCH 58261 at 10 mg kg(- 1 ) reduced immobility of mice that were 
selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
5 and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 

1 0 Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 

Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 
Psychiatry 2001;62 Suppl 15:12-7 

Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 

15 reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1 A (cell body) and 5-HT1 B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 
released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 

20 enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT1 A receptors, to SSRI treatment decouples the 
feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 
response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 
connections to forebrain areas believed to be involved in depression has been dissected. The 

25 firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alpha 1 -adrenoceptor- 
mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 
alpha2 -adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 

30 mechanism that does not require a time-dependent desensitization of receptors. These 

neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 
Drugs 1999Nov;8(ll):1837-1848 
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The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 
metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 
5 focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 

channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemically, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 

10 NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

15 Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 

receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
Neuroreport 1998 Dec l;9(17):3955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 
ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 

20 assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 

25 infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

Panel 1.3D Summary: Ag2878 Expression of the NOV 17a gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

30 Panel 4D Summary: Ag2878 Expression of the NOV17a gene is restricted to a few 

samples in this panel, with highest expression in the kidney (CT=33.1). Thus, the putative 
GPCR encoded for by this gene could allow cells within the kidney to respond to specific 
microenvironmental signals (For example, ref. 1). Therefore, antibody or small molecule 
therapies designed with the protein encoded for by this gene could modulate kidney function 
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and be important in the treatment of inflammatory or autoimmune diseases that affect the 
kidney, including lupus and glomerulonephritis. 

Furthermore, significant levels of expression are also seen in the PMA and ionomycin 
treated basophil cell line KU-812. GPCR-type receptors are important in multiple 
5 physiological responses mediated by basophils (ref 2). Therefore, antibody or small molecule 
therapies designed with the protein encoded for by this gene could also block or inhibit 
inflammation or tissue damage due to basophil activation in response to asthma, allergies, 
hypersensitivity reactions, psoriasis, and viral infections. 
References: 

10 1. Mark M.D., Wittemann S., Herlitze S. (2000) G protein modulation of recombinant 

P/Q-type calcium channels by regulators of G protein signalling proteins. J. Physiol. 528 Pt 1 : 
65-77. 

1. Fast synaptic transmission is triggered by the activation of presynaptic Ca2+ 
channels which can be inhibited by Gbetagamma subunits via G protein-coupled receptors 

15 (GPCR). Regulators of G protein signalling (RGS) proteins are GTPase-accelerating proteins 
(GAPs), which are responsible for > 100-fold increases in the GTPase activity of G proteins 
and might be involved in the regulation of presynaptic Ca2+ channels. In this study we 
investigated the effects of RGS2 on G protein modulation of recombinant P/Q-type channels 
expressed in a human embryonic kidney (HEK293) cell line using whole-cell recordings. 2. 

20 RGS2 markedly accelerates transmitter-mediated inhibition and recovery from inhibition of 
Ba2+ currents (IBa) through P/Q-type channels heterologously expressed with the muscarinic 
acetylcholine receptor M2 (mAChR M2). 3. Both RGS2 and RGS4 modulate the prepulse 
facilitation properties of P/Q-type Ca2+ channels. G protein reinhibition is accelerated, while 
release from inhibition is slowed. These kinetics depend on the availability of G protein alpha 

25 and betagamma subunits which is altered by RGS proteins. 4. RGS proteins unmask the Ca2+ 
channel beta subunit modulation of Ca2+ channel G protein inhibition. In the presence of 
RGS2, P/Q-type channels containing the beta2a and beta3 subunits reveal significantly altered 
kinetics of G protein modulation and increased facilitation compared to Ca2+ channels 
coexpressed with the beta lb or beta4 subunit. 

30 PMID: 11018106 

2. Heinemann A., Hartnell A., Stubbs V.E., Murakami K., Soler D., LaRosa G., 
Askenase P.W., Williams T.J., Sabroe I. (2000) Basophil responses to chemokines are 
regulated by both sequential and cooperative receptor signaling. J. Immunol. 165: 7224-7233. 
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To investigate human basophil responses to chemokines, we have developed a 
sensitive assay that uses flow cytometry to measure leukocyte shape change as a marker of cell 
responsiveness. PBMC were isolated from the blood of volunteers. Basophils were identified 
as a single population of cells that stained positive for IL-3Ralpha (CDwl23) and negative for 
5 HLA-DR, and their increase in forward scatter (as a result of cell shape change) in response to 
chemokines was measured. Shape change responses of basophils to chemokines were highly 
reproducible, with a rank order of potency: monocyte chemoattractant protein (MCP) 4 (peak 
at /= eotaxin-2 = eotaxin-3 >/= eotaxin > MCP-1 = MCP- 3 > macrophage-inflammatory 
protein- 1 alpha > RANTES = MCP-2 - IL-8. The CCR4-selective ligand macrophage-derived 

10 chemokine did not elicit a response at concentrations up to 10 nM. Blocking mAbs to CCR2 
and CCR3 demonstrated that responses to higher concentrations (>10 nM) of MCP-1 were 
mediated by CCR3 rather than CCR2, whereas MCP-4 exhibited a biphasic response 
consistent with sequential activation of CCR3 at lower concentrations and CCR2 at 10 nM 
MCP-4 and above. In contrast, responses to MCP-3 were blocked only in the presence of both 

15 mAbs, but not after pretreatment with either anti-CCR2 or anti-CCR3 mAb alone. These 

patterns of receptor usage were different from those seen for eosinophils and monocytes. We 
suggest that cooperation between CCRs might be a mechanism for preferential recruitment of 
basophils, as occurs in tissue hypersensitivity responses in vivo. 
PMID: 11120855 
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NOV17c 



Expression of gene NOV 17c, also known as CG56659-02, was assessed using the 
primer-probe set Ag2970, described in Table NA. Results of the RTQ-PCR runs are shown in 
Table NB. 



Table NA. Probe Name Ag2970 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -acatccatctccacacacctat-3 ' 


22 


90 


1042 


Probe 


TET- 5 1 -agtcagctctccctcattgacctaaa-3 1 - 
TAMRA | 


26 


125 


1043 


Reverse^ 


5 1 -taaaccatctttggaacaatgg-3 1 


22 


162 


1044 



25 



Table NB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2970, 
Run 211008706 


Tissue Name 


Rel. Exp.(%) Ag2970, 
Run 211008706 


AD 1 Hippo 


2.3 


Control (Path) 3 


2.1 
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Temporal Ctx 




AD 2 Hippo 


16.7 


control ^rainj h- 
Temporal Ctx 


38.2 


AU j Jrlippo 




A n 1 C\r+r* mitnl pfu 

/-yl/ i i^iccipitai v^ix 


'X 7 


AD 4 Hippo 


2.2 


AD 2 Occipital Ctx 
^iviissingj 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital Ctx 


4.1 


AD 6 Hippo 


28.1 


AD 4 Occipital Ctx 


20.4 


Control 2 Hippo 


10.2 


AD 5 Occipital Ctx 


11.2 


Control 4 Hippo 


1.9 


AD 6 Occipital Ctx 


18.8 


Control (Path) 3 
Hippo 


0 A 


Control 1 Occipital 
Ctx 




r\U i 1 CllipOial V^IX 


^ A 


Control 2 Occipital 
Ctx 


77 7 
Z / .Z 


z i emporai v^ix 


Lj.y) 


Control 3 Occipital 
Ctx 


1/1 A 


Ai7 j l emporai v^tx 


1 .U 


Control 4 Occipital 
Ctx 


A C 


r\Y-j *+ i empordi v_^ix 


1 o.Z 


Control (Path) 1 
Occipital Ctx 


B7 7 


AD 5 Inf Temporal 
Ctx 


QQ ^ 


Control (Path) 2 
Occipital Ctx 


Q Q 


AD 5 SupTemporal 
Ctx 


^U. 1 


Control (Path) 3 
Occipital Ctx 


ft ^ 


AD 6 Inf Temporal 
Ctx 


/.u 


Control (Path) 4 
Occipital Ctx 


ii <: 
I J.J 


AD 6 Sup Temporal 
Ctx 


^0 Q 

J7.7 


Control 1 Parietal 
Ctx 


1 . 1 


Control 1 Temporal 
Ctx 


U.VJ 


Control 2 Parietal 
Ctx 


A(\ 1 


Control 2 Temporal 
Ctx 


17 0 


Control 3 Parietal 
Ctx 


13 2 


Control 3 Temporal 
Ctx 


10.7 


Control (Path) 1 
Parietal Ctx 


51.1 


Control 4 Temporal 
Ctx 


6.9 


Control (Path) 2 
Parietal Ctx 


14.9 


Control (Path) 1 
Temporal Ctx 


48.0 


Control (Path) 3 
Parietal Ctx 


0.0 


Control (Path) 2 
Temporal Ctx 


17.6 


Control (Path) 4 
Parietal Ctx 


38.4 



CNSneurodegenerationvl.O Summary: Ag2970 The NOV 17c gene represents a 
novel G-protein coupled receptor (GPCR) with expression in the brain. The GPCR family of 
receptors contains a large number of neurotransmitter receptors, including the dopamine, 
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serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
5 adrenergic receptors as well. While the majority of antidepressants can be classified as 

selective serotonin reuptake inhibitors, blockade of the 5-HT1 A and a2 adrenergic receptors 
increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 
excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 

10 in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 
treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 
memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 

In addition, this GPCR is upregulated in the temporal cortex of Alzheimer's disease 

15 patients. Therefore, blockade of this receptor may be of use in the treatment of this disease and 
decrease neuronal death. 
References: 

El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 
JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 

20 pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 
1 . Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce 'depressant' effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor antagonists have been found to reverse adenosine-mediated 
'depressant' effect. 2. We have designed studies to assess whether adenosine A2A receptor 

25 antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 
sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 
A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 

30 p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 -10 mg kg(-l)) 
and ZM 241385 (15 -60 mg kg(-l)) were effective in mice previously screened for having 
high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
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selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
5 antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 
Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 

10 might be an interesting target for the development of effective antidepressant agents. 

Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 
Psychiatry 2001;62 Suppl 15:12-7 

Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 
reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 

15 feedback mechanisms mediated by 5-HT1A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 
released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 

20 addition of pindolol, which blocks 5-HT1A receptors, to SSRI treatment decouples the 

feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 
response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 
connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alpha 1 -adrenoceptor- 

25 mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 
alpha2 -adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 
mechanism that does not require a time-dependent desensitization of receptors. These 

30 neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 
Drugs 1999Nov;8(ll):1837-1848 
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The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 
metabotropic receptors (mGIuR(l-8)) mediating G-protein coupled responses. This review 
5 focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 

channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemically, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 

10 NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

1 5 Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 

receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
Neuroreport 1998 Dec l;9(17):3955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 
ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 

20 assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 

25 infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

Panel 1.3D Summary: Ag2970 Expression of the NOV17c gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

30 Panel 4D Summary: Ag2970 Expression of the NOV17c gene is low/undetectable in 

all samples on this panel (CTs>35). (Data not shown.) 

NOV19a 
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Expression of gene NOV 1 9a was assessed using the primer-probe set Ag2972, 
described in Table OA. Results of the RTQ-PCR runs are shown in Tables OB, OC, and OD. 



Table OA. Probe Name Ag2972 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -atgtgtttcagcttccattctg-3 1 


22 


543 


1045 


Probe 


TET-5 ' -taggtctcggctcattaaccactttt-3 ■ - 
TAMRA 


26 


565 


1046 


Reverse 


5 1 -tgtcctgacacaccaatgatag-3 ■ 


22 


611 


1047 



Table OB, CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2972, 
Run 211008969 


Tissue Name 


j Rel. Exp.(%) Ag2972, 
Run 211008969 


AD 1 Hippo 


2.1 


Control (Path) 3 
Temporal Ctx 


1.9 


AD 2 Hippo 


13.7 


Control (Path) 4 
Temporal Ctx 


j 70.2 


AD 3 Hippo 


2.2 jAD 1 Occipital Ctx 


3.3 


AD 4 Hippo 


1 4 |AD 2 Occipital Ctx 
f J(Missing) 


0.0 


AD 5 Hippo 


75.8 jAD 3 Occipital Ctx 


2.2 


AD 6 Hippo 


33.4 JAD 4 Occipital Ctx 


46.3 


Control 2 Hinno 


11.9 jAD 5 Occipital Ctx 




Control 4 Hippo 


2.5 jAD 6 Occipital Ctx 


19.8 


Control (Path) 3 
Hippo 


1 ~ tControl 1 Occipital 
JCtx 


1.1 


AD 1 Temporal Ctx 


9 . jControl 2 Occipital 
jCtx 


42.0 


AD 2 Temporal Ctx 


7R ~ jControl 3 Occipital 

|ctx 


21.9 


AD 3 Temporal Ctx 


1 4 jControl 4 Occipital ; 

(Ctx 


10.4 


AD 4 Temporal Ctx 


44 8 Control (Path) 1 
jOccipital Ctx 


100.0 


AD 5 Inf Temporal 
Ctx 


78 5 jControl (Path) 2 
jOccipital Ctx 


3.5 


AD 5 Sup Temporal 
Ctx 


12 0 jControl (Path) 3 
jOccipital Ctx 


1.1 


AD 6 Inf Temporal 
Ctx 


33 7 jControl (Path) 4 
jOccipital Ctx 


26.6 


AD 6 Sup Temporal 
Ctx 


c , jControl 1 Parietal 

563 jctx ! 


0.6 


Control 1 Temporal 
Ctx 


0 0 IControl 2 Parietal 

jCtx 


18.8 
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Control 2 Temporal 
Ctx 


10.4 


Control 3 Parietal 
Ctx 


1 7 7 


Control 3 Temporal 
Ctx 


11.5 


Control (Path) 1 
Parietal Ctx 


77.4 


Control 3 Temporal 
Ctx 


7.8 


Control (Path) 2 
Parietal Ctx 


10.3 


Control (Path) 1 
Temporal Ctx 


52.9 


Control (Path) 3 
Parietal Ctx 


0.7 


Control (Path) 2 
Temporal Ctx 


7.9 


Control (Path) 4 
Parietal Ctx 


57.8 



Table OC. Panel 4D 



Tissue Name 


ReL Exp.(%) 
Ag2972, Run 
164314417 


Tissue Name 


Rel. Exp.(%) 
Ag2972, Run 
164314417 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


11.3 


Secondarv Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0 0 

\J.\J 


Secondary Th 1 rest 


4.1 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


2.1 


HUVEC IL-1 1 


0.0 


SlprnnHflrv Trl rp<vt 


0 0 


Lung Microvascular EC 
none 


u.u 


Primary Thl act 


12.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


4.2 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th 1 rest 


37.4 


Bronchial epithelium 
TNFalpha + ILlbeta 


7.6 


Primary Th2 rest 


46.7 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


9.7 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


21.2 


Coronery artery SMC rest 


22.2 


CD45RO CD4 
lymphocyte act 


16.3 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


8.4 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


11.6 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


20.3 



953 




CD4 lvmohocvte none 


25.5 


KU-812 (Basophil) 
PMA/ionomycin 


34.6 


2ry Thl/Th2/Trl anti- 
CD95CH11 


0.0 


CCD 1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


26.1 


CCD 1 106 (Keratinocytes) 
TNFalpna + IL-1 beta 


8.5 


¥ A IS 11 IT ^ 

LAK cells IL-2 


46 J 


Liver cirrhosis 


22.8 


LAK cells IL-2+IL-12 


32.5 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


49.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-1 8 


38.2 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


5.7 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


7.7 


NCI-H292 IL-1 3 


0.0 


Two Way MLR 3 day 


73.2 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


24.8 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


1.7 


Lung fibroblast none 


0.0 


PR MC PWM 

l L/1V1V/ A YY IVl 




Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


41.2 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


1U.U 


Lung tibroblast o 


U.U 


r> lymphocytes 1 WM 


A f\ 1 

4U. 1 


Lung fibroblast IFN 
gamma 


U.U 


B lymphocytes CD40L 
and IL-4 


21.6 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


6.6 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


21.9 


Dermal tibroblast IrN 

CL Cll 11111 C4- 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


2.9 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


9.2 


Colon 


7.6 


Macrophages rest 


19.6 : 


Lung 


0.0 


Macrophages LPS 


8.1 


Thymus 


11.8 


HUVEC none 


0.0 


Kidney 


71.7 


HUVEC starved 


0.0 
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Table OD. Panel CNS 1 



Tissue Name 


Rel. Exp.(%) Ag2972, 
Run 171670020 


1 issue JName 


Rel. Exp.(%) Ag2972, 
Run 171670020 


BA4 Control 


17.8 


BA17PSP 


52.5 


BA4 Control2 


8.6 


BA17PSP2 


3.1 


BA4 

Alzheimer's2 


7.1 


Sub Nigra Control 


31.0 


BA4 Parkinson's 


42.9 


Sub Nigra Control2 ; 


14.7 


BA4 

Parkinson's2 


51.1 


Sub Nifn*a 
Alzheimer's2 


5.1 


BA4 

Huntington's 


7.4 


Sub Nigra 
Parkin son's2 


28.9 


BA4 

Huntington f s2 


0.0 


Sub Nigra 
Huntington's 


100.0 


BA4 PSP 


0.0 


Sub Niora 
Huntington's2 


11.2 


RA4 PSP2 


3.7 


Sub Nisra PSP2 


1.3 


BA4 Depression 


16.7 


Sub Nigra 
Denression 


7.4 


BA4 

Dpnrps^inn? 


0.0 


Sub Nigra 
Oenression2 


2.5 


BA7 Control 


15.7 


Glob Palladus 
Control 


7.8 


BA7 Control2 


15.7 


Glob Palladus 
Control2 


0.0 


BA7 

Alzheimer's2 


0.0 


Glob Palladus 
Alzheimer's 


0.0 


BA7 Parkinson's 


24.3 


Glob Palladus 
Alzheimer's2 


2.0 


BA7 

Parkinson's2 


z9.v 


Glob Palladus 
Parkinson's 


J /.U 


BA7 

Huntington's 


43.5 


Glob Palladus 
Parkinson's2 


i ft 

D.O 


BA7 

Huntington's2 


1.1 


Lrloo Faiiaaus ror 


u.u 


BA7 PSP 


20.2 


Glob Palladus PSP2 


3.4 j 


BA7 PSP2 


2.1 


Glob Palladus 
Depression 


19.5 


BA7 Depression 


5.2 


Temp Pole Control 


0.0 


BA9 Control 


0.0 


Temp Pole Control2 


3.3 


BA9 Control2 


22.7 


Temp Pole 
Alzheimer's 


0.0 


BA9 Alzheimer's 


1.3 


Temp Pole 
Alzheimer's2 


8.6 


BA9 


9.0 


Temp Pole 


0.0 
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AIzheimer's2 j 


[Parkinson's 




BA9 Parkinson's 


1 1 j Temp Pole 
' |Parkinson's2 


23.0 


BA9 

Parkinson's2 


23 7 jTemp Pole 
jHuntington's 


/.Z 


BA9 

Huntington's 


12.4 jTemp Pole PSP 




BA9 

Huntington's2 


7.0 jTemp Pole PSP2 


i.O 


BA9 PSP 


13.6 


Temp Pole 
Depression2 


5.9 


BA9 PSP2 


5.6 


Cing Gyr Control 


51.4 


BA9 Depression 


12.6 


Cing Gyr ControI2 


0.0 


BA9 

Depression2 


2.7 


C^\r\a frvr 
V^lllei vjy J 

Alzheimer's 


6.7 


BA 17 Control 


75.8 


Cing Gyr 
Alzheimer's2 


2.7 


BA17Control2 


15.8 


Cing Gyr Parkinson's 


38.4 


BA17 

Alzheimer f s2 


8.4 


Cing Gyr 
Parkinson's2 


11.6 


BA17 
Parkinson's 


32.3 


Cing Gyr 
Huntington's 


60.3 


BA17 

Parkinson's2 


51.4 


Cing Gyr 
Huntington's2 


0.0 


BA17 

Huntington's 


22.4 


Cing Gyr PSP 


23.7 


BA17 

Huntington's2 


5.2 


Cing Gyr PSP2 


0.0 


BA17 
Depression 


17.6 


Cing Gyr Depression 


8.5 


BA17 

Depression2 


16.6 


Cing Gyr 
Depression2 


8.1 



CNSneurodegenerationvl.O Summary: Ag2972 The NOV 19a represents a novel 
G-protein coupled receptor (GPCR) with expression in the brain. The GPCR family of 
receptors contains a large number of neurotransmitter receptors, including the dopamine, 
5 serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
adrenergic receptors as well. While the majority of antidepressants can be classified as 
10 selective serotonin reuptake inhibitors, blockade of the 5-HT1 A and a2 adrenergic receptors 
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increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 
excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 
in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 
5 treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 

memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 



10 JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 

1 . Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce 'depressant' effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor antagonists have been found to reverse adenosine-mediated 

1 5 'depressant' effect. 2. We have designed studies to assess whether adenosine A2A receptor 

antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 
sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 

20 A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 
p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 
and ZM 241385 (15 - 60 mg kg(-l)) were effective in mice previously screened for having 

25 high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 

30 antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 



References: 



El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 
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Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 

Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 
Psychiatry 2001 ;62 Suppl 15:12-7 
5 Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 

reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1 A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 
released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 

10 treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT 1 A receptors, to SSRI treatment decouples the 
feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 
response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 

15 connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alphal-adrenoceptor- 
mediated excitatory inputs from NE neurons. Inhibitory alpha2 -adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 
alpha2 -adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 

20 NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 

mechanism that does not require a time-dependent desensitization of receptors. These 
neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 

25 Drugs 1999Nov;8(ll):1837-1848 

The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 
metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 

30 focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 

channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemical^, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 



958 




NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
can be divided into five main categories depending on their chemical structure: indoles, 
5 tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 
receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
Neuroreport 1998 Dec l;9(17):3955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 

10 ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 
assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 

15 observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 
infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

Panel 1.3D Summary: Ag2972 Expression of the NOV19a gene is low/undetectable 

20 in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2972 Expression of the the NOV19a gene is restricted to a 
few samples in this panel, with highest expression in peripheral blood mononuclear cells 
(PBMC) treated with the B cell selective pokeweed mitogen. No significant levels of 
expression of the transcript are seen in PBMC that contain normal B cells. Therefore, the 

25 putative GPCR encoded by this gene could potentially be used diagnostically to identify 

activated B cells. In addition, the gene product could also potentially be used therapeutically in 
the treatment of diseases in which B cells are activated. 

Panel CNS_1 Summary: Ag2972 This panel confirms expression of the NOV 19a 
gene in the brains of an independent group of subjects. Please see panel 1.3d for a discussion 

30 of utility of this gene in the central nervous system. 

NOV20 



Expression of gene NOV20 was assessed using the primer-probe set Ag2973, 
described in Table PA. Results of the RTQ-PCR runs are shown in Tables PB and PC. 
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Table PA. Probe Name Ag2973 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5' -gctgtgtggctcaagtctattt- 3 ' 


22 1287 


1048 


Probe 


TET-5 ' -ttctctgcctttgcatctgctgagct-3 ' - , 
TAMRA 


26 


310 


1049 


Reverses 


5 ' -agcggtcataagacatgacagt-3 1 


22 


346 


1050 



Table PB. CNS_neurodegeneration_vl.O 



Tissue Name 


ReL Exp.(%) Ag2973, 
Run 209779096 


i issue i>dnic 


Rel. Exp.(%) Ag2973, 
Run 209779096 


An t II : 
AD 1 1 UppO 


A O 


Control (Path) 3 
Temporal Ctx 


17 i 


AD Z Hippo 


1 ^ o 

lo.z 


Control (Path) 4 
Temporal Ctx 


HO. / 


AD 3 Hippo 


2.2 


AD 1 Occipital Ctx j 


3.9 


AD 4 Hippo 


3.7 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


50.7 


AD 3 Occipital Ctx 


1.7 


AD 6 Hippo 


16.4 


AD 4 Occipital Ctx \ 


19.1 


Control 2 Hippo 


8.2 


AD 5 Occipital Ctx 


24.5 


Control 4 T-Tir>r*r* 


0 5 


AD 6 Occipital Ctx 1 


2.7 


Control (Path) 3 
Hippo 


0.0 


Control 1 Occipital 
Ctx 


0.0 


AD 1 Temporal Ctx 


4.5 


Control 2 Occipital 

Ctx 


17.8 


AD 2 Temporal Ctx 


27.7 


Control 3 Occipital 
Ctx 


8.7 


AD 3 Temporal Ctx 


3.6 


Control 4 Occipital 
Ctx 


0.7 


AD 4 Temporal Ctx 


17.6 


Control (Path) 1 
Occipital Ctx 


74.7 


AD 5 Inf Temporal 
Ctx 


64.2 


Control (Path) 2 
Occipital Ctx 


13.2 


AD 5 Sup Temporal 
Ctx 


21.8 


Control (Path) 3 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 
Ctx 


24.3 


Control (Path) 4 
Occipital Ctx 


9.2 


AD 6 Sup Temporal 
Ctx 


22.2 


Control 1 Parietal 
Ctx 


1.5 


Control 1 Temporal 
Ctx 


1.4 


Control 2 Parietal 
Ctx 


27.9 


Control 2 Temporal 
Ctx 


33.9 


Control 3 Parietal 
Ctx 


16.8 


Control 3 Temporal 


18.4 


Control (Path) 1 


95.3 
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Ctx 




Parietal Ctx j 


Control 3 Temporal 
Ctx 


3.2 


Control (Path) 2 J 
Parietal Ctx I 


23.7 


Control (Path) 1 
Temporal Ctx 


100.0 


Control (Path) 3 
Parietal Ctx ] 


1.4 


Control (Path) 2 
Temporal Ctx 


68.8 


Control (Path) 4 
Parietal Ctx j 


33.7 



Table PC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2973, Run 
164329850 


Tissue Name 


Rel. Exp.(%) 
Ag2973, Run 
164329850 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


oeconuary in aci 




HUVEC TNF alpha + IFN 
gamma 


0 0 


Secondary Thl rest 


11.7 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL- 11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalphaH- IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta V 


0.0 


Primary Th2 rest 


6.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl_anti- 


0.0 


CCD 1 106 (Keratinocytes) 


0.0 
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CD95 CH11 




none 




LAK cells rest 


0.0 


V_^\_, \_J 1 1 \J\J 1 PwCI dill l\J\sJ LC3^ 

TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


9.7 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCT-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


UD A CP T\l 17 olnKo -4- TT 1 

Mr /\Jbi^ iJNr aipna^ii^-i 
beta 


0.0 


rtJIVlC rest 


u.u 


i^ung TiDrouiasi none 




PBMC PWM 


o.o • 


Lung fibroblast TNF alpha 
+ oeta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


POT 1 HK^ A\/fP 

PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 j 0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 0.0 


Monocytes rest 


0.0 


IBD Crohn's 


j 0.0 


Monocytes LPS 


0.0 


Colon 


8.7 


Macrophages rest 


0.0 


Lung 


; 6.6 


Macrophages LPS 


0.0 


Thymus 


| 13.9 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag2973 The NOV20 gene represents a 
novel G-protein coupled receptor (GPCR) with expression in the brain. The GPCR family of 
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receptors contains a large number of neurotransmitter receptors, including the dopamine, 
serotonin, a and b-adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic 
glutamate receptors. GPCRs are excellent drug targets in various neurologic and psychiatric 
diseases. All antipsychotics have been shown to act at the dopamine D2 receptor; similarly 
5 novel antipsychotics also act at the serotonergic receptor, and often the muscarinic and 
adrenergic receptors as well. While the majority of antidepressants can be classified as 
selective serotonin reuptake inhibitors, blockade of the 5-HT1 A and a2 adrenergic receptors 
increases the effects of these drugs. The GPCRs are also of use as drug targets in the treatment 
of stroke. Blockade of the glutamate receptors may decrease the neuronal death resulting from 

10 excitotoxicity; further more the purinergic receptors have also been implicated as drug targets 
in the treatment of cerebral ischemia. The b-adrenergic receptors have been implicated in the 
treatment of ADHD with Ritalin, while the a-adrenergic receptors have been implicated in 
memory. Therefore this gene may be of use as a small molecule target for the treatment of any 
of the described diseases. 

15 References: 

El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 
JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 

1. Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 

20 produce 'depressant' effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor antagonists have been found to reverse adenosine-mediated 
'depressant' effect. 2. We have designed studies to assess whether adenosine A2A receptor 
antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 

25 antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 

sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 
A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 
p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 

30 confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 
and ZM 241385 (15 - 60 mg kg(-l)) were effective in mice previously screened for having 
high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
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immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-like effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
5 stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 
Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 
Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 

10 Psychiatry 2001;62 Suppl 15:12-7 

Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 
reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1 A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 

15 released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT1 A receptors, to SSRI treatment decouples the 
feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 

20 response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 

connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alpha 1 -adrenoceptor- 
mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 

25 alpha2-adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 
mechanism that does not require a time-dependent desensitization of receptors. These 
neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

30 Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 

Drugs 1999 Nov;8(ll): 1837- 1848 

The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AMPA and kainic acid subtypes) and the 
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metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 
focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 
channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemical^, a massive release 
5 of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 
NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 

10 can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 
receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
Neuroreport 1998 Dec l;9(17):3955-9 

15 Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 

ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 
assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 

20 cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 
infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

25 Panel 1.3D Summary: Ag2973 Expression of the NOV20 gene is low/undetectable in 

all samples on this panel (CTs>35). (Data not shown.) 

Panel 3D Summary: Ag2973 Expression of the NOV20 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2973 Significant expression of the NOV20 gene is detected in 

30 a liver cirrhosis sample (CT = 32.7). Furthermore, expression of this gene is not detected in 

normal liver in Panel 1.3D, suggesting that its expression is unique to liver cirrhosis. This gene 
encodes a putative GPCR; therefore, antibodies or small molecule therapeutics could reduce or 
inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this putative GPCR 
could also be used for the diagnosis of liver cirrhosis. 
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Panel CNS1 Summary: Ag2973 Expression of the NOV20 gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

NOV16b 

r 

Expression of gene NOV 1 6b was assessed using the primer-probe sets Ag2875 and 
5 Ag30l0, described in Tables QA and QB. Results of the RTQ-PCR runs are shown in Table 
QC. 

Table QA. Probe Name Ag2875 



Primers 


Sequences 


Length 


j Start 
j Position 


SEQ ID NO: 


Forward 


5 1 -gaacatcatctcctaccctgaa-3 1 


22 


I 267 


1051 


Probe 


TET- 5 1 -tgcatgactcagctctacttcttcctcg- ; 
3 1 -TAMRA 


28 


I 289 


1052 


Reverse 


5' -atgtgacactctgcaatagcaa-3 1 


22 


J 320 


1053 


Table QB. Probe Name Ag3010 


Primers 


Sequences 


Length 


1 Start 
j Position 


SEQ ID NO: 


Forward^' -ggggaaagtatcctctgtgttt- 3 ' 


22 


"|810 


1054 


Probe 


TET- 5 ' -ttattgtgcccatgttgaaccctctg-3 1 - 
TAMRA 


26 


js39 


1055 


Reversejs ' - cagggaaacatggacatct tta- 3 ■ 


22 


I 882 


1056 



10 

Table QC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2875, Run 
164311029 


Tissue Name 


Rel. Exp.(%) 
Ag2875, Run 
164311029 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


3.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 
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Primary Th 1 rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


24.0 


zry inl/lnz/lrl anti- 
CD95 CH11 


5.1 


CCD 1 1 Uo (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD 1 1 Uo (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


72.2 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


20.7 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


upAFf TNF alnha -h TT -1 

beta 


1.4 






T 1 in (t fi l~\t-f"\hl a cl" nnnp 
J-^llllg llUIDUIdbl 1HJI1C 


0 0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 


0.0 


PRMC PHA-T 


0 0 


T imo fihrnhlaQt TT -4 


0 0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


3.2 


B lymphocytes CD40L [ 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


3.1 


Dermal fibroblast 


0.0 
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rTT»1ft7ft TKIP alnha 
^V^UIv/U 1 INr aipilct 




FOI -1 HhcAMP 
PMA/ionomycin 


0.0 


L/cinidi iiuiuuictoi 

CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


-~ — "-"-^ — = — = — - — * 

33.2 


Dermal fibroblast 1FN 
gamma 


0.0 


Dendritic cells LPS 


2.4 


Dermal fibroblast IL-4 


1.9 


Dendritic cells anti- 
CD^ 


100.0 


1BD Colitis 2 


16.3 


Monocytes rest 


0.0 


IBD Crohn's 


6.7 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest * 


44.1 


Lung 


21.8 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


3.1 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 1.3D Summary: Ag2875/Ag3010 Results from two experiments with the 
NOV 16b gene are not included. The amp plots indicate that there were experimental 
difficulties with these runs. 
5 Panel 2.2 Summary: Ag2875 Expression of the NOV16b gene is low/undetectable in 

all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2875 Highest expression of the NOV 15a is in anti-CD40 
treated dendritic cells (CT=33.2), with much lower expression in untreated dendritic cells. 
Thus, this gene product may be important in dendritic cell activation. Significant expression of 

10 this gene is also seen in liver cirrhosis. This gene encodes a putative GPCR; therefore, 

antibodies or small molecule therapeutics could reduce or inhibit fibrosis that occurs in liver 
cirrhosis. In addition, antibodies to this putative GPCR could also be used for the diagnosis of 
liver cirrhosis. In addition, significant expression of this gene is seen in resting macrophages. 
The putative GPCR encoded for by this transcript may therefore be important in macrophage 

15 detection of chemokine gradients and trafficking into specific sites within a tissue and in 

activation. Antibody or protein therapeutics designed against the protein encoded for by this 
transcript could reduce or inhibit inflammation in asthma, emphysema, allergy, psoriasis, 
arthritis, or any other condition in which macrophage localization/activation is important. A 
second experiment with the probe/primer set Ag3010 shows low/undetectable expression in all 

20 samples on this panel (CTs>35). (Data not shown.) 

NOV21a 



968 



Expression of gene NOV21a was assessed using the primer-probe sets Ag2963 and 
Ag1292, described in Tables RA and RB. Results of the RTQ-PCR runs are shown in Tables 
RC, RD and RE. 



Table RA. Probe Name Ag2963 



Primers! 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggtaccaccctt tgt tccaat - 3 1 


21 


903 


1057 


Probe V 


TET-5 1 -aaacacagcccctccaccctagct-3 1 - 
TAMRA 


24 


924 


1058 


Reverse 


5 ' -gtcctcgctgtgacactga-3 ' 


19 


963 


1059 



Table RB. Probe Name Agl292 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * - cagctgtgagaggctggata- 3 ' 


20 


731 


1060 


Probe 


TET-5 1 -atccaggagtgaccaccacgtgact-3 ' - 
TAMRA 


25 


760 


1061 


Reverse 


5 ' - tcctgttgctttcacgtagagt-3 ■ 


22 


797 


1062 



Table RC. CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) 
Ag2963, Run 
209778824 


Rel. Exp.(%) 
Ag2963, Run 
230512509 


Tissue 
Name 


Rel. Exp.(%) 
Ag2963, Run 
209778824 


Rel. Exp.(%) 
Ag2963, Run 
230512509 


AD 1 Hippo 


6.5 


6.8 


Control 
(Path) 3 
Temporal 
Ctx 


6.1 


3.6 


AD 2 Hippo 


21.2 


16.8 


Control 
(Path) 4 
Temporal 
Ctx 


32.5 


26.2 


AD 3 Hippo 


5.5 


6.3 


AD 1 

Occipital 

Ctx 


5.3 


4.9 


AD 4 Hippo 


7.4 


7.9 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 hippo 


21.0 


22.2 


AD 3 

Occipital 

Ctx 


7.9 


6.2 


AD 6 Hippo 


12.2 


19.6 


AD 4 

Occipital 

Ctx 


12.1 


13.8 


Control 2 


58.6 


44.8 


AD 5 


6.7 


3.3 
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Hippo 






Occipital 
Ctx 






Control 4 
Hippo 


12.9 


12.9 


AD 6 

Occipital 

Ctx 


73.7 


81.8 


Control (Path) 
3 Hippo 


2.0 


4.5 


Control 1 
Occipital 
Ctx 


5.9 


2.5 


AD 1 Temporal 
Ctx 


2.3 


6.3 


Control 2 
Occipital 
Ctx 


89.5 


100.0 


AD 2 Temporal 

Ctx 


13.2 


19.5 


Control 3 
Occipital 
Ctx 


27.4 


25.7 


AD 3 Temporal 
Ctx 


3.3 


7.1 


Control 4 
Occipital 
Ctx 


11.3 


12.4 


AD 4 Temporal 
Ctx 


15.8 


20.7 


Control 
(Path) 1 
Occipital 
Ctx 


100.0 


93.3 


AD 5 Inf 
Temporal Ctx 


16.7 


21.5 


Control 
(Path) 2 
Occipital 
Ctx 


8.4 


11.0 


AD 5 

SupTemporal 

Ctx 


12.5 


11.8 


Control 
(Path) 3 
Occipital 
Ctx 


4.4 


2.2 


AD 6 Inf 
Temporal Ctx 


25.0 


25.2 


Control 
(Path) 4 
Occipital 
Ctx 


16.7 


20.6 


AD 6 Sup 
Temporal Ctx 


22.7 


17.1 


Control 1 
Parietal Ctx 


9.8 


4.6 


Control 1 
Temporal Ctx 


2.1 


6.1 


Control 2 
Parietal Ctx 


8 2 


8 4 


Control 2 
Temporal Ctx 


82.4 


51.8 


Control 3 
Parietal Ctx 


26.6 


24.7 


Control 3 
Temporal Ctx 


31.2 


20.2 


Control 
(Path) 1 
Parietal Ctx 


94.6 


66.9 


Control 4 
Temporal Ctx 


11.6 


12.9 


Control 
(Path) 2 
Parietal Ctx 


19.3 


24.8 


Control (Path) 
1 Temporal Ctx 


77.9 


56.3 


Control 
(Path) 3 
Parietal Ctx 


2.6 


3.8 
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Control (Path) 
2 Temporal Ctx 


17.0 


12.3 


Control 
(Path) 4 
Parietal Ctx 


35.8 


23.0 



Table RD. Panel 1.3D 



l issue rxame 


Rel. Exp.(%) Ag2963, 
Run 167809191 


i issue rName 


Rel. Exp.(%) Ag2963, 
Run 167809191 


Liver adenocarcinoma 


82.4 


Kidney (fetal) 


45.4 


Pancreas 


50.0 


Renal ca. 786-0 


53.6 


Pancreatic ca. CAP AN 
2 


10.7 


Renal ca. A498 


19.6 


Adrenal gland 


14.2 


Renal ca. RXF 393 


21.2 


Thyroid 


26.2 


Renal ca. ACHN 


12.5 


Salivary gland 


21.2 


Renal ca. UO-31 


13.7 


Piti i itn r\/ ex 1 c5 n H 


^1 6 


JVCIldl L>d. 1 JV- 1 \J 




Brain (fetal) 


40.9 


Liver 


49.7 


Brain (whole) 


71.7 


Liver (fetal) 


45.1 


Brain (amygdala) 


11.6 


Liver ca. 

(hepatoblast) HepG2 


16.3 


Brain (cerebellum) 


62.4 


Lung 


26.4 


Brain (hippocampus) 


47.0 


Lung (fetal) 


66.0 


Brain (substantia nigra) 


89.5 


Lung ca. (small cell) 
LX-1 


42.0 


Brain (thalamus) 


53.2 


Lung ca. (small cell) 
NCI-H69 


21.6 


Cerebral Cortex 


92.7 


Lung ca. (s.cell var.) 
SHP-77 


59.9 


Spinal cord 


30.6 


Lung ca. (large 
cell)NCI-H460 


12.9 


glio/astro U87-MG 


28.5 


Lung ca. (non-sm. 
cell) A549 


37.1 


glio/astro U-118-MG 


44.4 


Lung ca. (non-s.cell) 
NCI-H23 


47.3 


astrocytoma SW1783 


50.3 


Lung ca. (non-s.cell) 
HOP-62 


11.8 


neuro*; met SK-N-AS 


40.6 


Lung ca. (non-s.cl) 
NCI-H522 


16.8 


astrocytoma SF-539 


37.1 


Lung ca. (squam.) 
SW 900 


29.3 


astrocytoma SNB-75 


49.3 


Lung ca. (squam.) 
NCI-H596 


51.1 


glioma SNB-19 


31.0 


Mammary gland 


70.2 


glioma U25 1 


52.9 


Breast ca.* (pl.ef) 
MCF-7 


15.2 
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II 5.1 



-a 



glioma SF-295 


27.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


29.7 


Heart (fetal) 


90.8 


r>reasi ca. ^pi.eij 
T47D 


41.2 


neari 


ZZ. / 


r>reasi ca. r> i -dh y 


Zoo 


Skeletal muscle (fetal) 


83.5 


Breast ca. MDA-N 


54.3 


Skeletal muscle 


57.0 


Ovary 


47.0 


Bone marrow 


25.2 


Ovarian ca. OVCAR- 

3 


26.1 


Thymus 


74.7 


Ovarian ca. OVCAR- 
4 


18.7 


Spleen 


28.1 


Ovarian ca. OVCAR- 


82.9 


Lymph node 


54.7 


Ovarian ca. OVCAR- 

o 
O 


3.4 


Colorectal 


7.9 


Ovarian ca. IGROV- 
1 


22.2 


Stomach 


7.0 


Ovarian ca.* (ascites) 
SK-OV-3 


100.0 


Small intestine 


Zd.5 


Uterus 


Z5. i 


Colon ca. SW480 


25.9 


Placenta 


3.2 


Colon ca.* 
owozU(bW4dU met) 


40.1 


Prostate 


22.1 


Colon ca. HT29 


24.1 


Prostate ca.* (bone 
metjrL.-3 


29.3 


Colon ca. HCT-116 


35.8 


Testis 


19.3 


Colon ca. CaCo-2 


52.9 


Melanoma 
Hs688(A).T 


19.5 


Colon ca. 


4.4 


Melanoma* (met) 


22.4 


Colon ca. HCC-2998 


31.9 


Melanoma UACC-62 


42.9 


Gastric ca.* (liver met) 
NCI-N87 


27.2 


Melanoma M14 


8.3 


Bladder 


20.6 


Melanoma LOX 
IMVI 


25.0 


Trachea 


11.7 


Melanoma* (met) 
SK-MEL-5 


23.3 


Kidney 


56.6 


Adipose 


21.0 



Table RE. Panel 4D 





Rel. 


Rel. 




Rel. 


Rel. 


Tissue Name 


Exp.(%) 


Exp.(%) 


Tissue Name 


Exp.(%) 


Exp.(%) 


Agl292, 


Ag2963, 


Agl292, 


Ag2963, 




Run 


Run 




Run 


Run 
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] 138719232 


164333805 




138719232 


164333805 


Secondary Thl act ] 14.5 


18.6 


HUVEC IL-lbeta 


7.6 


3.0 


Secondary Th2 act 


11.3 


47 Q Ihuvec ifn 

' Jgamma 


13.5 


31.6 


Secondary Trl act 


23.2 


29.1 


HUVEC TNF 
aloha + IFN 
gamma 


2.2 


26.1 


Secondary Th 1 rest 


4.0 


8.8 


HUVEC TNF 
alpha + IL4 


5.3 


D J.Z 


Secondary Th2 rest 


5.0 


1 o o 

1 8.2 


HUVLC 1L-1 1 


2.8 


26.6 


Secondary Trl rest 




2.2 


Lung 

Microvascular EC 
none 


7.1 


62.0 


Primary Th 1 act 


19.1 


55.9 


Lung 

Microvascular EC 

T^X T „ 1 1 i TT 

TNFalpha + IL- 
lbeta 


2.7 


40.9 


Primary Th2 act 


16.4 


33.4 


Microvascular 
Dermal EC none 


8.8 


41.2 


Primary Trl act 


25.3 


66.9 


Microsvasular 
Dermal EC 
l JNralpna + IL- 
lbeta 


7.5 


31.6 


Primary Th 1 rest 


13.6 


33.0 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


8.7 


37.1 


Primary Th2 rest 


9.3 


16.7 


Small airway 
epithelium none 


1.7 


8.4 


Primary Trl rest 


18.8 


23.0 


omau airway 
epithelium 
TNFalpha + IL- 
lbeta 


ft ft 


14 ? 


CD45RA CD4 
lymphocyte act 


7.9 


39.0 


Coronery artery 
SMC rest 


■S^ 9 




CD45RO CD4 
lymphocyte act 


10.3 


27.2 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


5.3 


14.6 


CD8 lymphocyte 
act 


9.9 


29.7 


Astrocytes rest 


5.8 


24.1 


Secondary CD8 
lymphocyte rest 


11.9 


44.4 


Astrocytes 
TNFalpha + IL- 
lbeta 


4.4 


19.5 


Secondary CD8 
lymphocyte act 


13.0 


22.8 


KU-812 
(Basophil) rest 


16.0 


66.9 


CD4 lymphocyte 
none 


6.3 


21.3 


KU-812 
(Basophil) 


8.1 


87.7 
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PMA/ionomycin 






2ry 

Thl/Th2/Trl anti- 
CD95CH11 


7.9 


20.3 


CCD1 106 

(Keratinocytes) 

none 


5.8 


— - 

41.2 


LAK cells rest 


10.1 


39.5 


CCD 11 06 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


6.5 


11.5 


LAK cells IL-2 


5.7 


33.2 


Liver cirrhosis 


4.9 


7.9 


LAK cells IL-2+IL- 
12 


9.0 


34.2 


Lupus kidney 


4.5 


9.2 


LAK cells IL- 
2+IFN gamma 


12.4 


31.9 


NCI-H292 none 


11.5 


63.7 


LAK cells IL-2+ 
IL-18 


16.8 


16.2 


NCI-H292 IL-4 


17.7 


77.9 


LAK cells 
PMA/ionomycin 


15.4 


50.7 


NCI-H292 1L-9 


59.5 


54.3 


NK Cells IL-2 rest 


7.6 


22.8 


NCI-H292 IL-13 


44.1 


63.7 


Two Way MLR 3 
day 


7.7 


49.0 


NCI-H292 I FN 
gamma 


10.4 


89.5 


Two Way MLR 5 
day 


5.4 


29.9 


HPAEC none 


8.2 


28.1 


Two Way MLR 7 
day 


5.3 


17.0 


HPAEC TNF 
alpha + IL-1 beta 


9.6 


30.1 


PBMC rest 


5.2 


11.2 


Lung fibroblast 
none 


8.7 


28.7 


PBMC PWM 


17.1 


100.0 


Lung fibroblast 
TNF alpha + IL-1 
beta 


4.5 


6.9 


PBMC PHA-L 


10.9 


51.4 


Lung fibroblast 
IL-4 


10.2 


53.6 


Ramos (B cell) 
none 


24.1 


88.3 


Lung fibroblast 
IL-9 


9.7 


49.7 


Ramos (B cell) 
ionomycin 


14.6 


92,7 


Lung fibroblast 
IL-13 


22.1 


44.1 


B lymphocytes 
PWM 


11.8 


59.0 


Lung fibroblast 
IFN gamma 


9.5 


29.5 


B lymphocytes 
CD40L and IL-4 


20.7 


21.8 


Dermal fibroblast 
CCD 1070 rest 


32.3 


37.6 


EOL-1 dbcAMP 


23.3 


32.8 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


100.0 


39.0 


EOL-1 dbcAMP 
PMA/ionomycin 


4.2 


25.5 


Dermal fibroblast 
CCD1 070 IL-1 
beta 


14.3 


28.7 


Dendritic cells 


4.3 


22.8 


Dermal fibroblast 


7.5 


17.8 
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none 




jlFN gamma 






Dendritic cells LPS 


4.4 


10.5 


jDermal fibroblast 
[IL-4 


30.4 


48.6 


Dendritic cells anti- 
CD40 


6.4 


24.3 


JlBD Colitis 2 


1.3 


1.2 


Monocytes rest 


7.6 


38.4 


jlBD Crohn's 


2.6 


2.2 


Monocytes LPS 


4.5 


8.0 


jColon 


7.3 


37.9 


Macrophages rest 


6.3 


21.5 


[Lung 


5.8 


22.8 


Macrophages LPS 


5.5 


1.9 


|Thymus 


12.7 


39.2 


HUVEC none 


4.8 


30.8 


jKidney 


19.1 


98.6 


HUVEC starved 


7.8 


27.7 j 







CNS neurodegeneratiou_vl.O Summary: Ag2963 The NOV21a gene, a secretory 
protease homolog, appears to be downregulated in the temporal cortex of Alzheimer's disease 
patients when compared to non-demented controls. Up regulation of this protease may 
5 therefore be of use in the treatment of Alzheimer's, particularly because Alzheimer's disease is 
believed to result at least in part from the improper processing of proteins (APP, Tau). This 
protease may serve to lower the levels of these disease proteins and ameliorate the 
dementia/pathology associated with Alzheimer's. 

Panel 1.3D Summary: Ag2963 The NOV21a gene, a putative secretory serine- 
10 protease, is widely expressed in this panel. Highest expression is in an ovarian cancer cell line 
(CT=32), with expression detected in all cancer cell lines in this panel. Thus, inhibition of the 
protease domain might lead to a decrease in cell survival and proliferation and serve as a small 
molecule target in cancer. 

This gene product also has low levels of expression in pancreas, thyroid, pituitary, 
15 adipose, and adult and fetal types of heart, skeletal muscle and liver. Therefore, this serine 

protease-like gene product may be a small molecule target for the treatment of endocrine and 
metabolic diseases, including obesity and Types 1 and 2 diabetes. 

The expression in this panel further confirms expression of this gene in the CNS. 
Please see CNS neurodegeneration for discussion of utility of this gene in the CNS. 
20 Panel 4D Summary: Agl292/Ag2693 The NOV21a transcript is expressed on most 

tissues in panel 4D. This widespread expression is consistent with the results in Panel 1 .3D. 
This transcript encodes a serine protease like protein with potential enzymatic activity and 
may important in maintaining normal cellular functions in a number of tissues. Therefore, 
therapies designed with the protein encoded by this transcript could be important in regulating 
25 cellular viability or function. 
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NOV22a 



Expression of gene NOV22a was assessed using the primer-probe set Ag2964, 
described in Table SA. 

Table SA. Probe Name Ag2964 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gaaatacgcaggaaggaaatct- 3 1 


22 


473 


1063 


Probe 


TET-5 ' -cgtgggcatcatagaccagaaaacct-3 1 - j 
TAMRA 


26 


507 


1064 


Reverse 


5 1 -ggtctgtgagggagaagttgta- 3 1 


22 


544 


1065 



CNS_neurodegeneration_vl.O Summary: Ag2964 Expression of the NOV22a gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 

Panel 1.3D Summary: Ag2964 Expression of the NOV22a gene is low/undetectable 
10 in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 4D Summary: Ag2964 Expression of the NOV22a gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 



15 



NOV23a 



Expression of gene NOV23a was assessed using the primer-probe set Ag2967, 
described in Table TA. 

Table TA. Probe Name Ag2967 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' - ctcaaggaggacttccgtaag- 3 1 j 


21 i 


478 


1066 


Probe 


TET-5 ' -atcttcctctgcagaaagccactgtg-3 1 - i 
TAMRA 


26 


500 


1067 


Reverse 


5 ' -agtgagtgaatggccgtaca-3 1 


20 


557 


1068 



20 CNS_neurodegeneration_vl.O Summary: Ag2967 Expression of the NOV23a gene 

is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 
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Panel 1.3D Summary: Ag2967 Expression of the NOV23a gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 4D Summary: Ag2967 Expression of the NOV23a gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

NOV23a and NOV23b 

Expression of gene NOV23a and variant NOV23b was assessed using the primer-probe 
set Ag2996, described in Table UA. Results of the RTQ-PCR runs are shown in Table UB. 



Table UA. Probe Name Ag2996 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' - cagaggttacgccctgaagt - 3 1 


20 


S4 


1069 


Probe 


TET-5 * - ctcagatcctgggccaggcactg- 3 1 - 
TAMRA 


- ! 


77 


1070 


Reverse 


5 1 -cagaaagaggtcctgctcatg-3 ' 




113 ' 


1071 



Table UB. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2996, Run 
165296353 


Tissue Name 


Rel. Exp.(%) 
Ag2996, Run 
165296353 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


23.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


< 

Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 


0.0 
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[ |TNFalpha + IL-1 beta j 


CD45RA CD4 
lymphocyte act 


! 0.0 


|coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


j 0.0 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.0 


CD8 lymphocyte act 


1 0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lyiiipiiocy ic aci 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


9r\/ Th 1 /ThO /Trl cinti 

zry ini/inz/iri ami- 
CD95CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD 1 106 (Keratinocytes) 

1 INI CI I LJ1 \CL 1 1 L/ 1 YJ\*> Lcl 


0.0 


LAK cells IL-2 






inn n 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 II -1 3 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


A A 


HrAhL none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PR MP rp<;t 


0.0 


Lung fibroblast none 


0.6 


PBMC PWM 


11.3 


Lung fibroblast TNF alpha! 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


14.6 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


15.3 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 
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Dendritic cells LPS 


0.0 


[Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


JlBD Colitis 2 


0.0 


Monocytes rest 


0.0 


|IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


jColon 


0.0 


Macrophages rest 


0.0 


jLung 


49.0 


Macrophages LPS 


0.0 


jThymus 


0.0 


HUVEC none 


0.0 


jKidney 


0.0 


HUVEC starved 


0.0 j 





Panel 1.3D Summary: Ag2996 Results from one experiment with the NOV23a gene 
are not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 4D Summary: Ag2996 Significant expression of the NOV23a gene is detected 
5 in a liver cirrhosis sample and normal lung tissue(CTs=33-35). Thus, antibodies to this protein 
product could potentially be used for the diagnosis of liver cirrhosis or as a marker of normal 
lung tissue. Furthermore, therapeutic modulation of the expression or function of this gene 
may reduce or inhibit fibrosis that occurs in liver cirrhosis. 

NOV24a 

10 Expression of gene NOV24a was assessed using the primer-probe set Ag2934, 

described in Table VA. 

Table VA. Probe Name Ag2934 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gaatctaattccgcaggaaatg-3 * 


22 


1645 


1072 


Probe 


TET-5 1 -cgtgtttgccaaattctcagcgttta-3 ' - 
TAMRA 


26 


1668 


1073 


Reverse 


5' -tcttttcatgaacctcatttgc-3 ' 


22 


1714 


1074 



CNS_neurodegeneration_vl.O Summary: Ag2934 Expression of the NOV24a gene 
15 is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 1.3D Summary: Ag2934 Expression of the NOV24a gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2934 Expression of the NOV24a gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
20 high probability of a probe failure. 



NOV25 
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Expression of gene NOV25 was assessed using the primer-probe sets Ag2935 and 
Ag3039, described in Tables WA and WB. Results of the RTQ-PCR runs are shown in Tables 
WC, WD and WE. 



Table WA. Probe Name Ag2935 



5 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5' -ctggactctacctcggaaactt-3 1 


22 


88 


1075 


Probe 


TET-5 ' -ctgggccgaaataagatcacacacat-3 • - 
TAMRA 


26 


135 


1076 


Reverse 


5 ' -gggtgactcatggatagagatg-3 1 


22 


161 


1077 


Table WB. Probe Name Ag3039 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -gccgaaataagatcacacacat-3 ' 


22 


139 


1078 


Probe 


TET-5 * -tctatccatgagtcaccccagcctct-3 ' - ; 
TAMRA 


26 


165 


1079 


Reversed 


5 1 -atgcgaaggtaggtgatatcct-3 1 


22 jl96 


1080 



Table WC. CNS_neurodegeneration_vl.O 



Tissue 
Name 


Rel. Exp.(%) 
Ag2935, Run 
209777519 


Rel. Exp.(%) 
Ag3039, Run 
211012103 


Tissue 
Name 


Rel. Exp.(%) 
Ag2935, Run 
209777519 


Rel. Exp.(%) 
Ag3039, Run 
211012103 


AD 1 Hippo 


34.2 


18.4 


Control 
(Path) 3 
Temporal 
Ctx 


11.9 


8.2 


AD 2 Hippo 


59.5 


48.0 


Control 
(Path) 4 
Temporal 
Ctx 


33.7 


36.3 


AD 3 Hippo 


15.3 


9.8 


AD 1 

Occipital 

Ctx 


24.3 


9.5 


AD 4 Hippo j 


18.7 


13.6 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 Hippo 


90.8 


70.2 


AD 3 

Occipital 

Ctx 


12.9 


6.3 


AD 6 Hippo | 


50.7 


69.3 


AD 4 

Occipital 

Ctx 


32.8 


20.9 


Control 2 


36.3 


25.5 


AD 5 


66.9 


18.3 



980 



1 



Hippo 






Occipital 
Ctx 






Control 4 
Hippo 


33.4 


24.0 


AD 6 

Occipital 

Ctx 


28.9 


43.2 


Control 
(Path) 3 
Hippo 


7.2 


7.6 


Control 1 
Occipital 
Ctx 


7.6 


6.0 


AD 1 

Temporal 

Ctx 


20.7 


24.3 


Control 2 
Occipital 
Ctx 


90.8 


57.0 


AD 2 

Temporal 

Ctx 


53.2 


36.9 


Control 3 
Occipital 
Ctx 


24.7 


18.7 


AD 3 

Temporal 

Ctx 


5.8 


4.7 


Control 4 
Occipital 
Ctx 


17.7 


13.9 


AD 4 

Temporal 

Ctx 


35.8 


24.5 


Control 
(Path) 1 
Occipital 
Ctx 


93.3 


74.2 


AD 5 Inf 
Temporal 
Ctx 


100.0 


100.0 


Control 
(Path) 2 
Occipital 
Ctx 


19.1 


14.8 


AD 5 Sup 

T" 1 1 

Temporal 
Ctx 


81 .2 


62.9 


Control 
(Path) 3 
Occipital 
Ctx 


8.6 


4.3 


AL) o int 
Temporal 
Ctx 


62.4 


58.2 


Control 
(Path) 4 
Occipital 
Ctx 


27.9 


25.2 


AD 6 Sup 
Temporal 
Ctx 


50.3 


49.3 


Control 1 
Parietal Ctx 


16.2 


15.9 


Control 1 
Temporal 
Ctx 


14.5 


11.6 


Control 2 
Parietal Ctx 


68.3 


58.2 


Control 2 
Temporal 
Ctx 


51.4 


34.4 


Control 3 
Parietal Ctx 


31.0 


32.1 


Control 3 
Temporal 
Ctx 


33.0 


20.0 


Control 
(Path) 1 
Parietal Ctx 


69.7 


66.9 


Control 3 
Temporal 
Ctx 


19.6 


20.7 


Control 
(Path) 2 
Parietal Ctx 


31.2 


39.0 
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Control 
(Path) 1 
Temporal 
Ctx 


55.1 


44.4 


Control 
(Path) 3 
Parietal Ctx 


3.1 


4.6 


Control 
(Path) 2 
Temporal 
Ctx 


56.3 


30.4 


Control 
(Path) 4 
Parietal Ctx 


61.6 


35.8 



Table WD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag2935, Run 

lO / 04004!/ 


Rel. Exp.(%) 
Ag3039, Run 

JLo /VOlolo 


Tissue Name 


Rel. Exp.(%) 
Ag2935, Run 

10 /040o.4V 


Rel. Exp.(%) 
Ag3039, Run 
lb /yololo 


Liver 

adenocarcinoma 


1.4 


1.7 


Kidney (fetal) 


49.3 


38.2 


Pancreas 


1.2 


2.0 


Kenai ca. /oo- 
0 


0.0 


0.0 


Pancreatic ca. 

v_x/-Vx /-VIM A 


0.0 


0.0 


Renal ca. 


1.7 


0.4 


Adrenal gland 


3.1 


1.2 


Renal ca. RXF 

393 


0.0 


0.1 


Thyroid 


11.7 


6.0 


Renal ca. 
ACHN 


5.7 


1.5 


Salivary gland 


1.3 


0.8 


Renal ca. UO- 

"> i 
31 


0.0 


0.0 


Pituitary gland 


6.3 


3.0 


Kenai ca. l is.- 
10 


0.0 


0.0 


Brain (fetal) 


10.0 


7.4 


Liver 


0.7 


0.3 


Brain (whole) 


12.8 


7.7 


Liver (fetal) 


1.8 


0.7 


Brain (amygdala) 


6.9 


6.0 


Liver ca. 

(hepatoblast) 

HepG2 


1.0 


0.1 


Brain (cerebellum) 


9.3 


5.9 ! 


Lung 


1.3 


1.3 


Brain 

(hippocampus) 


12.1 


3.9 


Lung (fetal) 


6.5 


3.0 


Brain (substantia 
nigra) 


24.0 


16.0 


Lung ca. 
(small cell) 
LX-1 


0.3 


0.0 


Brain (thalamus) 


5.1 


3.8 


Lung ca. 
(small cell) 
NCI-H69 


2.8 


1.5 


Cerebral Cortex 


15.6 


10.8 


Lung ca. 
(s.cell var.) 
SHP-77 


7.2 


3.7 


Spinal cord 


37.4 


15.2 


Lung ca. (large 


0.0 


0.1 



.-i! il I! IT- 
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t 


cell)NCI-H46o| | 


glio/astro U87-MG 


0.0 


0.2 


Lung ca. (non- j 
sm. cell) A549J 


1.6 


glio/astro U-118- 
MG 


0.4 


0.1 


Lung ca. (non- I 

s.cell) NCI- 4.0 

H23 1 


0.4 


astrocytoma 
SW1783 


3.5 


0.6 


Lung ca. (non- 1 1 1 
s.cell) HOP-62 1 


0.0 


neuro*; met SK-N- 
AS 


0.8 


0.3 


Lung ca. (non- I 

s.cl) NCI- | 3.9 

H522 J 


1.4 


astrocytoma SF- 
539 


3.0 


1.5 


Lung ca. 
(squam.) SW 
900 


2.7 


0.7 


astrocytoma SNB- 

/J 


2.5 


1.2 


Lung ca. 
(squam.) NCI- 
H596 


12.2 


5.7 


„1 ' , m „ C\1D 1 C\ 

glioma c>J\lr>-lv 


2.7 


O 1 

2.1 


Mammary 
gland 


3.9 


2.4 


glioma UzM 


1 .0 


0.3 


Breast ca.* 
(pl.ef) MCF-7 


0.7 


0.7 


glioma SF-295 


2.9 


2.3 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.4 


0.1 


Heart (fetal) 


24.0 


14.5 


Breast ca.* 

i T/i 7r\ 
(pi.eij 


23.3 


13.5 


Heart 


5.6 


3.4 


DlCclol Ld. L> 1 

549 


0.5 


1.2 


Skeletal muscle 
(fetal) 


11.3 


5.1 


Breast ca. 
MDA-N 


6.2 


6.4 


Skeletal muscle 


1.1 


0.0 


Ovary 


11.4 


3.9 


Bone marrow 


0.0 


0.4 


Ovarian ca. 
OVCAR-3 


1.0 


0.3 




1 A 


n o 
u.z 


Ovarian ca. 
OVCAR-4 


21.3 


1 J.O 


Spleen 


3.5 


3.1 


Ovarian ca. 
OVCAR-5 


5.6 


1.5 


Lymph node 


0.7 


0.9 


Ovarian ca. 
OVCAR-8 


0.1 


0.0 


Colorectal 


1.5 


0.4 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


2.5 


0.6 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.4 


0.4 


Small intestine 


3.4 


1.0 


Uterus 


3.0 


2.2 


Colon ca. SW480 


8.8 


6.7 


Placenta 


0.6 


0.0 
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Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


Prostate 


3.1 


2.7 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


100.0 


100.0 


Colon ca. CaCo-2 


23.0 


12.2 


Melanoma 
Hs688(A).T 


0.0 


0.1 


Colon ca. 
tissue(OD03866) 


1.3 


1.2 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


6.4 


3.0 


Melanoma 
UACC-62 


12.0 


5.2 


Gastric ca.* (liver 
met) NCI-N87 


2.7 


1.2 


Melanoma 
M14 


2.7 


0.8 


Bladder 


3.1 


1.8 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


1.4 


0.6 


Melanoma* 
(met) SK- 
MEL-5 


6.5 


2.0 


Kidney 


84.7 


41.8 


Adipose 


5.0 


0.6 



Table WE. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag2935, 
Run 
164403313 


Rel. 
Exp.(%) 
Ag3039, 
Run 
162427949 


Tissue Name 


Rel. 

Exp.(%) 
Ag2935, 
Run 
164403313 


Rel. 

Exp.(%) 
Ag3039, 
Run 
162427949 


Secondary Th 1 act 


0.0 


0.0 


HUVECIL-lbeta 


0.2 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVEC IFN 
gamma 


0.2 


0.3 


Secondary Trl act 


0.0 


0.0 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


0.0 


Secondary Th 1 rest 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


0.0 


0.0 I 


Secondary Th2 rest 


0.0 


0.0 


HUVEC IL-1 1 


0.0 


0.3 


Secondary Trl rest 


0.0 


0.0 


Lung 

Microvascular EC 
none 


o:o 


1.3 


Primary Th 1 act 

., i 


0.0 


0.0 


Lung 

Microvascular EC 
TNFalpha + 1L- 
lbeta 


0.0 


0.0 
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Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.3 


0.3 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


0.3 


0.4 


Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


1.7 


1.4 


Primary Th2 rest 


0.0 


0.0 


Small airway 
epithelium none 


0.2 


1.1 


P r t rvi M r a/ TVI rf*ct 

JTlllIiClIjr 111 1 Col 


0 0 

u.u 


u.u 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


n t. 
\j.j> 


n 'x 
yj.D 


CD45RA CD4 
lymphocyte act 


0 4 


0 0 
u.u 


Coronery artery 
SMC rest 


u.z, 


u.u 


CD45RO rr>4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


0.0 


0.0 


CD8 lymphocyte 
act 


0 1 

V/. 1 


0 0 

u.u 




1 0 

1 . y 


H-.U 


Secondary CD8 

Ivmnhnrvtp rp^t 


0.0 


0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


2.4 


2.3 


Secondary CD8 
lymphocyte act 


0.0 


0.0 


KU-812 
(Basophil) rest 


0.0 


0.0 


UD4 lymphocyte 
none 


0.0 


0.0 


KU-812 

(Basophil) 

PMA/ionomycin 


0.0 


0.0 


2ry 

Thl/Th2/Trl anti- 
CD95CH11 


0.0 


0.0 


CCD 1106 

(Keratinocytes) 

none 


0.8 


0.4 


LAK cells rest 


0.6 


0.1 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


0.5 


0.4 


LAN cells 1I_,-Z 


u.u 


A /i 

0.4 


Liver cirrhosis 


A A 

U.U 


U. / 


LAK cells IL-2+IL- 
12 


0.3 


0.0 


Lupus kidney 


4.0 


5.2 


LAK cells IL- 
2+IFN gamma 


0.0 


0.4 


NCI-H292 none 


1.4 


3.5 


LAK cells IL-2+ 
IL-18 


0.8 


0.8 


NCI-H292 IL-4 


1.2 


0.9 


LAK cells 
PMA/ionomycin 


0.3 


0.0 


NC1-H292 IL-9 


1.6 


1.8 
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NK Cells IL-2 rest 


0.0 


0.0 


NCI-H292 IL-13 


2.1 


1.2 


Two Way MLR 3 
day 


0.0 


0.0 


NCI-H292 IFN 
gamma 


2.3 


3.6 


Two Way MLR 5 
day 


0.0 


0.0 


HPAEC none 


0.0 


0.0 


Two Way MLR 7 
day 


0.0 


0.0 


HPAEC TNF 
alpha + IL-1 beta 


0.0 


0.0 


PBMC rest 


0.0 


0.3 


Lung fibroblast 
none 


0.0 


0.0 


PBMC PWM 


0.2 


2.5 


Lung fibroblast 
TNF alpha + IL-1 
beta 


0.0 


0.0 


PBMC PHA-L 


0.4 


1.6 


Lung fibroblast 
IL-4 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 
IL-9 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast 
IL-13 


0.0 


0.5 


B lymphocytes 
PWM 






Lung fibroblast 
IFN gamma 


u.u 


U.U 


B lymphocytes 
CD40L and IL-4 


1 .u 




Dermal fibroblast 
CCD 1070 rest 


U.U 


u.u 


EOL-1 dbcAMP 


0.5 


0.6 


Dermal fibroblast 
CCD1070 TNF 
alpha 


0.0 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.3 

■ 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 
beta 


0.0 


0.0 


. . ■ s - 
Dendritic cells 

none 


0.0 


0.0 


Dermal fibroblast i 
IFN gamma 


0.0 


0.0 


Dendritic cells LPS 


0.1 


0.4 


Dermal fibroblast j 
IL-4 


0.0 


0.0 


Dendritic cells anti- 
CD40 


0.3 


0.0 


IBD Colitis 2 


0.0 


0.0 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


0.2 


1.1 


Monocytes LPS 


0.0 


0.4 


Colon 


2.1 


2.5 


Macrophages rest 


1.1 


1.0 


Lung 


2.1 


5.1 


Macrophages LPS 


0.3 


0.4 


Thymus 


100.0 


100.0 


HUVEC none 


0.0 


0.6 


Kidney 


3.7 


3.1 


HUVEC starved 


0.0 


0.0 







CNS neurodegeneration vl.O Summary: Ag2935/Ag3039 No differential 
expression of the NOV25 gene is detected between the postmortem brains of Alzheimer's 
diseased patients and those of non-demented controls. However, this panel confirms the 
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expression of this gene in the CNS. Please see panel 1 3D for a discussion of utility of this 
gene in the central nervous system. 

Panel 1.3D Summary: Ag2935/Ag3039 The expression of the NOV25 gene was 
assessed in two independent runs with good concordance between runs. Highest expression is 
5 seen in the testis (CTs=29). In addition, expression of this gene is extremely low in renal and 
brain cancer cell lines but is expressed in the normal brain and kidney tissues on this sample. 
Therefore, this gene may be used as a diagnostic marker for brain and kidney cancer and 
prostate tissue. Furthermore, therapeutic modulation of the expression or function of this gene 
may be effective in the treatment of brain and renal cancers. 
10 This gene encodes a novel protein phosphatase expressed at moderate to low levels in 

the CNS that may therefore be a small molecule target for the treatment of neurologic 
diseases. 

In addition, this gene is expressed at low levels in metabolic tissues including pancreas, 
adrenal, thyroid, pituitary, adult and fetal heart, and adipose. This novel protein phosphatase 

15 may be a small molecule target for the treatment of metabolic and endocrine disease, including 
obesity and Types 1 and 2 diabetes. This gene is also differentially expressed in fetal skeletal 
muscle (CT values = 32-33) when compared to expression in adult skeletal muscle (CT values 
= 35-40). Therefore, expression of this gene may also be useful for the differentiation of adult 
and fetal skeletal muscle. 

20 Panel 4D Summary: Ag2935/Ag3039 Expression of the NOV25 gene is highest and 

almost exclusive to the thymus (CTs=29-30). Expression of this gene could be used to 
distinguish thymus from the other samples on this panel. The putative phosphatase encoded by 
this gene may play an important role in T cell development. Small molecule therapeutics 
designed against the protein encoded by this gene could therefore be utilized to modulate 

25 immune function (T cell development) and be important for organ transplant, AIDS treatment 
or post chemotherapy immune reconstitution. 

NOV26a and NOV26b 

Expression of gene NOV26a and variant NOV26b was assessed using the 
primer-probe set Ag2936, described in Table XA. 
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Table XA. Probe Name Ag2936 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ■ -gttcctgctgtctggactttt-3 1 


21 


1 


1081 
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Probe l TET ~ 5 ' " cccactgagacgcagctgtattctgt-3 1 - 
r ° e jTAMRA 


26 ^27 


1082 


Reversejs • - 1 cgccaaatcatatt tcacact - 3 * 


22 j57 


1083 



CNSneurodegenerationvl.O Summary: Ag2936 Expression of the NOV26a gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 

Panel 1.3D Summary: Ag2936 Expression of the NOV26a gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 4D Summary: Ag2936 Expression of the NOV26a gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

NOV24a and NOV24b 

Expression of gene NOV24a and variant NOV24b was assessed using the primer-probe 
set Ag2966, described in Table YA. Results of the RTQ-PCR runs are shown in Tables YB 
and YC. 

Table YA. Probe Name Ag2966 



i 

Primers J Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forwardjs ' -agcgggcaactct tcatcta- 3 1 


20 


1356 


1084 


Probe j TE T- 5 ' -atgcagtcagtgaccagctccctg-3 ' - 
r ° e JTAMRA 


24 


1377 


1085 


Reversejs 1 - caggacaaagactgcagtcact - 3 ' 


22 


1409 
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Table YB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag2966, Run 
160658385 


Rel. Exp.(%) 
Ag2966, Run' 
165701959 ! 


Tissue Name 


Rel. Exp.(%) 
Ag2966, Run 
160658385 


Rel. Exp.(%) 
Ag2966, Run 
165701959 


Liver 

adenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


1.0 


0.0 


Pancreas 


0.2 


1.3 


Renal ca. 786- '■ 
0 


0.2 


0.0 


Pancreatic ca. 
CAPAN 2 


0.0 


0.0 


Renal ca. 
A498 


0.0 


0.3 


Adrenal gland 


0.2 


0.0 


Renal ca. RXF 

393 


0.0 


0.5 


Thyroid 


0.3 


0.0 


Renal ca. 
ACHN 


0.3 


0.0 
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Salivary gland 


0.1 


0.4 


Renal ca. UO- 
j 1 


0.0 


0.0 


Pituitary gland 


0.0 


0.0 


Renal ca. TK- 


0.0 


0.0 


Brain (fetal) 


0.0 


0.0 


Liver 


0.6 


0.9 


Brain (whole) 


0.0 


0.3 


Liver (fetal) 


0.4 


0.0 


Brain (amygdala) 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.3 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.8 


0.6 


Brain 

(hippocampus) 


0.5 


0.3 


Lung (fetal) 


1.4 


1.4 


Brain (substantia 
nigra j 


0.0 


0.0 


Lung ca. 
(small cell) 
LX-1 


0.6 


0.0 


Brain (thalamus) 


0.0 


0.5 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.0 


Cerebral Cortex 


0.0 


0.0 


Lung ca. 
(s.cell var.) 
SHP-77 


0.3 


0.4 


Spinal cord 


0.2 


0.2 


Lung ca. (large 
cell)NCI-H460 


0.2 


0.0 


giio/asiro U0/-IVIO 




u.u 


Lung ca. (non- 
sm. cell) A549 


U.o 


u. / 


glio/astro U-118- 

IVlvJ 


0.0 


0.0 


Lung ca. (non- 
s.cell) NCI- 
H23 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


1.6 


2.2 


neuro*; met SK-N- 
AS 


0.0 


0.4 


Lung ca. (non- 
s.cl) NCI- 
H522 


0.5 


0.0 


a^trnrvtnma 'sp. 

CI .3 11 W^jr" IW1I1C1 OA 

539 


0.0 


0.0 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


doll \J\^y LLHIld OIND- 

75 


0.0 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.0 


glioma SNB-19 


0.0 


0.4 


Mammary 
gland 


0.3 


0.7 


glioma U251 


0.0 


0.4 


Breast ca.* 
(pl.ef) MCF-7 


0.2 


0.0 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


* 0.0 


0.0 


Heart (fetal) 


0.0 


0.0 


Breast ca.* 


0.0 


0.0 
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(pl.ef) T47D 






Heart 


0.0 


0.0 


Breast ca. BT- 
549 


0.2 


0.0 


Skeletal muscle 
(fetal) 


3.9 


0.3 


Breast ca. 
MDA-N 


0.0 


0.0 


Skeletal muscle 


0.0 


0.0 


Ovary 


0.5 


0.0 


Bone marrow 


6.9 


2.5 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Thymus 


1.9 


0.9 


Ovarian ca. 
OVCAR-4 


0.0 


0.4 


Spleen 


4.2 


1.0 


Ovarian ca. 
OVCAR-5 


0.3 


0.0 


Lymph node 


3.0 


5.4 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Colorectal 


0.5 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


0.9 


0.0 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.0 


0.0 


Small intestine 


1.5 


0.5 


Uterus 


0.0 


0.5 


Colon ca. SW480 


0.8 


0.0 


Placenta 


0.0 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


1.6 


0.8 


Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


0.3 


0.3 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.3 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


0.2 


0.7 


Melanoma 
M14 


0.0 


0.0 


Bladder 


0.2 


0.3 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.6 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 


100.0 


100.0 


Adipose 


. 0.1 


0.6 
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Table YC. Panel 4D 



Tissue Name 


ReL Exp.(%) 
Ag2966, Run 
160660646 


Tissue Name 


Rel. Exp.(%) 
Ag2966, Run 
160660646 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.3 


Secondary Th2 act 


0.0 


HUVEC 1FN gamma 


0.9 


Secondary Trl act 


0.7 


HUVEC TNF alpha + IFN 
gamma 


0.3 


Secondary Thl rest 


0.2 


HUVEC TNF alpha + IL4 


0.5 


Secondary Th2 rest 


1.6 


HUVEC IL-11 


0.8 


Secondary Trl rest 


1.5 


Lung Microvascular EC 
none 


2.0 


Primary Thl act 


0.3 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.8 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


1.9 


Primary Trl act 


0.3 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


1.7 


Primary Th 1 rest 


A O 


Bronchial epithelium 
TNFalpha + IL1 beta 


ft ft 


Primary inz rest 


Z.I 


Small airway epithelium 
none 


ft ft 


Primary Trl rest 


1 A 

j .4 


Small airway epithelium 
TNFalpha + IL-lbeta 


ft 1 


CD45RA CD4 
lymphocyte act 


ft £ 


i^oronery artery oivii^ rest 


ft ft 


CD45RO CD4 
lymphocyte act 


1 1 
1 . 1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


ft 1 


CD8 lymphocyte act 


0.3 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


1 'X 

1 .D 


Astrocytes TNFalpha + 
IL-lbeta 


U.J 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.3 


CD4 lymphocyte none 


1.7 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.8 


CCD1 106 (Keratinocytes) 
none 


0.1 


LAK cells rest 


2.1 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.4 


Liver cirrhosis 


0.4 


LAK cells IL-2+IL-12 


0.3 


Lupus kidney 


1.0 


LAK cells IL-2+IFN 
gamma 


0.6 


NCI-H292 none 


0.6 


LAK cells IL-2+ 1L-18 


0.6 


NCI-H292 IL-4 


0.0 


LAK cells 


0.0 


NCI-H292 IL-9 


0.0 
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PMA/ionomycin 








NK Cells IL-2 rest 


0.6 


NCI-H292 IL-13 


~ 0.0 


Two Way MLR 3 day 


1.1 


NC1-H292 I FN gamma 


0.0 


Two Way MLR 5 day 


1.0 


HPAEC none 


0.6 


Two Way MLR 7 day 


0.5 


ri i a i iNr aipna ^ ii_,-i 
beta 


0.0 


r jdivil^ rest 


u.o 


Lung iiDroDidSL none 


u.u 


PBMC PWM 


i.i 


Lung fibroblast TNF alpha; 

-4-11 1 K/=»tc» 

t Lj- i oeia 


0.0 


PBMC PHA-L 


0.2 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


1.7 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


1.1 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


1.4 


Lung fibroblast IFN 
gamma 


0.2 


B lymphocytes CD40L 
and IL-4 


4.7 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
ccuiu/u irsir aipna 


1.4 


hUL-1 dbcAMP 
PK4 A/ionomvcin 


0.1 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.5 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.5 


IBD Colitis 2 


0.2 


Monocytes rest 


1.2 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


4.9 


Macrophages rest 


0.2 


Lung 


3.1 


Macrophages LPS 


0.0 


Thymus 


100.0 


HUVEC none 


0.5 


Kidney 


4.6 


HUVEC starved 


0.3 







Panel 1.3D Summary: Ag2966 Two experiments both show that expression of the 
NOV24a gene, a sodium-glucose cotransporter homolog, is limited to the kidney (CTs=29). 
This restricted expression is in agreement with published data which has shown that secondary 
5 active transport of glucose in the kidney is mediated by sodium glucose cotransporter. (See 

ref. 1). Thus, expression of this gene could be used as a marker for kidney tissue. Furthermore, 
the protein product may be important for normal function of the kidney. Thus, therapeutic 
modulation of the expression or function of this protein may be useful in treating diseases that 
affect the kidney, including diabetes. 
10 References: 
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Bissonnette P, Noel J, Coady MJ, Lapointe JY. Functional expression of tagged human 
Na+-glucose cotransporter in Xenopus laevis oocytes. J Physiol 1999 Oct 15;520 Pt 2:359-71 

1. High-affinity, secondary active transport of glucose in the intestine and kidney is 
mediated by an integral membrane protein named SGLT1 (sodium glucose cotransporter). 
5 Though basic properties of the transporter are now defined, many questions regarding the 
structure- function relationship of the protein, its biosynthesis and targeting remain 
unanswered. In order to better address these questions, we produced a functional hSGLTl 
protein (from human) containing a reporter tag. 2. Six constructs, made from three tags (myc, 
haemaglutinin and poly-His) inserted at both the C- and N-terminal positions, were thus tested 

10 using the Xenopus oocyte expression system via electrophysiology and 

immunohistochemistry. Of these, only the hSGLTl construct with the myc tag inserted at the 
N-terminal position proved to be of interest, all other constructs showing no or little transport 
activity. A systematic comparison of transport properties was therefore performed between the 
myc-tagged and the untagged hSGLTl proteins. 3. On the basis of both steady-state (affinities 

15 for substrate (glucose) and inhibitor (phlorizin) as well as expression levels) and presteady- 
state parameters (transient currents) we conclude that the two proteins are functionally 
indistinguishable, at least under these criteria. Immunological detection confirmed the 
appropriate targeting of the tagged protein to the plasma membrane of the oocyte with the 
epitope located at the extracellular side. 4. The myc-tagged hSGLTl was also successfully 

20 expressed in polarized MDCK cells. alpha-Methylglucose uptake studies on transfected cells 
showed an exclusively apical uptake pathway, thus indicating that the expressed protein was 
correctly targeted to the apical domain of the cell. 5. These comparative studies demonstrate 
that the myc epitope inserted at the N-terminus of hSGLTl produces a fully functional protein 
while other epitopes of similar size inserted at either end of the protein inactivated the final 

25 protein. 

PMID: 10523405 

Panel 2D Summary: Ag2966 Expression of the NOV24a gene is predominantly 
limited to the kidney. This result is in agreement with the expression seen in Panel 1 .3D. 

Panel 3D Summary: Ag2966 Results from one experiment with the NOV24a gene are 
30 not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 4D Summary: Ag2966 Expression of the NOV24a gene is predominantly found 
in normal tissue from thymus, lung, colon and kidney. This expression profile suggests that the 
protein product may be involved in glucose transport in these tissues. Therefore, therapeutic 
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modulation of the expression or function of this protein may be useful in treating diseases that 
affect these organs. 

NOV28 

Expression of gene NOV28 was assessed using the primer-probe set Ag2891, 
5 described in Table ZA. Results of the RTQ-PCR runs are shown in Tables ZB, ZC, ZD and 
ZE. 



Table ZA. Probe Name Ag2891 



Primers; 


Sequences 


Length 


j Start 
1 Position 


SEQ ID NO: 


Forward 


5 ' -ccggaaattaatctaccatcaa-3 1 


22 


j 252 


1087 


Probe 


TET-5* -ccccgggtaacaactgtttcagatct-3 1 - 
TAMRA 


26 




1088 


Reverse 


5 1 -gatggaaaagtccatgttggt-3 • 


21 


|325 


1089 



Table ZB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2891, 
Run 224116294 


Tissue Name 


Rel. Exp.(%) Ag2891, 
Run 224116294 


AD 1 Hippo 


13.9 


Control (Path) 3 
Temporal Ctx 


43.2 


AD 2 Hippo 


27.0 


Control (Path) 4 
Temporal Ctx 


20.9 


AD 3 Hippo 


36.1 


AD 1 Occipital Ctx 


6.4 


AD 4 Hippo 


16.5 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


35.1 


AD 3 Occipital Ctx 


19.1 


AD 6 Hippo 


29.9 


AD 4 Occipital Ctx 


51.1 


Control 2 Hippo 


6.0 


AD 5 Occipital Ctx \ 


0.0 


Control 4 Hippo 


34.4 


AD 6 Occipital Ctx 


11.0 


Control (Path) 3 
Hippo 


17.4 


Control 1 Occipital 
Ctx 


20.6 


AD 1 Temporal Ctx 


76.3 


Control 2 Occipital 
Ctx 


18.2 


AD 2 Temporal Ctx 


23.3 


Control 3 Occipital 
Ctx 


0.0 


AD 3 Temporal Ctx 


0.0 


Control 4 Occipital 
Ctx 


17.3 


AD 4 Temporal Ctx 


34.6 


Control (Path) 1 
Occipital Ctx 


65.1 


AD 5 Inf Temporal 
Ctx 


24.5 


Control (Path) 2 
Occipital Ctx 


13.5 


AD 5 SupTemporal 


58.2 


Control (Path) 3 


0.0 
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Ctx 




Occipital Ctx 




AD 6 Inf Temporal 
Ctx 


45.1 


Control (Path) 4 
Occipital Ctx 


17.6 


AD 6 Sup Temporal 
Ctx 


95.9 


Control 1 Parietal 
Ctx 


0.0 


Control 1 Temporal 
Ctx 


5.2 


Control 2 Parietal 
Ctx 


19.2 


Control 2 Temporal 
Ctx 


0.0 


Control 3 Parietal 
Ctx 


37.9 


Control 3 Temporal 
Ctx 


12.4 


Control (Path) 1 
Parietal Ctx 


100.0 


Control 4 Temporal 
Ctx 


21.6 


Control (Path) 2 
Parietal Ctx 


1 25.9 


Control (Path) 1 
Temporal Ctx 


65.1 


Control (Path) 3 
Parietal Ctx 


0.0 


Control (Path) 2 
Temporal Ctx 


55.9 


Control (Path) 4 
Parietal Ctx 


11.2 



Table ZC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag2891, Run 
160898914 


Rel. Exp.(%) 
Ag2891, Run 
165701349 ! 


Tissue Name 


Rel. Exp.(%)| 
Ag2891, Run 
160898914 


Rel. Exp.(%) 
Ag2891, Run 
165701349 


Liver 

adenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


0.0 


0.0 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


72.7 


17.7 


Pancreatic ca. 
CAPAN 2 


0.0 


0.0 


Renal ca. 
A498 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 
393 


98.6 


100.0 


Thyroid 


0.0 


0.0 


Renal ca. 
ACHN 


35.6 


21.3 


Salivary gland 


0.0 


0.0 


Renal ca. UO- 
31 


43.5 


17.9 


Pituitary gland 


0.0 


0.0 


Renal ca. TK- 
10 


0.0 


0.0 


Brain (fetal) 


0.0 


0.0 


Liver 


0.0 


0.0 


Brain (whole) 


0.0 


0.0 


Liver (fetal) 


0.0 


0.0 


Brain (amygdala) 


18.4 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


1.9 


10.2 


Lung 


0.0 


0.0 


Brain 

(hippocampus) 


19.6 


12.2 


Lung (fetal) 


0.0 


0.0 


Brain (substantia 


0.0 


0.0 


Lung ca. 


0.0 


0.0 
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nigra) 






(small cell) 
LX-1 






Brain (thalamus) 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.0 


Cerebral Cortex 


100.0 


11.8 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


12.4 


Spinal cord 


14.9 


13.5 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


gho/astro Uo/-MCji 


U.U ! 


U.U 


Lung ca. (non- 
sm. cell) A549 


U.U 


^26 1 

ZO. 1 


glio/astroU-118- \ 

MO 


0.0 


0.0 


Lung ca. (non- 
s.cell) NCI- 
H23 


16.3 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.0 


0.0 


neuro*; met SK-N- 


0.0 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


19.8 


0.0 

■ — ■ 


astrocytoma SF- 
539 


0.0 


10.5 


Lung ca. 
(squam.) SW 
900 


— s 

0.0 


0.0 


astrocytoma SNB- 
75 


0.0 - 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.0 


glioma SNB-19 


0.0 


0.0 


Mammary 
gland 


0.0 


37.4 


glioma U251 


14.4 


0.0 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


0.0 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


0.0 


Heart (fetal) 


0.0 


0.0 


Breast ca.* 
(pl.ef) T47D 


14.6 


0.0 


Heart 


0.0 


0.0 


Breast ca. BT- 
549 


0.0 


0.0 


Skeletal muscle 
^ieiai ) 


15.8 


0.0 


Breast ca. 


0.0 


0.0 


Skeletal muscle 


0.0 


0.0 


Ovary 


58.6 


0.0 


Bone marrow 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


4.6 


Thymus 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Spleen 


0.0 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


0.0 


Lymph node 


0.0 


0.0 


Ovarian ca. 


0.0 


0.0 
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OVCAR-8 






Colorectal 


26.2 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


0.0 


0.0 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.0 


0.0 


Small intestine 


0.0 


30.4 


Uterus 


0.0 


0.0 


Colon ca. SW480 


6.7 


23.3 


Placenta 


0.0 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


92.0 


71.2 


Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


0.0 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met) NC1-N87 


0.0 


0.0 


Melanoma 
M14 


3.3 


0.0 


Bladder 


0.0 


0.0 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.0 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 


0.0 


0.0 


Adipose 


0.0 


0.0 



Table ZD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2891,Run 
160899401 


Tissue Name 


Rel. Exp.(%) 
Ag2891, Run 
160899401 


Normal Colon 


0.0 


Kidney Margin 
8120608 


18.2 


CC Well to Mod Diff 
(OD03866) 


15.6 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


37.1 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


100.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 


6.3 
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9010321 




CC Mod Diff (0003920)] 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 06401 1 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


0.0 


Thyroid Cancer 
064010 


0.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


20.6 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD0445 1 - \ 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 
(OD04410) 


17.2 


Breast Cancer 

Metastasis 

(OD04655-05) 


0.0 


Prostate Cancer 
(OD04720-01) 


0.0 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02) 


u.u 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


24.7 


Breast Cancer - 

y 1 UUzOD 


0.0 


Lung Met to Muscle 


15.7 


r>reast iviargin 
9100265 


0.0 


Muscle Margin 
(UUU4ZOO) 


0.0 


Breast Cancer 
A209073 


0.0 


Lung Malignant Cancer 
(D\Jv)3 iZo) 


0.0 


Breast Margin 
A2090734 


0.0 


T imp Margin fOD03126^ 


0.0 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


0.0 


Lung Margin (OD04404) 


34.4 


Liver Cancer 1025 


8.7 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


36.6 


Lung Margin (OD04565) 


8.4 


Liver Cancer 6004-T 


8.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


8.2 


Ocular Mel Met to Liver 


17.0 


Liver Tissue 6005-N 


0.0 
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(ODO4310) 






Liver Margin (ODO4310) 


0.0 


Normal Bladder 


8.3 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


U.U 


Lung Margin (OD04321) 


45.7 


Bladder Cancer 
A302173 


10.6 


Normal Kidney 


41.5 


Bladder Cancer 
(OD047 18-01) 


5.2 


Kidney Ca, Nuclear grade 

Z [VJLJVHDJO) 


28.3 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(OD04338) 


47.3 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2(OD04339) 


62.9 


Ovarian Cancer 
064008 


9.1 


Kidney Margin 
(OD04339) 


40.9 


Ovarian Cancer 
(OD04768-07) 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


97.3 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


17 4 


Normal Stomach 


35.1 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0 0 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 
(OD04348) 


3 2 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


8.3 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 
(OD04622-03) 


17.7 


Stomach Margin 
9060394 


23.5 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


8.1 


Kidney Margin 
(OD04450-03) 


4.3 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


10.6 


Gastric Cancer 
064005 


0.0 



Table ZE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2891, Run 
159632970 


Tissue Name 


Rel. Exp.(%) 
Ag2891, Run 
159632970 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC 1FN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 1FN 
gamma 


0.0 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + 1L4 


0.0 
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Secondary Th2 rest 


0.0 


HUVECIL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Micro vascular EC 
none 


0.0 


Primary ThI act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


iVlicro vascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Tlronphifil pnithplinm 

I J 1 \Jl l\s 1 1 1 Cl 1 wl/Ivl 1 v 1 1 LI 1 11 

TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


lymphocyte rest 


0.0 


IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


3.5 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) 
none 


0.0 


T ART r* M q rp ct 


0 O 

yj . w 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0 0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


41.8 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0 0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


1.5 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


19.5 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 
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PBMC PHA-L 


0.0 Lung fibroblast IL-4 1 0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 j 0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL- 13 j 0.0 


R Ivmnhocvtes PWM 


u.u 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


u.u 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


7.3 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
rivi/v ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


2.2 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


4.5 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 JColon 


100.0 


Macrophages rest 


0.0 |Lung 


32.3 


Macrophages LPS 


| 1 .3 jThymus 


40.3 


HUVEC none 


J 0.0 jKidney 


0.0 


HUVEC starved 


\ o.o I j 



CNS_neurodegeneration_vl.O Summary: Ag2891 The NOV28 gene is expressed at 
very low levels in the CNS. No differential expression is detected in the postmortem brains of 
Alzheimer's patients when compared with non-demented controls. The widespread expression 
5 in the brain however suggests that this gene may be of utility in the treatment of neurological 
diseases. 

Panel 1.3D Summary: Ag2891 Two experiments with the same probe and primer set 
produce results that are in excellent agreement, with highest expression of the NOV28 gene in 
a lung cancer cell line and the brain (CTs=33-34). Significant expression is also seen in the 
10 testis and a cluster of lung cancer cell lines. Thus, expression of this gene could be used to 
differentiate these samples from other samples on this panel, and as a marker of testis tissue 
and lung cancer. 

Panel 2D Summary: Ag2891 Expression of the NOV28 gene is limited to samples 

derived from kidney cancer (CTs=33-34). Thus, expression of this gene could be used to 

15 differentiate between these samples and other samples on this panel and as a marker to detect 

the presence of kidney cancer. Furthermore, therapeutic modulation of the expression or 

function of this gene may be effective in the treatment of kidney cancer. 
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Panel 3D Summary: Ag2891 Expression of the NOV28 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2891 The NOV28 transcript is expressed at low but 
significant levels in the colon and thymus (CTs-33-35). Thus, the transcript or the protein it 
5 encodes could be used for detection of these tissues. The protein encoded by this transcript 
may also play an important role in the normal homeostasis of these tissues. Therefore, 
therapeutics designed with the protein encoded by this transcript could be important for 
modulating T cell development in the thymus or maintaining or restoring normal function to 
these organs during inflammation due to inflammatory bowel disease in the colon. 

10 NOV29a 

Expression of gene NOV29a was assessed using the primer-probe set Ag2892, 
described in Table AAA. Results of the RTQ-PCR runs are shown in Tables AAB, A AC, 
AAD and AAE. 

Table AAA. Probe Name Ag2892 



Primers 


Sequences 


Length i 


Start 
Position 


SEQ ID NO: 


Forward: 


5' -ggcagctcgaaactttgac-3 ' 


19 j 


171 


1090 


Probe 


TET-5 1 - cagaaat ccgaagacatgc tccgaag- 3 1 - 
TAMRA 


26 


193 


1091 


Reverse 


5' -gacaatgttgtccaggtcttgt-3 1 


22 J 


240 


1092 



15 

Table AAB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2892, 
Run 165721697 


Tissue Name 


Rel. Exp.(%) Ag2892, 
Run 165721697 


Liver adenocarcinoma 


4.6 


Kidney (fetal) 


4.4 


Pancreas 


0.7 


Renal ca. 786-0 


3.6 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


4.2 


Renal ca. RXF 393 


7.0 


Thyroid 


0.0 


Renal ca. ACHN 


2.1 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


41.5 


Brain (whole) 


0.6 


Liver (fetal) 


76.8 


Brain (amygdala) 


4.0 


Liver ca. 

(hepatoblast) HepG2 


8.0 


Brain (cerebellum) 


2.0 


Lung 


26.8 


Brain (hippocampus) 


0.0 


Lung (fetal) 


11.5 
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Brain (substantia nigra) 


19 1 
1.9 


Lung ca. (small cell) 
LX-1 


100.0 


Brain (thalamus) 


41 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


A 9 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


z.o 


Lung ca. (large 
cell)NCI-H460 


19.1 


glio/astro Uo/-mcj 


u.u 


Lung ca. (non-sm. 
cell) A549 


16.6 


glio/astro U-l 18-MCj 


u.u 


Lung ca. (non-s.cell) 
NCI-H23 


28.3 


astrocytoma SW1783 


2.1 


Lung ca. (non-s.cell) 
HOP-62 


10.4 


neuro*; met SK-N-AS 


1.3 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


a<:trorvtnmfi STsTR-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.7 


Heart (fetal) 


jBreast ca.* (pl.ef) 
73 |T47D 


0.0 


Heart 


0.0 jBreast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


0.0 jBreast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


77.4 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


2.2 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 


0.0 


Ovarian ca. 1GROV- 
1 


0.0 


Stomach 


5.6 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


2.7 


Colon ca. SW480 


32.3 


Placenta 


0.0 


Colon ca.* 


31.6 


Prostate | 0.0 
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SW620(SW480 met) 




1 


Colon ca HT29 


1.6 F 
r 


Vostate ca.* (bone 
net)PC-3 


25.0 


Colon ca. HCT-116 


28.9 J\ 


"estis 


39.2 


Colon ca. CaCo-2 


0.0 J 


vlelanoma 
4s688(A).T 


u.u 


Colon ca. 
tissue(OD03866) 


; 


vlelanoma* (met) 
-ls688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 I 


Vlelanoma UACC-62 


14.2 


Gastric ca.* (liver met) 
NCI-N87 


3.4 1 


s4elanoma M14 


0.0 


Bladder 


o.o 1 


Vlelanoma LOX 
[MVI 


0.0 


Trachea 


jMelanoma* (met) 
U0 jSK-MEL-5 


4.4 


Kidney 


3.2 ) Adipose 


4.1 


Table AAC. Panel 2D 


Tissue Name 


Kei. i^xp.^ /o ) 
Ag2892, Run 
160942674 


Tissue Name 


Rel Exd (%) 
Ag2892, Run 
160942674 


Normal Colon 


1.8 


Kidney Margin 
8120608 


0.9 


CC Well to Mod Diff 
(OD03866) 


1.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


0.7 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


2.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


0.0 


CC Mod Diff (OD03 920) 


2.4 


Normal Uterus 


0.0 


CC Margin (ODO3920)_ 


0.0 


Uterus Cancer 064011 


1.1 


CC Gr.2 ascend colon 
(OD03921) 


1.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


0.0 


Thyroid Cancer 
064010 


1.5 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


5.3 


Thyroid Cancer 
A302152 


1.6 


Liver Margin (ODO4309) 


100.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


5.6 


Normal Breast 


0.7 


Lung Margin (OD04451- 
02) 


4.2 


Breast Cancer 
(OD04566) 


0.0 
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Normal Prostate 6546-1 


0.0 ] 


3reast Cancer 
OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
PD04590-03) 


0.0 


Prostate Margin 

/ s~\ rA/"\ AAA f\\ 

(OD04410) i 


] 

0.7 1 


Breast Cancer 

Metastasis 

(OD04655-05) 


3.3 


Prostate Cancer 
(OD04720-01) 


u.u 


Rrea<;t Cancer 064006 

DIvujI VytlllUv] \j\j—t\j\j\j 


0.0 


Prostate Margin 
(OD04720-02) 


u. / 


Rreast Cancer 1024 


0.0 


Normal 1 unp 061 010 


6.7 


Breast Cancer 
9100266 


0.0 


Lung Met to Muscle 
(OD04286) 


3.4 


Breast Margin 
y i UUZO-) 


0.0 


Muscle Margin 
(OD04286) 


■ - ■ — 

0.0 


tsreast cancer 
A209073 


0.0 


Lung Malignant Cancer 
(OD03126) 


1.0 


Jtsreast iviargin 
A2090734 


0.0 


Lung Margin (OD03126) 


9.2 


iNormai i_>iver 


59 0 


Lung Cancer (OD04404) 


3.5 


Liver Cancer 064003 


19.2 


Lung Margin (OD04404) 


8.2 


Liver Cancer 1025 


60.7 


Lung Cancer (OD04565) 


1.4 


Liver Cancer 1026 


39.5 


Lung Margin (OD04565) 


12.3 


Liver Cancer 6004-T 


73.2 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


9.8 


Lung Margin (OD04237- 
02) 


7.2 


Liver Cancer 6005-T 


39.5 


Ocular Mel Met to Liver 
(ODO4310) 


0.6 


Liver Tissue 6005-N 


7.9 


Liver Margin (ODO4310) 


23.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


1.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


18.8 


Bladder Cancer 
A302173 


0.0 


Normal Kidney 


2.0 


Bladder Cancer 
(OD047 18-01) 


6.1 


Kidney Ca, Nuclear grade 
2(OD04338) 


1.2 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(OD04338) 


1.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.0 


Ovarian Cancer 
064008 


1.8 


Kidney Margin 
(OD04339) 


0.0 


Ovarian Cancer 
(OD04768-07) 


0.0 
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Kidney Ca, Clear cell 
type (OD04340) 


n n 

u.u 


Ovary Margin 
(OD04768-08) 


0.6 


Kidney Margin 
(OD04340) 


.5. J 


Normal Stomach 


1.6 


Kidney Ca, Nuclear grade! 
3 (OD04348) 


u.u 


Gastric Cancer 
9060358 


0.6 


Kidney Margin 
(OD04348) 


u.u 


Stomach Margin 
9060359 


1.7 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 
(OD04622-03) 


0.8 


Stomach Margin 
9060394 


0.6 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.7 


Kidney Margin 
(OD04450-03) 


2.7 


Stomach Margin 
9060396 


0.9 


Kidney Cancer 8120607 


1.4 


Gastric Cancer 
064005 


0.0 



Table AAD. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2892,Run 
164629840 


Tissue Name 


Rel. Exp.(%) 
Ag2892, Run 
164629840 


Daoy- Medulloblastoma 


5.0 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


2.2 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


3.2 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 


0.0 


TF-1- Erythroleukemia 


0.0 
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Mucoepidermoid lung 
carcinoma 








DMS-1 14- Small cell 
lung cancer 


ZZ.o J 


HI IT 78- T-rell Ivmnhoma 

I y_) 1 / o 1 ^v^ll i yuipiiv/iiiw 


0.0 


DMS-79- Small cell lung : 
cancer 


u.u 


TQ^7- Ffi^tincvtic Ivmnhoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


fCU-812- Myelogenous 
eukemia 


14.5 


NCI-H526- Small cell 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


2.0 


NCT-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


1 . / 


cell lung cancer 
(metastasis) 


20.4 


G401- Wilms' tumor 


0.5 


NCI-HI 155- Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


1.0 


NCI-H1299- Large cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver • 
metastasis) 


2.0 


NCI-H727- Lung 
carcinoid 


U.U 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.5 


NCI-UMC-ll-Lung 
carcinoid 


u.u 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


1UU.U 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Co1o-2Ud- Colon cancer 


u.u 


MIA PaCa-2- Pancreatic 
carcinoma 


2.0 


KM 12- Colon cancer 


n n 
u.u 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


0.7 

! - 


KJvlzUl^i- colon cancer 


U.v> 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


6.5 


JNCl-ri / io- colon cancer 


0 A 

U.H- 


T24- Bladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


6.6 


5637- Bladder carcinoma 


0.0 


SW111 6- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


4.4 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


! 0.0 


SW-480- Colon 
adenocarcinoma 


2.8 


HT-1 080- Fibrosarcoma 


1 0.0 



1007 




IN V-/ I o IN vJ ~> VJCloll 1V^ 

carcinoma 


63.7 


MG-63- Osteosarcoma 

— a — ^sbs^j^s^s a zaga^S; -™ - 


4.3 


IVrv 1 KJ 111 - vjcloli I U 

carcinoma 


23.7 


C l<f _I A/fQ_l_ T p i nm vn<;fi rrnm :i 

OJV JLIVIO I J-^tl^Jlilj'VJoClI V^VJl 1 1CI 

(vulva) 


0.0 


l N V^l~01 > U 1 \J VJcljLl 1L/ 

carcinoma 


1.3 


o j rvi i j) \j ivi icacvjl/i 1 1 y vjcii ^/Ui 1 1 ci 

(met to bone marrow) 


0.0 


l>V^l~kMN v_J 1 VJdbll IC 

carcinoma 


0.0 


A431- Epidermoid carcinoma 


6.9 


DC 1 /^nctri/- 

adenocarcinoma 


0.0 


WM266-4- Melanoma 


13.1 


rvr -ho- vjdsinc 
adenocarcinoma 


0.0 


LJ\J 1 H- J- .r rOblaLC CarClIIOIIia 

(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


fVfDA-MR-468- Rreast 
adenocarcinoma 


5.0 


NCLN87- Gastric" ~ 
carcinoma 


2.1 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


8.8 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table AAE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2892, Run 
164033139 


Tissue Name 


Rel. Exp.(%) 
Ag2892, Run 
164033139 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Th 1 act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


21.5 
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Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


65.1 


CD45RA CD4 
lymphocyte act 


6.7 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta j 


0.0 


Secondary CD8 
lymphocyte act 


0.0 JkU-8 1 2 (Basophil) rest 


32.5 


CD4 Ivmohocvte none 


KU-812 (Basophil) 
(PMA/ionomycin 


31.6 


2ry Thl/Th2/Trl anti- 
CD95CH11 


0.0 


CCD1 106 (Keratinocytes) ] 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


3.8 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


6.8 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


11.0 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HP A EC none 


0.0 


Two Way MLR 7 day 


iHPAEC TNF alpha + IL- 1 
00 jbeta 


0.0 


PBMC rest 


0.0 |Lung fibroblast none 


0 0 


PBMC PWM 


Lung fibroblast TNF alpha 
00 |+IL-lbeta 


8.1 


PBMC PHA-L 


0.0 jLung fibroblast IL-4 


10.4 


Ramos (B cell) none 


0.0 JLung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 Lung fibroblast IL-13 


4.3 


B lymphocytes PWM 


„ _ iLung fibroblast IFN 
4.5 

j gam ma 


5.8 


B lymphocytes CD40L 
and IL-4 


jDermal fibroblast 
0 0 jCCD 1070 rest 


0.0 


EOL-1 dbcAMP 


JDermal fibroblast 
0 0 JCCD1070 TNF alpha 


10.4 


EOL-1 dbcAMP 
PMA/ionomycin 


jDermal fibroblast 
00 ]CCD1070IL-1 beta 


0.0 


Dendritic cells none 


0.0 jDermal fibroblast IFN 


13.0 
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] gam ma 




L/enunuc cens i > r o 


0.0 


Dermal fibroblast IL-4 


21.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


12.2 


Macrophages rest 


0.0 


Lung 


63.7 


Macrophages LPS 


0.0 


Thymus 


43.8 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 1.3D Summary: Ag2892 Highest expression of the NOV29a gene is seen in a 
lung cancer cell line (CT=32.7). Significant expression is also seen in a colon cancer cell line 
and the liver. Thus, expression of this gene could be used to differentiate between these 
samples and other samples on this panel and as a diagnostic marker for the presence of colon 



5 and lung cancer. Furthermore, therapeutic modulation of the expression or function of this 
gene may be effective in the treatment of colon and lung cancers. Results from a second run 
with the same probe and primer set are not included because of a potential problem in one of 
the sample wells. 

Panel 2D Summary: Ag2892 Expression of the NOV29a gene is restricted to liver 
10 derived tissue, with highest expression in normal liver tissue (CT=32.4). Significant 

expression is also seen in liver cancer samples. Thus, expression of this gene could be used to 
differentiate liver derived samples from other samples on this panel and from other tissue 
samples. 

Panel 3D Summary: Ag2892 Highest expression of the NOV29a gene is seen in a 
15 lung cancer cell line (CT-31.4). Significant expression is also seen in a gastric cancer cell line. 
Thus, expression of this gene could be used to differentiate between these samples and other 
samples on this panel and as a diagnostic marker for the presence of gastric and lung cancer. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
effective in the treatment of gastric and lung cancers. 
20 Panel 4D Summary: Ag2892 Expression of the NOV29a gene is restricted to liver 

cirrhosis (CT=34.8). This liver specific expression is in agreement with the expression in 
Panels 1 .3D and 2D. Thus, expression of this gene could be used to differentiate between this 
sample and other samples on this panel, and as a marker of liver tissue. 

NOV29c 
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Expression of gene NOV29c was assessed using the primer-probe set Ag2893 ? 
described in Table ABA. Results of the RTQ-PCR runs are shown in Tables ABB, ABC, 
ABD, ABE and ABF. 



Table ABA. Probe Name Ag2893 



Primers! 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward! 


5 ' -gcccaatcctgatgactacttc-3 1 


22 


39 


1093 


Probe 


TET-5 1 -ctccaagctcggagctttgacctg-3 ' - 
TAMRA 


24 


73 


1094 


Reverse; 


5 ' -ctcagcatgtcctctgatttct-3 1 


22 


98 


1095 



Table ABB. CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) Ag2893, 
Run 224116295 


Tissue Name 


Rel. Exp.(%) Ag2893, 
Run 224116295 


AD 1 Hippo 


18.6 


Control (Path) 3 
Temporal Ctx 


15.8 


AD 2 Hippo 


59.0 


Control (Path) 4 
Temporal Ctx 


43.5 


AD 3 Hippo 


25.3 


AD 1 Occipital Ctx 


41.8 


AU 4 Hippo 




AD 2 Occipital Ctx 
(Missing) 


ft n 


AD 5 Hippo 


95.9 


AD 3 Occipital Ctx 


27.5 


AD 6 Hippo 


86.5 


AD 4 Occipital Ctx { 


58.6 


Control 2 Hippo 


26.1 


AD 5 Occipital Ctx 


47.0 


Control 4 Hippo 


59.5 


AD 6 Occipital Ctx 


31.2 


Control (Path) 3 
Hippo 


17.3 


Control 1 Occipital 
Ctx 


20.9 


AD 1 Temporal Ctx 


48.6 


Control 2 Occipital 
Ctx 


63.7 


AD 2 Temporal Ctx 


61.1 


Control 3 Occipital 
Ctx 


62.0 


AD 3 Temporal Ctx 


25.2 


Control 4 Occipital 
Ctx 


28.1 


AD 4 Temporal Ctx 


70.7 


Control (Path) 1 
Occipital Ctx 


82.9 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


24.5 


AD 5 Sup Temporal 
Ctx 


55.9 


Control (Path) 3 
Occipital Ctx 


15.0 


AD 6 Inf Temporal 
Ctx 


86.5 


Control (Path) 4 
Occipital Ctx 


28.3 


AD 6 Sup Temporal 
Ctx 


92.7 


Control 1 Parietal 
Ctx 


20.4 
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Ctx 


24.7 


Pontrol 7 Parietal 

Ctx 


55.1 


Control 2 Temporal 
Ctx 


53.2 


Control 3 Parietal 
Ctx 


30.4 


Control 3 Temporal 
Ctx 


26.8 


Control (Path) 1 
Parietal Ctx 


74.2 


Control 3 Temporal 
Ctx 


40.6 


Control (Path) 2 
Parietal Ctx 


35.8 


Control (Path) 1 
Temporal Ctx 


63.7 


Control (Path) 3 
Parietal Ctx 


6.9 


Control (Path) 2 
Temporal Ctx 


44.1 


Control (Path) 4 
Parietal Ctx 


38.4 



Table ABC. Panel 1.3D 



Tissue Name 


Kel. iLxp.{vo)\ 
Ag2893, Run 
160944329 


Kel. E/Xp.( 7o) 
Ag2893, Run 
165701489 


Tissue Name 


i<ei. n.xp.( /o) 
Ag2893, Run 
160944329 


Kei. JL*Xp.^ /o ) 
Ag2893, Run 
165701489 


Liver 

adenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


19.5 


4.3 


Pancreas 


2.5 


4.0 


Renal ca. 786- 
n 


73.7 


64.2 


Pancreatic ca. 
CAPAN 2 


0.0 


0.0 


Renal ca. 
A498 


0.5 


0.6 


Adrenal gland 


0.8 


1.2 


Renal ca. RXF 

393 


100.0 


100.0 


Thyroid 


0.5 


0.3 


Renal ca. 
ACHN 


28.1 


18.8 


Salivary gland 


0.0 


0.0 


Renal ca. UO 1 
31 


23.5 


24.3 


Pituitary gland 


2.0 ■ 


1.8 


Renal ca. TK- 
10 


21.2 


8.9 


Brain (fetal) 


0.9 


2.0 


Liver 


0.0 


1.3 


Brain (whole) 


3.6 


6.3 


Liver (fetal) 


1.2 


1.6 


Brain (amygdala) 


3.6 


6.1 


Liver ca. 

(hepatoblast) 

HepG2 


10.9 


8.1 


Brain (cerebellum) 


6.3 


6.2 


Lung 


8.9 


4.8 


Brain 

(hippocampus) 


9.3 


10.4 


Lung (fetal) 


6.6 


3.6 


Brain (substantia 
nigra) 


2.6 


5.4 


Lung ca. 
(small cell) 
LX-1 


0.0 


0.0 


Brain (thalamus) 


7.2 


13.8 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.0 
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Cerebral Cortex 


25.5 


3.3 I 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 


Spinal cord 


17.0 


10.7 


Lung ca. (large' 
=ell)NCI-H460 


0.0 


0.0 


glio/astro U87-MG 


0.0 


0.0 


Lung ca. (non- \ 
sm. cell) A549 


u.u 


u.u 


glio/astro U-l 18- 
MG 


0.0 


0.0 


Lung ca. (non- j 
s.cell) NCI- 
H23 


2.3 


1.1 


astrocytoma 
SW1783 


0.6 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


3.6 


2.5 


neuro*; met SK-N- 
AS 


0.0 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


0.0 


0.0 


astrocytoma SF- 
539 


0.0 


0.0 


Lung ca. 
(squam.) SW 
900 


0.0 


2.4 


astrocytoma SNB- 
75 


0.0 


0.0 


Lung ca. 
(squam.) NCI- 
H596 


0.8 


0.0 


glioma SNB- 19 


0.0 


0.0 


Mammary 
gland 


0.5 


0.0 


glioma U251 


0.0 


0.0 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


0.0 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.4 


1.7 


Heart (fetal) 


1.4 


0.0 


Breast ca.* 
(pl.ef) T47D 


4.8 


0.9 


Heart 


0.0 


0.0 


Breast ca. BT- 
549 


0.0 


0.0 


Skeletal muscle 
(fetal) 


3.3 


0.0 


Breast ca. 
MDA-N 


0.0 


0.0 


Skeletal muscle 


0.4 


0.0 


Ovary 


9.9 


2.4 


DUI1C IlldllUW 


0 0 


0.3 


Ovarian ca. 
OVCAR-3 


3.2 


2.2 


Thymus 


1.4 


0.2 


! Ovarian ca. 
OVC AR-4 


0.0 


0.0 


Spleen 


1.7 


0.6 


Ovarian ca. 
OVCAR-5 


0.0 


0.0 


Lymph node 


0.0 


1.1 


! Ovarian ca. 
OVCAR-8 


1.0 


1.2 


Colorectal 


0.0 


0.0 


! Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


0.0 


2.0 


Ovarian ca.* 
(ascites) SK- 


0.0 


0.0 
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OV-3 1 | 


Small intestine 


0.0 


2.3 


Uterus | 


1.9 J 


5.4 


Colon ca. SW480 


2.3 


0.5 


Placenta 


0.5 


0.9 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.6 


Prostate 


1.0 


1.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
f bone mef)PC- ' [ 
3 


0.0 


0.0 


Colon ca. HCT- j 
116 


0.0 


0.0 


Testis 


25.7 


9.3 


Colon ca. CaCo-2 


0.9 


0.0 


Hs688(A).T 


0.0 


0.0 


Colon'ca. 
tissue(OD03866) 


1.5 


0.0 


Melanoma* 
(met} 

Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


0.0 


0.6 


Melanoma 
M14 


0.0 


0.0 


Bladder 


11.3 


5.0 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


2.5 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 


— , „ — „ 
26.2 


5.3 


Adipose 


1.0 


0.0 



Table ABD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
160966072 


Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
160966072 


Normal Colon 


2.5 


Kidney Margin 
8120608 


16.8 


CC Well to Mod Diff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.5 


Kidney Margin 
8120614 


18.0 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


74.7 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


18.3 


CC Mod Diff (OD03 920) 


0.5 


Normal Uterus 


0.7 


CC Margin (ODO3920) 


0.4 


Uterus Cancer 06401 1 


10.8 


CC Gr.2 ascend colon 
(ODQ3921) 


0.5 


Normal Thyroid 


0.0 
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CC Margin (OD03921) 


0.0 ( 


rhyroid Cancer 
364010 


1.7 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.7 


rhyroid Cancer 
A302152 


3.4 


Liver Margin \\jL}\j^\D\)y ) 


0 Q 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


5.0 


Normal Breast 


2.0 


Lung Margin (OD04451- 
02) 


18.0 


Breast Cancer 
(OD04566) 


0.2 


Normal Prostate 6546-1 


0.7 


Breast Cancer 
(OD04590-01) 


0.2 


Prostate Cancer 
(OD04410) 


2.0 


Breast Cancer Mets 
(OD04590-03) 


0.8 


Prostate Margin 
(OD04410) 


8.7 


Breast Cancer 

Metastasis 

(OD04655-05) 


0.6 


Prostate Cancer 
(OD04720-01) 


D.o 


t5 r poc+ panrer 064006 


0.3 


Prostate Margin 
(OD04720-02) 






0.5 


Normal Lune 061010 


12.9 


Breast Cancer 
9100266 


0.5 


Lung Met to Muscle 
(OD04286) 


0.0 


Breast Margin 


1.1 


Muscle Margin 
(OD04286) 


0.3 


DlCdbL V^dlIL/CI 

A209073 


0.2 


Lung Malignant Cancer 
(OD03126) 


8.7 


A2090734 


0.4 


Lung Margin (OJJU3 Izo) 


zi .u 


?\J r^rm "A 1 1 i vpr 


0.0 


Lung Cancer (OD04404) 


8.0 


Liver Cancer 064003 


0.8 


Lung Margin (OD04404) 


28.7 


Liver Cancer 1025 




Lung Cancer (OD04565) 


0.7 


Liver Cancer 1026 


Q 1 
0.1 


Lung Margin (OD04565) 


33.0 


Liver Cancer 6004-T 


1.2 


Lung Cancer (OD04237- 
01) 


1.3 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


11.3 


Liver Cancer 6005-T 


5.4 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.3 


Liver Margin (ODO4310) 


0.4 


Normal Bladder 


17.3 


Melanoma Mets to Lung 
(OD04321) 


1.0 


Bladder Cancer 1023 


0.2 


Lung Margin (OD04321) 


32.1 


Bladder Cancer 
A302173 


' 1.3 



1015 




Normal Kidney 


42.0 


Bladder Cancer 
(OD047 18-01) 


7.9 


Kidney Ca, Nuclear grade 
2 (OD04338) 


55.5 


Bladder Normal 
Adjacent (OD047 18- 
03) 


0.4 


Kidney Margin 
(OD04338) 


40.9 


Normal Ovary 


Z.K) 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


61 . 1 


Ovarian Cancer 
064008 


1 A T 

1U. / 


Kidney Margin 
(OD04339) 


22.8 


Ovarian Cancer 
(OD04768-07) 


0.3 


Kidney Ca, Clear cell 
type (OD04340) 


100.0 


Ovary Margin 
(OD04768-08) 


A O 

4.2 


Kidney Margin 
(OD04340) 


31. 9 


Normal Stomach 


3.1 


Kidney Ca, Nuclear grade 
3 (OD04348) 


3.3 


Gastric Cancer 
9060358 


0.4 


Kidney Margin 
(OD04348) 


27.0 


Stomach Margin 
9060359 


1.1 


Kidney Cancer 
(OD04622-01) 


TA A 

3U.4 


Gastric Cancer 
9060395 


u.z 


Kidney Margin 
(OD04622-03) 


17.4 


Stomach Margin 
9060394 


3.0 


Kidney Cancer 
(OD04450-01) 


21.5 


Gastric Cancer 
9060397 


0.2 


Kidney Margin 
(OD04450-03) 


21.3 


Stomach Margin 
9060396 


2.6 


Kidney Cancer 8120607 


70.7 


Gastric Cancer 
064005 


0.8 



Table ABE. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
165924139 


Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
165924139 


Daoy- Medulloblastoma 


2.0 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma: 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 
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• 








SNB-78- Glioma 


0.0 Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.9 J 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


0.0 i 


U266- B-cell plasmacytoma ■ 


0.0 


SK-N-SH- 

O IN. 1> Oil 

Neuroblastoma 
(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


ft ft 


Cerebellum 


6.4 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


9.9 


Jurkat- T cell leukemia 


0.0 


_ ~—~4 

NCUH292- 

Mucoepidermoid lung 

carcinoma 


2.0 


TF- 1 - Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 


1.4 


U937- Histiocytic lymphoma j 


, 0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


J\U Oil. 1V1 V tlUg,^! IUUO 

leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


4.5 


/ O/ - ! - v^icai ecu iciidi 
carcinoma 


7.7 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
uarc/iiiuiHct 


100.0 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 

Udl V/lllUIll<a 


73.2 


NCT-H157- Sauamous 
cell lung cancer 
(metastasis) 


ft ft 


G401- Wilms* tumor 

VJTV/ 1 VV 111110 lUlllvl 


0.0 


NCI-H1155- Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


5.3 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


1.3 


NCI-UMC-ll-Lung 
carcinoid 




BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


0.0 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon cancer 


0.0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


KM 12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


1.6 


JS.1V1ZULZ- v_-oion cancer 


j 0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon cancer 


] 0.0 


T24- Bladder carcinma 


j 0.0 
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(transitional cell) 




SW-48- Colon 
adenocarcinoma 


a a 
U.U 


jOj /- oiauaer carcinoma 


0 0 


SW1116- Colon 
adenocarcinoma 


a a 

U.U 


ii i-i iv/- oiaaaer carcinoma 


0 0 

\J.\J 


LS 174T- Colon 
adenocarcinoma 


a a 
U.U 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0 0 


SW-948- Colon 
adenocarcinoma 


A A 

U.U 


/vzu^f- kji a Duomyo sarcoma 


ft 0 


SW-480- Colon 
adenocarcinoma 


a a 
U.U 


H 1 -lUoU- Fibrosarcoma 


U.U 


NCI-SNU-5- Gastric 
carcinoma 


y, / 


MG-63- Osteosarcoma 


U.U 


KATO III- Gastric 
carcinoma 


U.U 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


U.U 


NC1-SNU-16- Gastric 
carcinoma 


A A 

U.U 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


U.U 


NCI-SNU-1- Gastric 
carcinoma 


A A 
U.U 


A431- Epidermoid carcinoma 


U.U 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


A A 
U.U 


RF-48- Gastric 
adenocarcinoma 


U.U 


DU 145- Prostate carcinoma 
(brain metastasis) 


A A 

U.U 


MKN-45- Gastric 
carcinoma 


0 0 


MDA-MB-468- Breast 
adenocarcinoma 


0.0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table ABF. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
159633002 


Tissue Name 


Rel. Exp.(%) 
Ag2893, Run 
159633002 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC TFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


1.2 


HUVEC IL-1 1 


0.0 
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Secondary Trl rest 


u.u 


Lung Microvascular EC 
none 


0 0 


Primary Th 1 act 


f\ n 
u.u 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0 0 


Primary l nz act 


u.u 


Microvascular Dermal EC 
none 


0 0 


Primary Trl act 


u.u 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


ft 0 


Primary Thl rest 


U.U 


Bronchial epithelium 
TNFalpha + lLlbeta 


0 0 


Primary Th2 rest 


0.7 


Small airway epithelium 
none 


1.7 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


3.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.3 


Coronery artery SMC 
TNFalphaH- IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


4.4 


Secondary CD8 
lymphocyte rest 


U.z 


Astrocytes TNFalpha + 
IL-lbeta 


->.o 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.8 


CCD 11 06 (Keratinocytes) 
none 


0.0 


LAK cells rest 


1.0 


CCD1 106 (Keratinocytes) 
TJNraipna » li^-ioeid 


0.0 


LAK cells 1L-2 


U.U 


Liver cirrhosis 


1 ? 7 


LAK cells lL-z+lL-lz 


U.4 


Lupus kidney 


1 R ft 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


6.5 


1 A \S 1 TT Q i Tf 1 O 

LAK cells 1L-Z+ IL-lo 


U.U 


XTfT T-TOQ9 TT A 




LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


6.8 




0 0 

V/.V 


NCI-H292 IL-13 


3.6 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


2.9 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.3 


Lung fibroblast none 


0.0 


PBMC PWM 


0.4 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 
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Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


U.U 


idling TioroDiasi il-i j 


0 0 


B lymphocytes PWM 


0.8 


Lung fibroblast IFN 
gamma 




B lymphocytes CD40L 
and IL-4 


0.6 


Dermal fibroblast 
CCD 1070 rest 


0.7 


FOT -1 dhcAMP 


0.0 


Dermal fibroblast 
CCD1070TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD^ 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


1.2 


Monocytes LPS 


0.0 


Colon 


0.8 


Macrophages rest 0.0 


|Lung | 34.2 


Macrophages LPS 0.0 


[Thymus j 100.0 


HUVECnone 0.0 jKidney 


1 . 00 


HUVEC starved 0.0 f 





CNS_neurodegeneration_vl.O Summary: Ag2893 This panel does not show 
differential expression of the NOV29c gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for a discussion 
5 of utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag2893 Two experiments with the same probe and primer set 
produce results that are in excellent agreement, with highest expression of the NOV29c gene 
in a renal cancer cell line (CTs=28-30). Significant expression is also seen in a cluster of renal 
cancer cell lines. Thus, expression of this gene could be used to differentiate between this 
10 sample and other samples on this panel and as a marker to detect the presence of renal cancer. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
effective in the treatment of renal cancer. 

This gene also is expressed at low, but significant levels in the brain. Expression of the 
NOV29C gene in the cerebral cortex suggests a role in CNS-specific processes. Homology to 
15 the tocopherol-associated protein (TAP) transcription factor suggests a role for NOV29C in 
tocopherol mediated gene transcription. Tocopherol is an essential vitamin involved in many 
CNS processes that may be mediated by both its antioxidant properties and ability to regulate 
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gene transcription via NOV29c. Genetic disruption of tocopherol processing results in 
tocopherol deficiency and CNS disorders such as ataxia and neurodegeneration. Agents that 
modulate NOV29c may thus have utility in the treatment of ataxia and neurodegenerative 
diseases. 
5 References: 

Yamauchi J, Iwamoto T, Kida S, Masushige S, Yamada K, Esashi T. Tocopherol- 
associated protein is a ligand-dependent transcriptional activator. Biochem Biophys Res 
Commun 2001 Jul 13;285(2):295-9 

Vitamin E is a term that encompasses a group of potent, lipid-soluble, chain-breaking 

10 antioxidants. Structural analysis reveals that molecules having vitamin E activity include four 
isomers (alpha, beta, gamma, and delta) of both tocopherols and tocotrienols. Alpha- 
tocopherol has been shown to have the highest biological vitamin E activity in mammalian 
tissues based on fetal resorption assays, and it reverses vitamin E deficiency symptoms. 
Although the molecular functions fulfilled specifically by alpha-tocopherol have yet to be 

1 5 fully described, it is unlikely that they are limited to general antioxidant functions. Here we 
show the functional characterization of alpha-tocopherol associated protein, TAP, which 
displays significant sequence similarity to the alpha-tocopherol transfer protein. Ligand 
competition analysis showed that recombinant TAP binds to alpha-tocopherol but not to other 
isomers of tocopherols. Using GFP fusion protein expression system, we observed that TAP 

20 translocates from cytosol to nuclei in alpha-tocopherol-dependent fashion. Transient 

transfection experiment showed that TAP activates transcription of the reporter gene in alpha- 
tocopherol-dependent manner. These results suggest that the biological function of alpha- 
tocopherol is not only as an antioxidant but also as a transcriptional regulator of gene 
expression via association with a transcription factor TAP. 

25 Yokota T, Igarashi K, Uchihara T, Jishage K, Tomita H, Inaba A, Li Y, Arita M, 

Suzuki H, Mizusawa H, Arai H. Delayed-onset ataxia in mice lacking alpha -tocopherol 
transfer protein: model for neuronal degeneration caused by chronic oxidative stress. Proc Natl 
Acad Sci U S A 2001 Dec 18;98(26):1 5 185-90 

alpha-Tocopherol transfer protein (alpha-TTP) maintains the concentration of serum 

30 alpha-tocopherol (vitamin E), one of the most potent fat-soluble antioxidants, by facilitating 
alpha-tocopherol export from the liver. Mutations of the alpha-TTP gene are linked to ataxia 
with isolated vitamin E deficiency (AVED). We produced a model mouse of AVED by 
deleting the alpha-TTP gene, which showed ataxia and retinal degeneration after 1 year of age. 
Because the brain alpha-TTP functions in maintaining alpha-tocopherol levels in the brain, 
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alpha-tocopherol was completely depleted in the alpha-TTP(-/-) mouse brain, and the 
neurological phenotype of alpha-TTP(-/-) mice is much more severe than that of wild-type 
mice when maintained on an alpha-tocopherol-deficient diet. Lipid peroxidation in alpha- 
TTP(-/-) mice brains showed a significant increase, especially in degenerating neurons, alpha- 
5 Tocopherol supplementation suppressed lipid peroxidation and almost completely prevented 
the development of neurological symptoms. This therapy almost completely corrects the 
abnormalities in a mouse model of human neurodegenerative disease. Moreover, alpha-TTP(- 
/-) mice may prove to be excellent animal models of delayed onset, slowly progressive 
neuronal degeneration caused by chronic oxidative stress. 

10 Panel 2D Summary: Ag2893 Highest expression of the NOV29c gene is seen in a 

sample derived from a kidney cancer cell line (CT=29.5). In addition, this sample is more 
highly expressed in kidney cancer than in adjacent normal tissue. Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of kidney cancer. Furthermore, therapeutic modulation of the 

1 5 expression or function of this gene may be effective in the treatment of kidney cancer. 

Panel 3D Summary: Ag2893 Expression of the NOV29c gene is detected primarily in 
samples derived from kidney cancer cell lines (CTs=30). Thus, expression of this gene could 
be used to differentiate between these samples and other samples on this panel and as a marker 
to detect the presence of kidney cancer. Furthermore, therapeutic modulation of the expression 

20 or function of this gene may be effective in the treatment of kidney cancer. 

Panel 4D Summary: Ag2893 The NOV29c transcript is expressed at low but 
significant levels in the lung and thymus and in lupus kidney and cirrhotic liver. Thus, the 
transcript or the protein it encodes could be used for detection of these tissues. The expression 
of this gene suggests that the protein encoded by this transcript may play an important role in 

25 the normal homeostasis of the thymus and lung tissues. Therefore, therapeutics designed with 
the protein encoded by this transcript could be important for modulating T cell development in 
the thymus and for maintaining or restoring normal function to these lung during inflammation 
due to diseases such as asthma and emphysema. Additionally, induction of this transcript in 
other tissues such as the kidney and liver may be detrimental and antagonistic therapies 

30 designed with the protein encoded for by this transcript could be important in the treatment of 
diseases of these tissues. 

NOV24b 
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Expression of gene NOV24b was assessed using the primer-probe set Agl688, 
described in Table ACA. Results of the RTQ-PCR runs are shown in Tables ACB, ACC and 
ACD. 



Table ACA. Probe Name Agl688 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' - tcagaagggaatcatgatatcg-3 1 


22 ; 


577 


1096 


Probe 


TET- 5 1 - ccttgataaaactccaggctcctttga- 
3 ' -TAMRA 


27 


550 


1097 


Reverse 


5 1 - tttggaaggtaggcatattgg-3 ' 


21 


509 „ > 


1098 



Table ACB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Agl688, 
Run 147249266 


Tissue Name 


Rel. Exp.(%) Agl688, 
Run 147249266 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


9.2 


Pancreas 


6.7 


Renal ca 786-0 


0.0 


Pancreatic ca. CAP AN 

2 


0.2 


Renal ca. A498 


1.7 


Adrenal gland 


1.8 


Renal ca. RXF 393 


0.0 


Thyroid 


1 Q 

3.o 


Kenai ca. ali iin 


U.U 


^la 1 ivarv crlanrl 
ociiivcii^y ^itii i\j 


1 < 

l . -J 




0 0 


Pituitary gland 


6.1 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.5 


Liver 


100.0 


Brain (whole) 


3.6 


Liver (fetal) 


99.3 


Brain (amygdala) 


3.3 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.4 


Lung 


1.3 


Brain (hippocampus) 


6.2 


Lung (fetal) 


1.8 


Brain (substantia nigra) 


1.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


2.1 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


6.3 


Lung ca. (s.cell var.) 
SHP-77 


0.8 


Spinal cord 


3.1 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.2 


gIio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.2 


Lung ca. (non-s.cl) 


0.0 
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NC1-H522 




astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.2 


astrocytoma SNB-75 


0.1 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.2 


Mammary gland 


2.9 


glioma U251 


1.2 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.2 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart 


1.6 


Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


0.7 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


1.2 


Ovary 


0.0 


Bone marrow 


U.J 


Ovarian ca. OVCAR- 

3 


U.z 


Thymus 


3.2 


Ovarian ca. OVCAR- 
4 


U.U 


Spleen 


1.0 


Ovarian ca. OVCAR- 

5 


0.3 


Lymph node 


2.9 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 


0.8 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


3.3 


Ovarian ca.* (ascites) 
SK-OV-3 


1.0 


Small intestine 


6.2 


Uterus 


1.4 


Colon ca. SW480 


0.0 


Placenta 


0.4 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


1.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


6.1 


Colon ca. CaCo-2 


0.2 


Melanoma 
Hs688(A).T 


0.4 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.9 


Colon ca. HCC-2998 


0.2 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


4.4 


Melanoma M14 


0.0 


Bladder 


3.1 


Melanoma LOX 
IMVI 


0.0 


Trachea 


3.0 


Melanoma* (met) 
SK-MEL-5 


0.0 
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Kidney 



6.8 



|Adip< 



ose 



0.5 



Table ACC. Panel 2D 



Tissue Name 


Dpi Fvn f°/.A 

Agl688, Run 
162646059 


Tissue Name 


Rel Fvn (%\ 

Agl688, Run 
162646059 


Normal Colon 


1.7 


Kidney Margin 
8120608 


0.7 


CC Well to Mod Diff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.2 


Kidney Margin 
8120614 


0.5 


CC Gr.2 rectosigmoid 
(OD03868) 


0.2 


Kidney Cancer 
9010320 


0.2 


CC Margin (OD03868) 


0.1 


Kidney Margin 
9010321 


1.0 


CC Mod Diff (ODO3920) 


0.1 


Normal Uterus 


0.2 


v^K^ iviargiii ^L/iJv^j vzu j 


o o 


ULerus cancer uohui i 


n r 


CC Gr.2 ascend colon 


0.1 


Normal Thyroid 


0.9 


CC Margin (OD03921) 


0.1 


Thyroid Cancer 


0.2 


CC from Partial 
Hepatectomy (ODO4309) 

IVlt-lO 


4.7 


Thyroid Cancer 
A302152 


0.5 


Liver Margin (ODO4309) 


100.0 


Thyroid Margin 
A3021 S3 


1.0 


Colon mets to lung 
(OD04451-0n 


0.1 


Normal Breast 


0.3 


Lung Margin (OD04451- 
02^ 


0.1 


Breast Cancer 
rOD04566^ 


0.1 


Normal Prostate 6546-1 


2.1 


Breast Cancer 
(OD04590-01) 


0.1 


Prostate Cancer 
(OD04410) 


0.6 


Breast Cancer Mets 
(OD04590-03) 


0.4 


Prostate Margin 
(OD04410) 


0.5 


Breast Cancer 

Metastasis 

(OD04655-05) 


0.9 


Prostate Cancer 
(OD04720-01) 


1.1 


Breast Cancer 064006 


0.6 


Prostate Margin 
(OD04720-02) 


1.6 


Breast Cancer 1 024 


1.2 


Normal Lung 061010 


2.0 


Breast Cancer 
9100266 


0.1 


Lung Met to Muscle 


0.0 


Breast Margin 


0.1 
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(OLHJ4z8o) 




y J UkjZod 




Muscle Margin 
(OD04286) 


— — . 

0.2 


Breast Cancer 
A209073 


0.3 


Lung Malignant Cancer 
(OD03126) 


0.1 


Breast Margin 
A2090734 


0.3 


Lung (Vlargin (<JDU312o) ; 


O.D 


Normal Liver 


AO 1 

oy. / 


Lung Cancer (OD04404) j 


0.1 


Liver Cancer 064003 


13.7 


Lung Margin (OD04404) 


0.2 


Liver Cancer 1025 


18.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


1.2 


Lung Margin (OD04565) 


0.1 


Liver Cancer 6004-T 


22.2 


Lung Cancer (OD04237- ! 
01) 


0.1 


Liver Tissue 6004-N 


1.0 


Lung Margin (OD04237- 
02) 


0.4 


Liver Cancer 6005-T 


1.9 


Ocular Mel Met to Liver 
(ODO4310) 


0.1 


Liver Tissue 6005-N 


4.2 


Liver Margin (ODO4310) 


77.4 


Normal Bladder 


2.7 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.1 


Bladder Cancer 
A302173 


0.2 


Normal Kidney 


12.9 


Bladder Cancer 
(OD047 18-01) 


0.1 


Kidney Ca, Nuclear grade 


3.8 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.5 


Kidney Margin 
(OD04338) 


1 ^ 

1 .o 


Normal Ovary 


ft n 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0 ft 
Z.o 


Ovarian Cancer 
064008 


ft i 


Kidney Margin 
(OD04339) 




Ovarian Cancer 
(OD04768-07) 


ft 9 


Kidney Ca, Clear cell 
type (OD04340) 


1 A 


Ovary Margin 
(OD04768-08) 


0 1 

V/. 1 


Kidney Margin 
(OD04340) 


A 1 


iNormai oiomacn 


ft ^ 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.1 


Gastric Cancer 
9060358 


0.1 


Kidney Margin 
(OD04348) 


3.8 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


0.2 


Gastric Cancer 
9060395 


0.2 


Kidney Margin 
(OD04622-03) 


0.7 


Stomach Margin 
9060394 


0.3 


Kidney Cancer 


0.2 


Gastric Cancer 


0.3 
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(OD04450-01) 




9060397 




Kidney Margin 
(OD04450-03) 


2.6 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


1.1 



Table ACD. Panel 5 Islet 



Tissue Name 


Kel. n,xp.( /o) 
Agl688, Run 
226587524 


Tissue Name 


Ivcl. H#A|J.^ /O ) 

Agl 688, Run 
226587524 


97457_Patient- 


41.2 


94709_Donor 2 AM - A adipose 


0.0 


97476_Patient- 
07<;k" 'skeletal muscle 


9.9 


94710_Donor 2 AM - B_adipose 


0.0 


97477_Patient- 
07nt ntprii^ 


8.1 


9471 1 Donor 2 AM - C_adipose 


0.0 


97478_Patient- 
07n1 nlacenta 

\J / VJ 1 L/1CI \s V>1 1 LC4- 


0.0 


94712_Donor 2 AD - A_adipose 


11.4 


99 167_Bayer Patient 1 


84.7 


94713_Donor 2 AD - B_adipose 


0.0 


y /4oZ_ratient- 
08ut_uterus 


2.4 


94714_Donor 2 AD - C_adipose 


29.1 


y/^foJ x*aiieni- 
08pl_placenta 


0.0 


Q474? Donor ^ I F - 

AMesenchymal Stem Cells 


19.2 


y / Hoo_r aueni- 
09sk_skeletal muscle 


8.0 


0474^ Donor ^TI- 

B Mesenchymal Stem Cells 


0.0 


y/^ro/ raLlCnL- 

09ut_uterus 


9.6 


94730_Donor 3 AM - A adipose 


15.0 


97488 Patient- 
09pl_placenta 


0.0 


94731_Donor 3 AM - Badipose 


37.9 


97492_Patient- 
1 Oututerus 


0.0 


94732JDonor 3 AM - C adipose 


0.0 


97493_Patient- 
10pl_placenta 


0.0 


94733_Donor 3 AD - A_adipose 


39.2 


97495_Patient- 
1 1 go adipose 


0.0 


94734_Donor 3 AD - B adipose 


11.4 


97496_Patient- 

1 1 skskeletal muscle 


52.9 


94735_Donor 3 AD - C_adipose 


34.4 


97497_Patient- 
1 1 ututerus 


35.8 


771 38_Liver_HepG2untreated 


8.4 


97498_Patient- 
1 1 pl_placenta 


10.5 


73556_Heart_Cardiac stromal 
cells (primary) 


0.0 


97500 Patient- 
12go_adipose 


0.0 


81735_Small Intestine 


100.0 


97501_Patient- 
12sk_skeletal muscle 


35.4 


72409_Kidney_Proximal 
Convoluted Tubule 


9.9 
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97502 Patient- 
1 2ut_uterus 


20.7 


82685_Small intestine_Duodenum 


70.2 


97503_Patient- 
1 2pl_placenta 


0.0 


90650_Adrenal_Adrenocortical 
adenoma 


25.5 


94721_Donor2 U - 
A Mesenchymal Stem 
Cells 


0.0 


72410_Kidney_HRCE 


10.4 


94722_Donor 2 U - 
B Mesenchymal Stem 
Cells 


0.0 


72411 Kidney HRE 


7.2 


94723_Donor 2 U - 
C Mesenchymal Stem 
Cells 


0.0 


73139_Uterus_Uterine smooth 
muscle cells 


0.0 



Panel 1.3D Summary: Agl688 Expression of the NOV24b gene, a plasma kallikrein 
homolog, is significantly higher in liver (CTs=28) than in any other sample on this panel. 
Thus, expression of this gene could be used as a marker of liver tissue. Plasma kallikrein is a 
5 serine protease that, among other roles, plays a part in blood coagulation, fibrinolysis, and 
complement activation and has been implicated in adipose differentiation by remodelling of 
the fibronectin-rich ECM of preadipocytes. Therefore, an antagonist to this gene product may 
be beneficial in the treatment of obesity. 

References: 

10 Hoover-Plow J, Yuen L. Plasminogen binding is increased with adipocyte 

differentiation. Biochem.Biophys.Res.Commun. (2001) 284, 389-394 

The purpose of this study was to examine the role of the plasminogen system in the 
development of adipose tissue. Plasminogen binding capacity was determined in differentiated 
and undifferentiated cells from adipose tissue of plasminogen deficient mice and 3T3 cells, a 

15 well-characterized tissue culture model. In 3T3 cells, plasminogen binding was fivefold higher 
in differentiated cells compared to the undifferentiated cells. Inhibition of binding by 
carboxyl-terminal lysine analogs was similar for the differentiated and undifferentiated cells 
with tranexamic acid > EACA > lysine. The binding of plasminogen was concentration- 
dependent and approaches saturation in the both cell types. The number of plasminogen 

20 binding sites was tenfold higher in the differentiated compared to the undifferentiated cells. In 
isolated mature fat cells and stromal cell cultures from mouse adipose tissue, plasminogen 
binding was also higher in the differentiated mature fat cells and differentiated stromal cells 
compared to undifferentiated stromal cells. Plasminogen binding was elevated in the 
differentiated cells from the Pig-/- mice compared to cells from the WT mice. These results 
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suggest that the plasminogen system plays an important role in adipose tissue development. 
Copyright 2001 Academic Press. 
PMID: 11394891 

Selvarajan S, Lund LR, Takeuchi T, Craik CS, Werb Z.A plasma kallikrein-dependent 
5 plasminogen cascade required for adipocyte differentiation. Nature Cell Biol. (2001) 3, 267- 

275. 

Here we show that plasma kallikrein (PKal) mediates a plasminogen (Pig) cascade in 
adipocyte differentiation. Ecotin, an inhibitor of serine proteases, inhibits cell-shape change, 
adipocyte-specific gene expression, and lipid accumulation during adipogenesis in culture. 

10 Deficiency of Pig, but not of urokinase or tissue-type plasminogen activator, suppresses 

adipogenesis during differentiation of 3T3-L1 cells and mammary-gland involution. PKal, 
which is inhibited by ecotin, is required for adipose conversion, Pig activation and 3T3-L1 
differentiation. Human plasma lacking PKal does not support differentiation of 3T3-L1 cells. 
PKal is therefore a physiological regulator that acts in the Pig cascade during adipogenesis. 

1 5 We propose that the Pig cascade fosters adipocyte differentiation by degradation of the 
fibronectin-rich preadipocyte stromal matrix. 
PMID: 11231576 

Panel 2D Summary: Agl688 The expression of the NOV24b gene appears to be 
highest in a sample derived from a sample of normal liver tissue adjacent to a metastatic colon 

20 cancer CT=26.2). In addition, there is substantial expression in other samples of normal liver, 
and to a much lesser degree, malignant liver tissue. This liver specific expression is consistent 
with the expression seen in Panel 1.3D. Thus, the expression of this gene could be used to 
distinguish liver derived tissue from the other samples in the panel, and more specifically the 
expression of this gene could be used to distinguish normal liver from malignant liver tissue. 

25 Moreover, therapeutic modulation of this gene, through the use of small molecule drugs, 
protein therapeutics or antibodies might be of benefit in the treatment of liver cancer. 

Panel 5 Islet Summary: Agl688 Expression of the NOV24b gene is limited to 
pancreatic islets and small intestines. Please see Panel 1 .3 for discussion of utility of this gene 
in metabolic disease. 

30 NOV30 

Expression of gene NOV30 was assessed using the primer-probe set Ag2894, 
described in Table ADA. Results of the RTQ-PCR runs are shown in Tables ADB, ADC and 
ADD. 
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Table ADA. Probe Name Ag2894 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -cctgagtcaatccaagaaactg-3 1 


22 


94 


1099 


Probe 


TET-5 1 -aggtcatcaacccaggaccgcctag- 3 ' - 
TAMRA 


25 


116 


1100 


Reverse 


5 1 -tccagtagggatctggagaagt-3 ' 


22 


149 


1101 



Table ADB. Panel 1.3D 



Tissue Name 


ReL Exp.(%) Ag2894, 
Run 160968507 


Tissue Name 


Rel. Exp.(%) Ag2894, 
Run 160968507 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.0' 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


50.3 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


8.1 


Brain (whole) 


0.0 


Liver (fetal) 


6 4 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain fhinnoppmniiQ^ 


u.u 


Lung (fetal) 


o. / 


Brain (substantia nigra) ! 


0.0 


Lune ca (small celH 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


9.7 


glio/astroU-118-MG 


0.0^ 


Lung ca. (non-s.cell) 
NCI-H23 


19.8 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


3.4 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 


0.0 
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NC1-H596 




glioma oiNtJ-iy 


o.o 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


3.3 


Heart (fetal) 


3.5 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart 


0.0 


Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


1.2 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


6.5 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0 0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


0 0 

\J,\J 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
oWozu(oW4oU met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


100.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866 > l 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


5.7 


Melanoma M 1 4 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


14.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 Adipose 


4.2 
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Table ADC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2894, Run 
160966709 


Tissue Name 


Rel. Exp.(%) 
Ag2894, Run 
160966709 


i yyji iiicii v^vjiuii 




Kidney Margin 
8120608 


U.U 


CC Well to Mod Diff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


0.0 


iVldXglll ^UL/VJ OOO j 


u.u 


Kidney Margin 
9010321 


A A 

U.U 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


6.4 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


0.0 


Thyroid Cancer 
064010 


0.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


3.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


40.3 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 
(OD04410) 


0.0 


Breast Cancer 

Metastasis 

(OD04655-05) 


0.0 


Prostate Cancer 
(OD04720-01) 


0.0 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02) 


0.0 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


9.7 


Breast Cancer 
9100266 


0.0 


Lung Met to Muscle 
(OD04286) 


3.9 


Breast Margin 
9100265 


0.0 


Muscle Margin 
(ODQ4286) 


0.0 


Breast Cancer 
A209073 


0.0 
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Lung Malignant Cancer 
(OD03126) 


4.4 


Breast Margin 
A2090734 


0.0 


Lung Margin (OD03126) 


25.0 


Normal Liver 


100.0 


Lung Cancer (OD04404) 


12.5 


Liver Cancer 064003 


5.3 


i_,ung iviargin ^kjljv^^\j^) 




Liver Cancer 1025 


2U.z 


L<ung cancer ^wj->ju^oo»> ) 


n n 
u.u 


Liver Cancer 1026 


12. / 


Lung Margin (OD04565) 


8.1 


Liver Cancer 6004-T 


13.0 


Lung Cancer (OD04237- 
01) 


2.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


0.0 


i ■ - 

Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (OD043 1 0) 


19.3 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


16.2 


Bladder Cancer 
A302173 


15.7 


Normal Kidney 


0.0 


Bladder Cancer 
(OD047 18-01) 


0.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


2.9 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(OD04338) 


0.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


3.7 


Ovarian Cancer 
064008 


0.0 


Kidney Margin 
(OD04339) 


0.0 


Ovarian Cancer 
(OD04768-07) 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


0.0 


Normal Stomach 


4.1 


Kidney Ca, Nuclear grade! 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 


0.0 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 
(OD04622-03) 


0.0 


Stomach Margin 
9060394 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


0.0 
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Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.0 



Table ADD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2894, Run 
164033148 


Tissue Name 


Rel. Exp.(%) 
Ag2894, Run 
164033148 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC TFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primarv Th'? ar*t 

1 1 1111(11 1IIZ, dV^L 


\J.\J 


Microvascular Dermal EC 
none 


n n 
u.u 


Primarv Tr1 apt 

A 1 llllul V 111 Cl Vw L 


0 0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0 0 


Primarv Thl rest 

j. i 1 1 1 1 u i y till i V/O t 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0 0 


Primarv Th2 rest 

A 1 1111 Wl Y 11 1 <>-~ 1 vJV 


0.0 


Small airway epithelium 
none 


0 0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0 0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronerv arterv SMC re»«st 


0 0 


CD45RO CD4 
lympnocyie act 


17.3 


Coronery artery SMC 
iiNraipna + iL-ioeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 1 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 
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LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 

PM A/ion omyc in 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HJrAbL- 1 INr alptia + 

beta 


0.0 


rDJVlt rest 


A A 
0.0 


Lung fibroblast none 


A A 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ 1L-J beta 


0.0 


PBMC PHA-L 


13.8 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1U/U INr alpha 


0.0 

. — , 


EAJL- 1 GDC/VLVLr 


0.0 


Dermal fibroblast 


0.0 


PMA/ionomycin 


CCD 1070 IL-1 beta 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


7.6 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


23.0 


Macrophages rest 


0.0 


Lung 


13.8 


Macrophages LPS 


0.0 


Thymus 


12.8 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







Panel 1.3D Summary: Ag2894 Expression of the NOV30 gene is restricted to the 
testis and a renal cancer cell line(CTs=33-35). Thus, expression of this gene could be used to 
differentiate these samples from other samples on this panel and as a marker for testis tissue 
5 and renal cancer. 

Panel 2D Summary: Ag2894 Expression of the NOV30 gene is restricted to normal 

liver tissue (CTs=33-35). This gene enodes a ryudocan homolog. Ryudocan is a cell-surface 
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heparan sulfate proteoglycan, which is involved in regulation of blood coagulation, among 
other biological functions. Thus, based on its expression profile, expression of this gene could 
be used to identify liver tissue and to differentiate between normal and malignant liver. 
Furthermore, this gene product may be involved in normal homeostasis of the liver. Thus, 
5 therapeutic modulation of the expression or function of this gene product may be effective in 
the treatment of liver disease and liver cancer. 
References: 

Kojima T, Inazawa J, Takamatsu J, Rosenberg RD, Saito H.Human ryudocan core 
protein: molecular cloning and characterization of the cDNA, and chromosomal localization of 

10 the gene. Biochem Biophys Res Commun 1993 Feb 15;190(3):814-22 

We have isolated a series of overlapping cDNA clones encoding a 2,628 bp transcript, 
which potentially codes for a 198 amino acid protein with predicted molecular mass of 21,641 
daltons, for the human ryudocan core protein. The deduced core proteins of the human and the 
rat ryudocan have high structural conservation, particularly in the NH2 and COOH terminus 

15 regions of the putative mature core protein, including the combined 

transmembrane/cytoplasmic domains with conserved positions of all 4 tyrosine groups and 3 
conserved glycosaminoglycan chain attachment regions, which might serve important roles for 
biological function of ryudocan. A major 2.7 kb transcript was detected in all tissues tested, 
with relatively high levels of expression observed in mRNA from lung, liver, skeletal muscle 

20 and kidney. A minor 1 .9 kb transcript was also observed in some of tissues, which would be 
caused by alternative polyadenylation. Human ryudocan gene has been localized on the 
chromosome 20ql2 by fluorescence in situ hybridization. 
PMID: 7916598 

Kojima T. Molecular biology of ryudocan, an endothelial heparan sulfate proteoglycan. 

25 Semin Thromb Hemost 2000;26(l):67-73 

Ryudocan is a type I integral membrane heparan sulfate proteoglycan, which was 
originally cloned from rat microvascular endothelial cells. We have cloned the cDNA of rat 
ryudocan. The deduced amino acids of ryudocan has homologous transmembrane and 
intracellular domains with syndecan but very distinct extracellular regions. We also cloned the 

30 human ryudocan cDNA, of which the gene localizes on the chromosome 20ql2. To better 
understand the regulation of ryudocan expression, we have determined the structural 
organization of the human ryudocan gene. The human ryudocan gene extends approximately 
24 kb and is divided into five exons that appear conserved in syndecan family members. The 
5'-flanking sequences of the human ryudocan gene contain a variety of potential binding sites 
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for transcription factors and are capable of functioning as a promoter. We purified human 
ryudocan and evaluated its interactions with several extracellular ligands. It was found that 
basic fibroblast growth factor (bFGF), midkine, and tissue factor pathway inhibitor exhibited 
significant ryudocan bindings. Heparitinase, but not chondroitin ABC lyase treatment, 
5 destroyed those ryudocan bindings; thus, the heparan sulfate chains of ryudocan appear to be 
responsible for those bindings. Immunohistochemical analysis revealed that ryudocan is 
expressed in peripheral nerve tissues, fibrous connective tissues, and placental trophoblasts. 
These findings suggest that ryudocan may possess multiple biologic functions, such as bFGF 
modulation, neurite growth promotion, and anticoagulation, via heparan sulfate-binding 
10 effectors in the cellular microenvironment. 
PMID: 10805285 

Panel 4D Summary: Ag2894 Expression of the NOV30 gene is restricted to a sample 
derived from liver cirrhosis. This gene is also expressed in normal liver in panel 2D. This 
expression suggests that this protein product, a ryudocan homolog, is essential to liver 
15 function. Thus, expression of this gene could be used as a marker for liver derived tissue. 

Furthermore, therapeutic modulation of the expression or function of this gene product may be 
effective in the treatment of diseases that affect the liver, including liver cirrhosis. 

NOV31 

Expression of gene NOV3 1 was assessed using the primer-probe set Ag2922, 
20 described in Table AEA. Results of the RTQ-PCR runs are shown in Table AEB. 

Table AEA. Probe Name Ag2922 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cttcaggctgtgggtcatc-3 ' 


19 


407 


1102 


Probe 


TET-5 1 -caagccctactgctcccagtccag-3 1 - 
TAMRA 


24 


447 


1103 


Reverse 


5 ' - cagcaacagggcttacaaca- 3 ' 


20 


472 


1104 



Table AEB. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag2922, Run 
171619741 


Tissue Name 


Rel. Exp.(%) 
Ag2922, Run 
171619741 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


1.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


1.3 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 
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Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.6 


Lung Microvascular EC 
none 


3.2 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.9 


Primarv Th2 act 


0.4 


Microvascular Dermal EC 
none 


0.0 


Primarv Trl act 

X 1 1111 14-1 j X 1 X d-w L 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primarv Th1 rest 

x iiin cxi y x 1 1 x l jl ^ 


1.1 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


1.3 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.7 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.6 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


1.8 


CD45RO CD4 
lymphocyte act 


1.2 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


1.6 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


2.1 


Astrocytes TNFalpha + 
IL-lbeta 


0.6 


Secondary CD8 
lymphocyte act 


1.2 


KU-812 (Basophil) rest 


3.4 


CD4 lymphocyte none 


0.4 


KU-8 12 (Basophil) 
PMA/i onomy c in 


1.8 


2ry Thl/Th2/Trl anti- 
CD95CHH 


2.3 


CCD! 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.9 


CCD1 106 (Keratinocytes) 
TNFalnha + IL-lbeta 


0.0 


LAK cells IL-2 

X— 4JT 11Y ww 1 1 0 X 1 j ^ 


0.8 


I ,iver cirrhosis 

J j 1 V wl vll 1 llVOl O 


0.7 


LAK cells IL-2+IL- 12 \ 


2.0 


NCI-H292 none 


2.2' 


LAK cells IL-z+lrlN 

gamma 


1.9 


NCI-H292 IL-4 


4.3 


LAK cells IL-2+IL- 18 \ 


0.0 


NCI-H292 IL-9 


1.0 


i_v/vrv cens 
PMA/ionomycin 


0.0 


NCI-H292 IL-13 


1.8 


NK Cells IL-2 rest 


2.0 


NCI-H292 IFN gamma 


1.3 


Two Way MLR 3 day 


3.4 


HPAEC none 


0.4 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.6 


Two Way MLR 7 day 


0.6 


Lung fibroblast none 


0.6 


PBMC rest 


0.5 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 
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roMl r WM 


U.U 


Lung TiDroDiasi il-4 


'•' _ 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL- 1 3 


3.5 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


2.8 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCJJIU/U rest 


2.2 


B lymphocytes CD40L 

_ „ J IT A 

and IL-4 


2.4 


Dermal fibroblast 

(~*f**T\ 1 f\Hf\ TXT 17 , , 1 ,^ K , , 

CCD1U/U liNr alpha 


7.3 


EOL-1 dbcAMP 


5.3 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


FOT -1 dhcAMP 
PMA/ionomycin 


1.2 


Dprmal fibroblast TFTvJ 

iVvl 111U1 XI UL V/L/lUiJl 11 1 ^ 

gamma 


0.0 


Dendritic cells none 


2.5 


Dermal fibroblast IL-4 


1.3 


Dendritic cells LPS 


0.9 


Dermal Fibroblasts rest 


0.5 


Dendritic cells anti- 
CD40 


2.3 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


r 1.2 


Monocytes LPS 


0.0 


Colon 


3.1 


Macrophages rest 


1.8 


Lung 


6.7 


Macrophages LPS 


0.9 


Thymus 


32.5 


HUVEC none 


0.8 


Kidney 


100.0 


HUVEC starved 


0.5 


i 



CNS_neurodegeneration_vl.O Summary: Ag2922 Expression of the NOV31 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 1.3D Summary: Ag2922 Expression of the NOV31 gene is low/undetectable in 
5 all samples on this panel (CTs>35). (Data not shown.) 

Panel 2D Summary: Ag2922 Results from one experiment with the NOV31 gene are 
not included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 4.1D Summary: Ag2922 Expression of the NOV31 gene is restricted to normal 
thymus and kidney (CTs=32-33). Thus, expression of this gene could be used as a marker for 
10 kidney and thymic tissue. 

Panel 4D Summary: Ag2922 Results from one experiment with the NOV31 gene are 
not included. The amp plot indicates that there were experimental difficulties with this run. 

NOV36a and NOV36b 
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Expression of gene NOV36a and variant NOV36b was assessed using the 
primer-probe sets Agl 136 and Ag2999, described in Tables AFA and AFB. Results of 
the RTQ-PCR runs are shown in Tables AFC, AFD, AFE and AFF. 



Table AFA. Probe Name Agl 136 



Primers 


Sequences 


Length : 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -tcatcaaagtgcaagacatcaa-3 1 


22 


455 


1105 


Probe 


TET-5 ' -ttttccccttgggccctaccatg-3 * - j 
TAMRA 


23 


492 


1106 


Reverse 


5 ' -atgtaccgacattggacatctc-3 1 


22 


526 


1107 



Table AFB. Probe Name Ag2999 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gccctacgatacttttgtgtgt-3 ' 


22 


1488 


1108 


Probe 


TET-5 * -ctcctggccagctgattcaggtcat-3 ' - 
TAMRA 


25 


1520 


1109 


Reverse: 


5 ' -gctactgttgccaacttcatct-3 ' 


22 


1560 


1110 



Table AFC. CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) Agl 136, 
Run 206992276 


Tissue Name 


Rel. Exp.(%) Agl 136, 
Run 206992276 


AD 1 Hippo 


14.4 


Control (Path) 3 
Temporal Ctx 


6.2 


AD 2 Hippo 


44.4 


Control (Path) 4 
Temporal Ctx 


25.7 


AD 3 Hippo 


7.6 


AD 1 Occipital Ctx 


12.2 


AD 4 Hippo 


6.8 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


91.4 


AD 3 Occipital Ctx 


9.0 


AD 6 Hippo 


48.3 


AD 4 Occipital Ctx 


17.7 


Control 2 Hippo 


35.6 


AD 5 Occipital Ctx 


12.5 


Control^ Hippo 


14.3 


AD 6 Occipital Ctx 


35.4 


Control (Path) 3 
Hippo 


6.4 


Control 1 Occipital 
Ctx 


13.6 


AD 1 Temporal Ctx 


16.7 


Control 2 Occipital 
Ctx 


61.1 


AD 2 Temporal Ctx 


31.6 


Control 3 Occipital 
Ctx 


25.5 


AD 3 Temporal Ctx 


15.5 


Control 4 Occipital 
Ctx 


8.4 


AD 4 Temporal Ctx 


21.6 


Control (Path) 1 
Occipital Ctx 


73.7 
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AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


9.2 


AD 5 SupTemporal 
Ctx 


41.8 


Control (Path) 3 
Occipital Ctx 


4.6 


AD 6 Inf Temporal 
Ctx 


57.0 


Control (Path) 4 
Occipital Ctx 


21.3 


AD 6 Sup Temporal 
Ctx 


37.4 


Control 1 Parietal 
Ctx 


10.6 


Control 1 Temporal 
Ctx 


16.5 


Control 2 Parietal 
Ctx 


46.0 


Control 2 Temporal 
Ctx 


45.7 


Control 3 Parietal 
Ctx 


18.7 


Control 3 Temporal 
Ctx 


20.4 


Control (Path) 1 
Parietal Ctx 


62.9 


Control 4 Temporal 
Ctx 


12.6 


Control (Path) 2 
Parietal Ctx 


17.9 


Control (Path) 1 
Temporal Ctx 


51.1 


Control (Path) 3 
Parietal Ctx 


10.1 


Control (Path) 2 
Temporal Ctx 


31.6 


Control (Path) 4 
Parietal Ctx 


39.5 



Table AFD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Agll36, 
Run 165528214 


Tissue Name 


Rel. Exp.(%) Agll36, 
Run 165528214 


Liver adenocarcinoma 


36.6 


Kidney (fetal) 


19.1 


Pancreas 


4.9 


Renal ca. 786-0 


30.1 


Pancreatic ca. CAPAN 
2 


12.0 


Renal ca. A498 


25.9 


Adrenal gland 


4.7 


Renal ca. RXF 393 


18.6 


Thyroid 


4.5 


Renal ca. ACHN 


22.2 


Salivary gland 


0.6 


Renal ca. UO-31 


22.4 


Pituitary gland 


10.0 


Renal ca. TK-10 


31.4 


Brain (fetal) 


53.2 


Liver 


0.0 


Brain (whole) 


76.3 


Liver (fetal) 


0.9 


Brain (amygdala) 


43.5 


Liver ca. 

(hepatoblast) HepG2 


4.3 


Brain (cerebellum) 


48.3 


Lung 


9.7 


Brain (hippocampus) 


42.6 


Lung (fetal) 


9.7 


Brain (substantia nigra) 


13.9 


Lung ca. (small cell) 
LX-1 


6.7 


Brain (thalamus) 


37.6 


Lung ca. (small cell) 
NCI-H69 


23.3 


Cerebral Cortex 


28.1 


Lung ca. (s.cell var.) 
SHP-77 


49.7 
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Spinal cord 


7.6 


Lung ca. (large 
cell)NCI-H460 


13.9 


glio/astro U87-MG . 


12.7 


Lung ca. (non-sm. 
cell) A549 


8.4 


glio/astro U- 1 1 8-MG 


85.9 


Lung ca. (non-s.cell) 
NCI-H23 


19.2 


astrocytoma SW1783 


33.9 


Lung ca. (non-s.cell) 
HOP-62 


30.4 


neuro*; met SK-N-AS 


72.2 


Lung ca. (non-s.cl) 
NCI-H522 


32.5 


astrocytoma SF-539 


51.1 


Lung ca. (squam.) 
SW 900 


47.6 


astrocytoma SNB-75 


39.5 


Lung ca. (squam.) 

XI I TIC OA 


80.7 


glioma SNB-19 


17.5 


Mammary gland 


1 /.z 


glioma U25 1 


100.0 


Breast ca.* (pl.ef) 
MCF-7 


38.4 


glioma SF-295 


13.5 


Breast ca.* (pl.ef) 
MDA-MB-231 


43.5 


Heart (fetal) 


12.3 


Breast ca.* (pl.ef) 
T47D 


13.8 


Heart 


13.5 


Breast ca. B 1 o4y 


/£1 1 


Skeletal muscle (fetal) 


8.0 


Breast ca. MDA-N 


3.2 


Skeletal muscle 


0.0 


Ovary 


5.2 


Bone marrow 


1.6 


Ovarian ca. OVCAR- 

3 


19.2 


Thymus 


26.4 


Ovarian ca. OVCAR- 
4 


35.8 


Spleen 


15.1 


Ovarian ca. OVCAR- 

<: 
D 


46.0 


Lymph node 


23.3 


Ovarian ca. OVCAR- 

5? 
o 


54.7 


Colorectal 


6.3 


Ovarian ca. IGROV- 

i 
i 


10.6 


m _ 

Stomach 


19.8 


V_JVarid.Il Ca. ^aJ>ClLCbJ 

SK-OV-3 


24.8 




40.1 


T Itprii's 

w Ivl UO 


82.9 


Colon ca. SW480 


16.0 


Placenta 


18.0 


Colon ca.* 
SW620(SW480 met) 


4.7 


Prostate 


5.0 


Colon ca. HT29 


3.0 


Prostate ca.* (bone 
met)PC-3 


15.8 


Colon ca. HCT-116 


46.0 


Testis 


13.3 


Colon ca. CaCo-2 


10.1 


Melanoma 
Hs688(A).T 


3.6 
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Colon ca. 

UbSUC^WL/vJJ) OUU ) 


25.7 


Melanoma* (met) 


6.2 


Colon ca. HCC-2998 


52.1 


Melanoma UACC-62 


8.8 


Gastric ca.* (liver met) 
NCI-N87 


26.2 


Melanoma M14 


6.9 


Bladder 


9.0 


Melanoma LOX 
IMVI 


5.6 


Trachea 


20.6 


Melanoma* (met) 
SK-MEL-5 


0.8 


Kidney 


8.5 


Adipose 


9.9 



Table AFE. Panel 2D 



Tissue Name 


ReL Exp.(%) 
Agll36, Run 
162599391 


Tissue Name 


Rel. Exp.(%) 
Agll36, Run 
162599391 


Normal Colon 


36.1 


Kidney Margin 
8120608 


10.9 


CC Well to Mod Diff 
(OD03866) 


29.7 


Kidney Cancer 
8120613 


9.7 


CC Margin (OD03866) 


12.2 


Kidney Margin 
8120614 


1 1 Q 


CC Gr.2 rectosigmoid 
(OD03868) 


16.3 


Kidney Cancer 
9010320 


I D.J 


CC Margin (OD03868) 


7.0 


Kidney Margin 
9010321 


29.1 


CC Mod Diff (OD03 920) 


24.7 


Normal Uterus 


13.4 


CC Margin (ODO3920) 


10.2 


Uterus Cancer 0640 11 


42.9 


CC Gr.2 ascend colon 
(OD03921) 


31.9 


Normal Thyroid 


12.8 


CC Margin (OD03921) 


10.4 


Thyroid Cancer 
064010 


20.4 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


9.1 


Thyroid Cancer 
A302152 


27.0 


Liver Margin (ODO4309) 


2.3 


Thyroid Margin 
A302153 


23.2 


Colon mets to lung 
(OD04451-01) 


22.1 


Normal Breast 


31.9 


Lung Margin (OD04451- 
02) 


13.2 


Breast Cancer 
(OD04566) 


37.9 


Normal Prostate 6546-1 


85.9 


Breast Cancer 
(OD04590-01) 


31.6 


Prostate Cancer 
(OD04410) 


34.9 


Breast Cancer Mets 
(OD045 90-03) 


44.1 


Prostate Margin 


25.9 


Breast Cancer 


26.6 i 
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(OD04410) 




Metastasis 
(OD04655-05) 




Prostate Cancer 
(OD04720-01) 


24.1 


Breast Cancer 064006 


21.6 


Prostate Margin 
(OD04720-02) 


26.4 


Breast Cancer 1024 


21.6 


Normal Lung 061010 


40.3 


Breast Cancer 
y 1 UUZoo 


38.7 ! 


Lung Met to Muscle 
(0004286^ 


100.0 


Breast Margin 
9100265 


21.9 


\/fiiQr k lf > lV/forcxin 

IVlUoLlL IVlcTlt^llI 

(OD04286) 


8.2 


Breast Cancer 
A209073 


41.8 


T nno lVtalicrnant Cancpr 

1 / U 1 1 1 V 1 CM. I 1 fcil 1 CI 1 1 L V. CI 1 1 V V 1 

(OD03126) 


28.3 


Breast Margin 
A2090734 


14.4 


I una Margin fOD03126"l 


28.1 


Normal Liver 


2.2 


Lung Cancer (OD04404) j 


51.4 


Liver Cancer 064003 


4.2 


Lung Margin (OD04404) ; 


J. 6 


Liver Cancer 1 UZD 


^ i 
j. i 


Lung Cancer (OD04565) 


27.7 


Liver Cancer 1026 


Q 1 
0. 1 


Lung Margin (OD04565) 


6.2 


Liver Cancer 6004-T 


1.3 


Lung Cancer (OD04237- 
01) 


84.1 


Liver Tissue 6004-N 


3.6 


Lung Margin (OD04237- 
02) 


10.6 


Liver Cancer 6005-T 


20.6 


Ocular Mel Met to Liver 
(ODO4310) 


4.5 


Liver Tissue 6005-N 


1.2 


Liver Margin (ODO4310) 


3.5 


Normal Bladder 


30.1 


Melanoma Mets to Lung 
(OD04321) 


19.8 


Bladder Cancer 1023 


16.3 


Lung Margin (OD04321) 


21.9 


Bladder Cancer 
A302173 


22.2 


Normal Kidney 


29.7 


Bladder Cancer 
(OD047 18-01) 


37.1 


Kidney Ca, Nuclear grade 
2 (OD04338) 


30.4 


Bladder Normal 
Adjacent (OD04718- 
03) 


12.1 


Kidney Margin 
(OD04338) 


16.2 


Normal Ovary 


25.2 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


17.9 


Ovarian Cancer 
064008 


33.2 


Kidney Margin 
(OD04339) 


18.2 


Ovarian Cancer 
(OD04768-07) 


77.9 


Kidney Ca, Clear cell 
type (OD04340) 


21.6 


Ovary Margin 
(OD04768-08) 


11.9 


Kidney Margin 
(OD04340) 


18.8 


Normal Stomach 


16.5 
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Kidney Ca, Nuclear grade; 
3 (OD04348) 


10.4 


Gastric Cancer 
9060358 


8.4 


Kidney Margin 
(OD04348) 


22.1 


Stomach Margin 
9060359 


5.8 


Kidney Cancer 
(OD04622-01) 


8.4 


Gastric Cancer 
9060395 


25.3 


Kidney Margin 
(OD04622-03) 


5.6 


Stomach Margin 
9060394 


14.7 


Kidney Cancer 
(OD04450-01) ! 


42.0 


Gastric Cancer 
9060397 


42.3 


Kidney Margin 
(OD04450-03) 


13.6 


Stomach Margin 
9060396 


9.7 


Kidney Cancer 8120607 


20.2 


Gastric Cancer 
064005 


21.9 



Table AFF. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Agll36, 
Kun 
164037277 


Rel. 

Exp.(%) 
Ag2999, 
xviin 
165296355 


Tissue Name 


Rel. 

Exp.(%) 
Agll36, 

XVUIl 

164037277 


Rel. 

Exp.(%) 
Ag2999, 

Run 

IN. till 

165296355 


Secondary Thl act 


4.9 


3.8 


HUVEC IL-lbeta \ 


15.8 


9.5 


Secondary Th2 act 


10.7 


8.0 


HUVEC IFN 
gamma 


47.0 


61.1 


Secondary Trl act 


12.2 


10.2 


HUVEC TNF 
alpha + IFN 
gamma 


10.5 


8.9 


Secondary Thl rest 


4.9 


5.8 


HUVEC TNF 
alpha + IL4 


24.1 


15.1 


Secondary Th2 rest 


7.5 


3.0 


HUVEC IL-11 


37.9 


29.5 


Secondary Trl rest 


7.7 


4.2 


Lung 

Microvascular EC 
none 


85.9 


72.2 


Primary Th 1 act 


5.7 


7.3 


Lung 

Microvascular EC 
TNFalpha + IL- 
lbeta 


37.1 


33.9 


Primary Th2 act 


9.1 


6.7 


Microvascular 
Dermal EC none 


85.9 


24.7 


Primary Trl act 


12.2 


9.2 


Microsvasular 
Dermal EC 
TNFalpha + 1L- 
lbeta 


26.6 


14.8 


Primary Thl rest 


21.0 


20.7 


Bronchial 
epithelium 
TNFalpha + 


21.8 


15.8 
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ILlbeta 






Primary Th2 rest 


13.6 


8.5 


Small airway 
epithelium none 


3.1 


0.6 


Primary Trl rest 


10.0 


16.3 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


OCT 

25.3 


1 5.0 


CD45RA CD4 
lymphocyte act 


10.7 


8.2 


Coronery artery 
SMC rest 


ICO 

15.3 


5.7 


CD45RO CD4 
lymphocyte act 


5.2 


2.6 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


3.1 


4.6 


CD8 lymphocyte 
act 


4.5 


3.2 


Astrocytes rest 


33.2 


1 A O 

14.8 


Secondary CD8 
lymphocyte rest 


2.8 


3.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


18.7 


15.4 


Secondary CD8 
lymphocyte act 


6.1 


2.2 


KU-812 
(Basophil) rest 


10.2 


11.0 


CD4 lymphocyte 
none 


2.4 


1.6 


KU-812 

(Basophil) 

PMA/ionomycin 


41.8 


22.2 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


7.5 


10.0 


CCD1106 

(Keratinocytes) 

none 


48.3 


52.1 


T A r'P'llc rpct 




1 S 


CCD1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


47.3 


46.7 


LAK cells IL-2 


4.7 


3.0 


Liver cirrhosis 


5.7 


1.8 


LAK cells IL-2+IL- 
12 


2.5 


0.8 


Lupus kidney 


5.0 


2.8 


LAK cells IL- 
2+IFN gamma 


4.1 


2.3 


NCI-H292 none 


22.8 


11.4 


LAK cells IL-2+ 
IL-18 


3.1 


1.0 


NCI-H292 IL-4 


40.1 


26.1 


LAK cells 
PMA/ionomycin 


6.3 


4.3 


NCI-H292 IL-9 


32.1 


23.2 


NK Cells IL-2 rest 


11.2 


7.7 


NCI-H292 IL-13 


20.9 


24.0 


Two Way MLR 3 
day 


2.9 


3.0 


NCI-H292 IFN 
gamma 


20.4 


16.6 


Two Way MLR 5 
day 


2.7 


2.4 . 


HPAEC none 


67.8 


66.4 


Two Way MLR 7 
day 


2.9 


0.8 


HPAEC TNF 
alpha + IL-1 beta 


16.5 


17.1 


PBMC rest 


0.7 


2.8 


Lung fibroblast 
none 


19.8 


15.8 
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PBMC PWM 


12.7 


6.6 


Lung fibroblast 
TNF alpha + IL-1 
beta 


5.0 


6.7 


PBMC PHA-L 


6.7 


3.2 


Lung fibroblast 
IL-4 


42.9 


30.1 


Ramos (B cell) 
none 


0.8 


0.0 


Lung fibroblast 
11. -9 


49.3 


29.5 ' 


Ramos (B cell) 
ionomycin 


1.0 


2.2 


Lung fibroblast 
IL-13 


24.7 


21.0 


B lymphocytes 
PWM 


9.7 


A O 

4.3 


Lung fibroblast 
IFN gamma 


A C\ 1 

40.3 


33.9 


B lymphocytes 
CD40L and IL-4 1 


5.2 


5.0 


Dermal fibroblast 
CCD 1070 rest 


75.3 


52.5 


EOL-1 dbcAMP 


11.9 


3.7 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


55.5 


22.2 


CUL-l U 1V1 1 

PMA/ionomycin 


17.8 


7.7 


Dermal fibroblast 
CCD 1070 IL-1 
beta 


18.6 


18.9 


Dendritic cells 
none 


0.0 


0.0 


Dermal fibroblast 
iriN gdiniTid 


12.3 


16.0 


Dendritic cells LPS 


0.4 


0.0 


ucrmdi iiDruDiabi 
IL-4 


16.6 


13.2 


Dendritic cells anti- 
CD40 


0.0 


0.5 


IBD Colitis 2 


0.8 


2.2 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


1.2 


1.0 


Monocytes LPS 


0.6 


0.4 


Colon 


12.6 


6.9 


Macrophages rest 


0.8 


0.8 


Lung 


7.7 


4.0 


Macrophages LPS 


0.0 


0.4 


Thymus 


13.8 


9.5 


HUVEC none 


45.4 


54.0 


Kidney 


73.7 


39.0 


HUVEC starved 


100.0 


100.0 







CNS_neurodegeneration vl.O Summary: Agl 136 This panel does not show 
differential expression of the NOV36a gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Agl 136 Highest expression of the NOV36a gene, a cadherin 
1 1 homolog, is seen in a glioma cell line U251 (CT=30.1). There is also low but significant 
significant expression in clusters of ovarian, lung, breast, kidney and colon cancer cell lines. 
Thus, expression of this gene could potentially be used as a diagnostic marker for these 
10 cancers. Furthermore, therapeutic inhibition of this gene product may be useful in the 
treatment of these cancers. 
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This gene also exhibits brain-preferential expression, indicating a role in CNS-specific 
processes. Recent research has shown that genetic deletion of cadherin- 1 1 function acts to 
increase long term potentiation, a process thought to underlie learning and memory. Thus, 
drugs that target this gene prodcut may have utility as memory enhancing drugs. Such drugs 
5 would have utility in treatment of CNS disorders involving memory dysfunction, including 
Alzheimer's disease and normal aging. In behavioral tests, Cadherin 1 1 deletion mutant mice 
show reduced fear- or anxiety-related responses. Thus, inhibitory agents targeting this gene 
product may also have utility as sedatives or anxiolytic agents for the treatment of CNS 
disorders including anxiety. 

10 Among tissues with metabolic function, this gene has low levels of expression in 

pancreas, adrenal, thyroid, pituitary, adult and fetal heart, and adipose. Thus, this cadherin-like 
gene product may be important in the pathogenesis, diagnosis, and/or treatment of metabolic 
and endocrine disease, including obesity and Types 1 and 2 diabetes. Decreased levels of 
cadherin have been associated with decreased insulin secretion, suggesting that increasing 

15 cadherin levels may be a potent therapeutic for Type 2 diabetes. In addition, this gene is 
expressed at higher levels in fetal (CT=34) vs adult skeletal muscle (CT=40) and may be 
useful for differentiation between the two sources of tissue. Ag2999 Results from one 
experiment with the CG56003-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

20 References: 

Manabe T, Togashi H, Uchida N, Suzuki SC, Hayakawa Y, Yamamoto M, Yoda H, 
Miyakawa T, Takeichi M, Chisaka O. Loss of cadherin- 1 1 adhesion receptor enhances plastic 
changes in hippocampal synapses and modifies behavioral responses. Mol Cell Neurosci 2000 
Jun;15(6):534-46 

25 Cadherins organize symmetrical junctions between the pre- and postsynaptic 

membranes in central synapses. One of them, cadherin- 1 1 (cadi 1), is expressed in the limbic 
system of the brain, most strongly in the hippocampus. Immunohistochemical studies of the 
hippocampus showed that cadi 1 proteins were densely distributed in its synaptic neuropil 
zones; in cultured hippocampal neurons, their distribution often overlapped with that of 

30 synaptophysin, and also occasionally with that of GluRl at spines. To assess the role of cadi 1 
in synaptic formation and/or function, we analyzed brains of cadi 1 -deficient mice. In these 
mice, long-term potentiation (LTP) in the CA1 region of the hippocampus was, unexpectedly, 
enhanced; and the level of LTP saturation was increased. In behavioral tests, the mutant mice 
showed reduced fear- or anxiety-related responses. These results suggest that the cadi 1- 
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mediated junctions may modulate synaptic efficacy, confining its dynamic changes to a 
limited range, or these junctions are required for normal development of synaptic organization 
in the hippocampus. 

PMID: 10860580 

5 Yamagata K, Nammo T, Moriwaki M, Ihara A, Iizuka K, Yang Q, Satoh T, Li M, 

Uenaka R, Okita K, Iwahashi H, Zhu Q, Cao Y, Imagawa A, Tochino Y, Hanafusa T, 
Miyagawa Ji J, Matsuzawa Y. Overexpression of Dominant-Negative Mutant Hepatocyte 
Nuclear Factor- 1 alpha in Pancreatic beta-Cells Causes Abnormal Islet Architecture With 
Decreased Expression of E-Cadherin, Reduced beta-cell Proliferation, and Diabetes. Diabetes. 

10 2002 Jan;51(l):l 14-23. 

One subtype of maturity-onset diabetes of the young (MODY)-3 results from 
mutations in the gene encoding hepatocyte nuclear factor (HNF)-l alpha. We generated 
transgenic mice expressing a naturally occurring dominant-negative form of human HNF- 
lalpha (P291fsinsC) in pancreatic beta-cells. A progressive hyperglycemia with age was seen 

15 in these transgenic mice, and the mice developed diabetes with impaired glucose-stimulated 
insulin secretion. The pancreatic islets exhibited abnormal architecture with reduced 
expression of glucose transporter (GLUT2) and E-cadherin. Blockade of E-cadherin-mediated 
cell adhesion in pancreatic islets abolished the glucose-stimulated increases in intracellular 
Ca(2+) levels and insulin secretion, suggesting that loss of E-cadherin in beta-cells is 

20 associated with impaired insulin secretion. There was also a reduction in beta-cell number 

(50%), proliferation rate (15%), and pancreatic insulin content (45%) in 2-day-old transgenic 
mice and a further reduction in 4-week-old animals. Our findings suggest various roles for 
HNF-1 alpha in normal glucose metabolism, including the regulation of glucose transport, 
beta-cell growth, and beta-cell-to-beta-cell communication. 

25 PMID: 11756330 

Panel 2D Summary: Agl 136 The NOV36a gene is a good target for ovarian, gastric, 
breast, lung, colon, uterine and kidney cancers because it is expressed at a higher level in these 
cancers than the adjacent normal tissue. Therefore, expression of this gene could potentially be 
used as a diagnostic marker for these cancers and therapeutic inhibition may be useful in 

30 treatment of these cancers. 

Panel 4D Summary: Agl 136/Ag2999 Two experiments produce results that are in 
excellent agreement. The NOV36A transcript is expressed at low levels in hematopoietic cells 
and at higher levels in endothelial cells. IL-1 beta and TNFalpha treatment reduce transcript 
levels consistently in endothelium samples including HPAEC, HUVEC and lung 
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microvascular EC. Fibroblasts also express this transcript and dermal fibroblasts down 
regulate expression in response to IL-1 beta, gamma interferon and IL-4. This transcript 
encodes a putative cadherin 1 1 like molecule. Cadherins are adhesion molecules that regulate 
normal homeostasis. Loss of cadherin 1 1 expression can reduce the expression of factors such 
5 as VEGF-D in fibroblasts (see reference 1). Therapies designed with the protein encoded by 
this transcript could be important in the regulating endothelium function including leukocyte 
extravasation, a major component of inflammation during asthma, IBD, and psoriasis. 
Therapeutics designed with the protein encoded by this transcript could also be important in 
the treatment of osteoarthritis and osteoporosis since this protein may be important in 
10 maintaining bone density (see reference 2). 
References: 

Orlandini M, Oliviero S In fibroblasts Vegf-D expression is induced by cell-cell 
contact mediated by cadherin- 1 1. J Biol Chem 2001 Mar 2;276(9):6576-81 

Vascular endothelial growth factors (VEGFs) are a highly conserved family of growth 

15 factors all angiogenic in vivo with mitogenic and chemotactic activity on endothelial cells. 

VEGFs are expressed in fibroblasts either in hypoxia or in response to growth factors. Here we 
report that, differently from the other members of the family, Vegf-D is induced by cell-cell 
contact. By in situ hybridization we demonstrated that noninteracting fibroblasts express low 
levels of Vegf-D mRNA, whereas contacting cells express high levels of Vegf-D transcripts. 

20 By immunostaining we observed that the surface protein cadherin- 1 1 is localized at the 
opposite sites of interacting cell surfaces. Ca(2+) deprivation from the culture medium 
determined the loss of cadherin- 1 1 from the cell surfaces and down-regulation of Vegf-D 
mRNA. Moreover, a cadherin- 1 1 antisense RNA construct inhibited Vegf-D expression in 
confluent BALB/c fibroblasts, whereas in NIH 3T3 cells, which express low levels of 

25 cadherin- 1 1, Vegf-D induction could be obtained by overexpression of cadherin- 1 1. This 
suggests that cell interaction mediated by cadherin- 1 1 induces the expression of the 
angiogenic factor Vegf-D in fibroblasts. 
PMID: 11108717 

Kawaguchi J, Azuma Y, Hoshi K, Kii I, Takeshita S, Ohta T, Ozawa H, Takeichi M, 
30 Chisaka O, Kudo A. Targeted disruption of cadherin- 1 1 leads to a reduction in bone density in 
calvaria and long bone metaphyses. J Bone Miner Res 2001 Jul; 16(7): 1265-71 

The migration and adhesion of osteoblasts requires several classical cadherins. 
Cadherin- 1 1, one of the classical cadherins, was expressed in mouse osteoblasts in skull bone 
and femur, revealed by immunohistochemistry. To elucidate the function of cadherin- 1 1 in 
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osteohistogenesis, cadherin-1 1 null mutant mice were investigated. Although apparently 
normal at birth, Alizarin red staining of null mutant mice showed a reduced calcified area at 
the frontal suture that caused a round-shaped calvaria with increasing animal age to 3 months. 
Consequently, there was a reduction in bone density at the femoral metaphyses and the diploe 
5 of calvaria in null mutant mice. In the in vitro culture of newborn calvarial cells, the calcified 
area of mutant cells was smaller than those derived from wild-type littermates. These results 
show that absence of cadherin-1 1 leads to reduced bone density in some parts of skeletons 
including calvaria and long bone metaphyses, and thus suggest that cadherin-1 1 plays roles in 
the regulation of osteoblast differentiation and in the mineralization of the osteoid matrix. 
10 PMID: 11450702 

NOV37 

Expression of gene NOV37 was assessed using the primer-probe sets Ag047, Ag2679, 
Ag2728, Ag332, Ag47b, Ag712, Ag2732 and Ag2975, described in Tables AGA, AGB, AGC, 
AGD, AGE, AGF, AGG and AGH. Results of the RTQ-PCR runs are shown in Tables AGI, 
15 AGJ, AGK, AGL, AGM, AGN, AGO, AGP, AGQ and AGR. 



Table AGA. Probe Name Ag047 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ccaatgacctggccacca-3 ' 


18 


1064 


1111 


Probe 


TET-5 1 -ccagagtccgttcagcttcaggacagc- 
3 1 - TAMRA 


27 


1084 


1112 


Reverse 


5 ' -gtggcacgttgctgtttagc-3 ' 


20 


1116 


1113 



Table AGB. Probe Name Ag2679 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -ttgacctcaggaacggtttac-3 * 


21 


1213 


1114 


Probe 


TET-5 1 -ctgctgcccaggaatactttctccag-3 1 - 
TAMRA 


26 


1249 


1115 


Reverse 


5 1 -agtatttggagggcttcttcag-3 ' 


22 


1288 


1116 



20 Table AGC. Probe Name Ag2728 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ggagcttggtctcatgaccta-3 ' 


21 


5178 


1117 


Probe 


TET-5 1 -actgggctcctggccaccaagag-3 ' - 
TAMRA 


23 


5209 


1118 


Reverse 


5 1 -agtcgtccatcctgtttcatc-3 ' 


21 


5233 


1119 
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Table AGD. Probe Name Ag332 



Primers 


Sequences 


Length j 


Start 
Position 


SEQ ID NO: 


Forward! 


5 ' -gctgccctgacttgtgcaa-3 1 


19 


2594 


1120 


Probe 


TET-5 ' -tctgacccagtgtgcatctcccgtt-3 ' - ; 
TAMRA 


25 j 


2617 


1121 


Reverse 


5 1 - ccggtctggcagacacact-3 ' jl9 


2651 


1122 


Table AGE. Probe Name Ag47b 


Primers 


Sequences 


Length ! 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gaacgccggagcatacaga-3 ' 


19 


1774 


1123 


Probe 


TET-5 1 -ccaggtactgcacaaacacggcttcat- 
3 ' -TAMRA 


27 


1805 


1124 


Reverse: 


5 ' -gatgccacaggcccaca-3 ' 




1833 


1125 


Table AGF. Probe Name Ag712 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -tgcaagggtacgagggatat-3 * 


20 


8369 


1126 


Probe 


TET-5 ' -cttcccgtggagcaatacccagag-3 * - 
TAMRA 


24 


8395 


1127 


Reverse 


5 ' - tggatgttgctgctactgtct-3 ' 


21 


8424 


1128 


Table AGG. Probe Name Ag2732 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ggagcttggtctcatgaccta-3 • 


21 


5178 


1129 


Probe 


TET-5 ' -actgggctcctggccaccaagag-3 1 - 
TAMRA 


23 


5209 


1130 


Reverse 


5 ■ -agtcgtccatcctgtttcatc-3 1 


21 


5233 


1131 


Table AGH. Probe Name Ag2975 


Primers 


| Sequences 


Length 


| Start 
Position 


SEQ ID NO: 


Forward 


| 5 1 -ggagcttggtctcatgaccta-3 1 


21 


[5178 


1132 


Probe 


Itet- 5 ' -actgggctcctggccaccaagag-3 1 - 
TAMRA 


23 


5209 


1133 


Reversejs * - agt cgtccatcctgt ttcatc - 3 ' 


21 


|5233 


1134 



10 



Table AGI. CNSneurodegenerationvl.O 





Rel. 


Rel. 


Rel. 




Rel. 


Rel. 


Rel. 


Tissue 


Exp.(%) 


Exp.(%) 


Exp.(%) 


Tissue 


Exp.(%) 


Exp.(%) 


Exp.(%) 


Name 


Ag047, 


Ag2679, 


Ag2728, 


Name 


Ag047, 


Ag2679, 


Ag2728, 




Run 


Run 


Run 




Run 


Run 


Run 
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206228021 


209751329 


206483376 




206228021 


209751329 


206483376 


AD 1 
Hippo 


10.3 


6.6 


8.4 


Control 
(Path) 3 
Temporal 
Ctx 


4.1 


3.2 


2.3 


AD 2 
Hippo 


18.0 


20.9 


17.3 


Control 
(Path) 4 
Temporal 
Ctx 


13.8 


33.9 


29.7 


AD 3 
Hippo 


6.6 


4.6 


6.4 


AD 1 

Occipital 

Ctx 


18.8 


16.8 


17.1 


AD 4 
Hippo 


5.6 


4.4 


7.7 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


0.0 


AD 5 
Hippo 


i no ii 


100.0 


95.9 


AD 3 

Occipital 

Ctx 


5.7 


3.4 


3.1 


AD 6 
Hippo 


^7 (\ 
j / .yj 


33.0 


31.4 


AD 4 

Occipital 

Ctx 


12.9 


15.1 


19.5 


Control 2 
Hippo 


99 1 
ZZ. 1 


17.3 


43.2 


AD 5 

Occipital 

Ctx 


25.2 


33.4 


51.4 


Control 4 
Hippo 


7 7 
/ . / 


6.4 


5.3 


AD 6 

Occipital 

Ctx 


3.2 


18.9 


19.8 


Control 
f Path i ^ 
Hippo 


1 5 


3.4 


3.7 


Control 1 
Occipital 
Ctx 


3.4 


3.5 


4.4 


AD 1 

Temporal 

Ctx 


8.1 


6.7 


8.6 


Control 2 
Occipital 
Ctx 


83.5 


82.9 


100.0 


AD 2 

Temporal 

Ctx 


21.2 


40.3 


21.2 


Control 3 
Occipital 
Ctx 


25.3 


20.2 


17.3 


f\VJ J 

Temporal 
Ctx 


3.3 


4.2 


4.2 


Control 4 
Occipital 
Ctx 


5.4 


4.2 


4.4 


AD 4 

Temporal 

Ctx 


18.7 


13.1 


20.0 


Control 
(Path) 1 
Occipital 
Ctx 


66.4 


74.7 


75.8 


AD 5 Inf 
Temporal 
Ctx 


27.0 


57.4 


42.9 


Control 
(Path) 2 
Occipital 
Ctx 


15.2 


11.3 


13.0 
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AD 5 Sup 
Temporal 
Ctx 


14.6 


27.0 


20.7 


Control 
(Path) 3 1 
Occipital 
Ctx 


3.0 


2.4 


1.5 


AD 6 Inf 
Temporal 
Ctx 


25.2 


16.8 


19.1 


Control 
(Path) 4 
Occipital 
Ctx 


38.7 


29.3 


19.6 


AD 6 Sup 
Temporal 
Ctx 


26.1 


18.9 


17.2 


Control 1 

Parietal 

Ctx 


4.4 


3.7 


4.1 


Control 1 
Temporal 
Ctx 


3.9 


3.3 


3.8 


Control 2 

Parietal 

Ctx 


31.4 


19.8 


18.2 


Control 2 
Temporal 
Ctx 


21.3 


27.4 


33.4 


Control 3 

Parietal 

Ctx 


7.9 


10.7 


9.6 


Control 3 
Temporal 
Ctx 


13.7 


10.0 


10.4 


Control 
(Path) 1 
Parietal 
Ctx 


46.3 


56.3 


58.6 


Control 3 
Temporal 
Ctx 


8.1 


7.9 


8.7 


Control 
(Path) 2 
Parietal 
Ctx 


13.2 


16.6 


18.4 


Control 
(Path) 1 
Temporal 
Ctx 


34.4 


42.3 


51.4 


Control 
(Path) 3 
Parietal 
Ctx 


2.5 


2.4 


1.2 


Control 
(Path) 2 
Temporal 
Ctx 


27.2 


29.9 


24.5 


Control 
(Path) 4 
Parietal 

Ctx 


35.1 


41.2 


35.6 



Table AGJ. Panel 1 



Tissue Name 


Rel. Exp.(%) ; 
Ag047, Run 
87354354 


Rel. Exp.(%) 
Ag047, Run 
87354779 


Rel. Exp.(%) 
Ag332, Run 
97803603 


Rel. Exp.(%) 
Ag332, Run 
98747043 


Rel. Exp.(%) 
Ag47b, Run 
88164379 


Endothelial cells 


0.0 


0.0 


0.0 


0.0 


0.0 


Endothelial cells 
(treated) 


0.0 


0.0 


0.0 


0.0 


0.0 


Pancreas 


0.0 


0.3 


0.0 


0.0 


0.0 


Pancreatic ca. 
CAPAN 2 


0.0 


0.0 


0.0 


0.0 


0.0 


Adrenal gland 


0.0 


1.3 


0.2 


0.0 


0.0 


Thyroid 


0.0 


0.4 


100.0 


0.1 


0.0 
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Salivary gland 


0.0 


0.2 


0.0 


0.0 


0.0 


Pituitary gland 


0.0 


0.1 


2.1 


0.0 


0.0 


t>rain (retal) 


A O 


1 Z>.\) 


^ A 


1 I .u 


ZI.O 


Brain (whole) 




O /.o 


■J ^ 

J.Z 


A ^ 


JZ.J 


Brain (amygdala) 


0.0 


8.8 


3.8 


8.7 


12.2 | 


Brain 

(cerebellum) 


0.0 


22.2 


1.4 


0.1 


14.3 


Brain 

(hippocampus) 


0.0 


24.7 


3.4 


8.7 


15.6 


Brain (substantia 
nigra) 


3.4 


3.6 


1.3 


2.3 


3.8 


Brain (thalamus) 


3.5 


4.7 


3.3 


5.0 


9.1 


Brain 

(hypothalamus) 


0.0 


0.7 


0.2 


0.0 


0.0 


Spinal cord 


0.7 


1.5 


0.9 


1.4 


1.1 


glio/astro U87- 
MG 


0.6 


2.6 


1.6 


1.7 


3.0 


glio/astro U-l 18- 
MG 


0.0 


0.6 


0.2 


1.1 


0.1 


astrocytoma 

C 1 VI 7 1 TOO 

SW 1 783 


33.7 


29.5 


9.0 


42.9 


36.1 


neuro*; met SK- 
N-AS 


0.0 


0.0 


0.0 


0.0 

— , — _ 


0.0 




astrocytoma SF- 
539 


31.0 


36.9 


10.3 


68.3 


48.3 


astrocytoma SNB- 
75 


33.7 


O 

32.8 


7.1 


23. U 




glioma SNB-19 


100.0 


100.0 


30.1 


100.0 


100.0 


glioma U251 


49.0 


44.1 


16.5 


57.8 


41.5 


glioma SF-295 


6.0 


8.1 


2.1 


19.8 


8.7 


Heart 


61.1 


26.8 


36.6 


70.7 


39.8 


Skeletal muscle 


0.0 


0.1 


0.2 


0.0 


0.0 


Bone marrow 


0.0 


0.1 


0.0 


0.0 


0.0 


Thymus 


18.7 


18.4 


1.1 


2.7 


17.2 


Spleen 


0.0 


0.0 


0.0 


0.0 


0.0 


Lymph node 


0.0 


0.2 


0.0 


0.0 


0.0 


Colon (ascending) 


0.5 


0.7 


0.7 


0.9 


4.5 


Stomach 


0.1 


1.1 


0.2 


0.2 


0.2 


Small intestine 


0.0 


0.1 


0.0 


0.0 


0.0 


Colon ca. SW480 


0.6 


1.0 


1.6 


7.0 


1.4 


Colon ca.* 
SW620 (SW480 
met) 


0.0 


0.0 


0.0 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


0.0 


0.0 


0.0 
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Colon ca. HCT- 
1 1 a 

1 I o 


0.0 


0.0 


0.0 


0.0 


0.0 


colon ca. LaLo-z 


u.u 


U. J 


u.u 


U.u 


ft ft 
u.u 


i^oion ca. rici-iz> 


u.u 


U. 1 


u.z 


u.u 


ft ft 


L^oion ca. riCC- 
2998 


0.0 


0.0 


0.0 


0.0 


0.0 


Gastric ca. * (liver 
met) NCI-N87 


0.0 


0.4 


0.2 


0.0 


0.0 


Bladder 


0.3 


1.0 


0.2 


0.0 


0.1 


Trachea 


0.0 


0.4 


0.2 


0.0 


0.0 


TC idnpv 


0.2 


0.9 


0.7 


1.2 


1.0 


Kidnev (fetaH 


1 3 


3 9 


0.8 


3.2 


2.7 


Renal ca 786-0 


10.6 


1 1.7 


3.4 


7.9 


12.2 


Renal ca. A498 


0.0 


0.1 


0.0 


0.0 


0.0 


Renal ca. RXF 


17.9 


14.0 


4.1 


16.6 


15.2 


Renal ca. ALHN 


A A 
0.0 


A A 
0.0 


A A 
0.0 


A A 


A A 
U.U 


Renal ca. UO-3 1 


A A 

0.0 


A O 

0.2 


A 1 

0.1 


A A 
0.0 


A O 
U.Z 


Renal ca. TK-10 


0.0 ] 


0.0 


0.0 


0.0 


0.0 


Liver 


0.0 


3.2 


0.8 


0.9 


1.9 


Liver (fetal) 


0.0 


0.1 


0.0 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung 


0.0 


0.2 


0.0 


0.0 


0.0 


Lung (fetal) 


0.0 


0.5 


0.0 


0.0 


0.0 


Lung ca. (small 
cell) LX-1 


u.u 


u.u 


u.u 


0 0 
u.u 


0 0 
u.u 


Lung ca. (small 
cell) NCI-H69 


? 0 

Zr-U 


1 3 


1 4 


S 4 


2 9 


Lung ca. (s.cell 
var.) SHP-77 


0 0 


ft ft 


0 0 

v/.v/ 


0 0 


0.0 


Lung ca. (large 
cell)NCl-H460 


0 ft 

U.U 


ft 0 

U.U 


1 3 


0 7 

V/. / 


0 3 


Lung ca. (non-sm. 
cell) A549 


0 0 


ft ft 
u.u 


0 1 

U. I 


0 0 
u.u 


0 0 


Lung ca. (non- 
s.cell) NCI-H23 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


5.2 


4.7 


3.9 


11.3 


6.7 


Lung ca. (non- 
s.cl) NCI-H522 


1.7 


3.1 


1.5 


2.8 


4.8 


Lung ca. (squam.) 
SW 900 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung ca. (squam.) 


1.3 


2.5 


2.0 


4.2 


3.0 
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NCI-H596 












Mammary gland 


11.9 


9.1 


4.6 


7.0 


10.6 


Breast ca.* (pl.ef) 
MCr-7 


0.0 


0.0 


0.0 


0.0 


0.0 


Breast ca.* (pl.ef) \ 
MDA-MB-23 1 


0.0 


0.1 


0.0 


0.0 


0.0 


Breast ca.* (pi. ef) 
T47D 


11.8 


7.5 


7.9 


31.6 


31.4 


Breast ca. BT-549 


0.0 


0.0 


4.5 


13.8 


28.3 


Breast ca. MDA- 
N 


0.0 


0.1 


0.0 


0.0 


0.0 


Ovary 


0.1 


0.6 


0.4 


0.1 


0.2 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


0.0 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.1 


1.3 


4.4 


0.0 


Ovarian ca. 
OVCAR-5 


73.2 


38.2 


14.4 


57.0 


36.1 


Ovarian ca. 
OVCAR-8 


0.0 


0.7 


1.0 


3.4 


0.2 


Ovarian ca. 


0.0 


0.0 


0.8 


2.6 


0.0 


Ovarian ca. 


0.0 


0.0 


0.0 


0.0 


0.0 


Uterus 


0.4 


1.2 


0.6 


1.7 


2.9 


Placenta 


0.0 


0.1 


u.u 


U.U 


A A 


Prostate 


0.9 


1.9 


0.7 


1.9 


1.7 


Prostate ca.* 
(bone met) rL-j 


0.0 


0.0 


0.1 


0.0 


0.0 


Testis 


25.7 


22.2 


0.7 


0.7 


26.4 


Melanoma 
Hs688(A).T 


23.8 


21.6 


5.0 


20.0 


28.9 


Melanoma* (met) 
Hs688(B).T 


4.6 


6.5 


3.4 


9.6 


9.2 


Melanoma 
UACC-62 


0.0 


0.0 


0.0 


0.0 


0.0 


Melanoma Ml 4 


0.0 


0.1 


0.0 


0.0 


0.0 


Melanoma LOX 
1MVI 


3.7 


3.4 


0.6 


0.2 


2.7 


Melanoma* (met) 
SK-MEL-5 


0.0 


0.0 


0.0 


0.0 


0.0 


Melanoma SK- 
MEL-28 


0.0 


0.0 


3.6 


17.4 


0.0 
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Table AGK. Panel 1.1 



Tissue Name 


Rel. Exp.(%) Ag047, 
Run 109663520 


Tissue Name 


Rel. Exp.(%) Ag047, 
Run 109663520 


Adrenal gland 


0.5 


Renal ca. UO-31 


0.1 


Bladder 


1.2 


Renal ca. RXF 393 


8.7 


Brain (amygdala) 


2.8 


Liver 


1.3 


Brain (cerebellum) 


4.1 


Liver (fetal) 


0.0 


Brain (hippocampus) 


9.6 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (substantia 
nigra) 


15.0 


Lung 


0.1 


Brain (thalamus) 


4.7 


Lung (fetal) 


0.4 


Cerebral Cortex 


48.3 


Lung ca. (non-s.cell) 
HOP-62 


26.6 


Brain (fetal) 


21.2 


Lung ca. (large 
cell)NCl-II460 


1.2 


Brain (whole) 


9.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


glio/astroU-118-MG 


0.7 


Lung ca. (non-s.cl) 
NCI-H522 


5.8 


astrocytoma SF-539 


38.4 


Lung ca. (non-sm. 
cell) A549 


0.0 


astrocytoma SNB-75 


16.4 


Lung ca. (s.cell var.) 
SHP-77 


0.0 
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Lung ca. (small cell) 
LX-1 


0.0 


glioma U251 


55.5 


Lung ca. (small cell) 
NCI-H69 


5.3 


glioma SF-295 


8.7 


Lung ca. (squam.) 
SW 900 


0.0 


glioma SNB-19 


100.0 


Lung ca. (squam.) 
NCI-H596 


5.0 


glio/astro U87-MG 


3.3 


Lymph node 


0.3 


neuro*; met SK-N-AS 


0.0 


Spleen 


0.0 


Mammary gland 


2.0 


Thymus 


0.6 


Breast ca. BT-549 


6.4 


Ovary 


0.6 


Breast ca. MDA-N 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Breast ca.* (pl.ef) 
T47D 


5.7 


Ovarian ca. OVCAR- 
3 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


Ovarian ca. OVCAR- 
4 


0.1 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Ovarian ca. OVCAR- 
5 


49.0 


Small intestine 


0.0 


Ovarian ca. OVCAR- 


0.5 
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8 




Colorectal 


0.1 


Ovarian ca.* (ascites): 
SK-OV-3 


0.1 


Colon ca. HT29 


0.0 


Pancreas 


0.6 


Colon ca. CaCo-2 


0.0 


Pancreatic ca. 
CAPAN 2 


0.0 


Colon ca. HCT-15 


0.1 


Pituitary gland 


0.5 


Colon ca. HC1-1 16 


0.0 


Placenta 


U.U 


Colon ca. HCC-2998 


0.0 


Prostate 


U.o 


Colon ca. SW480 


1.2 


Prostate ca.* (bone 


0.1 


Pnlnn ra * SW690 

(SW480 met) 


0.0 


Salivary gland 


0.2 


lj IAJ 1 1 1 CIV^ 1 1 


0 1 




0.5 


NCI-N87 


0.1 


Spinal cord 


1.4 


n ccii i 


80 7 

Ov. / 




0.5 


Skeletal muscle (Fetal) 


1.6 


Thyroid 


0.3 


Skeletal muscle 


0.4 


Uterus 


0.0 


Endothelial cells 


0.0 


Melanoma Ml 4 


0.0 


Heart (Fetal) 


14.2 


Melanoma LOX 
IMVI 


0.7 


Kidney 


2.4 


Melanoma UACC-62 


0.0 


Kidney (fetal) 


1.5 


Melanoma SK-MEL- 
28 


0.0 


Renal ca. 786-0 


3.6 


Melanoma* (met) 
SK-MEL-5 


0.0 


Renal ca. A498 


0.0 


Melanoma 
Hs688(A).T 


16.5 


Renal ca. ACHN 


0.0 


Melanoma* (met) 
Hs688(B).T 


5.6 


Renal ca. TK-10 


0.0 







Table AGL. Panel 1.2 



Tissue Name 


ReL Exp.(%) 
Ag712, Run 
114986148 


Rel. Exp.(%) 
Ag712, Run 
119452123 


Tissue Name 


Rel. Exp.(%) 
Ag712, Run 
114986148 


Rel. Exp.(%) 
Ag712, Run 
119452123 


Endothelial cells 


0.0 


0.0 


Renal ca. 786- 
0 


4.0 


3.0 


Heart (Fetal) 


4.0 


2.1 


Renal ca. A498 


0.1 


0.1 


Pancreas 


2.1 


1.1 


Renal ca. RXF 

393 


4.9 


3.5 


Pancreatic ca. 
CAPAN 2 


0.0 


0.0 


Renal ca. 
ACHN 


0.6 


0.3 
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n a u ii i, ,:■ i; ;i .: (! si ,,. J; h 

m 



Adrenal Gland 


2.8 


2.2 


Renal ca. UO- 

1 1 

J I 


0.3 


0.1 


Thyroid 


2.1 


1.5 


Renal ca. TK- 

1 f\ 


0.2 


0.1 


Salivary gland 


1.9 


1.1 


Liver 


4.6 


2.9 


Pituitary gland 


4.6 


3.7 


Liver (fetal) 


0.4 


0.3 


Brain (fetal) 


22.2 


29.3 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (whole) 


33.2 


26.1 


Lung 


0.8 


0.6 


Brain 

(amygdala) 


18.6 


12.9 


Lung (fetal) 


1.3 


1.2 


Brain 

(cerebellum) 


2.3 


1.8 


Lung ca. (small) 
cell) LX-1 


0.0 


0.0 


Brain 

(hippocampus) 


27.5 


18.0 


Lung ca. (small 
cell)NCI-H69 


4.4 


3.7 


Brain (thalamus) j 


10.4 


7.7 


Lung ca. (s.cell 
var.) SHP-77 


0.1 


0.0 


Cerebral Cortex 


25.3 


22.2 


Lung ca. (large 
cell)NCI-H460 


1.0 


0.6 


Spinal cord 


3.8 


3.2 


Lung ca. (non- 
sm. cell) A549 


0.3 


0.1 


1 " / j I TOT 

gho/astro U87- 
MG 


2.5 


2.1 


Lung ca. (non- 
s.cell) NCI- 
H23 


0.1 


0.1 


glio/astro U-118- 
MG 


U.O 


A A 


Lung ca. (non- 
s.cell) HOP-62 


7 9 
/ .Z 


J.o 


astrocytoma 
SW1783 


in Q 




Lung ca. (non- 
s.cl) NCI-H522 




4 0 


neuro*; met SK- 
N-AS 


0.1 


0.0 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


astrocytoma SF- 
539 


36.1 


19.8 


Lung ca. 
(squam.) NCI- 
H596 


4.6 


3.7 


astrocytoma 
SNB-75 


10.7 


3.6 


Mammary 
gland 


7.6 


4.5 


glioma SNB-1 9 


71.2 


51.4 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


0.0 


glioma U251 


38.7 


19.9 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


0.0 


glioma SF-295 


4.6 


4.1 


Breast ca.* (pi. 
ef) T47D 


10.2 


7.8 


Heart 


100.0 


100.0 


Breast ca. BT- 
549 


8.4 


5.5 
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Skeletal Muscle 


1.4 


1.1 


Breast ca. 
MDA-N 


0.5 


0.1 


Bone marrow 


0.3 


0.0 


Ovary 


0.5 


0.3 


Thymus 


2.1 


1.0 


Ovarian ca. 
OVCAR-3 


0.4 { 


0.1 


Spleen 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


1.8 


1.4 


Lymph node 


0.8 


0.3 


Ovarian ca. 
OVCAR-5 


39.0 


30.1 


Colorectal 
Tissue 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.6 


0.3 


Stomach 


1 o 

1.2 


0.7 


Ovarian ca. 
1GROV-1 


Q O 

5.5 


8 1 
O.J 


Small intestine 


0.8 


0.4 


Ovarian ca. 
(ascites) SK- 
OV-3 


0.1 


0.0 


Colon ca. 
SW480 


2.5 


1.7 


Uterus 


0.3 


0.2 


Colon ca.* 
SW620 (SW480 
met) 


0.0 


0.0 


Placenta 


0.3 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate 


5.1 


3.2 


Colon ca. HCT- 
116 


0.0 


0.0 


Prostate ca.* 
(bone met) PC- 

3 


0.1 


0.0 


Colon ca. CaCo- 
2 


U.J 


u.z 


Testis 




O.o 


Colon ca. Tissue 
(OD03866) 


A 1 




Melanoma 
Hs688(A).T 


1 z.u 


in a 


Colon ca. ricc- 
2998 


0.0 


0.0 


Melanoma* 
(met) 

Hs688(B).T 


5.0 


4.3 


Gastric ca.* 
(liver met) NCI- 
N87 


0.5 


0.2 


iVieiallUIIld. 

UACC-62 


0.0 


0.0 


Bladder 


0.9 


0.6 


Melanoma 
M14 


0.0 


0.0 


Trachea 


1.2 


0.9 


Melanoma 
LOX IMVI 


0.7 


0.2 


Kidney 


4.0 


2.7 


Melanoma* 
(met) SK- 
MEL-5 


1.0 


0.5 


Kidney (fetal) 


6.0 


0.0 
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Table AGM. Panel 1.3D 



Tissue 
Name 


Rel. 
Exp.(%) 
Ag2679, 
Run 
158633802 


Rel. 

Exp.(%) 
Ag2728, 
Run 
158560797 


Rel. 
Exp.(%) 
Ag2732, 

Run 
16240U87o 


Tissue 
Name 


Rel. 

Exp.(%) 
Ag2679, 
Run 


Rel. 

Exp.(%) 
Ag2728, 
Run 

IjojOU /y / 


Rel. 
Exp.(%) 
Ag2732, 

Run 


Liver 

adenocarci 

noma 


3.8 


2.0 




Kidney 
(fetal) 


U.Z 


1 .o 


1 ? 


Pancreas 


0.0 


0.0 


0.0 


Renal ca. 
786-0 


9.8 


11.5 


3.3 


Pancreatic 
ca. 

p a DAM 


0.0 


0.0 


0.0 


Renal ca. 
A498 


54.7 


40.6 


15.1 


Adrenal _ 
gland 


0.2 


0.5 


0.2 


Renal ca. 

ivAr j yj 


11.9 


7.3 


8.2 


Thyroid 


0.6 


1.3 


0.2 


Renal ca. 
ACHN 


0.0 


0.0 


0.0 


Salivary 
gland 


0.1 | 


0.3 


0.1 


Renal ca. 
UO-31 


0.8 


0.3 


0.0 


Pituitary 
gland 


0.4 


0.3 


0.0 


Renal ca. 
TK-10 


0.0 


0.0 


0.0 


Brain 

(fetal) 


12.4 


15.2 


2.5 


Liver 


0.7 


0.4 


0.4 


Brain 
(whole) 


7.2 


15.5 


6.1 


Liver 
(fetal) 


0.2 


0.4 


0.0 


Brain 

(amygdala) 


11.3 


17.4 


7.0 


Liver ca. 
(hepatoblas 
t) HepG2 


0.0 


0.0 


A A 

0.0 


Brain 

(cerebellu 

m) 


1.3 


1.0 


0.4 


Lung 


0.1 


a q 
0.O 


A 1 


Brain 

(hippocam 

pus) 


60.7 


80.1 


16.8 


Lung 
(fetal) 


0.4 


O.o 


A A 


Brain 

(substantia 
nigra) 


1.7 


1.7 


0.5 


Lung ca. 
(small cell) 

T V 1 


0.0 


0.0 


0.0 


Brain 
(thalamus) 


3.7 


10.4 


4.5 


Lung ca. 
(small cell) 
NCI-H69 


3.9 


14.0 


3.7 


Cerebral 
Cortex 


100.0 


100.0 


93.3 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.2 


0.0 


Spinal cord 


1.8 


1.8 


1.3 


Lung ca. 
(large 
cell)NCI- 
H460 


0.2 


0.1 


0.5 
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glio/astro 
U87-MG 


3.0 


4.7 


1 

3.0 


Lung ca. 
(non-sm. 
sell) A549 


0.0 


0.2 


0.0 


glio/astro 1 
U-118-MG 


5.3 


5.4 


0.8 


Lung ca. 
non-s.cell) 
NCI-H23 


0.0 


0.0 


0.0 


astrocytom 
aSW1783 


99.3 , 


87.7 


100.0 


Lung ca. 

(non-s.cell) 

HOP-62 


6.4 


4.3 


3.6 


neuro*; 
met SK-N- 
AS 


0.1 


0.0 


0.0 


Lung ca. 
non-s.cl) 
NCI-H522 


1.3 


1.5 


0.9 


astrocytom 
a SF-539 


80.7 


75.3 


42.0 


Lung ca. 
^squam.) 
SW 900 


0.0 


0.0 


0.0 


astrocytom 
a SNB-75 


77.4 


64.6 


18.2 


Lung ca. 
(squam.) 
NCI-H596 


1.0 


2.5 


2.6 


glioma 
SNB-19 


94.0 


75.3 


63.7 


Mammary 
gland 


1.9 


4.3 


0.6 


glioma 
U251 


68.8 


62.4 


29.1 


(pl.ef) 
MCF-7 


0.0 


0.0 


0.0 


glioma SF- 
295 


8.1 


6.9 


4.0 


Breast ca.* 
(pl.ef) 
MDA-MB- 
231 


0.8 


0.5 


0.1 


Heart 
(fetal) 


34.6 


10.6 


11.9 


Breast ca.* 

(pl.ef) 

T47D 


15.9 


7.6 


4.9 


Heart 


12.9 


12.1 


20.2 


Breast ca. 
BT-549 


58.2 


36.6 


4.0 


Skeletal 

muscle 

(fetal) 


11.0 


7.0 


4.1 


Breast ca. 

IVIU/V-IN 


0.2 


0.0 


0.0 


Skeletal 
muscle 


A A 
0.0 


U.J 




Ovary 


1 8 

1 .o 


l s 

1 .~f 


1.4 


Bone 
marrow 


A A 


U.z 


U. 1 


Ovarian ca. 
OVCAR-3 


n n 

V/.v 


0 0 


0.0 


Thymus 


0.9 


1.2 


3.8 


Ovarian ca. 
OVCAR-4 


0.1 


0.5 


0.2 


Spleen 


0.0 


0.0 


0.0 


Ovarian ca. 
OVCAR-5 


41.8 


33.9 


14.3 


Lymph 
node 


0.1 


0.6 


0.0 


Ovarian ca. 
OVCAR-8 


1.0 


0.6 


0.5 


Colorectal 


1.3 


0.5 


0.9 


Ovarian ca. 
IGROV-1 


0.0 


4.5 


1.8 


Stomach 


0.4 


0.5 


0.1 


Ovarian 


0.1 


0.1 


0.0 
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ca.* 

(ascites) 
SK-OV-3 








Small 
intestine 


0.0 


0.6 


0.2 


Uterus 


0.0 


0.0 


0.0 


Colon ca 
SW480 


11.2 


5.4 


1.1 


Placenta 


0.1 


0.0 


0.0 


Colon ca.* 
SW620CS 
W480 met) 


0.0 


0.0 


0.0 


Prostate 


1.2 


0.5 


0.6 


Colon ca. 
HT29 


0.0 


0.0 


0.0 


Prostate 
ca.* (bone 
met)PC-3 


0.5 1 


0.0 


0.0 


Colon ca. 
HCT-1 16 


0.0 


0.0 


0.0 


Testis 


1.8 


1.4 


0.9 


Colon ca. 
CaCo-2 


0.4 


0.1 


0.1 


Melanoma 
Hs688(A). 
T 


42.3 


22.4 


17.1 


Colon ca. 
tissue(OD 
03866) 


0.4 


0.4 


0.3 


IVIC IuIIOII la 

* (met) 

Hs688(B). 

T 


1.7 


1.6 


1.0 


Colon ca. 
HCC-2998 


0.0 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


0.0 


Gastric 
ca.* (liver 
met) NCI- 
N87 


0.2 


0.4 


0.1 


Melanoma 
M14 


0.0 


0.0 


0.0 


Bladder 


0.5 


0.3 


0.3 


Melanoma 
LOX IMVI 


2.4 


1.1 


0.4 


Trachea 


1.3 


1.5 


1.0 


Melanoma 
* (met) 
SK-MEL-5 


0.0 


0.0 


0.0 


Kidney 


0.7 


0.3 


0.2 


Adipose 


0.2 


0.7 


0.0 ! 



Table AGN. Panel 2.2 



Tissue Name 


Rel. Exp.(%) 
Ag2975, Run 
173763053 


Tissue Name 


Rel. Exp.(%) 
Ag2975, Run 
173763053 


Normal Colon 


2.5 


Kidney Margin 
(OD04348) 


29.7 


Colon cancer 
(OD06064) 


4.7 


Kidney malignant 
cancer (OD06204B) 


0.0 


Colon Margin 
(OD06064) 


4.3 


Kidney normal adjacent 
tissue (OD06204E) 


0.0 


Colon cancer 


2.2 


Kidney Cancer 


0.0 j 
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(OD06159) 




(OD04450-01) 




Colon Margin 
(OD06159) 


7.3 


Kidney Margin 
(OD04450-03) 


13.6 


Colon cancer 
(OD06297-04) 


0.0 


Kidney Cancer 8120613 


2.3 


Colon Margin 
(OD06297-015) 


0.0 


Kidney Margin 
8120614 


11.5 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Kidney Cancer 9010320 


23.8 


CC Margin (OD03 921) 


0.0 


Kidney Margin 
9010321 


20.0 


Colon cancer metastasis 
(OD06104) 


0.0 


Kidney Cancer 8120607 


2.0 


Lung Margin 
(OD06104) 


0.0 


Kidney Margin 
8120608 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Uterus 


4.4 


Lung Margin 
(OD04451-02) 


2.0 


Uterine Cancer 06401 1 


0.0 


Normal Prostate 


7.5 |Normal Thyroid 


4.4 


Prostate Cancer 
(OD04410) 


2.3 


Thyroid Cancer 064010 


0.0 


Prostate Margin 
(OD04410) 


4.6 


Thyroid Cancer 
A302152 


2.2 


Normal Ovary 


2.2 


Thyroid Margin 
A302153 


3.6 


Ovarian cancer 
(OD06283-03) 


0.0 


Normal Breast 


55.1 


Ovarian Margin 
(OD06283-07) 


0.0 


Breast Cancer 
(OD04566) 


3.9 


Ovarian Cancer 064008 


13.4 


Breast Cancer 1024 


68.3 


Ovarian cancer 
(OD06145) 


18.8 


Breast Cancer 
(OD04590-01) 


5.3 


Ovarian Margin 
(OD06145) 


8.1 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Ovarian cancer 
(OD06455-03) 


5.0 


Breast Cancer 
Metastasis (OD04655- 
05) 


13.1 


Ovarian Margin 
(OD06455-07) 


2.3 


Breast Cancer 064006 


12.5 


Normal Lung 


2.3 


Breast Cancer 9100266 


34.6 


Invasive poor diff. lung 
adeno (ODO4945-01 


0.0 


Breast Margin 9100265 


17.0 


Lung Margin 
(ODO4945-03) 


0.0 


Breast Cancer A209073 


16.7 


Lung Malignant Cancer 


1.4 


Breast Margin 


71.7 
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(OD03126) 




A2090734 




Lung Margin 
(OD03126) 


0.0 


Breast cancer 
(OD06083) 


20.7 


Lung Cancer 
(OD05014A) 


0.0 


Breast cancer node 
metastasis (OD06083) 


6.3 


Lung Margin 
(OD05014B) 


2.1 


Normal Liver 


0.0 


Lung cancer (OD06081) 


100.0 


Liver Cancer 1026 


8.1 


Lung Margin 
(OD06081) 


A 1 




2Q S 


Lung Cancer 
(OD04237-01) 


0.0 


Liver Cancer 6004-T 


8.4 


Lung Margin 
(OD04237r02) 


0.0 


Liver Tissue 6004-N 


4.7 


Ocular Melanoma 
Metastasis 


6.7 


Liver Cancer 6005-T 


23.0 


Ocular Melanoma 
Margin (Liver) 


2.8 


Liver Tissue 6005-N 


32.8 


Melanoma Metastasis 


2.2 


Liver Cancer 064003 


12.8 


Melanoma Margin 
(Lung) 


3.7 


Normal Bladder 


0.0 


Normal Kidney 


9.0 


Bladder Cancer 1023 


6.3 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


l y.j 


Bladder Cancer 
A302173 


\f> 7 


Kidney Margin 
(OD04338) 




INUIlllcil OlUUIaVTl 




Kidney Ca Nuclear 
grade 1/2 (OD04339) 


0.0 


Gastric Cancer 9060397 


0.0 


Kidney Margin 
(OD04339) 


9.3 


Stomach Margin 
9060396 


3.8 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Gastric Cancer 9060395 


6.5 


Kidney Margin 
(OD04340) 


4.6 


Stomach Margin 
9060394 


2.3 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


77.4 


Gastric Cancer 064005 


0.0 



Table AGO. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag047, Run 
144771648 


Rel. Exp.(%) 
Ag047, Run 
152940364 


Rel. Exp.(%) 
Ag2679, Run 
158633803 


Rel. Exp.(%) 
Ag2728, Run 
158561830 


Normal Colon 


5.5 


8.9 


7.4 


10.4 


CC Well to Mod 
Diff(ODQ3866) 


1.4 


0.2 


0.2 


1.3 
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CC Margin 
(OD03866) 


0.4 


0.5 


0.4 


0.1 


CC Gr.2 

rectosigmoid 

(OD03868) 


0.4 


0.1 


1.7 


0.2 


CC Margin 
(OD03868) 


7 ^ 


n o 
v.y 


n 7 


n 7 


CC Mod Diff 
(ODO3920) 


0.0 


0.0 


0.0 


0.2 


CC Margin 
(ODO3920) 


0.7 


0.9 


1.1 


0.3 


CC Gr.2 ascend 
colon (OD03 921) 


0.0 


0.7 


0.6 


0.2 


CC Margin 
(OD03921) 


0.2 


0.2 


0.6 


0.8 


CC from Partial 
Hepatectomy 
(ODO4309) Mets 


0.6 


0.4 


1.1 


1.0 


Liver Margin 
(ODO4309) 


111 
11.1 


IJ.O 




71 7 
Zl.Z 


Colon mets to lung 
(OD04451-01) 


u.u 


n a 


n l 


n 7 


Lung Margin 
(OD04451-02) 




1 .u 




O 7 
u. / 


Normal Prostate 
6546-1 


1 Q 1 

i y. i 


n 7 


1 0 


7 1 


Prostate Cancer 
(OD04410) 


D .5 


7 0 


A 1 


^ A 


Prostate Margin 
(OD04410) 


A 7 


S 7 


7 7 


4 Q 


Prostate Cancer 
(OD04720-01) 


7 0 








Prostate Margin 
(OD04720-02) 


1? 5 


10 7 


8.3 


15.2 


Normal Lung 
061010 


1 < 
i .J 


7 1 




5.4 


Lung Met to Muscle 
(OD04286) 


0.3 


0.1 


0.0 


0.5 


Muscle Margin 
(OD04286) 


0.3 


0.2 


0.3 


0.6 


Lung Malignant 
Cancer (OD03126) 


0.4 


0.2 


0.8 


1.5 


Lung Margin 
(OD03126) 


0.4 


1.0 


1.2 


0.5 


Lung Cancer 
(OD04404) 


86.5 


100.0 


100.0 


100.0 
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;:.!■ ?r 



;* Si II'" 



Lung Margin 
(OD04404) 


18 3 


3.3 


2.2 


3.0 


Lung Cancer 
(OD04565) 


100 0 


52.1 


62.0 


77.9 


Lung Margin 
(OD04565) 


0 2 


0.1 


0.0 


0.6 


Lung Cancer 
(OD04237-01) 


6 3 


1.5 


3.2 


3.3 


Lung Margin 
(OD04237-02) 


1 4 




0 5 


0.6 


Ocular Mel Met to 
Liver (OD043 10) 


0.4 


0.3 


0.5 


0.7 


Liver Margin 
(ODO4310) 


2.3 


1.8 


3.5 


2.8 


Melanoma Mets to 
Lung (OD04321) 


0.0 


0.3 


0.4 


1.6 


Lung Margin 
(OD04321) 


9 1 


9 (\ 


9 9 


3 0 


Normal Kidney 


6.7 


4.9 


8.4 


7.0 


Kidney Ca, Nuclear 
grade2(OD04338) 


0.0 


0.0 


0.3 


1.4 


Kidney Margin 
(OD04338) 


3.5 


1.5 


4.1 


3.5 


Kidney Ca Nuclear 
grade 1/2 
(OD04339) 


0.0 


0.1 


0.3 


0.7 


Kidney Margin 
(OD04339) 


18.4 


10.3 


8.4 


15.5 


Kidney Ca, Clear 
cell type (OD04340) 


0.0 


0.8 


0.7 


1.1 


Kidney Margin 
(OD04340) 


6.5 


4.4 


4.5 


6.5 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


90.8 


36.1 


54.3 


50.3 


Kidney Margin 
(OD04348) 


4.6 


3.2 


3.8 


4.0 


Kidney Cancer 
(OD04622-01) 


2.7 


1.8 


4.1 


3.5 


Kidney Margin 
(OD04622-03) 


2.0 


0.2 


0.3 


1.1 


Kidney Cancer 
(OD04450-01) 


0.0 


0.0 


0.0 


0.0 


Kidney Margin 
(OD04450-03) 


3.1 


1.4 


6.9 


5.4 


Kidney Cancer 
8120607 


1.5 


0.3 


0.5 


1.8 
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Kidney Margin 
8120608 




0.8 


1.3 


2.2 


Kidney Cancer 
8120613 


0.4 


0.8 


4.3 


2.0 


Kidney Margin 
8120614 


2.8 


1.2 


4.9 


4.0 


Kidney Cancer 
9010320 


76.8 


39.0 


36.3 


52.9 


Kidney Margin 
9010321 


10.4 


5.1 


4.9 


4.3 


Normal Uterus 


0.0 


0.2 


0.3 


0.0 


Uterus Cancer 
064011 


0.9 


0.0 


0.3 


1.9 


Normal Thyroid 


0.3 


0.0 


0.9 


2.2 


Thyroid Cancer J 
064010 


0.0 


0.0 


0.0 


0.0 


Thyroid Cancer 
A302152 


1.4 


0.1 


0.4 


0.0 


Thyroid Margin 
A302153 


1.7 


0.9 


6.4 


41 


Normal Breast 


20.3 


5.7 


13.0 


20.9 


Breast Cancer 
(OD04566) 


0.3 


0.1 


0.5 


0.3 


Breast Cancer 
(OD04590-01) 




2.2 


1.1 


1.9 


Breast Cancer Mets 
(OD04590-03) 


0 7 
yj. i 


0 1 


0.3 


0.0 


Breast Cancer 

Metastasis 

(OD04655-05) 


6.9 


3.3 


7.7 


9.3 


Breast Cancer 
064006 


5.6 


8.3 


4.2 


4.9 


Breast Cancer 1024 


47.0 


19.3 


30.1 


23.5 


Breast Cancer 
9100266 


14.9 


9.3 


15.7 


21.8 


Breast Margin 
9100265 


4.6 


1.5 


4.4 


8.4 


Breast Cancer 
A209073 


48.3 


12.9 


28.3 


40.9 1 


Breast Margin 
A2090734 


38.7 


16.6 


27.5 


29.7 


Normal Liver 


0.2 


0.1 


0.0 


0.2 


Liver Cancer 064003 


11.8 


5.6 


4.6 


4.5 


Liver Cancer 1025 


4.5 


1.6 


4.2 


2.8 


Liver Cancer 1026 


6.2 


6.7 


9.0 


6.4 
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Liver Cancer 6004-T: 


15.6 


3.3 


3.5 J 2.8 


.Liver i issue ouu*+-in 


n 1 

U. 1 




0.2 


0.4 


Liver Cancer 6005-T 


14.6 


8.0 


8.7 


7.9 


Liver Tissue 6005-N 


6.4 


7.0 


5.9 


3.2 


Normal Bladder 


1.3 


0.9 


1.6 


0.9 


Bladder Cancer 1023 


0.4 


0.2 


0.3 


0.3 


Bladder Cancer 
A302173 


7 S 
/ .J 




12.1 


19.8 


Bladder Cancer 
(OD047 18-01) 


77 1 




35.6 


41.5 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.4 


0.3 


0.6 


0.7 


Normal Ovary 


1.4 


0.4 


1.1 


0.3 


Ovarian Cancer 
064008 


1.3 


0.5 


1.3 


i ~j 
1.5 


Ovarian Cancer 
(OD04768-07) 


0.4 


0.2 


A A 

u.u 


A A 

U.U 


Ovary Margin 
(OD04768-08) 


1.0 


0.8 


A a 
U.O 


l .O 


Normal Stomach 


4.3 


2.0 


0.3 


3.1 


Gastric Cancer 
9060358 




0 7 


0.7 


1.2 


Stomach Margin 
9060359 


n 0 


0 0 


0.7 


4.3 


Gastric Cancer 
9060395 


0.9 


2.0 


1.9 


2.4 


Stomach Margin 
9060394 


0.2 


0.3 


1.1 


0.2 


Gastric Cancer 
9060397 


0.4 


0.5 


1.6 


0.7 


Stomach Margin 
9060396 


0.0 


0.1 


0.2 


0.2 


Gastric Cancer 
064005 


0.7 


0.7 


0.9 


1.5 



Table AGP. Panel 3D 



Tissue Name 


Rel.Exp.(%) 
Ag047, Run 
158634002 


Tissue Name 


Rel. Exp.(%) 
Ag047, Run 
158634002 


Daoy- Medulloblastoma 


0.9 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.3 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


6.7 
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D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


100.0 


Raji- Burkitt's lymphoma 


U.U 


SF-268- Glioblastoma 


79.0 


Daudi- Burkitt's lymphoma 


U.U 


T98G- Glioblastoma 


1.4 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


2.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.1 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


4.5 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


2.9 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


3.8 


TF-1- Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0 0 


HUT 78- T-cell lvmohoma 


0.0 


DMS-79- Small cell lung 

cancer 


0 1 


T 1917- Histiocvtic lvmohoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-812- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


1.0 


769-P- Clear cell renal 
carcinoma 


0.1 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


0.1 


NCI-H82- Small cell lung 
cancer 


/.o 


SW 839- Clear cell renal 
carcinoma 


0 0 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.8 


G401- Wilms* tumor 


6.0 


NCI-H1155- Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 
lung cancer 


0.2 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.1 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.6 


LX-1- Small cell lung 
cancer 


0.0 


HP AC- Pancreatic 
adenocarcinoma 


0.0 
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0 0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 




0.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


0.1 


rviviz, vjLvZ,- v^uiuii tciiitci 


0 0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.0 




0 0 


T24- Bladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


0 0 


5637- Rladder carcinoma 


3.8 


SW1 116- Colon 
adenocarcinoma 


0 0 

yj .\j 


HT-1 197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


0 0 

\J.VJ 


UM-UC-3- Bladder carcinma 
(transitional cell) 


10.3 


SW-948- Colon 
adenocarcinoma 


0 0 


A 904- RhahHnmvosarcoma 

rVAUT^ ivi lauu will j yj&cxi \s uui u 


0.0 


SW-480- Colon 
adenocarcinoma 


0 0 


HT-10R0- Fibrosarcoma 

111 lV/OU 1 1 KJl \J JOI vvl 1 1<* 


59.9 


NCI-SNU-5- Gastric 
carcinoma 


o 7 


1 VI V J UJ V^OUVV^Cll WVJ111C* 


0.5 


KATO 111- Gastric 
carcinoma 




SK-LMS-1- Leiomyosarcoma 
(vulva) 


4.2 


NCI-SNU-16- Gastric 
carcinoma 


0 9 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.3 


NCI-SNU-1- Gastric 
carcinoma 


0 0 


A43 1 - Rnidermoid carcinoma 


0.6 


RF-1- Gastric 
adenocarcinoma 


0.3 


WM266-4- Melanoma 


0.0 


RF-48- Gastric 
adenocarcinoma 


0.1 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


0.0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.2 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


1.1 


RL95-2- Uterine 
carcinoma 


4.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.3 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


7.0 



Table AGQ. Panel 4.1D 



Tissue Name 



Rel. Exp.(%) 
Ag2975, Run 
171818669 



Tissue Name 



Rel. Exp.(%) 
Ag2975, Run 
171818669 



1072 




Secondary Th 1 act 


0.0 HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


U.U 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


^f=*r»r\nH TV1 rf^ct 

oCdJiicidry iii i Coi 


0.0 


Lung Microvascular EC 
none 


0.0 


.rrirnary 1111 act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.1 


r rim ary 1 nz aci 


0.0 


Microvascular Dermal EC 
none 


0.0 


rTimary in aci 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Jr nmary i n i rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


37.4 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


47.6 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalphaH- IL-lbeta 


100.0 


CD45RA CD4 
lymphocyte act 


0.8 


Coronery artery SMC rest 1 


0.8 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.6 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


51.8 


Secondary CD8 
lymphocyte rest 


0. 1 


Astrocytes TNFalpha + 
IL-lbeta 


78.5 


Secondary CDS 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 

, _ , ^ r. 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) 
none 


45.1 


LAK cells rest 


0.0 


CCD 1106 (Keratinocytes) 
i fNraipna ^ li^-ioeia 


33.7 


l^r\ IV CeilS 1L--Z 


0.0 


i^lVCr ClIIIlOMi* 


0.2 


LAK cells IL-2+IL-12 


0.0 


NC1-H292 none 


0.5 


LAK cells 1L-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


12.6 


LAK cells IL-2+ IL-18 


0.1 


NCI-H292 IL-9 


0.6 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-13 


10.9 


NK Cells IL-2 rest 


0.0 


NCI-H292 IFN gamma 


0.9 


Two Way MLR 3 day 


0.0 


HPAEC none 


1.2 


Two Way MLR 5 day 0.0 


HP A EC TNF alpha + IL-1 


0.3 
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beta 


Two Way MLR 7 day 


0.1 


Lung fibroblast none 


1.5 


PBMC rest 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.1 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.2 


Ramos (B cell) none 


0.0 


Lung fibroblast TL- 1 3 


1.3 


Ramos (B cell) 
ionomycin 


U.U 


Lung fibroblast IFN 
gamma 


U.O 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD 1070 rest 


4.1 


B lymphocytes CD40L 
and IL-4 


0 0 


Dermal fibroblast 
CCD 1070 TNF alpha 


2.2 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.8 




EOL-1 dbcAMP 
P1VT A /innomvrin 


0.0 


Dermal fibroblast IFN 

galUIIld. 


2.4 


OpnHritif* ppII^ nnnp 

L/vl IU1 1 L1V.- L L 1 1 .> lIUllv 


0.0 


Dermal fibroblast IL-4 


5.4 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


9.3 


Dendritic cells anti- 
CD40 


0.1 


Neutrophils TNFa+LPS 


0.6 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


0.0 


Colon 


0.5 


Macrophages rest 


0.0 


Lung 


1.6 


Macrophages LPS 


0.0 


Thymus 


9.3 


HUVEC none 


0.0 


Kidney 


11.3 


HUVEC starved 


0.4 







Table AGR. Panel 4D 



Tissue 
Name 


Rel. 

Exp.(%) : 
Ag047, 
Run 
146087309 


Rel. 
Exp.(%) 
Ag047, 
Run 
151918119 


Rel. 

Exp.(%) 
Ag047, 
Run 
152893522 


Rel. 

Exp.(%) 
Ag2679, 
Run 
158535667 


Rel. 
Exp.(%) 
Ag2728, 
Run 
158562582 


Rel. 
Exp.(%) 
Ag2975, 
Run 
164314612 


Rel. 

Exp.(%) 
Ag47b, 
Run 
158664022 


Secondary 1 
Thl act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.1 


Secondary 
Th2 act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Secondary 
Trl act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Secondary 
Thl rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Secondary 
Th2 rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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Secondary 
Trl rest 


u.u 


u.u 


ft ft 
u.u 


ft ft 

u.u 


u.u 


ft ft 
u.u 


ft ft 
u.u 


Primary 
Thl act 




u.u 


u.u 


ft ft 

u.u 


ft ft 
u.u 


ft ft 
u.u 


ft ft 
u.u 


Primary 
Th2 act 


u.u 


u.u 


u.u 


ft ft 
u.u 


ft ft 
u.u 


ft ft 
u.u 


ft ft 
u.u 


Primary 
Trl act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Primary 
Thl rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Primary 
Th2 rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


r nmary 
Trl rest 


0.0 


0.0 


0.0 


* 0.0 


0.0 


0.0 


0.0 


CD45RA 

lymphocyt 
e act 


0.1 


0.2 


0.1 


0.3 


0.3 


0.0 


0.3 


CD45RO I 

lymphocyt 
e act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.1 


CD8 

lympiiucy 1 
e act 


0 ft 

U.U 


0 ft 
u.u 


0 0 

\J.\J 


ft ft 

\j . yj 


ft ft 


ft ft 

yj .\j 


0 0 


Secondary 
TDK 

lymphocyt 
e rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Secondary 
CD8 

1 y iiiuiiu^y l 

e act 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


lymphocyt 
e none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


2ry 

Thl/Th2/T 
rl anti- 
CD95 
CH11 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


LAK cells 
rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


LAK cells 
IL-2 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


LAK cells 
IL-2+IL- 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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12 
















LAK cells 

IL-2+1FN 

gamma 


0.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.1 


LAK cells 
IL-2+IL- I 
18 


0.0 


0.0 


0.0 


0.0 


0.1 


0.0 


0.0 


LAK cells 
PMA/iono 
mycin 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


NK Cells 
IL-2 rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Two Way 
MLR 3 
day 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Two Way 
MLR 5 * 
day 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 

, . — - 


0.0 


Two Way 
MLR 7 
day 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


PBMC rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


PBMC 
PWM 


0.0 


0.0 


0.0 


0.0 


0.1 


0.0 


0.0 


PBMC 
PHA-L 


0.0 


0.0 


0.0 


0.0 


A A 

0.0 


u.u 


u.u 


Ramos (B 
cell) none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


rvcii i iu5 V 

cell) 

ionomycin 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


B 

lymphocyt 
es PWM 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


R 

lymphocyt 
es CD40L 
and IL-4 


0.0 


0.0 


0.0 


0.0 


0.1 


0.0 


0.0 


EOL-1 
dbcAMP 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.1 


EOL-1 
dbcAMP 
PMA/iono 
mycin 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Dendritic 
cells none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Dendritic 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 j 0.0 



1076 




cells LPS 
















Dendritic 
cells anti- 
CD40 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Monocytes 
rest 


n a 
u.u 


A A 

u.u 


a a 
u.u 


A A 
U.U 


A A 
U.U 


A A 
U.U 


A A 
U.U 


Monocytes 
LPS 


a a 
u.u 


a a 

U.U 


a a 
u.u 


A A 
U.U 


A A 
U.U 


A A 
U.U 


A A 
U.U 


Macropha 
ges rest 


a a 
u.u 


A a 
U.U 


A A 

u.u 


A A 
U.U 


A A 
U.U 


A A 
U.U 


n A 
U.U 


Macropha 
ges LPS 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 
none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 
starved 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 
IL-lbeta 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


IFN 
gamma 


0.0 


0.0 


0.0 


0.1 


0.0 


0.0 


0.0 


HUVEC 
TNF alpha 
+ IFN 
gamma 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 
TNF alpha 
+ IL4 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


HUVEC 
IL-1 1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung 

Microvasc 
ular EC 

ii^jii^ 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung 

Microvasc 
ular EC 
TNFalpha 
+ IL-lbeta 


0.0 


7.3 


0.0 


0.0 


0.0 


0.0 


0.0 


Microvasc 
ular 

Dermal EC] 
none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Microsvas 
ular 

Dermal EC 
TNFalpha 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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+ II .- 1 beta i 
















Bronchial 

epithelium 

TNFalpha 


8.6 


2.0 


1.4 


10.0 


4.6 


11.9 


13.3 


Small 
airway 
cpiincuuiii 
none 


21.8 


9.5 


17.8 


17.7 


17.8 


12.2 


17.2 


Small 

airway 

epithelium 

l iNrdipiid. 

+ IL-lbeta 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


v_.oronery 
artery 
SMC rest 


0.2 


0.2 


0.3 


0.4 


0.4 


0.2 


0.2 


v^oronery 

artery 

SMC 

TNFalpha 

H-IL-lbeta 


0.2 


0.3 


0.2 


0.2 


0.2 


0.1 


0.2 


Astrocytes 
rest 


23.3 


19.3 


20.4 


30.8 


20.7 


16.6 


21.8 


/vsirocyres 
TNFalpha 
-HL-lbeta 


27.0 


25.0 


25.3 


27.9 


22.7 


19.3 


28.5 


(Basophil) 
rest 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


KU-812 
(Basophil) 

mycin 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


CCD1 106 
(Keratinoc 
ytes) none 


18.4 


12.7 


14.3 


16.7 


12.5 


10.2 


13.5 


CCD 1106 
(Keratinoc 
ytes) 

TNFalpha 
+ 1L-Ibeta 


4.2 


1.4 


2.0 


7.1 


3.5 


5.1 


8.4 


Liver 
cirrhosis 


0.3 


0.2 


0.2 


0.3 


0.3 


0.2 


0.4 


Lupus 
kidney 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


NCI-H292 
none 


0.4 


0.2 


0.4 


0.3 


0.6 


0.1 


0.2 
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NCI-H292 
IL-4 


9.3 


4.8 


7.5 


8.9 


8.2 


5.3 


8.7 


NC1-H292 ! 
IL-9 


0.6 


0.3 


0.6 


0.7 


0.7 


0.5 


0.4 


NCI-H292 
IL-13 


4.0 


5.1 


3.9 


4.5 


4.2 


2.9 


3.3 


NCI-H292 

IFN 

gamma 


0.2 


0.2 


0.1 


0.3 


0.2 


0.2 


0.2 


HPAEC 
none 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.1 


hpapp 
TNF alpha 
+ IL-1 beta 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Lung 

fibroblast 

none 


0.1 


0.3 


0.3 


0.3 


0.3 


0.1 


0.2 


Lung 
fibroblast 
TNF alpha 
+ IL-1 beta 


0.0 


0.0 


0.1 


0.0 


0.0 


0.1 . 


0.1 


Lung 

fibroblast 

IL-4 


0.3 


0.1 


0.2 


0.3 


0.1 


0.2 


0.3 


Lung 

fibroblast 

IL-9 


0.0 


0.2 


0.1 


0.1 


0.1 


0.1 


0.1 


Lung 

Tl nlVA hi CI cl" 
1 lul UUIdol 

IL-13 


0 1 


0 1 


0 1 


0 2 


0.2 


0.0 


0.3 


Lung 
fibroblast 

gamma 


0.1 


0.0 


0.0 


0.1 


0.1 


0.2 


0.1 


Dermal 
fibroblast 

rest 


2.2 


1.3 


2.4 


3.1 


2.5 


1.5 


2.4 


Dermal 

L/vl 111(41 

fibroblast 
CCD 1070 
TNF alpha 


1.0 


0.8 


1.1 


2.1 


1.8 


0.9 


1.2 


Dermal 
fibroblast 
CCD 1070 
IL-1 beta 


0.3 


0.3 


0.6 


0.7 


0.6 


0.4 


0.6 


Dermal 
fibroblast 


0.6 


0.6 


1.1 


1.5 


1.1 


0.6 


0.7 




IFN 
gamma 
















Dermal 
fibroblast 
II -4 


1.8 


1.2 


2.0 


2.5 


1.5 


1.2 


1.4 


IBD 
Colitis 2 


0.1 


0.1 


0.0 


0.0 


0.0 


0.0 


0.0 


IBD 

Crohn's 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Colon 


0.0 


0.0 


0.1 


0.2 


0.2 


0.1 


0.2 


Lung 


0.3 


0.3 


0.2 


0.6 


0.3 


0.2 


0.2 


Thymus 


0.8 


1.5 


0.9 


1.4 | 


1.2 


1.0 j 


1.4 


Kidney 


3.0 


2.8 


3 : 6 


4.5 


3.3 


2.0 


2.6 



CNSneurodegenerationvl.O Summary: Ag047/Ag2679/Ag2728 This panel does 
not show differential expression of the NOV37 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1 for 
5 discussion of utility of this gene in the central nervous system. 

Panel 1 Summary: Ag047/Ag332/Ag47b Multiple experiments with three different 
probe and primer sets produce results that are in excellent agreement, with highest expression 
of the NOV37 gene in a brain cancer cell line (CT=23-25). There is also significant expression 
in clusters of samples from melanoma, ovarian cancer, breast, lung, renal colon and brain 

10 cancer lines. Thus, expression of this gene may be associated with these forms of cancer and 
could potentially be used as a diagnostic marker for the presence of these cancers. 
Furthermore, therapeutic modulation of the expression or function of these genes may be 
useful in the treatment of ovarian, breast, lung, renal, and brain cancer and melanoma. 
In addition to significant expression in brain cancer cell lines, this gene is 

15 preferentially expressed in the brain. This expression profile suggests that this gene prodcut 

may play a role in CNS processes. This gene encodes a homolog of a member of the neurestin 
family, Ten M2, and may play a role in neuronal regeneration. Thus, agents that induce the 
expression or activity of NOV37 may have utility as neuronal regeneration drugs. Such agents 
would have utility in neurodegenerative diseases, stroke, and neuronal trauma. 

20 Among tissues with metabolic function, this gene shows consistent expression in 

thyroid, adult and fetal heart, liver and skeletal muscle. Thus, this gene product may be an 
antibody target for the treatment of metabolic and endocrine disease, including obesity and 
Types 1 and 2 diabetes. In addition, this gene is expressed at higher levels in adult liver than in 
fetal liver and may be useful for differentiating between the two sources of liver tissue. 
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References: 

Otaki JM, Firestein S. Neurestin: putative transmembrane molecule implicated in 
neuronal development. Dev Biol 1999 Aug 1;2 12(1): 165-81 

We have cloned a novel cDNA encoding a putative transmembrane protein, neurestin, 
5 from the rat olfactory bulb. Neurestin was identified based on a sequence similar to that of the 
second extracellular loops of odorant receptors in the cysteine-rich CC box located 
immediately after EGF-like motifs. Neurestin shows homology to a neuregulin gene product, 
human gamma-heregulin, a Drosophila receptor-type pair-rule gene product, Odd Oz (Odz) / 
Ten(m), and Ten(a), suggesting a possible function in synapse formation and morphogenesis. 

10 Recently, a mouse neurestin homolog has independently been cloned as DOC4 from the NIH- 
3T3 cell line. Northern blot analysis showed that neurestin is highly expressed in the brain and 
also in other tissues at much lower levels. In situ hybridization studies showed that neurestin is 
expressed in many types of neurons, including pyramidal cells in the cerebral cortex and tufted 
cells in the olfactory bulb during development. In adults, neurestin is mainly expressed in 

15 olfactory and hippocampal granule cells, which are known to be generated throughout 

adulthood. Nonetheless, in adults the expression of neurestin was experimentally induced in 
external tufted cells during regeneration of olfactory sensory neurons. These results suggest a 
role for neurestin in neuronal development and regeneration in the central nervous system. 

Panels 1.1/1.2/1.3D Summary: Ag047 The NOV37 gene is expressed in melanoma, 

20 ovarian cancer, breast, lung, renal and brain cancer lines with good concordance for 3 

independent runs. Expression of this gene might be associated with these forms of cancer and 
therapeutic modulation of this gene might be of use in the treatment of these cancers. Please 
note that results from one experiment on Panel 1.3D with the probe/primer set Ag2975 are not 
included. The amp plot suggests that there were experimental difficulties with this run. 

25 Panel 2.2 Summary: Ag2975 This gene appears to be expressed at a very low level in 

the samples used in this panel. Significant expression is only seen in lung, kidney and breast 
cancer samples. Expression of this gene might be associated with these forms of cancer and 
therapeutic modulation of this gene might be of use in the treatment of these cancers. 

Panel 2D Summary: Ag047 The expression of the NOV37 gene was assessed in 

30 multiple runs on this panel, with excellent concordance between the runs. This gene appears to 
be expressed at a higher level in gastric, bladder, and 2 samples each of lung and kidney 
cancer when compared to the normal adjacent tissue. Thus, expression of this gene might be 
associated with these forms of cancer and therapeutic modulation of this gene might be of use 
in the treatment of these cancers. 
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Panel 3D Summary: Ag047 The NOV37 gene is expressed in squamous cell 
carcinoma, glioma, small cell lung cancer cell lines. Thus, expression of this gene might be 
associated with these cancers and therapeutic modulation of this gene might be of use in the 
treatment of these cancers. 
5 Panels 4D/4.1D Summary: Ag047/Ag2679/Ag2728/Ag2975/Ag047b Multiple runs 

with different set of primers give very consistent expression data. Highest expression of the 
NOV37 transcript is found in small airway epithelium upon treatment with TNF-a and II- 1 
(CT=25). This expression is significantly up-regulated when compared to untreated tissue 
(CT=28). Moderate expression of this transcript is also found in keratinocytes and astrocytes. 

10 NOV37 encodes for a neurestin like molecule whose role in neuronal regeneration has been 
demonstrated. Therefore, the putative protein encoded by NOV37 may play an important role 
in the regeneration or repair mechanism of these tissues in inflammation. Thus, therapeutic 
modulation of the expression of this gene product may be beneficial for the treatment of 
inflammatory lung diseases such as bronchitis, chronic obstructive pulmonary disease, 

15 emphysema. Therapeutics designed against this putative protein may also be useful for in the 
CNS for reducing inflammation, including inflammation that results from multiple sclerosis or 
stroke. 

NOV38, NOV39a, and NOV39b 

Expression of gene NOV38 and variants NOV39a and NOV39b was assessed 
20 using the primer-probe set Ag3753, described in Table AHA. Results of the RTQ-PCR 
runs are shown in Tables AHB and AHC. 

Table AHA. Probe Name Ag3753 



Primers 


Sequences 


Length 


Start j SEQ ID NO: 
Position 1^ 


Forward 


5 ' -cactgcagtaattcagctggta-3 * 


22 


835 | 1135 


Probe 


TET-5 1 -agtatccagtcccgccatcccagtt-3 1 - ; 
TAMRA 


25 


! 1136 
797 J 


Reverse 


5 ' -aggcgagaccattacgtagact-3 ' i 


22 


769 J 1137 



Table AHB. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) Ag3753, 
Run 216707728 


Tissue Name 


Rel. Exp.(%) Ag3753, 
Run 216707728 


Adipose 


0.0 


Renal ca. TK-10 


1.0 


Melanoma* 
Hs688(A).T 


0.3 


Bladder 


0.0 
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Melanoma* 
Hs688(B).T 


0.3 


Gastric ca. (liver met.) 
NCI-N87 


0.1 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MbL-5 


0.0 


Colon ca. SW480 


0.2 


Squamous cell 
carcinoma SCC-4 


0.2 


Colon ca.* (SW480 
met) ^Vvozu 


2.6 


Testis Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.1 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


A 1 

U.l 


C/Oion ca. ^oio-zto 




Ovarian ca. SK-OV- 

3 


A A 


L^oion ca. ^w-ho 


VJ.VJ 


Ovarian ca. 
OVCAR-4 


0.1 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


100.0 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 

1 


0.1 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


2.1 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.0 


Breast ca. JvlCr-/ 


A 1 
U. 1 


riean rOOl 


O 0 


Breast ca. MDA- 

JVlt>-/3 1 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.5 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 




0 0 

V/.v/ 


CNS cancer (glio/astro) 
U87-MG 


0.2 


Lung 


0.0 


CNS cancer (glio/astro) 
U-118-MG 


0.2 


Fetal Lung 


0.0 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.2 


CNS cancer (astro) SF- 
539 


1.1 


Lung ca. LX-1 


1.9 


CNS cancer (astro) 
SNB-75 


0.4 
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Lung ca.NCI-H 146 


0.1 


CNS cancer (glio) 

CMD 1 Q 

iM\J3- 1 y 


0.2 


Lung ca. SHP-77 


0.9 


CNS cancer (glio) SF- 


3.1 


i^ung ca. Aj4y 


U. I 


orain ^/\mygaaiaj rooi 


0 0 


Lung ca. JNCI-HjZo 


U.U 


rsrain ^cereDenum^ 


n i 

U. 1 


Lung ca. JNd-Hz3 


U.z 


tsrain (jetaij 


n n 
u.u 


Lung ca. NC1-H460 


- 0.1 


Brain (Hippocampus) 
Pool 


0.0 


Lune ca HOP-62 


0.0 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


0.2 


Brain (Substantia niera^ 
Pool 


0.0 


Liver 


1.2 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


0.9 


Brain (whole) 


0.1 


Liver ca. HepG2 


2.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


0.0 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.1 


Pancreatic ca. CAPAN2 


0.0 


Renal ca. UO-3 1 


0.0 


Pancreas Pool 


0.0 



Table AHC. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag3753, Run 
172209242 


Tissue Name 


Rel. Exp.(%) 
Ag3753, Run 
172209242 


Secondary Th 1 act 


0.5 


HUVEC IL-lbeta 


0.5 i 


Secondary Th2 act 


2.1 


HUVEC IFN gamma 


1.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Th 1 rest 


1.2 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.2 


HUVEC IL- 11 


0.0 


Secondary Trl rest 


1.0 


Lung Microvascular EC 
none 


0.9 


Primary Th 1 act 


0.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.2 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th 1 rest 


0.5 


Bronchial epithelium 
TNFalpha + ILlbeta 


1.2 


Primary Th2 rest 


0.0 


Small airway epithelium 


0.0 
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none 




Primary Trl rest 


0.5 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


1.7 


Coronery artery SMC rest 


2.6 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


1.3 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


1.2 


Secondary CD8 
lymphocyte rest 


a o 
U.z 


Astrocytes TNFalpha + 
IL-lbeta 


n a 


Secondary CD8 
lymphocyte act 


3.6 


KU-812 (Basophil) rest 


0.0 


CD4 lvmohocvte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.5 


CCD1 106 (Keratinocytes) j 
none 


0.0 


LAK cells rest 


2.5 


CCD 1 106 (Keratinocytes) 
1 IN r alpha + IL-lbeta 


0.0 


T A 17" 1 1 TT '"I 

LAK cells IL-2 


0.0 


Liver cirrhosis 


5y.j 


LAK cells IL-2+IL-12 


1.3 


NCI-H292 none 


0.3 


LAK cells IL-2+IFN 
gamma 


1.0 


NCI-H292 IL-4 


1.1 


LAK cells IL-2+ IL-1 8 


0.5 


NCI-H292 IL-9 


0.0 


LAK cells 
PMA/ionomycin 


3.8 


NCI-H292 IL-13 


0.8 


NK Cells 1L-2 rest 


U.3 


JNCi-rizyz irJN gamma 




1 wo Way MLR 3 day 


yj.j 


nrAbL none 


u.u 


Two Way MLR 5 day 


2.6 


HP A EC TNF alpha + IL-1 
beta 


0.0 


Two Way MLR 7 day 


1.0 


Lung fibroblast none 


15.1 


PBMC rest 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


1.2 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.8 


PBMC PHA-L 


0.5 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


12.8 


Lung fibroblast IL-13 


0.4 


Ramos (B cell) 
ionomycin 


1 no o 


Lung fibroblast IFN 
gamma 


0.8 


B lymphocytes PWM 


0.5 


Dermal fibroblast 
CCD 1070 rest 


3.7 


B lymphocytes CD40L 
and IL-4 


0.9 


Dermal fibroblast 
CCD 1070 TNF alpha 


2.3 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


2.2 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 
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Dendritic cells none 


0.9 


Dermal fibroblast IL-4 


4.3 


Dendritic cells LPS 


6.2 


Dermal Fibroblasts rest 


0.9 


Dendritic cells anti- 
CD40 


3.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


0.5 


Colon 


0.8 


Macrophages rest 


3.7 


Lung 


0.0 


Macrophages LPS 


1.6 


Thymus 


1.1 


HUVEC none 


0.0 


Kidney 


15.3 


HUVEC starved 


0.0 







General_screening_panel_vl.4 Summary: Ag3753 Results from one experiment 
with the NOV38 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. 

5 Panel 4.1D Summary: Ag3753 Highest expression of the NOV38 transcript is found 

in Ramos B cell line activated with PMA and ionomycin (CT=29). However, expression is not 
seen in primary activated B cells. Therefore, epxression of this gene could potentially be used 
as a marker for activated B lymphoma. This gene is also expressed at lower levels in liver 
cirrhosis, lung fibroblasts and kidney. This transcript encodes for a molecule that belongs to 
10 the activin family, a member of the TGF beta superfamily. These factors influence growth and 
differentiation and are present in many cells and tissues. Therefore, therapeutics using the 
protein encoded by NOV38 could be important for the normal homeostasis of these tissues. 

NOV40 

Expression of gene NOV40 was assessed using the primer-probe set Ag2907, 
15 described in Table AIA. Results of the RTQ-PCR runs are shown in Tables AIB, AIC, AID 
and AIE. 

Table AIA. Probe Name Ag2907 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ggctcattcgaaactactggta-3 ' j 


22 


773 


1138 


Probe 


TET-5 ' -tggaatttcctcgcccactcttacct-3 ' - 
TAMRA \ 


26 


797 


1139 


Reverse 


5 * -ggttgacaggtttgcagtagag-3 1 


22 


844 


1140 



Table AIB. Panel 1.3D 



Tissue Name 


Rd. Exp.(%) 
Ag2907, Run 


Rel. Exp.(%) 
Ag2907, Run 


Tissue Name 


Rel. Exp.(%) 
Ag2907, Run 


Rel. Exp.(%) 
Ag2907, Run 
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157283423 


165701505 




157283423 


165701505 


Liver 

adenocarcinoma 


0 0 

u.u 


U.U 




0 0 


0 0 


I allCI tdi 


u.u 




Renal ca. 786- 
0 


0 0 
u.u 


KJ.YJ 


Pancreatic ca. 
CAPAN 2 


U.U 


U.U 


Renal ca. 
A498 


u.u 


0 0 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 

393 


0.0 


0.0 


Thyroid 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


0.0 


Salivary gland 


0.0 


0.0 


Renal ca. UO- 
31 


0.0 


0.0 


Pituitary gland 


100.0 


100.0 


Renal ca. TK- 
10 


2.0 


28.3 


Brain (fetal) 


0.0 


0.0 


Liver 


0.0 


0.0 


Brain (whole) 


0.0 


14.3 


Liver (fetal) 


0.0 


0.0 


Brain (amygdala) 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


11.9 


Lung 


0.0 


0.0 


Brain 

(hippocampus) 


1.9 


0.0 


Lung (fetal) 


0.0 


0.0 


Brain (substantia 
nigra) 


0.0 


0.0 


Lung ca. 
(small cell) 
LX-1 


0.0 


0.0 


Brain (thalamus) 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


1.1 


0.0 


Cerebral Cortex 


2.0 


0.0 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 


Spinal cord 


1.3 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


gho/astro U87-MG 


0.0 


12.2 


Lung ca. (non- 
sm. cell) A549 


0.0 


0.0 


glio/astro U-l 18- 
MG 


0.0 


0.0 


Lung ca. (non- 
s.cell) NCI- 

H23 ! 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.0 


0.0 


neuro*; met SK-N- 
AS 


1.0 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


0.0 


0.0 


astrocytoma SF- 
539 


1.4 


0.0 


Lung ca. 
(squam.) SW 


0.9 


0.0 
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900 1 




astrocytoma SNB- 
75 * 


1.0 


jLung ca. 
0.0 (squam.) NC1- 
JH596 


0.0 


0.0 


glioma SNB-19 


1.8 


0 jMammary 
jgland 


0.0 


0.0 


glioma U251 


0.8 


iBreast ca.* 
0 0 1 

j(pl.ef) MCF-7 


1.4 


2.5 


glioma SF-295 


1.0 


jBreast ca.* 
0.0 j(pl.ef) MDA- ! 
|MB-231 


2.6 


0.0 


Heart (fetal) 


0.0 


jBreast ca.* 
0 0 j(pl-ef) T47D 


u.u 


0.0 


Heart 


0.0 


ft n jBreast ca. BT- j 
UU J549 




0.0 


Skeletal muscle 
(fetal) 


1.0 


n n jBreast ca. 
U jMDA-N 


U.U 


0.0 


Skeletal muscle 


0.0 


0.0 Jovary 


0.0 


0.0 


oone marrow 


u.u 


_ _ jOvarian ca. 
joVCAR-3 


2.9 


0.0 


Thymus 


0.0 


n _ jOvarian ca. 
UU ]OVCAR-4 


0.0 


0.0 


opieen 




0.0 


Ovarian ca. 
OVCAR-5 


3.U 


10.9 


Lymph node 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 




0 0 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


0.0 


0.0 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.0 


0.0 


Small intestine 


0.0 


0.0 


Uterus 


0.0 


0.0 


Colon ca. SW480 


0.0 


0.0 ■ 


Placenta 


0.9 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. HT29 


1.0 


0.0 


Prostate ca.* 
(bone met)PC- 

3 


1.3 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


16.6 


9.5 


Colon ca. CaCo-2 


1.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(ODQ3866) 


0.0 


11.0 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 
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Colon ca. HCC- 
2998 


Z..O 




Melanoma 
UACC-62 


0 n 

l/.U 


O ft 


Gastric ca.* (liver 
met) NCI-N87 


1.2 


13.3 


Melanoma 
M14 


0.0 


0.0 


Bladder 


0.0 


6.7 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.0 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 


0.0 


0.0 


Adipose 


1.5 


0.0 



Table AIC. Panel 2D 



Tissue Name 


ReL Exp.(%) 
Ag2907, Run 
157284121 


Tissue Name 


Rel. Exp.(%) 
Ag2907, Run 
157284121 


Normal Colon 


4.5 


Kidney Margin 
8120608 


0.0 


CC Well to Mod Diff 
(OD03866) 


2.1 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


2.3 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


1.7 


Kidney Cancer 
9010320 


0.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


4.8 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 06401 1 


1.5 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


6.8 


Thyroid Cancer 
064010 


0.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


15.2 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


1.7 


Lung Margin (OD0445 1 - 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


2.2 


Prostate Margin 


0.0 


Breast Cancer 


2.8 
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(OD04410) 




Metastasis 




Prostate Cancer 
(UDU4 /zU-Ul ) 


100.0 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OL)04 /zU-Uz) 


0.0 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


7.2 


9100266 


2.4 


Lune Met to Muscle 
(OD04286) 


0.6 


Breast Margin 
9100265 


0.0 


Muscle Margin 
(OD04286) 


0.0 


Breast Cancer 
A209073 


0 n 


Lung Malignant Cancer 
(OD03126) 


3.4 


Breast Margin 
A2090734 


VJ.U 


Lung Margin (OD03 1 26) 


0.0 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


3.5 


Lung Margin (OD04404) 


4.1 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) J 0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.5 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD0432 1 ) 


0.0 


Bladder Cancer 
A302173 


22.1 


Normal Kidney 


0.0 


Bladder Cancer 
(OD047 18-01) 


2.4 


Kidney Ca, Nuclear grade 
2 (OD04338) 


2.0 


Bladder Normal 
Adjacent (OD04718- 


0.0 


Kidney Margin 
(OD04338) 


0.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


2.0 


Ovarian Cancer 
064008 


0.0 


Kidney Margin 
(OD04339) 


0.0 


Ovarian Cancer 
(OD04768-07) 


14.4 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


2.4 


Normal Stomach 


0.0 
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Kidney Ca, Nuclear grade 
3 (OD04348) 


fGastric Cancer 
00 ]9060358 


0.0 


Kidney Margin 
(OD04348) 


0.0 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 
(OD04622-03) 


0.0 


Stomach Margin 
9060394 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


2.1 


Gastric Cancer 
064005 


2.9 



Table AID. Panel 3D 



Tissue Name 


ReL Exp.(%) 
Ag2907, Run 
164633936 


Tissue Name 


Rel. Exp.(%) 
Ag2907, Run 
164633936 


Daoy- Medulloblastoma 


0.0 


Ca oKi- cervical epiaermoiu 
carcinoma (metastasis) 


49.7 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


15.7 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


49.7 


Cerebellum 


0.0 


JMl- pre-B-cell lymphoma 


0.0 


Cerebellum 


0.0 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


94.0 


TF-l- Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 
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DMS-79- Small cell lung 
cancer 


0.0 


U937- Histiocytic lymphoma 0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-8 1 2- Myelogenous I ^ A 
leukemia j 


NCI-H526- Small cell 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


T7 o 

2. 1 \Z 


NC1-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


A A 

0.0 


NCI-H82- Small cell 
lung cancer 


u.u 


S W 839- Clear cell renal 
carcinoma 


A A 

0.0 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms* tumor 


0.0 


NCI-HI 155- Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 

Inner panrpr 


16.4 


C APAN- 1 - Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.0 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


0.0 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-205- Colon cancer 


0.0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


KM 12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


0.0 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon cancer 


0.0 


T24- Bladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


0.0 


5637- Bladder carcinoma 


0.0 


SW1116- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder carcinoma 


17.6 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1 080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


0.0 
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NCI-SNU-16- Gastric 
carcinoma 


0.0 


SJRH30- Rhabdomyosarcoma! 
(met to bone marrow) 


0.0 


NCI-SNU-1- Gastric 
carcinoma 


0.0 


A431- Epidermoid carcinoma 


0.0 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


0.0 


RF-48- Gastric 
adenocarcinoma 


0.0 


DU 145- Prostate carcinoma i 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


37.6 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-1 5- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


100.0 



Table AIE. Panel 4D 



Tissue Name 


ReL Exp.(%) 
Ag2907, Run 
157284733 


Tissue Name 


Rel. Exp.(%) 
Ag2907, Run 
157284733 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


12.3 


Lung Microvascular EC 
TNFa!pha + IL-lbeta 


10.4 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


11.2 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


11.7 


Bronchial epithelium 
TNFalpha + ILlbeta 


12.9 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


24.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 
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CD45RO CD4 
lymphocyte act 


6.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0,0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


12.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


4.5 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


zry Inl/Inz/Irl anti- 
CD95CHH 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalnha + TT -Iheta 

nil uiuua ■ i i—i i uviu 


0.0 




1 ^ 4 


T ivpr r*irrhoQi<; 


1 


T AK relk IT -7+TT -1? 


0 0 


T urtnc J ri r"ir*\/ 

T_/U|JliJ IWUllC'jr 


0 0 


LAK cells 1L-2+IFN 

gcUIlIIld 


0.0 


NCI-H292 none 


33.0 


T AK relk TT -?+ TT -T 8 


0 0 






LAK cells 

T-^IV/T A / J on nm \/r* i n 
I 1 V 1 / \/ lUIHJUiy v^lll 


0.0 


NCI-H292 IL-9 


23.3 


NK Oik TT -? rest 


0 0 


NCT-H292 TT -1 3 

1 > 1 T TZ, 7jL 1 1 . 1 


21.2 




0 0 




1 1 6 

1 1 - V/ 


Two Way MLR 5 day 


0.0 


HPAEC none 


13.3 


Two Way MLR 7 day 


0.0 


HPALC 1 Nt alpha + 1L-1 
beta 


0.0 


rBMC rest 


0.0 


Lung fibroblast none 


U.U 


PBMC PWM 


11.7 


Lung fibroblast TNF alpha 

i TT 1 U , w _ 

+ IL-T beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL- 1 3 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


13.1 
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Monocytes rest 


0.0 


IBD Crohn's 


6.9 


Monocytes LPS 


26.1 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


16.0 


HUVEC none 


0.0 


Kidney 


13.1 


HUVEC starved 


0.0 







Panel 1.3D Summary: Ag2907 Results from two experiments with the same 
probe/primer set are in good agreement. Expression of the NOV40 gene is highest in a sample 
derived from pituitary tissue with little to no expression detected in any other tissue. Thus, 
5 expression of this gene could be used to distinguish pituitary gland from the other samples on 
this panel. 

The protein encoded for by this gene is most homologous to a glucuronosyltransferase, 
normally found in liver. UDP glycosyltransferases (UGT) are a superfamily of enzymes that 
catalyze the addition of the glycosyl group from a UTP-sugar to a small hydrophobic 

10 molecule. Glucuronosyltransferases are membrane-bound microsomal enzymes that catalyze 
the transfer of glucuronic acid to a wide variety of exogenous and endogenous lipophilic 
substrates. These enzymes are of major importance in the detoxification and subsequent 
elimination of xenobiotics such as drugs and carcinogens. The pituitary plays a major role in 
the physiology of many different systems in the body. Therefore, this gene may play an 

1 5 essential role in maintaining proper function of the pituitary gland and many of its secreted 

peptides. Furthermore, therapeutic modulation of the activity of this gene or its protein product 
using small molecule drugs may be useful for the treatment of diabetes and diabetes as well as 
growth, reproductive, and endocrine disorders. 

Panel 2D Summary: Ag2907 Expression of the NOV40 gene is highest and almost 

20 exclusive to a sample derived from a prostate cancer (CT = 31 .7). Thus, the expression of this 
gene could be used to distinguish prostate cancer from the other samples in the panel. 
Moreover, therapeutic modulation of the activity of this gene or its protein product, through 
the use of small molecule drugs, protein therapeutics or antibodies, might be of benefit in the 
treatment of prostate cancer. 

25 Panel 3D Summary: Ag2907 Expression of the NOV40 gene is highest in a sample 

derived from a squamous cell carcinoma cell line (CT = 33.8). Thus, the expression of this 
gene could be used to distinguish this sample from the other samples in the panel. 
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Panel 4D Summary: Ag2907 Expression of the NOV40 gene is detected at a very low 
level in small airway epithelium treated with the inflammatory cytokines TNF-a and IL-lb 
(CT = 34.2). Thus, expression of this gene may be a marker of inflammation in the lung. 

NOV41a and NOV41b 

5 Expression of gene NOV41a and variant NOV41 b was assessed using the primer-probe 

sets Agl361 and Ag2953, described in Tables AJA and AJB. Results of the RTQ-PCR runs 
are shown in Tables AJC, AJD and AJE. 



Table AJA. Probe Name Agl361 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5' -ctggtcaggtacctggatgtta-3 ' 


22 


1438 


1141 


Probe 


TET-5 * -tccatcaatgaagagcttcatattcg-3 ■ - 
TAMRA 


26 


1480 


1142 


Reverse 


5 1 -cagcctttaagtgatccatcag-3 1 


22 


1507 


1143 


Table AJB. Probe Name Ag2953 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ctggtcaggtacctggatgtta-3 ' 


22 


1438 


1144 


Probe 


TET-5 ' -tccatcaatgaagagcttcatattcg-3 ' - 
TAMRA 


26 


1480 


1145 


Reversejs ' - cagcctttaagtgat ccatcag- 3 * 


22 


1507 


1146 



Table AJC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Agl361, 
Run 152953143 


Tissue Name 


Rel. Exp.(%) Agl361, 
Run 152953143 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


2.1 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAP AN 

2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


1.6 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.1 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole). 


0.2 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.7 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


1.0 


Lung (fetal) 


0.0 
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Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Soinal cord 


0.2 


Lung ca. (large 
cell)NCI-H460 


0.0 


elio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


elio/astro U-l 1 8-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.1 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca.* (pl.et) 
MCF-7 


0.2 


glioma SF-295 


0.0 


Breast ca. (pl.et) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca. (pl.et) 
T47D 


0.0 


T T a. 

Heart 


0.0 


Breast ca. B 1 


u.u 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 

„ . 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 

3 


0.5 


Thymus 


0.4 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 

J 


0.7 


Lymph node 


0.0 


wvanan ca. v v^/vtv- 
8 


0.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


100.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 


0.0 


Prostate 


0.0 
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bW620(SW450 met) 








Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)rC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


2.1 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue^wLJ^jooo ) 


0.4 


Melanoma* (met) 
rlSOoo^D ). 1 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.4 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


12.8 


Adipose 


0.0 



Table AJD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Agl361, Run 
152953321 


Tissue Name 


Rel. Exp.(%) 
Agl361, Run 
152953321 


Normal Colon 


0.2 


Kidney Margin 
8120608 


2T 


CC Well to Mod Diff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.5 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


2.6 


CC Gr.2 rectosigmoid 
(OD03868) 


1.3 


Kidney Cancer 
9010320 


0.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


2.9 


CC Mod Diff (ODO3920) 


0.6 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


0.2 


CC Gr.2 ascend colon 
(OD0392I) 


0.2 


Normal Thyroid 


0.0 


CC Margin (ODQ3921) 


0.0 


Thyroid Cancer 


0.0 
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064010 




CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


i nyroiu v>ancer 
A302152 


0.0 


Liver Margin (ODO4309)! 


0.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 , 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal rrostate oj4o-i 


a a 
U.U 


Breast Cancer 
(OD04590-01) 


U.U 


Prostate Cancer 
(OD04410) 


A A 

U.U 


Breast Cancer Mets 
(OD04590-03) 


A A 
U.U 


Prostate Margin 
(OD04410) 


0.0 


Breast Cancer 

Metastasis 

(OD04655-05) 


0.0 


Prostate Cancer 
(OD04720-01) 


0.0 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02) 


0.1 


Breast Cancer 1024 


0.3 


Normal Lung 061010 


0.3 


Breast Cancer 
9100266 


0.0 


Lung Met to Muscle 
(OD04286) 


0.0 


Breast Margin 
9100265 


0.0 


Muscle Margin 
(OD04286) 


0.0 


Breast Cancer 
A209073 


0.1 


Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin 
A2090734 


0.1 


Lung Margin (OD03126) 


0.0 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


0.1 


Liver Cancer 064003 


0.0 
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Lung Margin (OD04404) 


0.1 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1 026 


0.2 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005 -T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


0.0 


Normal Bladder 


0.4 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.1 


i^ung Margin {\jl)\J43Zi ) 


u.u 


Bladder Cancer 
A302173 


-U. 1 


Normal Kidney 


Z3.3 


Bladder Cancer 
(OD047 18-01) 


u.u 


1/ / i r% \l i ir* loot 4 rtt*ono 

iviuney ^a, iNuciear graoe 
2 (OD04338) 


0.7 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(OD04338) 


9.2 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2(OD04339) 


0.0 


Ovarian Cancer 
064008 


0.2 


Kidney Margin 
(OD04339) 


28.3 


Ovarian Cancer 
(OD04768-07) 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


45.4 


Normal Stomach 


100.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


5.3 
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Kidney Margin 
(OD04348) 


18.6 


Stomach Margin 
9060359 


78.5 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.3 


Kidney Margin 
(OD04622-03) 


3.4 


Stomach Margin 
9060394 


31.6 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.2 


Kidney Margin 
(OD04450-03) 


25.7 


Stomach Margin 
9060396 


29.5 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


4.5 



Table AJE. Panel 4D 



Tissue Name 


Rel. 

Exp.(%) 
Agl361, 
Run 
152953376 


Rel. 
Exp.(%) 
Ag2953, 
Run 
164306345 


Tissue Name 


Rel. 
Exp.(%) 
Agl361, 
Run 
152953376 


Rel. 

Exp.(%) 
Ag2953, , 
Run 
164306345 


Secondary Thl act 


0.0 


0.0 


HUVEC IL-lbeta 


0.0 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVEC I FN 
gamma 


0.0 


0.0 


Secondary Trl act 


0.0 


0.0 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


. 0.0 


Secondary Th 1 rest 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


0.0 


0.0 


Secondary Th2 rest 


0.0 


0.0 


HUVEC IL-11 


0.0 


0.0 


Secondary Trl rest 


0.0 


0.0 


Lung 

Microvascular EC 
none 


0.0 


0.0 


Primary Thl act 


0.0 


0.0 


Lung 

Microvascular EC 
TNFalpha + 1L- 
lbeta 


0.0 


0.0 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.0 


0.0 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 


0.0 


0.0 
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lbeta 






Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha + 
TL lbeta 


0.0 


0.0 


Primary Th2 rest 


0.0 


0.0 


Small airway 
epithelium none 


0.0 


0.0 


Primary Trl rest 


0.0 


A A 

0.0 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


A A 
U.U 


A A 


CD45RA CD4 
lymphocyte act 


0.0 


A A 

0.0 


Coronery artery 
SMC rest 


A A 
0.0 


A A 
0.0 


CD4jK(J CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 1 
IL- lbeta J 


0.0 


0.0 


CD8 lymphocyte 
act 


A A 

0.0 


A A 
0.0 


Astrocytes rest 


A A 

0.0 


A A 

0.0 


Secondary CD8 
lymphocyte rest 


0.0 


0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


0.1 


0.2 


Secondary CD8 
lymphocyte act 


0.0 


0.0 


KU-812 
(Basophil) rest 


0.0 


0.0 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 

(Basophil) 

PMA/ionomycin 


0.0 


0.0 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


0.0 


CCD1106 

(Keratinocytes) 

none 


0.0 


0.0 


LAK cells rest 


0.0 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


LAK cells IL-2 


0.0 


0.0 


Liver cirrhosis 


0.0 


0.0 


LAK cells IL-2+IL- 
12 


0.0 


0.0 


Lupus kidney 


1.2 


0.8 


LAK cells IL- 
2+IFN gamma 


0.0 


0.0 


NCI-H292 none 


0.2 


0.0 


LAK cells IL-2+ 
IL-18 


0.0 


0.0 


NCI-H292 IL-4 


0.0 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


0.0 


NCI-H292 IL-9 


0.2 


0.0 


NK Cells IL-2 rest i 


0.0 


0.0 


NCI-H292 IL-13 


0.0 


0.0 


Two Way MLR 3 
day 


0.0 


0.0 


NCI-H292 I FN 
gamma 


0.0 


0.0 


Two Way MLR 5 
day 


0.0 


0.0 


HPAEC none 


0.0 


0.0 
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Two Way MLR 7 
day 


0.0 


0.0 


HPAEC TNF 
alpha + IL-1 beta \ 


0.0 


0.0 


PBMC rest 


0.0 


0.0 


Lung fibroblast 
none 


0.0 


0.0 


PBMC PWM 


0.0 


0.2 


Lung fibroblast 
TNF alpha + IL-1 
beta 


0.0 


0.0 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast 
IL-4 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 
IL-9 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast 
IL-1 3 


0.0 


0.0 


B lymphocytes 
PWM 


0.0 


0.0 


Lung fibroblast 
IFN gamma 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


0.0 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


0.0 


0.0 


FOT -1 dbc AMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 
beta 


0.0 


0.0 


Dendritic cells 
none 


0.0 


0.0 


Dermal fibroblast 

if in gaiiiiiia 


0.0 


0.0 


Dendritic cells LPS 


0.0 


0.0 


j_^ernid.i iiuruuid-isi 
IL-4 


0.0 


0.0 


Dendritic cells anti- 
CD40 


0.0 


0.0 


IBD Colitis 2 


0.0 


0.0 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


0.0 


0.0 


Monocytes LPS 


0.0 


0.0 


Colon 


0.0 


0.0 


Macrophages rest 


0.0 


0.0 


Lung 


0.0 


0.0 


Macrophages LPS 


0.0 


0.0 


Thymus 


100.0 


100.0 


HUVEC none 


6.6 


0.0 


Kidney 


0.0 


0.2 


HUVEC starved 


0.0 


0.0 







Panel 1.3D Summary: Agl361 Expression of the NOV41a gene is restricted to 
stomach (CT value = 29.9) and kidney (CT value = 32.9) tissue. This observation is consistent 
with the identification of this gene as a sodium/hydrogen ion exchanger because the function 
5 of both of these tissues requires sodium/hydrogen ion exchange activity. The inhibition of the 
NOV41 A protein activity, through the use of antibodies or small molecule drugs, might be of 
use in the treatment of kidney or gastric diseases related to the function of a sodium/hydrogen 
ion exchanger. For example, the activity of this gene may be related to over-production of 
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stomach acid leading to acid reflux disease or peptic ulcer. Results from a second experiment 
with the probe and primer set Ag2953 are not included. The amp plot indicates that there is a 
potential problem in one of the sample wells. 

Panel 2D Summary: Agl361 Consistent with what was observed in Panel 1.3D, 
5 expression of the NOV41a gene in panel 2D is restricted to both normal kidney and stomach 
adjacent to tumor tissue. Interestingly, expression of the gene is absent in 4/4 gastric tumors 
and 10/10 kidney cancers when compared to the normal adjacent tissue controls. Thus, the 
expression of this gene appears to be a consistent trait of the non-neoplastic kidney and 
stomach. Therefore the absence of expression of this gene could be used as a diagnostic 
10 marker for kidney or gastric cancer. In addition, the replacement of this gene, potentially 

through the direct application of the protein or using gene replacement therapy, could be of 
use in the treatment of kidney or gastric cancer. Na+/H-f exchangers have previously been 
implicated in modulation of cellular adhesion and tumor invasion (Refs. 1 and 2). 



binding of the Na— H exchanger NHE1 to ERM proteins regulates the cortical cytoskeleton 
and cell shape independently of H(+) translocation. Mol. Cell 6: 1425-1436. 

The association of actin filaments with the plasma membrane maintains cell shape and 
adhesion. Here, we show that the plasma membrane ion exchanger NHE1 acts as an anchor for 

20 actin filaments to control the integrity of the cortical cytoskeleton. This occurs through a 

previously unrecognized structural link between NHE1 and the actin binding proteins ezrin, 
radixin, and moesin (ERM). NHE1 and ERM proteins associate directly and colocalize in 
lamellipodia. Fibroblasts expressing NHE1 with mutations that disrupt ERM binding, but not 
ion translocation, have impaired organization of focal adhesions and actin stress fibers, and an 

25 irregular cell shape. We propose a structural role for NHE1 in regulating the cortical 
cytoskeleton that is independent of its function as an ion exchanger. 
PMID: 11163215 

2. Reshkin S.J., Bellizzi A., Albarani V., Guerra L., Tommasino M., Paradiso A., 
Casavola V. (2000) Phosphoinositide 3-kinase is involved in the tumor-specific activation of 
30 human breast cancer cell Na(+)/H(+) exchange, motility, and invasion induced by serum 
deprivation. J. Biol. Chem. 275: 5361-5369. 

Whereas the tumor acidic extracellular pH plays a crucial role in the invasive process, 
the mechanism(s) behind this acidification, especially in low nutrient conditions, are unclear. 
The regulation of the Na(+)/H(+) exchanger (NHE) and invasion by serum deprivation were 



References: 



15 



1. Denker S.P., Huang D.C., Orlowski J., Furthmayr H., Barber D.L. (2000) Direct 
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studied in a series of breast epithelial cell lines representing progression from non-tumor to 
highly metastatic cells. Whereas serum deprivation reduced lactate production in all three cells 
lines, it inhibited NHE activity in the non-tumor cells and stimulated it in the tumor cells with 
a larger stimulation in the metastatic cells. The stimulation of NHE in the tumor cell lines was 
5 the result of an increased affinity of the internal H(+) regulatory site of the NHE without 

changes in sodium kinetics or expression. Serum deprivation conferred increased cell motility 
and invasive ability that were abrogated by specific inhibition of the NHE. Inhibition of 
phosphoinositide 3-kinase by overexpression of a dominant-negative mutant or wortmannin 
incubation inhibited NHE activity and invasion in serum replete conditions while potentiating 
10 the serum deprivation-dependent activation of the NHE and invasion. These results indicate 

that the up-regulation of the NHE by a phosphoinositide 3-kinase-dependent mechanism plays 
an essential role in increased tumor cell invasion induced by serum deprivation. 
PMID: 10681510 

Panel 4D Summary: Agl361/Ag2953 Two experiments with the same probe and 
15 primer sets produce results that are in excellent agreement. The NOV4 la transcript is 
expressed in the thymus in Panel 4D (CT = 28.6), but not in Panel 1.3D (CT = 38). The 
NOV41 A gene encodes a putative ion exchange molecule and may therefore be important in 
signal transduction in the thymus. Antibodies against the protein encoded for by the NOV41A 
gene may be used to identify thymic tissue. Additionally, small molecule or antibody 
20 therapeutics designed against this putative ion exchanger could disrupt T cell development in 
the thymus and serve an immunosuppresive function that could be important for tissue 
transplant. 

NOV42a and NOV42b 

Expression of gene NOV42a and variant NOV42b was assessed using the primer-probe 
25 set Ag3002, described in Table AKA. Results of the RTQ-PCR runs are shown in Tables AKB 
and AKC. 



Table AKA. Probe Name Ag3002 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward; 


5' -agaccctccatgtggtcatt-3 ' 


20 


1188 


1147 


Probe ' 


TET- 5 1 - tcacaggaacagctacaaagaaccca- 3 ' - 
TAMRA 3 


26 


1211 


1148 


Reverse 


5 ' -caggaccatctggagaagct-3 ' 


20 


1245 


1149 



1105 




Table AKB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3002, 

KUn 10/7UJ/U4 


Tissue Name 


Keh .Lxp.( u /o) AgJUUZ, 
Run 167905704 


Liver adenocarcinoma 




Kidnev ^fetaH • 


0.0 


Pancreas 


o.o 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAP AN 
2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 




Renal ra RYF 191 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


3.0 


Renal ca. UO-3 1 


u.u 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


4.3 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


I4.l 


Lung 


0.0 


Brain (hippocampus) 


6.7 


Lung (fetal) 


4.0 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-l 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astro U-l 1 8-MG 


A A 

0.0 


Lung ca. (non-s.cell) 
NCI-H23 


ft ft 


astrocytoma SW1783 


A A 

4.0 


Lung ca. (non-s.cell) 
HOP-62 


ft ft 


neuro*; met SK-N-AS 


A A 

0.0 


Lung ca. (non-s.cl) 
NCI-H522 


j ■ ±- 


astrocytoma SF-539 


A A 

0.0 


Lung ca. (squam.) 
SW 900 


ft ft 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


100.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-23 1 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


29.9 


Heart 


15.7 


Breast ca. BT-549 


0.0 
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Skeletal muscle (fetal) 


6.5 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


20.6 


Ovary 


12.5 


Rnnp marrow 


0.0 


Ovarian ca. OVCAR- 

3 


0.0 


TVi \/m 1 1 q 
i iiy ii i k*z> 


13.7 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


71.7 


Ovarian ca. OVCAR- 

5 


21.6 


Lymph node 


24.8 


Ovarian ca. OVCAR- 

o 
5 


15.2 


Colorectal 


3.8 


Ovarian ca. IGROV- ; 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


33.2 


Uterus 


z /.u 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


4.2 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


2.4 


Testis 


7.7 


Colon ca. CaCo-2 


4.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(vJLMJ 5 ooo ) 


0.0 


Melanoma* (met) 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


13.8 



Table AKC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3002, Run 
164043126 


Tissue Name 


ReL Exp.(%) 
Ag3002, Run 
164043126 


Secondary Thl act 


10.2 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 
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oecoilUaJy 111 ICal 


oo 1 

i 


Lung Microvascular EC 
none 


0.0 


.r riiiidry 1 11 1 d^i 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Priman/ T'h'7 act 
r illllaiy 1 n^- cka 


0.0 


Microvascular Dermal EC 
none 


0.0 


ri liiiciry i 1 i «vi 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


iilTTiary 1111 ICal 


14.2 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


6.2 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0 0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD 1 1 Uo (Keratinocytes ) 
TNFalnha + IL-lbeta 


0.0 


T A V r>*>11o TT 9 
L/ViV cens 1 Vj-L 


0 0 

\J,\J 


T iver cirrhosis 


0.0 


L,/\JV CeilS IL-ZTi L- 1 Z 


0 0 


T Jinus kidnev 


10.8 


LAK cells IL-2+IFN 
gamma 


23.3 


NCI-H292 none 


10.8 


T A T<T ^p.11c TT -9-1- TT -1 R 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
r^ivi/vionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


30.6 


Lung fibroblast none 


0.0 


PBMC PWM 


24.8 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


13.5 jLung fibroblast IL-4 


0.0 
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"1 



Ramos (B cell) none 


0.0 jLung fibroblast 1L-9 OJO 


Ramos (B cell) 
ionomycin 


0.0 Lung fibroblast IL- 1 3 


0.0 


n Ivmnhnrvte^ PWlVf 


jLung fibroblast 1FN 
100.0 B 

jgamma 


0 0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0 0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


99.3 


Dermal fibroblast 
CCD 1070 1L-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


22.4 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


36.3 


IBD Colitis 2 


0.0 


Monocytes rest 


76.8 


IBD Crohn's 


22.2 


Monocytes LPS 


0.0 


Colon 


67.8 


Macrophages rest 


0.0 


Lung j 22.7 


Macrophages LPS 


0.0 


Thymus 


1 12 - 2 


HUVEC none 


0.0 


Kidney 




HUVEC starved 


0.0 







Panel 1.3D Summary: Ag3002 Significant expression of the NOV42a gene is limited 
to the mammary gland and the spleen (CTs=33-34). Thus, expression of this gene could be 
used to differentiate these samples from other samples on this panel and as a marker for these 
5 tissues. 

Panel 4D Summary: Ag3002 Significant expression of the NOV42a gene is limited to 
activated B cells, an eosinophil cell line, and monocytes (CTs=33-35). Thus, this transcript 
could be used as a marker for phagocytic cell types. 

NOV43 

10 Expression of gene NOV43 was assessed using the primer-probe set Ag2987, 

described in Table ALA. Results of the RTQ-PCR runs are shown in Tables ALB, ALC, ALD 
and ALE. 



Table ALA. Probe Name Ag2987 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ctacctcaccatctgctttctg-3 ' 


22 


860 


1150 


Probe 


TET-5 ' -tttctcaggactgccagctcttgatg-3 ' - 


26 


883 


1151 
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|tamra 






Reverse] 5 ' -tccatatcttgtaggccacact- 3 ' 


22 J916 


1152 



Table ALB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2987, 
Run 21 1 008970 

J\llll Z(11UU{J7 1 \J 


Tissue Name 


Rel. Exp.(%) Ag2987, 
Run 211008970 


AD 1 Hippo 


31.2 


Control (Path) 3 

Tpmnoral Ctx 


14.7 


AD 2 Hippo 


29.9 


Control (Path) 4 

Tptnnnrfil Ctx 


43.8 


AD 3 Hippo 


37.4 


AD 1 Occipital Ctx 


44.1 


AD 4 Hippo 


10.8 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


87.7 


AD 3 Occipital Ctx 


Zl . 1 


AD 6 Hippo 


85.3 


AD 4 Occipital Ctx 


37.4 


Control 2 Hippo 


14.5 


AD 5 Occipital Ctx 


17.0 


Control 4 Hippo 


13.4 


AD 6 Occipital Ctx 


12.5 


Control (Path) 3 
Hippo 


20.7 


Control 1 Occipital 
Ctx 


9.8 


AD 1 Temporal Ctx | 


49.0 


Control 2 Occipital 
Ctx 


14.6 


AD 2 Temporal Ctx 


35.6 


Control 3 Occipital 
Ctx 


29.7 


AD 3 Temporal Ctx 


29.9 


Control 4 Occipital 
Ctx 


9.4 


AD 4 Temporal Ctx 


31.0 


Control (Path) 1 
Occipital Ctx 


44.4 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


13.2 


AD 5 SupTemporal 

Ctx 


64.2 


Control (Path) 3 
Occipital Ctx 


5.8 


AD 6 Inf Temporal 

Ctx 


69.3 


Control (Path) 4 
Occipital Ctx 


29.9 


AD 6 Sup Temporal 
Ctx 


96.6 


Control 1 Parietal 
Ctx 


19.3 


Control 1 Temporal 

Ctx 


17.0 


Control 2 Parietal 
Ctx 


67.8 


Control 2 Temporal 
Ctx 


10.6 


Control 3 Parietal 
Ctx 


23.7 


Control 3 Temporal 
Ctx 


26.4 


Control (Path) 1 
Parietal Ctx 


23.0 


Control 4 Temporal 
Ctx 


11.0 


Control (Path) 2 
Parietal Ctx 


36.1 


Control (Path) 1 
Temporal Ctx 


35.6 


Control (Path) 3 
Parietal Ctx 


16.2 
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Control (Path) 2 


35.4 


[Control (Path) 4 


45.4 


Temporal Ctx 


jParietal Ctx 



Table ALC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2987 r 
Run 166232814 


Tissue Name 


ReL Exp.(%) Ag2987, 
Run 166232814 


Liver adenocarcinoma 


5.9 


Kidney (fetal) 


0.0 


Pancreas 


6.4 


Renal ca. 786-0 


13.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


1 3.6 


Adrenal gland 


18.7 


Renal ca. RXF 393 


28.1 


Thyroid 


5.6 


Renal ca. ACHN 


0.0 


Salivary gland 


40 0 


Renal ca UO-31 


0.0 


Pituitary gland 


23.5 


Renal ca. TK-10 


5.4 


Brain (fetal) 


67.4 


Liver 


n a 
u.u 


Brain (whole) 


84.7 


Liver (fetal) 


4.8 


Brain (amygdala) 


39.5 


Liver ca. 

(hepatoblast) HepG2 ! 


7.0 


: ; 

Brain (cerebellum) 


100.0 


Lung 


0.0 


Brain (hippocampus) 


21.8 


Lung (fetal) 


2.7 


Brain (substantia nigra) 


16.6 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


96.6 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


39.8 


Lung ca. (s.cell var.) 
SHP-77 


10.3 


Sninal cord 


41.2 


Lung ca. (large 
cell)NCI-H460 


13.0 


slio/astro U87-MG 


5.6 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astro U-118-MG 


51.8 


Lung ca. (non-s.cell) 
NCI-H23 


6.4 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


10.4 


neuro*; met SK-N-AS 


8.4 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


4.3 


Lung ca. (squam.) 
SW 900 


34.2 


astrocytoma SNB-75 


10.4 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


28.5 


Mammary gland 


0.0 


glioma U251 


10.2 


Breast ca.* (pl.ef) 
MCF-7 


13.0 


glioma SF-295 


76.8 


Breast ca.* (pl.ef) 


0.0 
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MDA-MB-231 




Heart (fetal) 


5.1 


T47D 


0.0 


riearc 




Rr^a^t ra RT-S40 

I > 1 CCloL l/d. LJ 1 — / i ,7 


15.8 


Skeletal muscle (fetal) 


10.8 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


15.7 


Ovary 


7.3 


Bone marrow 


5.0 


Ovarian ca. OVCAR- 

3 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


5.8 1 


Spleen 


11.3 


Ovarian ca. OVCAR- 


38.2 


Lymph node 


42.3 


Ovarian ca. OVCAR- 

o 
o 


31.4 


Colorectal 


8.4 


Ovarian ca. IGROV- 

i 
i 


10.2 


Stomach 


10.8 


uvarian ca. ^ascuesj 
SK-OV-3 


54.0 


Small intestine 




uterus 


?S 7 


Colon ca. SW480 


0.0 


Placenta 


11.7 


Colon ca.* 
SWozl)(oW4oU met) 


0.0 


Prostate 


18.9 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met j i j 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


9.1 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 

ti ssuefODCH 866^ 


4.8 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


11.9 


Gastric ca.* (liver met) 
NCI-N87 


31.2 


Melanoma M14 


0.0 


Bladder 


25.2 


Melanoma LOX 
IMVI 


5.2 


Trachea 


10.6 


Melanoma* (met) 
SK-MEL-5 


7.0 


Kidney 


0.0 


Adipose 


27.4 



/ 



Table ALD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2987, Run 
164314632 


Tissue Name 


Rel. Exp.(%) 
Ag2987, Run 
164314632 


Secondary Thl act 


8.1 


HUVEC IL-lbeta 


1.9 


Secondary Th2 act 


10.3 


HUVEC IFN gamma 


17.4 
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Secondary Trl act 


2.2 


HUVEC TNF alpha + IFN j 
gamma 


9.6 


Secondary Thl rest 


8.8 


HUVEC TNF alpha + IL4 


4.1 


Secondary Th2 rest 


10.0 


HUVEC IL-1 1 


9.3 


Secondary Trl rest 


25.0 


Lung Microvascular EC 
none 


31.9 


Primary Th 1 act 


4.0 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


22.8 


Primary Th2 act 


1.9 


Microvascular Dermal EC \ 
none 


25.9 


Primary Trl act 


4.5 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


7.3 


Primarv Thl rest 

JL 114 11 CI I Tr Jill 1 V/0 1* 


40.1 


Bronchial epithelium 
TNFalpha + IL1 beta 


33.4 


Primary Th2 rest 


39.5 


Small airway epithelium 
none 


10.6 


Primary Trl rest 


74.7 


Small airway epithelium 
TNFalpha + IL-1 beta 


77.4 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


15.0 


CD45RO CD4 
lymphocyte act 


15.4 


Coronery artery SMC 
TNFalpha + IL-1 beta 


3.0 


CD8 lymphocyte act 


6.1 


Astrocytes rest 


4.6 


Secondary CD8 
lymphocyte rest 


16.3 


Astrocytes TNFalpha + 
IL-1 beta 


0.0 


Secondary CD8 
lymphocyte act 


1.8 


KU-812 (Basophil) rest 


1.7 


CD4 lymphocyte none 


10.0 


KU-812 (Basophil) 
rivi/vioriomycin 


20.9 


O*-./ TU1 /TU1 /TV 1 

zry inl/lnz/lrl anti- 
CD95CH11 


13.7 


llui i uo ^Keraiinocyiesj 
none 


8.4 


LAK cells rest 


33.7 


v_^i^lm iuo ^N.eraiinocyxesj 
TNFalpha + IL-1 beta 


7.9 


LAK cells IL-2 


14.8 


Liver cirrhosis 


18.6 


LAK cells IL-2+IL-12 


14.9 


Lupus kidney 


15.3 


LAK cells IL-2+IFN 
gamma 


28.9 


NCI-H292 none 


15.0 


LAK cells IL-2+ IL-1 8 


14.4 


NCI-H292 IL-4 


4.6 


LAK cells 
PMA/ionomycin 


4.5 


NCI-H292 IL-9 


13.2 


NK Cells IL-2 rest 


11.3 


NCI-H292 IL-13 


4.7 


Two Way MLR 3 day 


32.3 


NCI-H292 IFN gamma 


2.8 


Two Way MLR 5 day 


2.4 


HPAEC none 


8.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


10.2 
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y diviv^ re si 




Lung fibroblast none 




PBMC PWM 


18.8 


Lung fibroblast TNF alpha 
+ 1 beta 


16.3 


PBMC PHA-L 


20.2 


Lung fibroblast IL-4 


65.1 


Ramos (B cell) none 


1.4 


Lung fibroblast IL-9 


55.5 


Ramos (B cell) 
ionomycin 


5.6 


Lung fibroblast IL-13 


57.4 


B lymphocytes PWM 


19.2 


Lung fibroblast IFN 
gamma 


100.0 


B lymphocytes CD40L 
and IL-4 


11.7 


Dermal fibroblast 
CCD 1070 rest 


9.7 


EOL-1 dbcAMP 


59.9 


Dermal fibroblast 
CCD 1070 INr alpha 


29.7 


twL- 1 auCAJVLr 


11.9 


Dermal fibroblast 


3.2 


PMA/ionomvcin 


CCD 1070 TT -1 beta 


Dendritic cells none 


48.6 


ivv^inidi iiuiuuiaM li i>i 

gamma 


11.4 


Dendritic cells LPS 


17.7 


Dermal fibroblast IL-4 


24.3 


Dendritic cells anti- 
CD40 


29.5 


IBD Colitis 2 


6.7 


Monocytes rest 


26.4 


1BD Crohn's 


4.5 


Monocytes LPS 


19.5 


Colon 


20.4 


Macrophages rest 


28.1 


Lung 


14.8 


Macrophages LPS 


8.9 


Thymus 


52.5 


HUVEC none 


9.9 


Kidney 


74.7 


HUVEC starved 


21.2 







Table ALE. Panel CNS 1 



Tissue Name 


Rel. Exp.(%) Ag2987, 
Run 171670053 


Tissue Name 


Rel. Exp.(%) Ag2987, 
Run 171670053 


BA4 Control 


21.3 


BA17PSP 


16.8 


BA4 Control2 


15.2 


BA17PSP2 


4.4 


BA4 

Alzheimer's2 


^27.5 


Sub Nigra Control 


39.8 


BA4 Parkinson's 


100.0 


Sub Nigra Control2 


13.9 


BA4 

Parkinson's2 


66.9 


Sub Nigra 
Alzheimer's2 


34.6 


BA4 

Huntington's 


33.2 


Sub Nigra 
Parkinson's2 


51.4 


BA4 

Huntington's2 


31.4 


Sub Nigra 
Huntington's 


69.3 


BA4 PSP 


3.9 


Sub Nigra 
Huntington's2 


22.7 


BA4 PSP2 


5.1 


Sub Nigra PSP2 


17.2 
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iJ 4 15 V M h il if li , !I ;i 13 ti 



BA4 Depression 


31.6 


Sub Nigra 
Depression 


23.0 


BA4 

Depression2 


23.3 


Sub Nigra 
Depression2 


17.1 


BA7 Control 


39.5 


Glob Palladus 
Control 


23.7 


BA7 Control2 


6.0 


Glob Palladus 
Control2 


6.3 


BA7 

Alzheimer's2 


43.2 


Glob Palladus 
Alzheimer's 


17.6 


BA7 Parkinson's 


33.2 


Glob Palladus 
Alzheimer's2 


20.2 


BA7 

Parkinson's2 


61.1 


Glob Palladus 
Parkinson's 


76.8 


D A H 

Jb>A/ 

Huntington's 


51.4 


Glob Palladus 
Parkin son's2 


,„ — ______ — _ — _ . __ 

31.6 


DAT 

oA/ 

Huntington's2 


69.3 


Glob Palladus PSP 


4.4 


DAT DCD 

M/ rSr 


55.5 


Glob Palladus PSP2 


7.8 


BA7 PSP2 


39.8 


Glob Palladus 
Depression 


19.6 


BA7 Depression 


15.7 


Temp Pole Control 


25.7 


BA9 Control 


27.4 


Temp Pole Control2 


15.5 


BA9 Control2 


28.5 


Temp Pole 
Alzheimer's 


1 1.7 


BA9 Alzheimer's 

______ 


8.8 


Temp Pole 
Alzheimer's2 


16.4 


AIzheimer's2 


42.3 


Temp Pole 
Parkinson's 


34.9 


BA9 Parkinson's 


86.5 


Temp Pole 
Parkinson's2 


38.7 


BA9 

1 dl JV1I15>UI1 z>jL 


49.7 


Temp Pole 
Huntington's 


65.5 


BA9 

I— F 1 1 n 1 1 n erf r\r» ' c 
nuiIllll^LOIl S 


45.1 


Temp Pole PSP 


3.1 


BA9 

Huntin__ton's2 


40.1 


Temp Pole PSP2 


12.2 


BA9 PSP 


20.4 


Temp Pole 
Depression2 


19.1 


BA9 PSP2 


0.0 


Cing Gyr Control 


52.1 


BA9 Depression 


14.9 


Cing Gyr Control2 


14.1 


BA9 

Depression2 


17.7 


Cing Gyr 
Alzheimer's 


13.1 


BA 17 Control 


64.2 


Cing Gyr 
Alzheimer's2 


60.3 
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BA17 Control2 


22.5 


Cing Gyr Parkinson's 


66.4 


BA17 

Alzheimer's2 


52.1 


Cing Gyr 
Parkinson's2 


25.5 


BA17 
Parkinson's 


85.3 


Cing Gyr 
Huntington's 


85.3 


BA17 

Parkinson's2 


94.0 


Cing Gyr 
Huntington's2 


43.2 


BA17 

Huntington's 


70.7 


Cing Gyr PSP 


26.2 


BA17 

Huntington's2 


34.4 


Cing Gyr PSP2 


6.2 


BA17 
Depression 


46.7 


Cing Gyr Depression 


12.9 


BA17 

Depression2 


82.9 


Cing Gyr 
Depression2 


23.8 



CNS_neurodegeneration_vl.O Summary: Ag2987 The NOV43 gene exhibits 
significantly higher expression in the brains of Alzheimer's disease patients than normal 
controls. This is consistant with reports of increased purinoceptor expression in AD. Please see 
5 Panel 1 .3D for discussion of utility of this gene in the central nervous system. 
References: 

Moore D, Iritani S, Chambers J, Emson P. Immunohistochemical localization of the 
P2Y1 purinergic receptor in Alzheimer's disease. Neuroreport 2000 Nov 27;1 l(17):3799-803 
The biological actions of extracellular nucleotides are mediated by two distinct classes 

10 of P2 receptor, P2X and P2Y. The G protein-coupled P2Y receptors comprise five mammalian 
subtypes, P2Y(1-1 1). The P2Y1 subtype is expressed abundantly throughout the human brain 
and is specifically localized to neuronal structures. In the present study, the distribution of the 
P2Y1 receptor was investigated in Alzheimer's disease (AD) brains. In contrast to control 
human brain, the P2Y1 receptor was localized to a number of characteristic AD structures 

15 such as neurofibrillary tangles, neuritic plaques and neuropil threads. Immunoblot analysis 
showed that this specific immunostaining observed over tangles was not a result of cross- 
reactivity between the anti-P2Yl antiserum and abnormal tau protein, the major constituent of 
tangles. The significance of this altered P2Y1 cellular distribution in AD brains is at present 
unclear. 

20 Panel 1.3D Summary: Ag2987 The NOV43 gene, a purinoceptor homolog, exhibits 

highly brain preferential expression in this panel. Purinoceptors found in GDNF sensitive 
sensory neurons mediate nociceptor function. Therefore, agents that block the action of this 
receptor may have utility in treating pain, either as analgesics or in inhibiting the establishment 
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of chronic pain. In addition, adenosine plays a significant neuromodulatory role in brain 
regions such as the hippocampus, cortex, basal ganglia, and thalamus. Thus, this purinoceptor 
is localized in a position to participate with the action of adenosine in these brain regions. The 
NOV43 gene product may also influence Ca2+ mobilization, a function performed by other 
5 purinoceptors. Ca2+ mobilization is an important component of the molecular process leading 
to neurotransmitter release, such as dopamine and glutamate. P2Y receptors have been shown 
to affect the release of dopamine, a critical neurotransmitter deficient in Parkinson's disease. 
P2 receptor agonists are known to induce secretion. Therefore, agents that modulate NOV43 
may be effective treatments for Parkinson's disease via effecting enhanced dopamine release. 

10 Furthermore, glutamate is the main excitatory amino acid neurotransmitter. Glutamate exerts 
excitotoxic neuronal damage and death in a number of pathological conditions, including 
stroke. Therefore, agents that inhibit this gene prodcut are likely to affect glutamate release in 
the brain and the subsequent cytotoxic action of glutamate in these regions. The 
overexpression of this gene in the brains of Alzheimer's disease patients in the 

1 5 CNSneurodegenerationvl .0 panel indicates that antagonists of this receptor may also have 
utility in countering the processes associated with this disease. 
References: 

Liu DM, Katnik C, Stafford M, Adams DJ.P2Y purinoceptor activation mobilizes 
intracellular Ca2+ and induces a membrane current in rat intracardiac neurones. J Physiol 

20 2000 Jul 1 5;526 Pt 2:287-98 

1 . The mobilization of Ca2+ by purinoceptor activation and the relative contributions 
of intra- and extracellular sources of Ca2+ were investigated using microfluorimetric 
measurements of fura-2 loaded in cultured neurones from rat intracardiac ganglia. 2. Reverse 
transcriptase-polymerase chain reaction (RT-PCR) revealed expression of mRNA for the G 

25 protein-coupled P2Y2 and P2Y4 receptors. 3. Brief application of either 300 microM ATP or 
300 microM UTP caused transient increases in [Ca2+]i of 277 +/- 22 nM and 267 +/- 39 nM, 
respectively. Removal of external Ca2+ did not significantly reduce these [Ca2+]i responses. 
4. The order of purinoceptor agonist potency for [Ca2+]i increases was ATP = UTP > 2- 
MeSATP > ADP » adenosine, consistent with the profile for P2Y2 purinoceptors. ATP- and 

30 UTP-induced rises in [Ca2+]i were completely and reversibly blocked by 10 microM PPADS 
(a P2 purinoceptor antagonist) and partially inhibited by 100 microM suramin (a relatively 
non-specific purinoceptor antagonist). 5. In the presence of the endoplasmic reticulum Ca2+- 
ATPase inhibitor cyclopiazonic acid (10 microM) in Ca2+-free media, the [Ca2+]i responses 
evoked by ATP were progressively decreased and abolished. 6. ATP- and UTP-induced 
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[Ca2+]i rises were insensitive to pertussis toxin, caffeine (5 mM) and ryanodine ( 1 0 m icroM) 
but were significantly reduced by U-73122, a phospholipase C (PLC) inhibitor. 7. In fura-2- 
loaded cells, perforated patch whole-cell recordings show that ATP and UTP evoked slow 
outward currents at -60 mV, concomitant with the rise in [Ca2+]i, in approximately 30 % of 
5 rat intracardiac neurones. 8. In conclusion, these results suggest that in r intracardiac neurones, 
ATP binds to P2Y2 purinoceptors to transiently raise [Ca2+]i and activate an outward current. 
The signalling pathway appears to involve a PTX-insensitive G protein coupled to PLC 
generation of IP3 which triggers the release of Ca2+ from a ryanodine-insensitive Ca2+ 
store(s). 

10 Driessen B, Bultmann R, Jurna I, Baldauf J.Depression of C fiber-evoked activity by 

intrathecally administered reactive red 2 in rat thalamic neurons. Brain Res 1998 Jun 
15;796(l-2):284-90 

To investigate the possible role of spinal purinoceptors in nociception, the potent P2- 
purinoceptor antagonist reactive red 2 was studied in rats under urethane anesthesia in which 

15 nociceptive activity was elicited by electrical stimulation of afferent C fibers in the sural nerve 
and recorded from single neurons in the ventrobasal complex of the thalamus. Intrathecal (i.t.) 
application of reactive red 2 (6-200 micrograms) caused a dose-dependent reduction of the 
evoked activity in thalamic neurons. The estimated ED50 was 30 micrograms, and the 
maximum depression of nociceptive activity amounted to about 70% of the control activity at 

20 a dose of 100 micrograms. Morphine, administered i.t. at a maximally effective dose (80 
micrograms), inhibited the evoked nociceptive activity by only up to 55% of the control 
activity. An i.t. co-injection of reactive red 2 (100 micrograms) and morphine (80 micrograms) 
caused a maximum reduction of the evoked thalamic activity by up to 85% of the control 
activity, thus, exceeding significantly the effect elicited by either drug alone. Similarly, i.t. co- 

25 injection of almost equipotent dosages of reactive red 2 (30 micrograms) and morphine (30 
micrograms) caused a maximum reduction of the evoked activity by up to 72% of the control 
activity, which again exceeded significantly the effect of either drug alone. The results suggest 
that in rats reactive red 2 exerts antinociception by blockade of P2 -purinoceptors in the spinal 
cord and, hence, support the idea that ATP may play an important role in spinal transmission 

30 of nociceptive signals. An activation of the spinal opioid system does not seem to contribute to 
the effect of reactive red 2 but might act additive or even synergistically with its 
antinociceptive action. 
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Krugel U, Kittner H, Franke H, Illes P. Stimulation of P2 receptors in the ventral 
tegmental area enhances dopaminergic mechanisms in vivo. Neuropharmacology 2001 
Jun;40(8): 1084-93 

It has been shown that endogenous adenosine 5 '-triphosphate (ATP) as well as its 
5 exogenously applied structural analogue, 2-methylthio ATP (2-MeSATP), facilitate the release 
of dopamine from axon terminals in the rat nucleus accumbens (NAc) by activating ATP- 
sensitive P2 receptors. In the present study, reversed microdialysis of 2-MeSATP (10 microM, 
100 microM and 1 mM), or its microinjection (0.5, 5.0 and 50 pmol) into the ventral tegmental 
area (VTA), dose-dependently increased the local extracellular level of dopamine and the 

10 locomotion in the open field, respectively. These effects were abolished by the P2 -receptor 
antagonist pyridoxalphosphate-6-azophenyl-2',4'-disulfonic acid (PPADS). When applied 
alone, the antagonist decreased the basal dopamine concentration, indicating that endogenous 
ATP controls the somatodendritic release of dopamine. Repeated microinjections of 2- 
MeSATP (5 pmol) once daily for 4 days led to a reproducible locomotor stimulation in the 

15 open field. Conditioned locomotion was induced by re-exposure to the novel environment on 
the seventh day. A challenge with amphetamine (1 mg/kg intraperitoneal^) on the eighth day 
enhanced the locomotor activity in the 2-MeSATP-treated group in the sense of a cross- 
sensitisation, but failed to do so in the control group. Neurons in the VTA were heavily stained 
with antibodies developed against the P2Y(1) subtype of P2 receptors. Taken together, our 

20 data suggest that P2 receptors (probably of the P2Y(1) subtype) are involved in the initiation 
of somatodendritic dopamine release in the VTA and thereby may have a profound influence 
on sensitisation and reward-motivated behaviour. 

Fernandez-Alvarez J, Hillaire-Buys D, Loubatieres-Mariani MM, Gomis R, Petit P. P2 
receptor agonists stimulate insulin release from human pancreatic islets. Pancreas 2001 

25 Jan;22(l):69-71 

Although P2 receptors for adenosine 5'-triphosphate (ATP) and/or adenosine 5'- 
diphosphate (ADP) have been characterized in mammalian pancreatic beta cells, no evidence 
for an insulin-secreting effect of P2 receptor agonists has been reported as yet in humans. The 
present study aimed at investigating whether P2 receptor agonists could stimulate insulin 

30 release in human pancreatic islets obtained from brain-dead organ donors. Experiments were 
performed using different glucose concentrations and insulin was measured by 
radioimmunoassay. When the glucose concentration (8.3 mmol/L) was slightly stimulating for 
insulin release, alpha,beta-methylene ATP (200 micromol/L) and ADPbetaS (50 micromol/L) 
similarly amplified insulin secretion: both compounds induced a threefold increase in insulin 
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response. In the presence of a nonstimulating glucose concentration (3.0 mmol/L), only 
alpha,beta-methylene ATP could induce a significant 1.4-fold increase in insulin release, 
ADPbetaS being completely ineffective. These results give evidence that P2 receptor agonists 
are effective in stimulating insulin release in humans, the effect of the P2Y agonist being 
5 essentially glucose dependent 

Panel 4D Summary: Ag 2987 The NOV43 transcript is expressed in lung fibroblasts 
after treatment with IFNg, IL-4, IL-9 other cytokines (CTs=32). This gene is also expressed in 
small airway epithelium treated with the inflammatory cytokines TNF-a and IL-1. This 
expression profile suggests a role for this transcript in lung inflammation. Low but detectable 

10 expression of this transcript is found also in dermal fibroblasts, primary CD4 T cells, EOL and 
antigen presenting cells. 

This transcript encodes for a PY2receptor like molecule. Expression of this receptor 
has been reported in several cell types including eosinophils (Ref.l) and lung epithelium 
where it has been shown to mediate Cl(-) secretion via an increase in intracellular calcium 

1 5 concentration (ref. 2). Thus, the NOV43 gene product may influence Ca2+ mobilization, a 
funtion performed by other purinoceptors, and therefore lead to activation or secretion 
processes. As suggested by ref.3, the release of nucleotides by damaged cells in inflammation 
can lead to the activation of purinoreceptors and of other cells present in the inflammed 
tissues, including lung epithelium in asthma, COPD, emphysema and the skin in psoriasis or 

20 other skin inflammatory diseases. This release can also result in the activation of antigen 

presenting cells and T cells, which contribute to the perpetuation of the inflammatory process. 
Therefore, modulation of the expression or activity of the protein encoded by the NOV43 gene 
may prevent or reduce the inflammation process in all of these diseases and other autoimmune 
diseases, including as inflammatory bowel diseases. 

25 References: 

Idzko M, Dichmann S, Panther E, Ferrari D, Herouy Y, Virchow C Jr, Luttmann W, Di 
Virgilio F, Norgauer J. 

Functional characterization of P2Y and P2X receptors in human eosinophils. J Cell 
Physiol 2001 Sep;188(3):329-36 

30 Activation of purinoceptor by ATP induces in eosinophils various cell responses 

including calcium transients, actin polymerization, production of reactive oxygen metabolites, 
CD1 lb-expression, and chemotaxis. Here, the effect of ion channel-gated P2X and/or G 
protein-coupled P2Y receptor agonists ATP, ATPgammaS, alpha,beta-meATP, 2-MeSATP, 
BzATP, ADP, CTP, and UTP on the intracellular Ca(2+)-mobilization, actin polymerization, 
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production of reactive oxygen metabolites, CD1 lb expression and chemotaxis of human 
eosinophils were measured and the biological activity was analyzed. Although all tested 
nucleotides were able to induce all these cell responses, the biological activity of the analyzed 
nucleotides were distinct. Agonists of the G protein-coupled P2Y receptors such as 2- 
5 MeSATP, UTP, and ADP have a higher biological activity for production of reactive oxygen 
metabolites, actin polymerization and chemotaxis in comparison to the ion channel-gated P2X 
agonists alphabeta-meATP, BzATP, and CTP. In contrast, P2Y and P2X agonist showed 
similar potencies in respect to intracellular calcium transient and CD1 lb up-regulation. This 
conclusion was further supported by experiments with receptor iso-type antagonist KN62, 

10 EGTA or with the G(i) protein-inactivating pertussis toxin. These findings indicate 

participation of different purinorecptors in the regulation of cell responses in eosinophils. 

Laubinger W, Streubel G, Reiser G. Physiological evidence for a P2Y receptor 
responsive to diadenosine polyphosphates in human lung via Ca(2+) release studies in 
bronchial epithelial cells. Biochem Pharmacol 2001 Mar l;61(5):623-9 

1 5 P2Y(2) receptors that are activated by the extracellular nucleotides ATP or UTP 

mediate CI(-) secretion via an increase in [Ca(2+)](i) (intracellular calcium concentration). 
Therefore, in the lung of patients suffering from cystic fibrosis, inhalation of aerosolized UTP 
offers a way to circumvent the defect in Cl(-) secretion by the cystic fibrosis transmembrane 
conductance regulator. A possible alternative for the relatively unstable UTP in inhalation 

20 therapy is the more resistant diadenosine tetraphosphate (Ap(4)A). In human and rat lung 
membranes, Ap(4)A binds to P2 receptor sites coupled to G proteins. Here, we showed that 
Ap(4)A caused an increase in [Ca(2+)](i) with an EC(50) of 17 microM in human bronchial 
epithelial cells (HBE1). The [Ca(2+)](i) rise evoked by ATP and UTP was completely, but 
that induced by Ap(4)A only partially, caused by release of Ca(2+) from internal stores. 

25 Moreover, the potency of Ap(4)A to mobilize Ca(2+) was lower than that of ATP and UTP 
(EC(50) 1.5 and 1.8 microM, respectively), and the maximal increase in [Ca(2+)](i) was 
considerably smaller than that after ATP or UTP. In accordance with our previous results 
providing evidence for a common binding site for various diadenosine polyphosphates in lung 
membranes, all Ap(n)A analogues tested (n = 3 to 6) caused a comparable [Ca(2+)](i) 

30 increase. Homologous or heterologous prestimulation largely diminished the increase in 

[Ca(2+)](i) found after a second pulse of either UTP or Ap(4)A. Although specific binding 
characteristics and functional responses of Ap(4)A on lung cells are in favor of a distinct 
receptor for Ap(4)A, the cross-talk between UTP and Ap(4)A in HBE1 cells and the only 
slight differences in Ca(2+) mobilization by ATP or UTP and Ap(4)A render it impossible at 
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this point to state unequivocally whether there exists a distinct P2Y receptor specific for 
diadenosine polyphosphates in lung epithelia or whether Ap(4)A activates one of the 
nucleotide receptors already described. 

Di Virgilio F, Chiozzi P, Ferrari D, Falzoni S, Sanz JM, Morelli A, Torboli M, 
5 Bolognesi G, Baricordi OR. Nucleotide receptors: an emerging family of regulatory molecules 
in blood cells. Blood 2001 Feb l;97(3):587-600 

Nucleotides are emerging as an ubiquitous family of extracellular signaling molecules. 
It has been known for many years that adenosine diphosphate is a potent platelet aggregating 
factor, but it is now clear that virtually every circulating cell is responsive to nucleotides. 

10 Effects as different as proliferation or differentiation, chemotaxis, release of cytokines or 

lysosomal constituents, and generation of reactive oxygen or nitrogen species are elicited upon 
stimulation of blood cells with extracellular adenosine triphosphate (ATP). These effects are 
mediated through a specific class of plasma membrane receptors called purinergic P2 
receptors that, according to the molecular structure, are further subdivided into 2 subfamilies: 

15 P2Y and P2X. ATP and possibly other nucleotides are released from damaged cells or secreted 
via nonlytic mechanisms. Thus, during inflammation or vascular damage, nucleotides may 
provide an important mechanism involved in the activation of leukocytes and platelets. 
However, the cell physiology of these receptors is still at its dawn, and the precise function of 
the multiple P2X and P2Y receptor subtypes remains to be understood. 

20 Panel CNS1 Summary: Ag2987 The expression in this panel confirms expression of 

the NOV43 gene in the brain. Please see Panel 1.3D for discussion of utility of this gene in the 
central nervous system. 

NOV44 



Expression of gene NOV44 was assessed using the primer-probe sets Ag2988 and 
25 Ag2989, described in Tables AMA and AMB. Results of the RTQ-PCR runs are shown in 
Table AMC. 



Table AMA. Probe Name Ag2988 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ctggccaacctatcctttattg-3 ' 


22 


195 


1153 


Probe 


TET-5 ' -tggctcctaaactcattgctgactca-3 1 - 
TAMRA 


26 


238 


1154 


Reverse 


5 ' -agatggttctcccctcatacaa-3 ' 


22 


264 


1155 
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Table AMB. Probe Name Ag2989 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ctggccaacctatcctttat tg- 3 ' 


22 


195 


1156 


Probe 


TET-5 1 -tggctcctaaactcattgctgactca-3 ' - 
TAMRA 


26 


238 


1157 


Reverse! 


5 • -agatggttctcccctcatacaa-3 ' 


22 I 


264 


1158 



Table AMC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2988, Run 
164523397 


Tissue Name 


1 Rel. Exp.(%) 
Ag2988, Run 
164523397 


Secondary Th 1 act 


1 0.0 


HUVEC IL-1 beta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


1 Jinp N4icrova ocular FC 
none 


0.0 


Primary Thl act 


0.0 


Lun*? Microvascular EC 
TNFalpha-ML-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-1 beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) 
none 


0.0 
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LAK cells rest 


0.0 


CCD I 106 (Keratinocytes) 
TNFalnha + TI -1heta 


0.0 


LAK cells IL-2 


0.0 


T ,iver cirrhosis 


inn n 


LAK cells IL-2+IL-12 


0.0 


1 .unns Icidnpv 




LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 TT -4 


0 0 


LAK cells 

PM AJ i onomy c i n 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0 0 


lsirT-H?90 II -1 3 


u.u 


Two Wav MI R 3 dav 


0 0 


NCLH7Q? TFN onmmn 


u.u 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-l 
beta 


0.0 


rBML rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-l beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lunp fibroblast TT - 1 ^ 


0 0 


B lymphocytes PWM 


5.3 


Lung fibroblast IFN 
gamma 


0 0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-l beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


1BD Colitis 2 


19.9 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


10.2 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved j 


24.5 







CNSneurodegenerationvl.O Summary: Ag2988/Ag2989 Expression of the 
NOV44 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 
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Panel 1.3D Summary: Ag2988/Ag2989 Expression of the NOV44 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2988 Significant expression of this gene is detected in a liver 
cirrhosis sample (CT = 32.7). This gene encodes a putative GPCR; therefore, antibodies or 
5 small molecule therapeutics could reduce or inhibit fibrosis that occurs in liver cirrhosis. In 
addition, antibodies to this putative GPCR could also be used for the diagnosis of liver 
cirrhosis. 

NOV45 

Expression of gene NOV45 was assessed using the primer-probe sets Ag2979, 
10 Ag2982, Ag2981 and Ag2984, described in Tables ANA, ANB, ANC and AND. Results of 
the RTQ-PCR runs are shown in Table ANE. 

Table ANA. Probe Name Ag2979 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -tacttcttcgtgggctcctt-3 ' 


20 


827 


1159 


Probe 


TET-5 ' -aaggcagaacctgaagctggttctcc-3 • - 
TAMRA 


26 


862 


1160 


Reverse 


5 1 -cattcacctcagtcgtgtcc-3 ' 


20 


901 


1161 


Table ANB. Probe Name Ag2982 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -tacttcttcgtgggctcctt-3 1 


20 


827 


1162 


Probe 


TET-5 * -aaggcagaacctgaagctggttctcc-3 1 - 
TAMRA " ; 


26 


862 


1163 


Reverse: 


5 1 - cattcacctcagtcgtgtcc-3 ' 


20 


901 


1164 


Table ANC. Probe Name Ag2981 


Primers; 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward|5 ' -tacttcttcgtgggctcctt-3 ' 


20 


827 


1165 


Probe 


TET-5 ' -aaggcagaacctgaagctggttctcc-3 1 - 1 
TAMRA 


26 


862 


1166 


ReversepT ' - cat tcacc tcagt cgtgtcc - 3 ' 


20 


901 


1167 


Table AND. Probe Name Ag2984 


Primers j 


Sequences \ 


Length 


Start | 
Position j 


SEQ ID NO: 


Forward j 


5 ' -atcctccatcccatcttcaa-3 1 j 


20 


284 ! 


1168 



1125 



Probe j TET ~ 5 ' -cctcagccctgtgatgatgttttcct-3 » 


J26 


307 


1169 


r ° 6 JTAMRA 




Reversejs ■ - cgcttagaaagctcaggct t - 3 ' 


|20 


340 


1170 



Table ANE. Panel 4D 



Tissue Name 


Rel Fxn (%\ 
Ag2982, Run 
158603041 


Tissue Name 


PpI Fvn (°AA 

IXC I . JLAjJ.^ /O) 

Ag2982, Run 
158603041 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


SIprnnHarv Th'? apt 




HI IVFP 1 FN (Tjimmn 


n n 
u.u 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 

^cin in 1 ci 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary I h2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
1 N r alpha + 1L- 1 beta 


19.8 


Primary Th2 act 


0.0 


» *" I T^V 1 I™ 1 ^"1 

Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


5.5 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + lLlbeta 


58.2 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


3.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


lymphocyte act 


3.6 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD 8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.8 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


2.4 


CCD1 106 (Keratinocytes) 
none 


0.3 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


5.8 


LAK cells IL-2 


0.0 


Liver cirrhosis 


23.3 


LAK cells IL-2+IL-12 1 


0.0 


Lupus kidney 


2.7 
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LAK cells IL-2+IFN 

LAI 111 1 lei 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2-f TL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 

Pft/T A /i on nm vr i n 

1 IVliYJ 1VJ11U1 1 1 Y \s 1 1 1 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells Tl -2 rest 

1 > 1\ 1 1 l3 1 1^/ X* 1 Vol 


0.0 


NC1-H292 IL-13 


0.0 


Two Wflv Ml R % Hav 


4 8 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


5.7 


HPAbC 1 N r alpha + IL- 1 
beta 


0.7 


PBMC rest 


0.0 


Lung fibroblast none 


u.u 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha j 
+ IL-1 beta 


1.8 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lunp fibroblast IL-13 


0.0 


R Ivmnhnrvtes P W1VT 


3.1 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.7 


Dermal fibroblast 
CCD1070 INr alpha 


4.1 


EOL-1 dbcAMP 

V* \A A /innnmvrin 
it i v i / \ / luiiuin y L/iii 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.4 


Dendritic cells none 


0.0 


Dermal fibroblast IFTvJ 

\S CI 1 1 1 CI 1 lIUlUUluol 11 1 ^1 

gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag2979/Ag2982 Expression of the 
NOV45 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 1.3D Summary: Ag2981/Ag2984 Expression of the NOV45 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2982 Expression of the NOV45 gene is restricted to a few 

samples, with highest expression in small airway epithelium treated with TNF-alpha and IL-1 
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beta (CT=3L2). Significant expression is treated in bronchial epithelium and lung 
microvascular endothelial cells. Thus, expression of this gene could be used as a marker for 
activated epithelium. The expression in lung derived samples suggests that this protein may be 
involved in lung inflammatory disorders, including asthma and chronic obstructive pulmonary 
5 disorder. Results from a second experiment with the probe/primer set Ag2979 are not included 
because the amp plot indicates that there is a potential problem in one of the sample wells. 

NOV46a and NOV46b 

Expression of gene NOV46s and variant NOV46b was assessed using the primer-probe 
sets Ag2990 and Ag2991, described in Tables AOA and AOB. Results of the RTQ-PCR runs 
10 are shown in Tables AOC. Please note that variant NOV46B does not match the probe and 
primer set Ag2990. 



Table AOA. Probe Name Ag2990 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cggaactgaggagactctttg-3 1 


21 


59 


1171 


Probe 


TET- 5 • -tacaagcagaccttgagcctcacggt-3 1 - 
TAMRA 


26 


81 


1172 


Reverse 


5 ' -gagcacaactgcgtttcct- 3 ■ 


19 


140 


1173 



Table AOB. Probe Name Ag2991 



Primersj Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward|5^j- tggacagggaagtct tattttg-3 1 


22 


742 


1174 


Probe 1 TET ~ 5 ' -tttcctgtccgctcttaacagcagtg-3 ' - 
r ° S | TAMRA 


26 


785 


1175 


Reversej5 1 -agcccacgaagaagtaaatgat-3 1 


22 


819 


1176 



15 

Table AOC. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag2990, 
Run 
164524407 


Rel. 
Exp.(%) 
Ag2991, 
Run 
164315033 


Tissue Name 


Rel. 
Exp.(%) 
Ag2990, 
Run 
164524407 


Rel. 

Exp.(%) 
Ag2991, 
Run 
164315033 


Secondary Thl act 


0.0 


0.0 


HUVEC IL-lbeta 


0.0 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVEC IFN 
gamma 


0.0 


0.0 


Secondary Trl act 


0.0 


0.0 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


0.0 
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Secondary 1 n J rest 


A A 

u.u 


A A 
U.U 


HUVEC TNF 
alpha + IL4 


0.0 


0.0 


Secondary Th2 rest 


0.0 


0.0 


HUVEC IL-11 


0.0 


0.0 


Secondary Trl rest 


0.0 


0.0 


Lung 

Microvascular EC 
none 


0.0 


0.0 


Primary Thl act 


0.0 


0.0 


Lung 

Microvascular EC j 
TNFalpha + IL- 
lbeta 


3.1 


23.2 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.0 


0.0 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


0.0 


11.6 


Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


123 


100:0 


Primary Th2 rest 


0.0 


0.0 


Small airway 
epithelium none 


0.0 


12.1 


Primary Trl rest 


A A 

u.u 


A A 
U.U 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


1 1 7 

11./ 


98.6 


CD45RA CD4 
lymphocyte act 


A A 

u.u 


A A 
U.U 


Coronery artery 
SMC rest 


0 0 


0.0 


CJJ4 

lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


0.0 


0.0 


CD8 lymphocyte 
act 


A A 
U.U 


u.u 




0.0 


0.0 


Secondary CD8 
lympnocyxe re si 


0.0 


0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


0.0 


KU-812 
(Basophil) rest 


0.0 


0.0 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 

(Basophil) 

PMA/ionomycin 


23.8 


5.6 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


0.0 


CCD1106 

(Keratinocytes) 

none 


0.0 


0.0 


LAK cells rest 


0.0 


0.0 


CCD 1106 

: (Keratinocytes) 

| TNFalpha + IL- 


0.0 


10.7 
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lbeta 






LAK cells IL-2 


0.0 


0.0 


Liver cirrhosis 


100.0 


18.0 


LAK cells IL-2+IL- 
12 


0.0 


0.0 


Lupus kidney 


0.0 


0.0 


LAK cells IL- 
2+IFN gamma 


0.0 


0.0 


NCI-H292 none 


0.0 


0.0 


LAK cells IL-2+ 
IL-18 


0.0 


0.0 


NCI-H292 IL-4 


0.0 . 


3.8 


LAK cells 
PMA/ionomycin 


0.0 


0.0 


NCI-H292 IL-9 


0.0 


0.0 


NK Cells IL-2 rest 


27.5 


0.0 


NCI-H292 IL-13 


0.0 


0.0 


Two Way MLR 3 
day 


0.0 


0.0 


NCI-H292 I FN 
gamma 


0.0 


0.0 


Two Way MLR 5 
day 


2.1 


0.0 


HPAEC none 


0.0 


0.0 


Two Way MLR 7 
day 


0.0 


0.0 


HPAEC TNF 
alpha + IL-1 beta 


0.0 


0.0 


PBMC rest 


0.0 


0.0 


Lung fibroblast 
none 


0.0 


0.0 


PBMC PWM 


0.0 


0.0 


Lung fibroblast 
TNF alpha + IL-1 
beta 


0.0 


2.0 


PRMC PHA-L 


0.0 


0.0 


Lung fibroblast 
IL-4 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 
IL-9 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast 
IL-13 


0.0 


0.0 


B lymphocytes 
PWM 


u.u 


VJ.VJ 


Lung fibroblast 
IFN gamma 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


u.u 


ft 0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


0.0 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


0.0 


4.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 
beta 


0.0 


0.0 


Dendritic cells 
none 


0.0 


0.0 


Dermal fibroblast 
IFN gamma 


0.0 


0.0 


Dendritic cells LPS 


0.0 


0.0 


Dermal fibroblast 
IL-4 


7.0 


0.0 


Dendritic cells anti- 
CD40 


0.0 


0.0 


IBD Colitis 2 


0.0 


6.2 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


0.0 


0.0 


Monocytes LPS 


0.0 


0.0 


Colon 


0.0 0.0 
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Macrophages rest 


0.0 


0.0 jLung 


0.0 


0.0 


Macrophages LPS 


0.0 


4.8 jThymus 


0.0 


0.0 


HUVEC none 


0.0 


0.0 jKidney 


0.0 


0.0 


HUVEC starved 


0.0 


o.o 1 





CNS_neurodegeneration_vl.O Summary: Ag2990/Ag2991 Expression of the 
NOV46a gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 
Panel 1.3D Summary: Ag2990/Ag2991 Expression of the NOV46a gene is 
5 low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2990/Ag2991 Expression of the NOV46a gene is restricted to 
liver cirrhosis and TNFalpha + ILlbeta treated bronchial and small airway epithelium. This 
expression profile suggests that antibodies or small molecule therapeutics designed with the 
putative protein encoded by this gene could reduce or inhibit fibrosis that occurs in liver 
10 cirrhosis. In addition, antibodies to this putative protein product could also be used for the 

diagnosis of liver cirrhosis. In addition, the expression of this gene in tissues derived from the 
lung suggests that this gene product may be involved in pathological and inflammatory lung 
disorders that include chronic obstructive pulmonary disease, asthma, allergy and emphysema. 
A second experiment with Ag290 shows low/undetectable in all samples on this panel 
1 5 (CTs>35). (Data not shown.) 

NOV46d 

Expression of gene NOV46d was assessed using the primer-probe sets Ag2992 and 
Ag513, described in Tables APA and APB. Results of the RTQ-PCR runs are shown in Tables 
APC and APD. 

20 Table APA. Probe Name Ag2992 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -agggctggtcttcctcttct-3 1 


20 


703 


1177 


Probe 


TET-5 ' -cccctcagcattcagggattcctatt-3 ' - 
TAMRA ; 


26 


731 


1178 


Reverse 


5 ■ -agtcatccaaatccttctcgat-3 1 


22 


764 


1179 




Table APB. Probe Name Ag513 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -tatctgtggttctctgtgggttca-3 ' 


24 


600 


1180 


Probe 


TET-5 • - 

atccacatatgatcctgacaagcaggaccag-3 1 - 


31 


626 


1181 



1131 




)tamra 1 1 




ReversejS ' - tggtcagcggcatct tctg- 3 ' 




I 659 


1182 



Table APC. Panel 1.1 



Tissue Name 


ReL Exp.(%) Ag513, 
Run 124882567 


Tissue Name 


Rel. Exp.(%) Ag513, 
Run 124882567 


Adrenal gland 


0.0 \] 


Renal ca. UO-31 


0.0 


Bladder 


o ft 


Renal ca RXF 393 


0.0 


Brain (amygdala) 


0.0 1 


Liver 


0.0 


Brain (cerebellum) 


0.0 


Liver (tetal) 


u.u 


Brain (hippocampus) 


0.0 


Liver ca. 

jiepatoblast) Hepuz 


0.0 


Brain (substantia 
nigra) 


0.0 


Lung 


0.0 


Brain (thalamus) 


0.0 


Lung (fetal) 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


Brain (fetal) 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


Brain (whole) 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


glio/astro U-118-MG 


0.0 


Lung ca. (non-s.cl) j 
JNCI-HjZz 


0.0 




astrocytoma SF-539 


0.0 


Lung ca. (non-sm. 
celD A549 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


glioma U251 


0.0 


Lung ca. (small cell) 
NCI-H69 


81.8 


glioma SF-295 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


glioma SNB-19 


0.0 


Lung ca. (squam.) 
NCI-H596 


1.4 


glio/astro U87-MG 


0.0 


Lymph node 


0.0 


neuro*; met SK-N-AS 


0.0 


Spleen 


0.0 


Mammary gland 


0.0 


Thymus 


0.0 


Breast ca. BT-549 


0.0 


Ovary 


0.0 


Breast ca. MDA-N 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Breast ca.* (pl.ef) 
T47D 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Breast ca.* (pl.ef) 


0.0 


Ovarian ca. OVCAR- 


0.0 
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it ij ii %l i) J ti n j» 

• • 


a |i™ ' -.. ii li _jL "li li _!l il l.'L : , 


MCF-7 


\ 




Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 ! 


Dvarian ca. OVCAR- 
5 


100.0 


Small intestine 


0.0 


Dvanan ca. OVCAK- 
R 

o 


0.0 


Colorectal 


< 

0.0 


Ovarian ca.* (ascites); 

j> rv u v -> 


0.0 


Colon ca. HT29 


0.0 


Pancreas 


0.0 


Colon ca. CaCo-2 


0.0 


Pancreatic ca. 

PAPAN ? 


0.0 


Colon ca. HCT- 15 


1.5 


Pituitary gland 


A A 


Colon ca. HCT-116 


0.0 


Placenta 


A A 

u.u 


Colon ca. HCC-2998 


0.0 


Prostate 


0.0 


Colon ca. SW480 


0.0 


Prostate ca.* (bone 
meiy ix-j 


• 0.0 


Colon ca.* SW620 
(SW480 met) 


0.0 


Salivary gland 


0.0 


Stomach 


a a 

U.U 




0.0 


Gastric ca. (liver met) 
NCI-N87 


0.0 


Spinal cord 


0.0 


Heart 


0.0 


Testis 


a a 

o.u 


Skeletal muscle (Fetal) 


0.0 


Thyroid 


a a 

u.u 


Skeletal muscle 


0.0 


Uterus 


A A 

u.u 


Endothelial cells 


0.0 


Melanoma Ml 4 


0.0 


Heart (retal) 




Melanoma LOX 
IMVI 


0.0 


Kidney 


0.0 


Melanoma UACC-62 


0.0 


Kidney (fetal) 


0.0 


Melanoma SK-MEL- 
28 


0.0 


Renal ca. 786-0 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Renal ca. A498 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Renal ca. ACHN 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Renal ca. TK-10 


0.0 








Table APD. Panel 1.2 


Tissue Name 


1 Rel. Exp.(%) Ag513, 
Run 129119406 


Tissue Name 


Rel. Exp.(%) Ag513, 
Run 129119406 


Endothelial cells 


0.0 


Renal ca. 786-0 


0.0 


Heart (Fetal) 


0.0 


Renal ca. A498 


0.0 


Pancreas 


j 0.0 


Renal ca. RXF 393 


0.0 
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us vtrsy ;*n i. i-* ... si 1 1 --i i 11 = 

# • 



Panrreatir ra C APATvJ 

2 


0.0 1 


Renal ca. ACI IN 


0.0 


Adrenal Gland 


0.0 ]l 


Renal ca.UO-31 


0.0 


THvrriiH 
1 1 ly iuiu 


0.0 1 


Renal ca. TK-10 


0.0 


Salivary gland 


0.0 j 


Liver 


0.0 


Pituitary gland 


U.U J. 


L/iver ^ieuu ) 


0.0 


Brain (fetal) 


o.o 1 


Liver ca. 

^nepaLUuidbij ricpvjz, 


0.0 


Brain (whole) 


0.0 


Lung 


0.0 


Brain (amygdala) 


0.0 


Lung (fetal) 


0.0 


Rrain fcerehellum^ 

l J l dill l vvl vUvlluiii y 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Rrain ^hinnoramnuO 

1 > 1 ulll ^11 lyyJJWV'Cll 1 i-ioy 


0.0 


Lung ca. (small cell) j 
NCI-H69 


100.0 


T^rnin f thalamus 

1 J 1 Cl 1 1 1 ^11 IdlCil 11 \A<JJ 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


fprphral ("'nrtex 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


^irjinal r*nrd 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


crlio/astro U87-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


glio/astro U-118-MG 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 


40.6 


astrocytoma bNB-/D 




ividiiinicii y gicuiu 


0.0 


glioma SNB-19 


0.0 


Rrf*aQt pa * f nl pfi 

MCF-7 


0.0 


glioma U251 


0.0 


Rrpact ra * ( nl f*f^ 

Dl CaM \->Cx . yyji.s^ij 

MDA-MB-231 


0.0 


glioma SF-295 


0.0 

-t ~ r-*^. 


Rrpacf pa * f nl pf\ 

T47D 


0.0 


Heart 


0.0 


Breast ca. BT-549 


0.0 


Skeletal Muscle 


0.0 


Breast ca. MDA-N 


0.0 


Bone marrow 


0.0 


Ovary 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 

3 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
4 


0.0 


Lymph node 


0.0 


! Ovarian ca. OVCAR- 


91.4 
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■J 




Colorectal Tissue 


0.0 


\J \ <X\\<X\\ Cd. \J V / \ IX - 

8 


0.0 


Stomach 


U.U 


KJ\ (XViciYl tel. IvJlvW V - 1 


0.0 


Small intestine 


0.0 


ovarian ca. ^ascites; 
SK-OV-3 


0.0 


Colon ca. SW480 


U.U 


uterus 


0 0 


Colon ca.* SW620 
(SW480 met) 


0.0 


Placenta 


0.0 . 


Colon ca. HT29 


0.0 


Prostate 


o.o 


Colon ca.HCT-1 16 


0.0 


Prostate ca.* (bone 
metj r\^-D 


0.0 


Colon ca. CaCo-2 


0.0 


Testis 


0.0 


Colon ca. Tissue 
(OD03866) 


59.0 


Melanoma 
Hs688(A).T 


b.o 


Colon ca. HCC-2998 


0.0 


Melanoma* (met) 
Hs688(TT> T 


2.0 


Gastric ca.* (liver met) 
NC1-N87 


0.0 


Melanoma UACC-62 


0.0 


Bladder 


0.0 


Melanoma Ml 4 


0.0 


Trachea 


0.0 


Melanoma LOX 
IMVI 


0.0 


Kidney 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney (fetal) 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag2992 Expression of the NOV46d gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 1.1 Summary: Ag513 Expression of the NOV46d gene is limited to two 
5 samples derived from lung cancer and ovarian cancer cell lines (CTs=3 1-32). Thus, expression 
of this gene could be used to differentiate between these sample and other samples on this 
panel and as a marker to detect the presence of lung and ovarian cancers. Furthermore, 
therapeutic modulation of the expression or function of this gene may be effective in the 
treatment of lung and ovarian cancers. 
10 Panel 1.2 Summary: Ag513 Expression of the NOV46d gene is restricted to samples 

derived from lung cancer, ovarian cancer, and colon cancer cell lines (CTs=31-32). This 
expression profile is in agreement with the expression seen in Panel 1.1. Thus, expression of 
this gene could be used to differentiate between these sample and other samples on this panel 
and as a marker to detect the presence of these cancers. Furthermore, therapeutic modulation 
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10 



of the expression or function of this gene may be effective in the treatment of lung, ovarian, 
and colon cancers. 

Panel 1.3D Summary: Ag2992 Expression of the NOV46d gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2992 Expression of the NOV46d gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

NOV46c 

Expression of gene NOV46c was assessed using the primer-probe set Ag2985, 
described in Table AQA. 

Table AQA. Probe Name Ag2985 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -tctgtcatcccatctccaaa-3 ' 


20 


285 


1183 


Probe 


TET-5 ' -atcctcattcctgtgatgacctttct-3 ' - 
TAMRA 


26 


305 


1184 


Reverse 


5 ■ -tcatggcactcagaaagctc-3 1 


20 


346 


1185 



15 



CNS_neurodegeneration_vl.O Summary: Ag2985 Expression of the NOV46c gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag2985 Expression of the NOV46c gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 
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NOV47 

Expression of gene NOV47 was assessed using the primer-probe set Ag2993, 
described in Table ARA. Results of the RTQ-PCR runs are shown in Tables ARB, ARC and 
ARD. 

Table ARA. Probe Name Ag2993 



Primers 


Sequences 


j Length 


1 Start 
| Position 


SEQ ID NO: 


Forward 


5 1 -gagggttactgctttcacagaa- 3 ' 


I 22 


]l61 


1186 


Probe 


TET-5 ' -tgacttcacatgccataatggcactg-3 ' 
TAMRA 


I 26 


j|211 


1187 


Reverse 


5 ' -ctccctgtagatggacttgct-3 ' 


I 21 


I 239 


1188 



Table ARB. CNSneurodegenerationvl.O 



Tissue Name ] Rel. Exp.(%) Ag2993, ) Tissue Name } Rel. Exp.(%) Ag2993, 
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Run 2110U946J 




Pun 21 1009463 


AD 1 Hippo 


ll.9 ! 


control yr d.ii\) j> 

remporal Ctx 


10.8 


AD 2 Hippo 


( 

43.2 


Control (Path) 4 
Temporal Ctx 


55.5 




5.3 


AD 1 Occipital Ctx 


38.4 


AD 4 Hippo 


15.2 


AD 2 Occipital Ctx 
^Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


7.1 


AD 6 Hippo 


33.7 


alj 4 occipital L^lX 




Control 2 Hippo 


25.0 


AD 5 Occipital Ctx 


21.6 


Control 4 Hippo 


18.4 


AD 6 Occipital Ctx 


16.0 


Control (Path) 3 
Hippo 


5.3 


Control 1 Occipital 
Ctx 


2.2 


AH 1 Tpmnoral Ctx 


2.9 


Control 2 Occipital 
Ctx 


39.2 


An? Tpmnnral Ptx 


42.3 


Control 3 Occipital 
Ctx 


16.3 


ADl Tpmnnral Ctx 

/A.1— / — ' 1 C<111LHJ1C11 V-LA ; 


11.9 


Control 4 Occipital 
Ctx 


6.4 




45.4 


Control (Path) 1 
Occipital Ctx 


64.6 


AD 5 Inf Temporal 
Ctx 


86.5 


Control (Path) 2 
Occipital Ctx 


15.8 


AD 5 Sup Temporal 
Ctx 


39.5 


Control (Path) 3 
Occipital Ctx 


3.1 


AD 6 Inf Temporal 
Ctx 


24.8 


Control (Path) 4 
Occipital Ctx 


27.9 


AD 6 Sup Temporal 
Ctx 


30.1 


Control 1 Parietal 
Ctx 


15.1 


Control 1 Temporal 
Ctx 


8.7 


Control 2 Parietal 
Ctx 


31.9 


Control 2 Temporal 
Ctx 


26.6 


Control 3 Parietal 


26.6 


Control 3 Temporal 

Ctx 


13.5 


Control (Path) 1 
Parietal Ctx 


13.3 


Control 3 Temporal 
Ctx 


6.2 


Control (Path) 2 
Parietal Ctx 


45.7 


Control (Path) 1 
Temporal Ctx 


38.4 


Control (Path) 3 
Parietal Ctx 


6.7 


Control (Path) 2 
Temporal Ctx 


34.9 


; Control (Path) 4 
Parietal Ctx 


36.3 
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Table ARC. Panel L3D 



Tissue Name 


Rel. Exp.(%) Ag2993 r 
Run 166230386 


iRel. Exp.(%) Ag2993, 
Tissue Name | Run 166230 386 


Liver adenocarcinoma 


23.3 - jl 


vidney (jetaij 


19 S 

1 Zs .J 


Pancreas 


0.0 ~P 


rCenai ca. /oo-u 




Pancreatic ca. CAP AN 
2 


34.2 1 


Renal ca. A498 


31.4 


Adrenal gland 


0.0 1 


<enal ca. KAr 5y5 


lo.J 


Thyroid 


0.0 


Renal ca. ACHN 


14.3 


Salivary gland 


46.7 


Renal ca. UO-31 


33.2 


Pituitary gland 


5.5 


Renal ca. TK-10 


19.5 


Brain (fetal) 


0.0 


Liver 


4.7 


Brain (whole) 29.1 


Liver (fetal) 


4.0 


Brain (amygdala) 


11/1 

13.4 


Liver ca. 

(hepatoblast) HepG2 


37.4 


Brain (cerebellum) 


3.3 


Lung 


0.0 


Brain (hippocampus) 


22.5 


^ung (fetal) 


9.9 


Brain (substantia nigra) 


18.7 


Lung ca. (small cell) 
LX-1 


26.4 


Brain (thalamus) 


100.0 


Lung ca. (small cell) 
NCI-H69 


31.2 


Cerebral Cortex 


19.6 


Lung ca. (s.cell var.) 
SHP-77 


53.6 


Spinal cord 


29.9 


Lung ca. (large 
cell)NCI-H460 


4.2 


glio/astro U87-MG 


37.1 


Lung ca. (non-sm. 
cell) A549 


i i r 
1 1 .O 


glio/astro U-118-MG 


24.3 


Lung ca. (non-s.cell) 
NCI-H23 


1 D.J 


astrocytoma SW1783 


15.0 


Lung ca. (non-s.cell) 
HOP-62 


ZH-.o 


neuro*; met SK-N-AS 


21.5 


Lung ca. (non-s.cl) 
NCI-H522 


OA 8 


astrocytoma SF-539 


26.1 


Lung ca. (squam.) 
SW 900 


10.7 


astrocytoma SNB-75 


31.6 


Lung ca. (squam.) 
NCI-H596 


30.1 


glioma SNB-19 


25.5 


Mammary gland 


12.8 


glioma U25 1 


7.9 


Breast ca.* (pl.ef) 
MCF-7 


17.4 


glioma SF-295 


3.4 


Breast ca.* (pl.ef) 
MDA-MB-231 


21.8 


Heart (fetal) 


3.3 


Breast ca.* (pl.ef) 
T47D 


15.4 


Heart 


0.0 jBreast ca. BT-549 


16.5 
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Skeletal muscle (fetal) 


3.0 jBreast ca. MDA-N 


13.4 


Skeletal muscle 


21.2 jOvary 


J.O 


Bone marrow 


2.4 


Ovarian ca. OVCAR- 

3 I 


12.0 


Thvmus 




Ovarian ca. OVCAR- 
4 


39.5 


Spleen 




Ovarian ca. OVCAR- 
5 


34.6 


Lymph node 


O.J 


Ovarian ca. OVCAR- 
8 


2.8 


Colorectal 


45.1 


Ovarian ca. IGROV- 
1 


12.4 


Stomach 


6.0 


Ovarian ca.* (ascites) 
SK-OV-3 


3.7 


Small intestine 


19.6 


Uterus 


0.0 


Colon ca. SW480 


60.3 


Placenta 


23.8 

, . 


Colon ca.* 
SW620(SW480 met) 


I J.Z 




5.4 


Colon ca. HT29 


o.u 


Prostate ca.* (bone 
met)PC-3 


24.0 


Colon ca. HCT-116 


13.5 


Testis 


11.8 


Colon ca. CaCo-2 


31.6 


Melanoma 
Hs688(A).T 


5.0 


Colon ca. 


41.2 


Melanoma* (met) 
Hs688(B).T 


7.6 


Colon ca. HCC-2998 


32.5 


Melanoma UACC-62 


25.7 


Gastric ca.* (liver met) 
NCI-N87 


11.3 


Melanoma M14 


12.8 


Bladder 


45.1 


Melanoma LOX 
IMVI 


6.7 


Trachea 


8.5 


Melanoma* (met) 
SK-MEL-5 


30.1 


Kidney 


3.7 


Adipose 


3.2 



Table ARD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2993, Run 
164315034 


Tissue Name 


Rel. Exp.(%) 
Ag2993, Run 
164315034 


Secondary Th 1 act 


8.4 


HUVEC IL-lbeta 


3.8 


Secondary Th2 act 


11.8 


HUVEC 1FN gamma 


4.5 


Secondary Trl act 


14.3 


HUVEC TNF alpha + IFN 
gamma 


4.2 


Secondary Thl rest 


3.3 


HUVEC TNF alpha + IL4 


5.0 


Secondary Th2 rest 


6.0 


HUVEC IL-11 


5.7 
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Secondary Trl rest 


3.8 ' 
i 


Lung Microvascular EC 
none 


3.8 


Primary Th 1 act 


22.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


3.3 


Primary Th2 act 


16.0 


Microvascular Dermal EC 
none 


10.2 


Primarv Trl act 

1 1 illicit j All c 


30.4 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


5.5 ' 


PrimarvThl rest 

A 1 lllltll V L 1 1 1 I VJl 


39.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


18.0 


Primary Th2 rest 


31.9 


Small airway epithelium 
none 


5.2 


Primary Trl rest 


20.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


25.9 


CD45RA CD4 
lymphocyte act 


10.0 


Coronery artery SMC rest 


4.5 


CD45RO CD4 
lymphocyte act 


24.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


1.5 


CD8 lymphocyte act 


17.4 


Astrocytes rest 


A C 

4.D 


Secondary CD8 
lymphocyte rest 


13.0 


Astrocytes TNFalpha + 
IL-lbeta 


5.0 


Secondary CD8 
lymphocyte act 


10.7 


KU-8 1 2 (Basophil) rest 


14.0 


CD4 lymphocyte none 


9.0 


KU-812 (Basophil) 
r iVLAvionornycin 


22.4 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


12.5 


LLUi wo ^iverdiinocy icis^ 
none 


8.7 


LAK cells rest 


13.3 


yj i i wo ^jvci diniuv^y itoy 

TNFalpha + IL-lbeta 


6.1 


I ak cells TT -2 


14.4 


Liver cirrhosis 


6.0 


LAK cells IL-2+1L-12 


13.1 


Lupus kidney 


5.0 


LAK cells IL-2+IFN 

iiCll 1 I 1 1 Id 


15.8 


NCI-H292 none 


25.3 


LAK cells IL-2+ IL-18 


13.8 


NCI-H292 IL-4 


28.3 


LAK cells 

PM A /ion om vein 


3.9 


NCI-H292 IL-9 


31.2 


NK Cells IL-2 rest 


10.7 


NCI-H292 1L-13 


14.9 


Two Way MLR 3 day 


11.7 


NCI-H292 I FN gamma 


11.9 


Two Way MLR 5 day 


6.8 


1 1PAEC none 


4.9 


Two Way MLR 7 day 


6.0 


HPAEC TNF alpha + IL-1 
beta 


4.7 


PBMC rest 


3.3 


Lung fibroblast none 


2.6 


PBMC PWM 


30.4 


Lung fibroblast TNF alpha 
+ IL-1 beta 


1.4 


PBMC PHA-L 


\ 20.9 


Lung fibroblast IL-4 


7.1 
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Ramos (B cell) none 


22. 1 Lung fibroblast IL-9 


6.5 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL- 1 3 


4.9 


o iy inpiiuv^ lc^ i vv ivi 


Z> 1 .^t 


Lung fibroblast 1FN 
gamma 


7.1 


B lymphocytes CD40L 
and IL-4 


OA 8 
Z^f -5 


Dermal fibroblast 
CCD 1070 rest 


9.7 


EOL-1 dbcAMP 


5.6 


Dermal fibroblast 
CCD 1070 TNF alpha 


24.1 


EOL-1 dbcAMP 
PMA/ionomycin 


4.5 


Dermal fibroblast 
d^uiu/u oeia 


3.9 


Dendritic cells none 


7.0 


Dermal iiuroDiast iriN 
gamma 


4.0 


Dendritic cells LPS 


5.7 


Dermal fibroblast IL-4 


5.9 


Dendritic cells anti- 
CD40 


8.7 


TBD Colitis 2 


0.2 


Monocytes rest 


3.0 


IBD Crohn's 


0.0 


Monocytes LPS 


1.6 


[Colon 


8.2 


Macrophages rest 


11.3 


Lung 


7.3 


Macrophages LPS 


5.8 


[Thymus 


27.0 


HUVEC none 


6.7 


[Kidney 


23.0 


HUVEC starved 


15.6 







CNS_neurodegeneration_vl.O Summary: Ag2993 This panel does not show 
differential expression of the NOV47 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag2993 Significant expression of the NOV47 gene is 
restricted to the brain, with expression in the thalamus (CT=34.4). This gene also shows strong 
expression in the brain in the previous panel suggesting that this gene product may be involved 
in the normal functioning of the brain. Thus, the protein encoded by this gene may represent a 
10 small molecule target for the treatment of neurologic diseases. 

Panel 4D Summary: Ag2993 The NOV47 gene, a Peptidyl Prolyl Cis-Trans 
Isomerase A homolog, is a novel member of the family of receptors for the widely used 
immunosuppressants cyclosporin A and FK506 (see Wang et al., 2001). The NOV47 gene is 
expressed at moderate levels in many of the tissues in this panel and is expressed at a 
15 somewhat higher level (CT = 30.3) in ionomycin-stimulated Ramos B lymphocytes. 

Therefore, small molecule drugs that antagonzie the activity of the NOV47 gene product may 
be useful as immunosuppressants to reduce or eliminate the symptoms in patients with 
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autoimmune or inflammatory conditions, such as Crohn's disease, ulcerative colitis, multiple 
sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, rheumatoid arthritis, 
lupus erythematosus, or psoriasis. 
References: 

5 Wang HC, Kim K, Bakhtiar R, Germanas JP. Structure-activity studies of ground- and 

transition-state analogue inhibitors of cyclophilin. J Med Chem 2001 Aug 2;44(16):2593-600 
Peptidyl-prolyl isomerases (PPIases) are ubiquitous cellular enzymes that play roles in 
cellular signaling and protein folding. In addition, these proteins are the receptors for the 
widely used immunosuppressants cyclosporin A and FK506. We report the first structure- 

10 activity studies of de novo designed inhibitors of cyclophilin, the cellular target of cyclosporin 
A. Our mechanism-based inhibitors were modeled on the ground- and transition-state 
structures of proline-containing peptides, the natural substrates of the enzyme. Both ground- 
state analogues 1 and transition-state analogues 2 were prepared as single enantiomers from L- 
proline following a "self-reproduction of chirality" procedure. The binding affinities of the 

15 analogues for the active site of cyclophilin were measured by a fluorescence perturbation 

assay. While the transition-state analogues 2 did not display significant avidity for the active 
site (K(d) = 77 microM for 2b), several ground-state analogues bound to the enzyme with low 
micromolar affinity (K(d) = 1 .5 microM for le). These results proclaim that properly designed 
small molecular weight molecules can form strong complexes with cyclophilin and may find 

20 use as probes in cell biology and as therapeutic agents. 

NOV48a 

Expression of gene NOV48a was assessed using the primer-probe set Ag3006, 
described in Table ASA. Results of the RTQ-PCR runs are shown in Tables ASB, ASC, ASD, 
ASE and ASF. 

25 Table ASA. Probe Name Ag3006 



Primers 


Sequences 


Length. 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gggcttggaatagagaaacct-3 * i 


21 


2991 


1189 


Probe 


TET-5 ' -caacttcctcaaagcccaaagccaag-3 * - 
TAMRA 


26 


3037 


1190 


Reverse 


5 ' -gaagccttgagccttgatttat-3 ' 


22 


3069 


1191 



Table ASB. Alcomprehensive panel_vl.O 



Tissue Name 


Rel. Exp.(%) I 
Ag3006, Run 


Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
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211059882 






110967 COPD-F 


27.2 


1 12427 Match Control 
rsonasis-r 


72.7 


110980 COPD-F 


18.8 


112418 Psoriasis-M 


20.2 


1 lUVOo CUrJJ-M 


zu.u 


1 12723 Match Control 
Psoriasis-M 


19.8 


110977 COPD-M 


71.7 


1 12419 Psoriasis-M 


32.8 


110989 Emphysema- 
F 


46.0 


112424 Match Control ! 
Psoriasis-M 


22.7 


1 10992 Emphysema- 


18.3 


1 12420 Psoriasis-M 


74.2 


110993 Emphysema- 
F 


23.8 


112425 Match Control 
Psoriasis-M 


56.6 


1 10994 Emphysema- 
F 


20.7 


104689 (MF) OA 
Bone-Backus 


55.1 


110995 Emphysema- 
F 


19.9 


104690 (MF) Adj 
"Normal" Bone-Backus 1 


70.7 


1 10996 Emphysema- 
F 


4.3 


104691 (MF) OA 
Synovium-Backus 


42.0 


110997 Asthma-M 


6.4 


104692 (BA) OA 
Cartilage -Backus 


50.7 


111001 Asthma-F 


33.4 


104694 (BA) OA 
Bone-Backus 


32.5 


111002 Asthma-F 


33.0 


104695 (BA) Adj 
"Normal" Bone-Backus 


52.1 


111003 Atopic 
Asthma-F 


jl.O 


104696 (BA) OA 
Synovium-Backus 


25.7 


111004 Atopic 
Asthma-F 




104700 (SS) OA Bone- 
Backus 


10.4 


111005 Atopic 
Asthma-F 


1 1 1 
11.1 


104701 (SS) Adj 
"Normal" Bone-Backus 


42.0 


111006 Atopic 
Asthma-F 


J .o 


104702 (SS) OA 
Synovium-Backus 


71.2 


111417 Allergy-M 


13.7 


1 17093 OA Cartilage 
ivep / 


23.8 


112347 Allergy-M 


6.2 


112672 OA Bone5 


57.8 


112349 Normal Lung- 
F 


3.8 


112673 OA Synovium5 


22.7 


112357 Normal Lung- 
F 


39.8 


112674 OA Synovial 
Fluid cells5 


27.5 


112354 Normal Lung- 
M 


22.1 


1 1 17100 OA Cartilage 
Rep 14 


12.3 


112374 Crohns-F 


36.9 


112756 OA Bone9 


41.2 


112389 Match 
Control Crohns-F 


15.6 


] 1 12757 OA Synovium9 


i 100.0 
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112375 Crohns-F 


20.6 


1 12758 OA Synovial 
Fluid Cells9 


19.2 


112732 Match 
Control Cronns-r 


17.9 


1 17125 RA Cartilage 
Kepz 


25.0 


1 12725 Crohns-M 


6.1 


113492 Bone2 RA 


19.8 


112387 Match 
Control Crohns-M 


14.9 


1 13493 Synovium2 RA j 


7.7 


112378 Crohns-M 


4.8 


113494 Syn Fluid Cells 
RA 


14.5 


112390 Match 
Control Crohns-M 


48.0 


1 13499 Cartilage4RA ! 


24.1 


112726 Crohns-M 


33.0 


113500 Bone4 RA 


25.9 


112731 Match 
Control Crohns-M 


25.2 


113501 Synovium4RA 


15.4 


112380 Ulcer Col-F 


32.3 


113502 Syn Fluid 
Cells4 RA 


7.5 


112734 Match 
Control Ulcer Col-F 


32.5 


1 13495 Cartilage3RA ! 


11.8 


112384 Ulcer Col-F 


35.6 


113496 Bone3 RA 


12.6 


112737 Match 
Control Ulcer Col-F 


6.5 


113497 Synovium3RA 


5.6 


112386 Ulcer Col-F 


17.3 


113498 Syn Fluid 
Cells3 RA 


13.9 


112738 Match 
Control Ulcer Col-F 


11.3 


117106 Normal 
Cartilage Kep/u 


14.2 


112381 Ulcer Col-M 


5.1 


113663 Bone3 Normal 


6.8 


112735 Match 
Control Ulcer Col-M 


41.2 


1 13664 Synovium3 
Normal 


1.1 


112382 Ulcer Col-M 


25.2 


1 1 jodj oyn riuiu 
Cells3 Normal 


3.8 


112394 Match 
Control Ulcer Col-M 


11.2 


117107 Normal 
Cartilage Rep22 


15.0 


112383 Ulcer Col-M 


23.0 


113667 Bone4 Normal 


23.8 


112736 Match 
Control Ulcer Col-M 


13.0 


113668 Synovium4 
Normal 


24.5 


112423 Psoriasis-F 


34.9 


113669 Syn Fluid 
!Cells4 Normal 


37.9 



Table ASC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3006, 
Run 165517770 


Tissue Name 


Rel. Exp.(%) Ag3006, 
Run 165517770 


Liver adenocarcinoma 


4.7 


Kidney (fetal) 


1.2 


Pancreas 


0.7 


Renal ca. 786-0 


0.8 


Pancreatic ca. CAP AN 


1.1 


Renal ca. A498 


2.5 
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Adrenal gland 


1.0 1 


£enal ca. RXF 393 


0.9 


Thyroid 


1.9 1 


Flenal ca. ACHN 


2.2 


Salivary gland 


Z.U i 


Rpnal ca UO-31 


1.7 


Pituitary gland 


7.0 1 


Renal ca. TK-10 


1.7 


Brain (fetal) 


2.2 j] 


^iver 


0.2 


Brain (whole) 


4.0 


Liver (fetal) 


0.7 


t>rain ^cuiiyguaicij 


4.6 


Liver ca. 

(hepatoblast) HepG2 


1.8 


Brain (cerebellum) 


2.1 


Lung 


1.2 


Brain (hippocampus) 


3.6 


Lung (fetal) 


0.9 


Brain (substantia nigra) 


3.5 


Lung ca. (small cell) 
LX-1 


3.0 


Brain (thalamus) 


8.0 


Lung ca. (small cell) 
NCI-H69 


0.6 


Cerebral Cortex 


2.3 


Lung ca. (s.cell var.) 
SHP-77 


2.4 


Spinal cord 


5.2 


Lung ca. (large 
cell)NCI-H460 


6.1 


glio/astro U87-MG 


12 


Lung ca. (non-sm. 
cell) A549 


0.9 


glio/astroU-118-MG 


0 0 

y .y 


Lung ca. (non-s.cell) 
NCI-H23 


2.8 


astrocytoma SW1783 




Lung ca. (non-s.cell) 
HOP-62 


1.1 . 


neuro*; met SK-N-AS 


R 8 


Lung ca. (non-s.cl) 
NCI-H522 


2.1 


astrocytoma SF-539 


2 6 


Lung ca. (squam.) 
SW 900 


1.3 


aSllOCy LDIIla ol>IX->-/~J 


3.7 


Lung ca. (squam.) 
NCLH396 


2.6 


glioma SNB-19 


2.7 


Mammary gland 


2.1 


glioma U251 


15.3 


Breast ca.* (pl.ef) 
MCr-/ 


0.8 


glioma SF-295 


2.2 


Breast ca.* (pl.ef) 
MDA-MB-23 1 


3.1 


nCall ^IClal^ 


0.2 


", Breast ca.* (pl.ef) 
IT47D 


0.7 


Heart 


1.8 


Breast ca. BT-549 


2.7 


Skeletal muscle (fetal) 


4.4 


\ Breast ca. MDA-N 


1.6 


Skeletal muscle 


100.0 


Ovary 


1.2 


Bone marrow 


1.8 


Ovarian ca. OVCAR- 

3 


1.7 


Thymus 


0.6 


Ovarian ca. OVCAR- 
4 


2.2 
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Spleen 


2.5 


Ovarian ca. OVLAR- 


1.7 


Lymph node 


4.7 


Ovarian ca. OVCAR- 

c 
o 


0.6 


Colorectal 


2.9 


Ovarian ca. IGROV- 

i 
i 


0.9 


Stomach 


3.2 


Ovarian * fa<ioitesi 

SK-OV-3 


2.0 


Small intestine 


Q Q 


U LCI Ut> 


3.2- 


Colon ca. SW480 


0.9 


Placenta 


0.8 


Colon ca.* 
SW620(SW480 met) 


1.3 


Prostate 


4.2 


Colon ca. HT29 


0.2 


Prostate ca.* (bone j 

IT1CL) 1 L J 


1.6 


Colon ca. HCT-116 


2.0 


Testis 


6.3 


Colon ca. CaCo-2 


1.2 


Melanoma 
Hs688(A).T 


0.9 


Colon ca. 


0.9 


Melanoma* (met) 
Hs688(B).T 


0.3 


Colon ca. HCC-2998 


2.2 


Melanoma UACC-62 


3.1 


Gastric ca.* (liver met) 
NCI-N87 


3.9 


Melanoma M14 


5.0 


Bladder 


0.8 


Melanoma LOX 
IMVI 


0.7 


Trachea 


2.3 


Melanoma* (met) 
SK-MEL-5 


0.8 


Kidney 


J 4.0 


Adipose 


1.1 



Table ASD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
163577592 


Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
163577592 


Normal Colon 


35.8 


Kidney Margin 
8120608 


9.0 


CC Well to Mod Diff 
(OD03866) 


5.6 


Kidney Cancer 
8120613 


13.2 


CC Margin (OD03866) 


6.2 


Kidney Margin 
8120614 


16.2 


CC Gr.2 rectosigmoid 
(OD03868) 


5.5 


Kidney Cancer 
9010320 


8.7 


CC Margin (OD03868) 


10.8 


Kidney Margin 
9010321 


20.9 


CC Mod Diff (ODO3920) 


9.5 


Normal Uterus 


6.3 


CC Margin (ODO3920) 


19.6 


Uterus Cancer 064011 


| 9.4 
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CC Gr.2 ascend colon 
(OD03921) 


10.7 1 


formal Thyroid 


21.0 


CC Margin (OD03921) 


8.5 ; 


rhyroid Cancer 
364010 


2.5 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


6.1 


Thyroid Cancer 
A302152 


5.6 


1 iver Marein (ODO4309V 


0.9 


Thyroid Margin 
A302153 


8.2 


Colon mets to lung 
(OD04451-01) 


6.8 


Normal Breast 


24.7 


Lung Margin (OD0445 1 - j 
02) 


2.8 


Breast Cancer 
(OD04566) 


65.1 


TSJrvrmal Prostate 6^46-1 


66.9 


Breast Cancer 
(OD04590-01) 


70.2 


Prostate Cancer 
(OD04410) 


16.6 


Breast Cancer Mets 
(OD04590-03) 


68.3 


Prostate Margin 


18.0 


Breast Cancer 

Metastasis 

(OD04655-05) 


100.0 


Prostate Cancer 
(OD04720-01) 


13.7 


Breast Cancer 064006 


14.7 


Prostate Margin 
(OD04720-02) 


25.5 


Breast Cancer 1024 


33.7 


Normal T uncr 061 01 0 


15.1 


Breast Cancer 
9100266 

~ — Bam 


26.6 


Lung Met to Muscle 
(OD04286) 


22.4 


Breast Margin 
9100265 


15.3 


Muscle Margin 
(OD04286) 


52.5 


Breast Cancer 
A209073 


16.7 


Lung Malignant Cancer 
(OD03126) 


9.9 


Breast Margin 


16.3 


Lung Margin (OD03126) 


11.2 


Normal Liver 


ft X 


Lung Cancer (OD04404) j 3.9 


juiver cancer uohuuj 


1 6 


Lung Margin (OD04404) 


1 8.7 


Liver dancer iuzj 


7 1 


Lung Cancer (OD04565) 


6.5 


Liver Cancer 1026 


2.0 


Lung Margin (OD04565) 


6.2 


Liver Cancer 6004-T 


2.3 


Lung Cancer (OD04237- 
01) 


13.3 


Liver Tissue 6004-N 


3.2 


Lung Margin (OD04237- 
02) 


11.4 


Liver Cancer 6005-T 


2.6 


Ocular Mel Met to Liver 
(ODO4310) 


5.2 


Liver Tissue 6005 -N 


0.6 


Liver Margin (ODO4310) 


2.0 


Normal Bladder 


0 7 


Melanoma Mets to Lung 
(OD04321) 


6.3 


Bladder Cancer 1023 


2.2 
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Lung Margin (OD04321) 


11.5 


Bladder Cancer 
A302173 


4.4 


Normal Kidney 


47.6 


Bladder Cancer 
(OD047 18-01) 


10.3 


Kidney Ca, Nuclear grade j 
2 (OD04338) 


27.7 


Bladder Normal 
Adjacent (OD04718- 
03) 


62.0 


Kidney Margin 
(OD04338) 


43.8 


Normal Ovary 


5.0 


Kidney Ca Nuclear grade 1 
1/2 (OD04339) 


12.4 


Ovarian Cancer 
064008 


17.9 


Kidney Margin 
(OD04339) 


46.7 


Ovarian Cancer 
(OD04768-07) 


20.3 


Kidney Ca, Clear cell 
type (OD04340) 


20.2 


Ovary Margin 
(OD04768-08) 


3.6 


Kidney Margin 
(OD04340) 


40.6 


Normal Stomach 


32.5 


Kidney Ca, Nuclear grade 
3 (OD04348) 


4.0 


Gastric Cancer 
9060358 


9.6 


Kidney Margin 
(OD04348) 


30.1 


Stomach Margin 
9060359 


8.6 


Kidney Cancer 
(OD04622-01) 


9.1 


Gastric Cancer 


12.0 


Kidney Margin 
(OD04622-03) 


5.6 


Stomach Margin 
9060394 


14.3 


Kidney Cancer 
(OD04450-01) 


9.1 


Gastric Cancer 
9060397 


17.6 


Kidney Margin 
(OD04450-03) 


24.0 


Stomach Margin 
9060396 


2.6 


Kidney Cancer 8120607 


18.3 


Gastric Cancer 
064005 


20.7 



Table ASE. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
170188143 


Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
170188143 


Daoy- Medulloblastoma 


8.5 


Ca Ski- Cervical epidermoid ! 
carcinoma (metastasis) 


15.7 


TE671- Medulloblastoma 


7.5 


ES-2- Ovarian clear cell 
carcinoma 


8.3 


D283 Med- 
Medulloblastoma 


31.6 


Ramos- Stimulated with 
PMA/ionomycin 6h 


6.3 


PFSK-1- Primitive 
Neuroectodermal 


21.8 


Ramos- Stimulated with 
PMA/ionomycin 14h 


12.6 


XF-498- CNS 


5.4 


MEG-01- Chronic 


3.5 
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1 


-nyelogenous leukemia 

i illCHL'Jvcn y kjuicxji ) 




SNB-78- (jlioma 




IvdJ I - J3UI KHL lyiUJJIlvJlIICi 


5.8 


SF-268- Glioblastoma 






15.4 


T98G- Glioblastoma 


24.5 


U266- B-cell plasmacytoma 


11.4 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


14.4 


CA46- Burkitt's lymphoma 


LA 


SF-295- Glioblastoma 


8.0 


T x x 1 1 " 1 T~"» 1 1 

RL- non-Hodgkin's B-cell 
lymphoma 


1.3 


Cerebellum 


5.3 


W . _ M Ill 1 

JMl- pre-B-cell lymphoma : 


6.4 


Cerebellum 


3.2 


Jurkat- T cell leukemia 


3.8 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


13.5 


TF-1- Erythroleukem ia 


8.0 


DMS-114- Small cell 
lung cancer 


9.1 


HUT 78- T-cell lymphoma 


7.8 


DMS-79- Small cell lung \ 
cancer 


100.0 


U937- Histiocytic lymphoma 1 


8.3 


NCI-H146- Small cell 
lung cancer 


6.3 


KU-812- Myelogenous 
leukemia 


5.8 


NCI-H526- Small cell 
lung cancer 


20.6 


769-P- Clear cell renal 
carcinoma 


7.7 


NCI-N417- Small cell 
lung cancer 


21.5 


Caki-2- Clear cell renal 
carcinoma 


13.9 


NCI-H82- Small cell 
lung cancer 


1 J.z. 


SW 839- Clear cell renal 
carcinoma 


0.9 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


18.2 


G401- Wilms 1 tumor 


3.7 


NCI-H1155- Large cell 
lung cancer 


18.6 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


10.5 


NCI-H1299- Large cell 
lung cancer 


19.2 


CAPAN-1 - Pancreatic 
adenocarcinoma (liver 
metastasis) 


6.3 


NCI-H727- Lung 
carcinoid 


7.2 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


10.3 


NCI-UMC-1 1- Lung 
carcinoid 


12.5 


Dai L J JT clIlL/i CaUL 

adenocarcinoma 


3.7 


LX-1- Small cell lung 
cancer 


10.9 


HPAC- Pancreatic 
adenocarcinoma 


11.0 


Colo-205- Colon cancer 


7.1 


MIA PaCa-2- Pancreatic 
carcinoma 


1.4 


KM 12- Colon cancer 


13.1 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


23.2 


KM20L2- Colon cancer 


1.1 


PANC-1- Pancreatic 


4.7 
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epithelioid ductal carcinoma ; 


\Ip I TJ^I/C /^i-vl^n />nn^pr 

JNUi-ri / 1 o- i^oion cancer 


7 ? 

7.2 ( 


r24- Bladder carcinma 
^transitional cell) 


3.5 


SW-48- Colon 
adenocarcinoma 


4 4 


5637- Bladder carcinoma 


4.7 


SW1116- Colon 
adenocarcinoma 


S 8 1 


HT-1 197- Bladder carcinoma 


1.5 . 


LS 174T- Colon 
adenocarcinoma 


\ 7 


UM-UC-3- Bladder carcinma 
(transitional cell) 


4.8 


SW-948- Colon 
adenocarcinoma 




A204- Rhabdomyosarcoma 


35.4 


SW-480- Colon 
adenocarcinoma 


1 7 


HT-1080- Fibrosarcoma 


9.0 


NCI-SNU-5- Gastric 
carcinoma 


J. 1 


MG-63- Osteosarcoma 


4.3 


KATO III- Gastric 
carcinoma 




SK-LMS-1- Leiomyosarcoma: 
(vulva) 


13.8 


NCI-SNU-16- Gastric 
carcinoma 




SJRH30- Rhabdomyosarcoma; 
(met to bone marrow) 


7.3 


NCI-SNU-1- Gastric 
carcinoma 


ion 


A43 1 - Epidermoid carcinoma 


2.8 


RF-1- Gastric 
adenocarcinoma 


*t . 1 


WM266-4- Melanoma 


5.4 


RF-48- Gastric 
adenocarcinoma 


1 4 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


inn 


MDA-MB-468- Breast 
adenocarcinoma 


14.5 


NCI-N87- Gastric 
carcinoma 


6.1 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


3.5 


SCC- 15- Squamous cell 
carcinoma of tongue 


0.3 


HelaS3- Cervical 
adenocarcinoma 


6.7 


CAL 27- Squamous cell 
carcinoma of tongue 


5.2 


Table ASF. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
168033497 


Tissue Name 


Rel. Exp.(%) 
Ag3006, Run 
168033497 


Secondary Thl act 


32.3 


HUVEC IL-lbeta 


9.0 


Secondary Th2 act 


25.5 


HUVEC IFN gamma 


21.9 


Secondary Trl act 


39.0 


HUVEC TNF alpha + IFN 
gamma 


29.9 


Secondary Thl rest 


5.8 


! HUVEC TNF alpha + IL4 


26.4 
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Secondary Th2 rest 


7.8 HUVECIL-11 


1 1.0 


oecoiiticu y iii icm 


15.9 


Lung Microvascular EC 
none 


27.7 


rnmary lni ctci 


38.2 


Lung Microvascular EC 
TOFalpha + IL-1 beta 


28.3 


rrirnary i nz aLi 


33.7 


Microvascular Dermal EC 
none 


30.8 


rrimary in dci 


47.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


17.4 


r nmary i n i resi 


3 J.O 


Bronchial epithelium 
TNFalpha + ILlbeta 


23.3 


Primary Th2 rest 


25.7 


Small airway epithelium 
none 


12.3 


Primary Trl rest 


23.7 


Small airway epithelium 
TNFalpha + IL-lbeta 


44.4 


CD45RA CD4 
lymphocyte act 


23.7 


Coronery artery SMC rest 


13.2 


CD45RO CD4 
lymphocyte act 


29.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


10.4 


CD8 lymphocyte act 


33.7 


Astrocytes rest 


11.5 


Secondary CD8 
lymphocyte rest 


37.6 


Astrocytes TNFalpha + 
IL-lbeta 


6.0 


Secondary CD8 
lymphocyte act 


16.4 


KU-812 (Basophil) rest 


14.5 


CD4 lymphocyte none 


4.8 


KU-812 (Basophil) 
PMA/ionomycin 


22.1 


2ry Thl/Th2/Trl_anti- 
CD95 CH11 


15.3 


CCD 11 06 (Keratinocytes) 
none 


28.1 


LAK cells rest 


11.2 


CCD 11 06 (Keratinocytes) 

"['XI *il.-.l-wi _t_ TT T katq 

1 INralpna + iL-iDeia 


14.8 


L/vis. ceiis il-z 


29.9 


Liver cirrhosis 


2.2 


LA Iv CeilS IL-ZiML- 1 Z 


25.3 


Lupus kidney 


5.8 


LAK cells IL-2+IFN 
gamma 


44.1 


NCI-H292 none 


15.4 


T AF ppIIc TT 9-1-11 18 
L/\rv CC11S 1L-Z i IL-IO 


35.8 


NCI-H292 IL-4 


36.3 


LAK cells 

PM A/ionomvcin 


8.7 


NCI-H292 IL-9 


22.8 


NK Cells IL-2 rest 


23.3 


NCI-H292 IL-13 


12.7 


Two Way MLR 3 day 


33.0 


NCI-H292 IFN gamma 


17.4 


Two Way MLR 5 day 


18.3 


HPAEC none 


22.8 


Two Way MLR 7 day 


17.0 


HPAEC TNF alpha + IL-1 
beta 


15.9 


PBMC rest 


5.0 


Lung fibroblast none 


27.5 


PBMC PWM 


100.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


14.3 
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PBMC PHA-L 


31.2 


Lung fibroblast IL-4 j 


39.0 


Ramos (B cell) none 


30.4 


Lung fibroblast IL-9 j 


26.2 


Ramos (B cell) 
ionomycin 


66.4 


Lung fibroblast IL-13 


27.9 


R Ivmnhnrvtes PWIVt 


96.6 


Lung fibroblast IFN 
gamma 


35.8 


B lymphocytes CD40L 
and TL-4 


32.3 


Dermal fibroblast 
CCD 1070 rest 


31.2 


EOL-1 dbcAMP 


18.7 


Dermal fibroblast 
CCD1070 INr alpha 


49.3 


EOL-1 dbcAMP 
PMA/ionomycin 


15.6 


Dermal fibroblast 
CCD 1070 TT -1 beta 


9.7 


Dendritic cells none 


4.6 


gamma 


9.5 


Dendritic cells LPS 


3.0 


Dermal fibroblast IL-4 


15.7 


Dendritic cells anti- 
CD40 


4.0 


IBD Colitis 2 


6.0 


Monocytes rest 


7.5 


IBD Crohn's 


1.4 


Monocytes LPS 


10.8 


Colon 


25.0 


Macrophages rest 


6.8 


Lung 


7.9 


Macrophages LPS 


3.1 


Thymus 


55.1 . 


HUVEC none 


27.9 


Kidney 


18.4 


HUVEC starved 


35.8 







AI_comprehensive panelvl.O Summary: Ag3006 The NOV48a gene is a novel 
member of the Phospholipase C family that is expressed at moderate to low levels in numerous 
cell types involved in the immune response in health and disease. In addition, the NOV48a 
5 gene is expressed at higher levels (CTs range 29-32) in samples obtained from patients with 
several autoimmune and inflammatory diseases, particularly a subset of samples from 
osteoarthritic synovium and psoriasis. Therefore, small molecule drugs that antagonzie the 
activity of this gene product may be useful as immunosuppressants to reduce or eliminate the 
symptoms in patients with conditions, such as osteoarthritis, psoriasis, Crohn's disease, 

10 ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, 
emphysema, rheumatoid arthritis, or lupus erythematosus. 

Panel 1.3D Summary: Ag3006 Highest expression of the NOV48a gene, a 
phospholipase C homolog, is seen in skeletal muscle (CT=27.6). Phosphatidylinositol-specific 
phospholipase C (PLC) plays an important role in receptor-mediated signal transduction. In 

15 addition to skeletal muscle, this gene is expressed in heat, liver, adipose, adrenal, thyroid, and 

pancreas. This widespread expression in metabolic tissues suggests that this gene product may 

be involved in cellular regulation of metabolism through interactions with the insulin receptor. 
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A therapeutic modulator to this gene and/or gene-product may be useful in the treatment of 
metabolic diseases that affect skeletal muscle metabolism. This gene and/or gene-product may 
also prove useful in differentiating between fetal and adult forms of skeletal muscle tissue, 
since it is expressed at much higher levels in the adult (CT=27) when compared to expression 
5 in the fetal tissue (CT^32). 

This gene is also expressed at a low level in almost all cancer cell lines in this panel. 
Hence, it is probably required for cell survival and proliferation and therefore, inhibition of 
this gene in cancer can probably be used as therapy. 

In addition, increased expression of PLC delta has been observed in the brains of 
10 Alzheimer's disease patients, indicating a role for this class of enzyme in the disease process. 
Therefore, inhibitors of the NOV48A protein product, by countering this disease associated 
process, may have utility in treating Alzheimer's disease and other neurodegenerative 
disorders. 

References: 

1 5 Tanino H, Shimohama S, Sasaki Y, Sumida Y, Fujimoto S. Increase in phospholipase 

C-deltal protein levels in aluminum-treated rat brains. Biochem Biophys Res Commun 2000 
May 19;271(3):620-5 

The effect of administration of aluminum to rats on the level of three phospholipase C 
(PLC) isozymes (betal, gammal, and deltal) was assessed in a variety of brain tissues. After 

20 exposure to aluminum, a statistically significant increase in malondialdehyde, an index of lipid 
peroxidation, was observed. In addition, there was a significant reduction in the catalytic 
activity of low molecular weight phosphotyrosine phosphatase, which loses its activity during 
oxidative stress. This suggests that oxidative stress is induced in brain tissues exposed to 
aluminum. The protein level of PLC-deltal, but not that of PLC-betal or -gammal, was 

25 significantly increased in brains where oxidative stress had been induced. The total PLC 
activity in aluminum-treated rat brains was significantly higher than that in control brains. 
These results suggest that PLC-deltal protein levels in brain tissues are increased by the 
induction of oxidative stress, giving an explanation for its up-regulation in Alzheimer's 
disease. 

30 Panel 2D Summary: Ag3006 The NOV48A gene is expressed at a low to moderate 

level in most of the tissues on this panel. There is increased expression in ovarian and breast 
cancer compared to normal adjacent tissue. Thus, expression of this gene could potentially be 
used as a diagnostic marker for the presence of cancer. Furthermore, inhibition of this gene in 
ovarian and breast cancer may be useful as a therapeutic treatment. Additionally, there is 
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increased expression in normal kidney samples compared to adjacent tumors. Thus, decreased 
expression of this gene could be used as a diagnostic marker for kidney cancer and therapeutic 
modulation of expression of this gene in tumors may be used to treat these cancers. 

Panel 3D Summary: Ag3006 The NOV48a gene is expressed at a low level in almost 
5 all cancer cell lines in this panel with the highest expression in DMS-79 (CT=29.2l). This 
ubiquitous pattern of expression suggests that this gene product may be required for cell 
survival and proliferation and inhibition of this gene in cancer may therefore be useful as a 
therapy. 

Panel 4D Summary: Ag3006 The NOV48a gene is ubiquitously expressed among the 
10 samples on this panel, suggesting a role for this protein product in inflammation. Please see 
AI comprehensive panelvl.O for further discussion of utility of this gene in inflammation. 
Results from a second experiment with the CG56003-01 gene are not included. The amp plot 
indicates that there were experimental difficulties with this run. 

NOV49 

1 5 Expression of gene NOV49 was assessed using the primer-probe set Ag3003, 

described in Table ATA. Results of the RTQ-PCR runs are shown in Table ATB. 



Table ATA. Probe Name Ag3003 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -catctcgtccaccctacgtt-3 * 


20 


299 


1192 


Probe 


TET- 5 1 -cttcagctgctgttgcactcaaggat-3 1 - 
TAMRA 


26 


339 


1193 


Reverse 


5' -ttcaggaagccatagaaactca-3 » 


22 


366 


1194 



Table ATB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3003, 
Run 166245477 


Tissue Name 


Rel. Exp.(%) Ag3003, 
Run 166245477 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


15.2 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 

2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


17.0 


Renal ca. UO-3 1 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


14.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 
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Brain (amygdala) 


0.0 


Liver ca. 

^nepaiODiasi^ riepvjz, 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0.0 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


17.7 


Sninal cord 


29.9 


Lung ca. (large 
cell)NCI-H460 


0.0 


plio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


v\ in/astro 1 1-1 1 R-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


15.5 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
JNU1-H jVo 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


JZ. 1 


glioma U25l 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


100.0 


glioma SF-295 


0.0 


oreast ca. (pi.eij 
MDA-MB-231 


0.0. 


Heart (fetal) 


0.0 


t>reast ca. jjpi.eij 
T47D 


0.0 


Heart 


n f\ 
u.u 


oreasi ca. d i -D^y 


0 0 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 

3 


0.0 


Thymus 


0.0 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


51.8 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 0.0 
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SK-OV-3 




Small intestine 


0.0 


Uterus 


u.u 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


11.2 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


0.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


o.o 


Gastric ca.* (liver met) 
NCI-N87 


31.2 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea j 13.3 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney j 0.0 


Adipose 


0.0 



Panel 1.3D Summary: Ag3003 Expression of the NOV49 gene is restricted to a 
sample derived from a breast cancer cell line (CT=34.7). Thus, expression of this gene could 
be used to differentiate between this sample and other samples on this panel and as a marker to 
detect the presence of breast cancer. Furthermore, therapeutic modulation of the expression or 



5 function of this gene may be effective in the treatment of breast cancer. 

Panel 4D Summary: Ag3003 Expression of the NOV49 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

NOV50a 

Expression of gene NOV50a was assessed using the primer-probe set Ag3014, 
10 described in Table AUA. 

Table AUA. Probe Name Ag3014 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gactgagcgttgccttttct-3 1 


20 


1065 


1195 


Probe 


TET-5 1 -agctacctcccaaagcagcctgacct-3 ' - \ 
TAMRA 


26 


1088 


1196 


Reverse 


5* -acaatccctgcacaacgat-3 ' 


19 


1138 


1197 
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CNSneurodegenerationvl.O Summary: Ag3014 Expression of the NOVSOa gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 

Panel 1.3D Summary: Ag3014 Expression of the NOVSOa gene is low/undetectable 
5 in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

Panel 4D Summary: Ag3014 Expression of the NOV50a gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there is a 
high probability of a probe failure. 

10 NOV53 

Expression of gene NOV53 was assessed using the primer-probe set Ag3008, 
described in Table AVA. Results of the RTQ-PCR runs are shown in Tables AVB, AVC and 
AVD. 

Table AVA. Probe Name Ag3008 

15 



PrimersJ Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forwardjs ' - cttaagctgctgcctatgaatg-3 ' 


22 


469 


1198 


Probe 1 TET " 5 ' -atacgggagctacagaccatcatccg-3 * - 


26 


496 


1199 


Reversejs ' -tcacctctactggctgtcttgt-3 1 


22 


524 


1200 



Table AVB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rcl. Exp.(%) Ag3008, 
Run 211010256 


Tissue Name j 


Rel. Exp.(%) Ag3008, 
Run 211010256 


AD 1 Hippo 


14.7 


Control (Path) 3 
Temporal Ctx 


6.2 


AD 2 Hippo 


30.6 


Control (Path) 4 
Temporal Ctx 


35.4 


AD 3 Hippo 


6.3 


AD 1 Occipital Ctx 


22.1 


AD 4 Hippo 


8.0 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


84.1 


AD 3 Occipital Ctx 


4.9 


AD 6 Hippo 


57.0 


AD 4 Occipital Ctx 


22.4 


Control 2 Hippo 


28.9 


AD 5 Occipital Ctx 


72.2 


Control 4 Hippo 


22.1 


AD 6 Occipital Ctx 


21.5 


Control (Path) 3 
Hippo 


4.7 


Control 1 Occipital 
Ctx 


4.7 
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AO 1 Tpmnnra 1 Ptv 




Control 2 Occipital 
Ctx 


64.6 


Afk O Tpmnnral ftv 

/A l^/ Z, 1 CllipUIdl V^lA 




Control 3 Occipital 
Ctx 


10.7 


AiJ j i empordi ix 




Control 4 Occipital | 
Ctx 


12.2 


/vlj i emporai V^IX 


1 / .o 


Control (Path) 1 
Occipital Ctx 


73.7 


AD 5 Inf Temporal 
Ctx 


1UU.U 


Control (Path) 2 
Occipital Ctx 


10.5 


AD 5 Sup Temporal; 
Ctx 




Control (Path) 3 
Occipital Ctx 


3.7 


AD 6 Inf Temporal 
Ctx 




Control (Path) 4 
Occipital Ctx 


9.7 


AD 6 Sup Temporal 
Ctx 




Control 1 Parietal 
Ctx 


5 0 


Control 1 Temporal 
Ctx 




Control 2 Parietal 
Ctx 


42 3 


Control 2 Temporal 
Ctx 


46.3 


Control 3 Parietal 
Ctx 


25.5 


Control 3 Temporal 
Ctx 


17.9 


Control (Path) 1 
Parietal Ctx 


95.9 


Control 3 Temporal 
Ctx 


8.0 


Control (Path) 2 
Parietal Ctx 


33.4 


Control (Path) 1 
Temporal Ctx 


58.2 


Control (Path) 3 
Parietal Ctx 


2.7 


Control (Path) 2 
Temporal Ctx 


33.2 


Control (Path) 4 
Parietal Ctx 


32.8 



Table AVC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3008, 
Run 167927168 


Tissue Name 


Rel. Exp.(%) Ag3008, 
Run 167927168 


Liver adenocarcinoma 


18.9 


Kidney (fetal) 


42.6 


Pancreas 


7.6 


Renal ca. 786-0 


18.4 


Pancreatic ca. CAPAN 

2 


9.4 


Renal ca. A498 


11.3 


Adrenal gland 


10.8 


Renal ca. RXF 393 


5.7 


Thyroid 


8.1 


Renal ca. ACHN 


5.2 


Salivary gland 


5.9 


Renal ca. UO-31 


9.1 I 


Pituitary gland 


11.6 


Renal ca. TK-10 


10.3 


Brain (fetal) 


31.9 


Liver 


6.2 


Brain (whole) 


30.1 


Liver (fetal) 


10.7 


Brain (amygdala) 


17.8 


Liver ca. 

(hepatoblast) HepG2 


17.3 
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Brain (cerebellum) 


253 


Lung 


5.5 


Brain (hippocampus) 


25.7 


Lung (fetal) 


10.2 


DIcllIl ^UUbLcillllcl llli^Itl^ 


24.7 


Lung ca. (small cell) 
LX-1 


21.0 


OltUIl ^llldJcUIlUoy 


14 1 


Lung ca. (small cell) 
NCI-H69 


1.4 


v^ereorai \^uricx 


76 9 


Lung ca. (s.cell var.) 
SHP-77 


94.0 


opinai coro 


1 U.J 


Lung ca. (large 
cell)NCI-H460 


4.7 


guo/asiro U0/-IV10 


Ifk 9 


Lung ca. (non-sm. 
cell) A549 


36.6 


giio/ astro u-i lo-ivivj 


99 4 


Lung ca. (non-s.cell) 
NCI-H23 


12.7 


astrocytoma SW1783 


82.4 


Lung ca. (non-s.cell) 
HOP-62 


19.1 


neuro*; met SK-N-AS 


20.7 


Lung ca. (non-s.cl) 
NCI-H522 


33.2 


astrocytoma SF-539 


30.6 


Lung ca. (squam.) 
SW 900 


16.0 


astrocytoma SNB-75 


33.0 


Lung ca. (squam.) 
NCI-H596 


1.1 


glioma SNB-19 


22.7 


Mammary gland 


13.0 


glioma U251 


100.0 


Breast ca.* (pl.ef) 
MCF-7 


21.8 


glioma SF-295 


38.2 


Breast ca.* (pl.ef) 
MDA-MB-231 


38.7 


Heart (fetal) 


12.5 


Breast ca.* (pl.ef) 
T47D 


21.3 


Heart 


14.7 


Breast ca. BT-549 


15.6 


Skeletal muscle (fetal) 


10.4 


Breast ca. MDA-N 


21.6 


Skeletal muscle 


35.4 


Ovary 


15.3 


1 1 1 ell 1 \J W 


7.4 


Ovarian ca. OVCAR- 

3 


11.6 


1 iiyiuub 


21 6 


Ovarian ca. OVCAR- 
4 


10.2 


Spleen 


9.1 


Ovarian ca. OVCAR- 
5 


12.4 


Lymph node 


16.8 


Ovarian ca. OVCAR- 
8 


8.5 


Colorectal 


15.0 


Ovarian ca. IGROV- 
1 


6.8 


Stomach 


17.8 


Ovarian ca.* (ascites) 
SK-OV-3 


58.2 


Small intestine 


4.7 


Uterus 


8.1 
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Colon ca. SW480 j 


12.5 


Placenta 


1.3 


Colon ca.* 
swozU(bW45U met) 


85.9 


Prostate 


7.2 


Colon ca. HT29 


7.0 


Prostate ca.* (bone 


22.8 


Colon ca. HCT-116 


13.3 


Testis 


0.6 


Colon ca. CaCo-2 


32.8 


Melanoma 
Hs688(A).T 


10.1 


Colon ca. 


9.3 


Melanoma* (met) 
Hs688(B).T 


8.1 


Colon ca. HCC-2998 


19.9 


Melanoma UACC-62 


22.4 


Gastric ca.* (liver met) 
NCI-N87 


10.7 


Melanoma M14 


6.3 


Bladder 


21.6 


Melanoma LOX 
IMVI 


18.2 


Trachea 


9.3 


Melanoma* (met) 
SK-MEL-5 


12.5 


Kidney 


31.2 


Adipose 


23.0 



Table AVD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3008, Run 
164043360 


Tissue Name 


Rel. Exp.(%) 
Ag3008, Run 
164043360 


Secondary Thl act 


21.0 


HUVEC IL-lbeta 


7.5 


Secondary Th2 act 


23.7 


HUVEC IFN gamma 


10.7 


Secondary Trl act 


23.5 


HUVEC TNF alpha + IFN 
gamma 


6.8 


Secondary Thl rest 


8.2 


HUVEC TNF alpha + IL4 


7.5 


Secondary Th2 rest 


8.8 


HUVEC IL-11 


7.7 


Secondary Trl rest 


9.1 


Lung Microvascular EC 
none 


12.0 


Primary Thl act 


24.8 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


8.2 


Primary Th2 act 


21.9 


Microvascular Dermal EC 
none 


17.9 


Primary Trl act 


29.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


11.0 


Primary Thl rest 


44.8 


Bronchial epithelium 
TNFalpha + ILlbeta 


16.4 


Primary Th2 rest 


21.3 


Small airway epithelium 
none 


6.0 


Primary Trl rest 


20.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


48.3 


CD45RA CD4 


15.8 


Coronery artery SMC rest 


19.9 
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lymphocyte act 






CD45RO CD4 
lymphocyte act 


30.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


8.0 


CD8 lymphocyte act 


22.8 


Astrocytes rest 


13.4 


Secondary CD8 
lymphocyte rest 


32.5 


Astrocytes TNFalpha + 
IL-lbeta 


8.9 


Secondary CD8 
lymphocyte act 


15.7 


KU-8 12 (Basophil) rest 


9.5 


CD4 lymphocyte none 


o.o 


KU-8 12 (Basophil) 
PMA/ionomycin 


33.0 


2ry Thl/Th2/Trl_anti- 
CD95 CH11 


17 0 


CCD 1 106 (Keratinocytes) 
none 


12.3 


LAK cells rest 


21.3 


CCD 1106 (Keratinocytes) 

rrrv tt~* "1 1 ■ TT" "11 a. 

TNFalpha + IL-lbeta 


8.6 


T A iy . . 1 L. TT O 

LAK cells lL-z 


19.5 


Liver cirrhosis 


2.1 


T A 11,, T¥ ^ i TT 1 O 

LAK cells 1L-2+IL-12 


17.6 


Lupus kidney 


1.4 


LAK cells IL-2+IFN 
gamma 


28.9 


NCI-H292 none 


19.8 


LAK cells 1L-Z+ 1L- 1 o 


16.2 


NCI-H292 IL-4 


25.9 


LAK cells 
PMA/ionomycin 


23.2 


NCI-H292 1L-9 


27.0 


NK Cells 1L-2 rest 


1 1.0 


NCI-H292 IL-1 3 


10.7 


lwo Way MLK J day 


23.2 


NCI-H292 IFN gamma 


11.8 


Two Way MLR 5 day 


16.3 


HPAEC none 


6.0 


Two Way MLR 7 day 


9 6, 


HPAEC TNF alpha + IL-1 
beta 


9.0 


PBMC rest 


10.7 


Lung fibroblast none 


/.J 


PBMC PWM 


65.1 


Lung fibroblast TNF alpha 
+ IL-1 beta 


6.7 


PBMC PHA-L 


58.6 


Lung fibroblast IL-4 


22.1 


Ramos (B cell) none 


33.0 


Lung fibroblast IL-9 


18.2 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-1 3 


14.7 


R Ivmnhocvtes PWM 


74. / 


Lung fibroblast IFN 
gamma 


23.2 


B lymphocytes CD40L 
and 1L-4 


23.2 


Dermal fibroblast 
CCD 1070 rest 


18.8 


EOL-1 dbcAMP 


12.2 


Dermal fibroblast 
CCD 1070 TNF alpha 


46.7 


EOL-1 dbcAMP 
PMA/ionomycin 


13.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


8.0 


Dendritic cells none 


16.7 


Dermal fibroblast IFN 
gamma 


6.7 


Dendritic cells LPS 


13.0 


Dermal fibroblast IL-4 


12.2 


Dendritic cells anti- 


17.1 


IBD Colitis 2 


2.0 
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CD40 








Monocytes rest 


17.2 


IBD Crohn's 


2.9 


Monocytes LPS 


7.5 


Colon 


15.4 


Macrophages rest 


24.5 


Lung 


14.8 


Macrophages LPS 


11.8 


Thymus 


22.2 


HUVEC none 


9.4 


Kidney 


19.8 


HUVEC starved 


18.8 







CNS_neurodegeneration_vl.O Summary: Ag3008 This panel does not show 
differential expression of the NOV53 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3008 Highest expression of the NOV53 gene is seen in a 
brain cancer cell line (CT=29). In addition, this gene has low to moderate expression in all the 
cancer cell lines used in this panel. Thus, expression of this gene might be used as a diagnostic 
marker in brain, colon, renal, lung, melanoma and ovarian cancers. 
10 This gene encodes a homolog of uracil phosphoribosyltransferase. This gene has low to 

moderate expression in several endocrine/metabolically-related tissues, including; adipose, 
adrenal, pancreas, liver and skeletal muscle. Therefore, a therapeutic modulator to this gene 
and/or gene-product may prove useful in the treatment of diseases which affect the endocrine 
system. 

15 In addition, this gene shows moderate to low levels in the CNS and may be a small 

molecule target for the treatment of neurologic diseases. 

Panel 4D Summary: Ag3008 The NOV53 gene, a uracil phosphoribosyl-transferase 
homolog is expressed at moderate to low levels in numerous cell types involved in the immune 
response. Higher levels of expression are seen in activated B lymphocytes, represented by 

20 ionomycin-activated Ramos (CT=27.6), and pokeweed mitogen-activated B lymphocytes 

(CT=28.02). Therefore, small molecules that antagonize the function of this gene product may 
be useful as therapeutic drugs to reduce or eliminate the symptoms in patients with 
autoimmune and inflammatory diseases in which B cells play a part in the initiation or 
progression of the disease process, such as lupus erythematosus, Crohn's disease, ulcerative 

25 colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, 
rheumatoid arthritis, or psoriasis. 

NOV54a 
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Expression of gene NOV54a was assessed using the primer-probe sets Ag3015 and 
Ag3070, described in Tables AWA and AWB. Results of the RTQ-PCR runs are shown in 
Tables AWC, A WD, AWE and AWF. 



Table AWA. Probe Name Ag3015 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gtgctctcactatccacctcaa-3 1 


22 


1515 


1201 


Probe 


TET-5 ' -cacacatccatctcaagaggaacatt-3 1 - 
TAMRA 


26 


1537 


1202 


Reverse 


5 ' -ccatacacttccagctctgact-3 * 


22 


1573 


1203 



5 



Table AWB. Probe Name Ag3070 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gtgctctcactatccacctcaa-3 1 


22 


1515 


1204 


Probe 


TET-5 ■ -cacacatccatctcaagaggaacatt-3 1 - ■ 
TAMRA 


26 


1537 


1205 


Reverse 


5 ' -ccatacacttccagctctgact-3 ' 


22 


1573 


1206 



Table AWC. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3015, Run 
211010356 


Rel. Exp.(%) 
Ag3070, Run 
208974108 


Tissue 
Name 


Rel. Exp.(%) 
Ag3015, Run 
211010356 


Rel. Exp.(%) 
Ag3070, Run 
208974108 


AD 1 Hippo 


37.1 


22.5 


Control 
(Path) 3 
Temporal 
Ctx 


11.2 


10.5 


AD 2 Hippo 


56.3 


50.0 


Control 
(Path) 4 
Temporal 
Ctx 


50.0 


29.9 


AD 3 Hippo 


24.8 


22.7 


AD 1 

Occipital 

Ctx 


15.5 


13.6 


AD 4 Hippo 


15.6 


8.5 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 hippo 


84.7 ' 


54.0 


AD 3 

Occipital 

Ctx 


17.7 


9.4 


AD 6 Hippo 


100.0 


66.4 


AD 4 

Occipital 

Ctx 


17.3 


11.3 


Control 2 


37.4 


32.5 


AD 5 


29.7 


39.2 
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Hippo 






Occipital 
Ctx 






Control 4 
Hippo 


33.0 


26.1 


AD 6 

Occipital 

Ctx 


45.4 


13.1 


Control (Path) 
3 Hippo 


15.2 


8.8 


Control 1 
Occipital 
Ctx 


9.7 


8.2 


AD 1 Temporal 
Ctx 


31.2 


17.4 


Control 2 
Occipital 
Ctx 


42.0 


30.4 


AD 2 Temporal 
Ctx 


50.3 


38.2 


Control 3 
Occipital 
Ctx 


15.5 


14.5 


AD 3 Temporal 

Ctx 


17.6 


14.8 


Control 4 
Occipital 
Ctx 


13.8 


9.4 


AD 4 Temporal 

Ctx 


36.1 


23.3 


Control 
(Path) 1 
Occipital 
Ctx 


92.7 


67.8 


AD 5 Inf 


87.1 


69.7 


Control 
(Path) 2 
Occinital 
Ctx 


17.7 


11.3 


AD 5 

SupTemporal 
Ctx 


87.1 


53.2 


Control 
(Path) 3 
Occipital 
Ctx 


7.9 


8.9 


AD 6 Inf 
Temporal Ctx 


60.7 


72.7 


Control 
(Path) 4 
Occipital 
Ctx 


21.8 


14.5 


AD 6 Sup 
Temporal Ctx 


55.9 


35.8 


Control 1 
Parietal Ctx 


16.5 


10.4 


Control 1 
Temporal Ctx 


1 < o 


o.U 


Control 2 
Parietal Ctx 


77 Q 




Control 2 
Temporal Ctx 


56.6 


28.9 


Control 3 
Parietal Ctx 


20.3 


19.3 


Control 3 
Temporal Ctx 


28.3 


17.8 


Control 
(Path) 1 
Parietal Ctx 


76.3 


100.0 


Control 4 
Temporal Ctx 


21.5 


16.7 


Control 
(Path) 2 
Parietal Ctx 


36.1 


21.2 


Control (Path) 
1 Temporal Ctx 


82.4 


70.2 


Control 
(Path) 3 
Parietal Ctx 


7.1 


9.3 
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Control (Path) 
2 Temporal Ctx 


49.7 


jControl 
38.7 (Path) 4 

(Parietal Ctx 


41.8 


30.6 



Table A WD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%)l 
Ag3015, Run 
167927212 


Rel. Exp.(%) 
Ag3070, Run 
167985243 


Tissue Name 


Rel. Exp.(%) 
Ag3015, Run 
167927212 


Rel. Exp.(%) 
Ag3070, Run 
167985243 


Liver 

adenocarcinoma 


23.7 


21.3 


Kidney (fetal) 


56.3 


64.6 


Pancreas 


7.4 


7.2 


Renal ca. 786- 
0 


2.3 


2.9 


Pancreatic ca. 
CAPAN 2 


4.5 


2.4 


Renal ca. 
A498 


13.9 


11.7 


Adrenal gland 


6.5 


4.9 


Renal ca. RXF | 
393 


14.1 


25.7 


Thyroid 


14.6 


14.6 


Renal ca. 
ACHN 


13.9 


19.8 


Salivary gland 


13.3 


1 .4 


Renal ca. UO- 
31 


7.4 


17.8 


r liuiiary gianu 


o.u 




Renal ca. TK- 
10 


1? 6 


14 0 


Brain (fetal) 


25.3 


14.6 


Liver 


13.8 


2.4 


Brain (whole) 


21.6 


11.1 


Liver (fetal) 


5.7 


4.9 


Brain (amygdala) 


28.1 


32.3 


Liver ca. 

(hepatoblast) 

HepG2 


3.7 


4.3 


Brain (cerebellum) 


17.9 


17.2 


Lung 


23.7 


7.6 


Brain 

(hippocampus) 


16.8 


16.5 


Lung (fetal) 


20.6 


33.0 


Brain (substantia 
nigra) 


18.9 


30.6 


Lung ca. 
(small cell) 
LX-1 


8.8 


15.2 


Brain (thalamus) 


9.1 


2.8 


Lung ca. 
(small cell) 
NCI-H69 


3.5 


10.4 


Cerebral Cortex 


42.6 


36.9 


Lung ca. 
(s.cell var.) 
SHP-77 


13.1 


17.3 


Spinal cord 


18.9 


18.4 


Lung ca. (large 
cell)NCI-H460 


4.7 


2.2 


glio/astro U87-MG 


19.8 


37.6 


Lung ca. (non- 
sm. cell) A549 


13.1 


10.2 


glio/astro U-118- 
MG 


11.9 


18.3 


Lung ca. (non- 
s.cell) NCI- 


5.5 


18.7 
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H23 


— — _ — _ — ,.„,.,„ — — „ 


astrocytoma 
SW1783 


5.6 


10.9 


Lung ca. (non- 
s.cell) HOP-62 


7.5 


9.3 


neuro*; met SK-N- 
AS 


3.4 


7.1 


Lung ca. (non- 
s.cl) NCI- 
H522 


18.8 


34.6 


astrocytoma SF- 
539 


6.4 


10.4 


Lung ca. 
(squam.) SW 
900 


2.0 


4.8 


astrocytoma SNB- 
75 


12.9 


10.1 


Lung ca. 
(squam.) NCI- 
H596 


7.6 


8.2 


glioma SNB-19 


13.3 


16.0 


Mammary 
gland 


19.5 

—-^^ 


3.8 


glioma U251 


24.5 


36.9 


Breast ca.* 
(pl.ef) MCF-7 


0.9 


2.6 


glioma SF-295 


13.4 


18.2 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


8.0 


9.7 


Heart (fetal) 


100.0 


100.0 


Breast ca.* 
(pl.ef) T47D 


26.8 


40.6 


Heart 


12.4 


14.0 


Breast ca. BT- 
549 


5.1 


. 6.1 


Skeletal muscle 
(fetal) 


c i i 
51.1 


,67.8 


Breast ca. 
MDA-N 


11.0 


16.2 


Skeletal muscle 


11.4 


14.1 


Ovary 


50.0 53.2 


Bone marrow 


28.9 


1 1.1 


Ovarian ca. 
OVCAR-3 


8.3 


3.3 


Thymus 


53.2 


56.3 


Ovarian ca. 
OVCAR-4 


11.0 


18.7 


Spleen 


1A ft 


30.6 


Ovarian ca. 
OVCAR-5 


20.2 


38.4 


Lymph node 


60.7 


47.6 


Ovarian ca. 
OVCAR-8 


5.0 


8.1 


Colorectal 


17.1 


15.5 


Ovarian ca. 
IGROV-1 


4.7 


7.9 


Stomach 


8.9 


9.7 


Ovarian ca.* 
(ascites) SK- 
OV-3 


8.5 


14.1 


Small intestine 


11.9 


12.8 


Uterus 


25.5 


26.2 


Colon ca. SW480 


17.9 


16.4 


Placenta 


3.4 


5.4 


Colon ca.* 

SW620(SW480 

met) 


21.9 


16.0 


Prostate 


14.3 


13.5 


Colon ca. HT29 


2.4 


2.3 


Prostate ca.* 
(bone met)PC- 


14.6 


28.3 
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Colon ca. HCT- 
116 


3.4 


6.8 


Testis 


6.6 


10.3 


Colon ca. CaCo-2 


11.7 


16.0 


Melanoma 
Hs688(A).T 


12.9 


12.4 


Colon ca. 
tissue(OD03866) 


9.9 


18.0 


Melanoma* 
(met) 

Hs688(B).T 


10.7 


16.6 


Colon ca. HCC- 
2998 


15.0 


17.8 


Melanoma 
UACC-62 


7.3 


10.4 


Gastric ca.* (liver 
met) NCI-N87 


3.1 


4.9 


Melanoma 
M14 


3.4 


7.4 


Bladder 


10.5 


5.8 


Melanoma 
LOX IMVI 


14.1 


15.7 


Trachea 


13.2 


10.9 


Melanoma* 
(met) SK- 
MEL-5 


2.5 


2.8 


Kidney 


26.1 


15.0 


Adipose 


34.2 


33.0 



Table AWE. Panel 2.2 



Tissue Name 


Rel. Exp.(%) 
Ag3070, Run 
173800588 


Tissue Name 


Rel. Exp.(%) 
Ag3070, Run 
173800588 


Normal Colon 


25.0 


Kidney Margin 
(OD04348) 


75.8 


Colon cancer 
(OD06064) 


74.2 


Kidney malignant 
cancer (OD06204B) 


5.0 


Colon Margin 
(OD06064) 


36.3 


Kidney normal adjacent 
tissue (OD06204E) 


11.9 


Colon cancer 
(OD06159) 


4.9 


Kidney Cancer 
(OD04450-01) 


44.4 


Colon Margin 
(OD06159) 


17.4 


Kidney Margin 
(OD04450-03) 


15.8 


Colon cancer 
(OD06297-04) 


1.4 


Kidney Cancer 8120613 


4.0 


Colon Margin 
(OD06297-015) 


18.9 


Kidney Margin 
8120614 


37.4 


CC Gr.2 ascend colon 
(OD03921) 


6.5 


Kidney Cancer 9010320 


19.3 


CC Margin (OD03921) 


16.5 


Kidney Margin 
9010321 


16.6 


Colon cancer metastasis 
(OD06104) 


12.7 


Kidney Cancer 8120607 


33.2 


Lung Margin 
(OD06104) 


34.6 


Kidney Margin 
8120608 


18.4 
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Colon mets to lung 
(OD04451-01) 


8.1 


Normal Uterus 


76.3 


Lung Margin 
(OD04451-02) 


52.5 


Uterine Cancer 06401 1 


12.8 


Normal Prostate 


19.1 


Normal Thyroid 


11.8 


Prostate Cancer 
(OD04410) 


0.0 


Thyroid Cancer 064010 


7.2 


Prostate Margin 
(OD04410) 


13.2 


Thyroid Cancer 
A302152 


37.4 


Normal Ovary 


60.7 


Thyroid Margin 
A302153 


9.7 


Ovarian cancer 
(OD06283-03) 


14.8 


Normal Breast 


100.0 


Ovarian Margin 
(OD06283-07) 


24. 1 


Breast Cancer 
(OD04566) 


13.4 


Ovarian Cancer 064008 


26.6 


Breast Cancer 1024 


34.6 


Ovarian cancer 
(OD06145) 




Breast Cancer 
(OD04590-01) 


13.3 


Ovarian Margin 
(OD06145) 




Breast Cancer Mets 
(OD04590-03) 


40.6 


v/varian cancer 
(OD06455-03) 


0.0 


Breast Cancer 
Metastasis (OD04655- 
05) 


52.9 


Ovarian Margin 
(OD06455-07) 


22.4 


Breast Cancer 064006 


29.3 


Normal Lung _ 


36.9 


Breast Cancer 9100266 


28.7 


Invasive poor diff. lung 
adeno (ODO4945-01 


19 1 


Breast Margin 9100265 


35.6 


Lung Margin 
(ODO4945-03) 


44.1 


Breast Cancer A209073 


10.7 


Lung Malignant Cancer 
(OD03126) 


18.0 


Breast Margin 
A2090734 


19.5 


Lung Margin 
(OD03126) 


18.3 


Breast cancer 
(OD06083) 


30.4 


Lung Cancer 
(OD05014A) 


22.5 


Breast cancer node 
metastasis (OD06083) 


59.5 


Lung Margin 

(UIJIDU 1 4r> ) 


79.6 


Normal Liver 


19.1 


Lung cancer (OD06081) 


9.5 


Liver Cancer 1 026 


17.3 \ 


Lung Margin 
(OD06081) 


17.7 


Liver Cancer 1 025 


17.4 


Lung Cancer 
(OD04237-01) 


4.1 


Liver Cancer 6004-T 


27.0 


Lung Margin 
(OD04237-02) 


54.7 


Liver Tissue 6004-N 


8.7 


Ocular Melanoma 


14.3 


Liver Cancer 6005-T 


48.6 
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Metastasis 








Ocular Melanoma 
Margin (Liver) 


9.7 


Liver Tissue 6005-N 


49.3 


Melanoma Metastasis 


4.4 


Liver Cancer 064003 


23.2 


Melanoma Margin 
(Lung) 


60.3 


Normal Bladder 


18.7 


Normal Kidney 


18.6 


Bladder Cancer 1023 


15.3 


Kidney Ca, Nuclear 
grade2(OD04338) 


47 6 


Bladder Cancer 
A302173 


1 5 1 


Kidney Margin 
(OD04338) 


ZO.D 


INOlIIiai olUiIlcU-«Il 




Kidney Ca Nuclear 
grade 1/2 (OD04339) 


49 7 


Gastric Cancer 9060397 


9.1 


Kidney Margin 
(OD04339) ! 


16.4 


Stomach Margin 
9060396 


33.0 


Kidney Ca, Clear cell 
type (OD04340) 


11.9 


Gastric Cancer 9060395 


29.9 


Kidney Margin 
(OD04340) 


19.1 


Stomach Margin 
9060394 


65.5 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


7.3 


Gastric Cancer 064005 


19.5 



Table AWF. Panel 4D 



Tissue Name 


Rel. 

Exp.(%) 
Ag3015, 
Run 
164043871 | 


Rel. 

Exp.(%) 
Ag3070, 
Run 
164525657 


Tissue Name 


Rel. 

Exp.(%) 
Ag3015, 
Run 
164043871 


Rel. 
Exp.(%) 
Ag3070, 

Run 
164525657 


Secondary Th 1 act 


35.4 


35.4 


HUVEC IL-lbeta 


2.6 


1.2 


Secondary Th2 act 


27.9 


21.6 


HUVEC I FN 
gamma 


9.7 


10.5 


Secondary Trl act 


28.9 


33.7 


HUVEC TNF 
alpha + IFN 
gamma 


8.1 


8.1 


Secondary Th 1 rest 


21.3 


21.5 


HUVEC TNF 
alpha + IL4 


4.8 


7.4 


Secondary Th2 rest 


24.0 


21.2 


HUVEC IL-11 


4.9 


5.6 


Secondary Trl rest 


26.6 


26.6 


Lung 

Microvascular EC ; 
none 


6.9 


9.2 


Primary Thl act 


23.5 


19.6 


Lung 

Microvascular EC 
TNFalpha + IL- 
lbeta 


6.7 


7.5 


Primary Th2 act 


18.3 


20.7 


Microvascular 


12.4 


14.8 
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Dermal EC none 






Primary Trl act 


23.8 


27.5 


Microsvasular 
Dermal EC 
TNFalpha + IL- 1 
lbeta 


5.5 


7.0 


Primary Thl rest 


48.6 


54.0 


Ftronchial 

I- J 1 \J\ IvlllUi 

epithelium 
TNFalpha + 
ILlbeta 


5.1 


4.8 


Primary Th2 rest 


33.0 


28.7 


Small airway 
epithelium none 


2.0 


2.1 


Primary Trl rest 


26.1 


28.1 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


3.4 


3.0 


CD45RA CD4 
lymphocyte act 


12.7 


14.8 


Coronery artery 
SMC rest 


8.4 


7.1 


CD45RO CD4 
lymphocyte act 


28.9 


32.3 


Coronery artery 
SMC TNFalpha + 
IL- lbeta 


4.1 


7.3 


CD8 lymphocyte 
act 


27.7 


42.0 


Astrocytes rest 


5.3 


7.6 


Secondary CD8 
lymphocyte rest 


29.1 


35.8 


Astrocytes 
TNFalpha + IL- 
lbeta 


3.8 


3.2 


Secondary CD8 
lymphocyte act 


38.2 


41.8 


KU-812 
(Basophil) rest 


4.0 


4.5 


CD4 lymphocyte 
none 


0.0 


18.0 


KU-812 

(Basophil) 

PMA/ionomycin 


7.1 


10.4 

• 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


26.8 


29.7 


CCD1106 

(Keratinocytes) 

none 


8.3 


6.8 


T A fC pp!1<i rp<5t 

Ly/vrv v. v 1 1 . t itji 


50.3 


71.2 


CCD1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


4.2 


2.1 


LAK cells IL-2 


30.1 


35.6 


Liver cirrhosis 


3.2 


3.0" 


LAK cells IL-2+IL- 
12 


16.6 


29.3 


Lupus kidney 


3.1 


3.3 


LAK cells IL- 
2+IFN gamma 


43.8 


44.8 


NCI-H292 none 


7.5 

^ — , — . yT 


5.7 


LAK cells IL-2+ 
IL-18 


26.8 


50.0 


NCI-H292 IL-4 


6.5 


8.3 


LAK cells 
PMA/ionomycin 


23.8 


24.8 


NCI-H292 IL-9 


6.0 


8.2 


NK Cells IL-2 rest 


31.4 


35.1 


NCI-H292 IL-13 


3.7 


4.1 
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Two Way MLR 3 
day 


32.1 


50.3 


NCI-H292 I FN 
gamma 


3.9 


4.5 


Two Way MLR 5 
day 


20.7 


26.2 


HPAEC none 


9.8 


10.2 


Two Way MLR 7 
day 


17.1 


25.9 


HPAEC TNF 
alpha + IL-1 beta 


3.8 


4.5 


PBMC rest 


16.2 


19.1 


Lung fibroblast 
none 


20.0 


23.3 


PBMC PWM 


55.1 


58.6 


Lung fibroblast 
TNF alpha + IL-1 
beta 


7.6 


8.0 


PRIV/If^ PU A T 
rDlViL^ r ri/v-L 


J7.0 




Lung fibroblast 
IL-4 


19 9 


24.7 


Ramos (B cell) 
none 


4.8 


5.2 


Lung fibroblast 
IL-9 


15.6 


19.2 


Ramos (B cell) 
ionomycin 


18.9 


25.3 


Lung fibroblast 
IL-1 3 


15.1 


22.7 


B lymphocytes 
PWM 


73.7 


77.9 


Lung fibroblast 
IFN gamma 


22.8 


20.7 


B lymphocytes 
CD40L and IL-4 


25.2 


37.4 


Dermal fibroblast 
CCD 1070 rest 


16.4 


17.6 


EOL-1 dbcAMP 


11.7 


10.5 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


56.6 


67.4 


JbOL- 1 auc/vivir 
PMA/ionomycin 


14.7 


24.3 


Dermal fibroblast 
CCD 1070 IL-1 
beta 


7.4 


9.6 


Dendritic cells 
none 


89.5 


100.0 


Dermal fibroblast 
j r in gamma 


13.1 


12.0 


Dendritic cells LPS 


38.2 


55.9 


L/ernidi iiuiuuic*i>i 
IL-4 


24.1 


33.7 


Dendritic cells anti- 
CD40 


100.0 


94.0 


IBD Colitis 2 


0.9 


1.4 


Monocytes rest 


49.3 


72.2 


IBD Crohn's 


1.4 


1.1 


Monocytes LPS 


15.4 


14.7 


Colon 


10.0 


10.7 


Macrophages rest 


80.7 


87.1 


Lung 


16.5 


15.0 


Macrophages LPS 


24.0 


30.1 


Thymus 


10.3 


11.0 


HUVEC none 


"6.9 


7.9 


Kidney 


34.6 


50.3 


HUVEC starved 


12.3 


15.3 







CNS_neurodegeneration_vl.O Summary: Ag3015/Ag2070 This panel does not show 
differential expression of the NOV54a gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 
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Panel 1.3D Summary: Ag3015/Ag2070 Two experiments with the same probe and 
primer set produce results that are in excellent agreement, with highest expression in fetal 
heart (CTs=29-30). This expression is higher than the expression seen in adult heart (CTs=32- 
33). Thus, expression of this gene could be used to differentiate between the two sources of 
5 this tissue. This gene is expressed in other metabolic tissues including adipose, adrenal, liver, 
pancreas, skeletal muscle and thyroid. This gene encodes a homoiog of protein phosphatase 
2C (PP2C), which has been linked to the regulation of hormone-sensitive lipase, the rate- 
limiting enzyme in adipose tissue lipolysis (Eur J Biochem 1987 Oct 15;168(2):399-405). 
PP2C may also play a role in controlling insulin signaling. Therefore, a therapeutic modulator 
10 of this gene and/or gene-product may prove useful in the treatment of diseases affecting the 
endocrine system. 

In addition, protein phosphatase 2C plays a role in dopamine and serotonin signaling. 
Specifically, PP2C counters the action of these neurotransmitters on DARPP-32. These 
neurotransmitter systems are the primary targets of drugs that treat schizophrenia and 
15 depression. Therefore, agents that inhibit this gene product may have utility in treating these 
disorders. 

References: 

Desdouits F, Siciliano JC, Nairn AC, Greengard P, Girault JA. Dephosphorylation of 
Ser-137 in DARPP-32 by protein phosphatases 2A and 2C: different roles in vitro and in 

20 striatonigral neurons. Biochem J 1998 Feb 15;330 ( Pt 1):21 1-6 

DARPP-32 (dopamine- and cAMP-regulated phosphoprotein, Mr=32000) is highly 
expressed in striatonigral neurons in which its phosphorylation is regulated by several 
neurotransmitters including dopamine and glutamate. DARPP-32 becomes a potent inhibitor 
of protein phosphatase 1 when it is phosphorylated on Thr-34 by cAMP- or cGMP-dependent 

25 protein kinases. DARPP-32 is also phosphorylated on Ser-137 by protein kinase CK1 (CK1), 
in vitro and in vivo. This phosphorylation has an important regulatory role since it inhibits the 
dephosphorylation of Thr-34 by calcineurin in vitro and in striatonigral neurons. Here, we 
show that DARPP-32 phosphorylated by CK1 is a substrate in vitro for protein phosphatases 
2A and 2C, but not protein phosphatase 1 or calcineurin. However, in substantia nigra slices, 

30 dephosphorylation of Ser-137 was markedly sensitive to decreased temperature, and not 
detectably affected by the presence of okadaic acid under conditions in which 
dephosphorylation of Thr-34 by protein phosphatase 2 A was inhibited. These results suggest 
that, in neurons, phospho-Ser-137-DARPP-32 is dephosphorylated by protein phosphatase 2C, 
but not 2A. Thus, DARPP-32 appears to be a component of a regulatory cascade of 
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phosphatases in which dephosphorylation of Ser-136 by protein phosphatase 2C facilitates 
dephosphorylation of Thr-34 by calcineurin, removing the cyclic nucleotide-induced inhibition 
of protein phosphatase 1. 

Overall, expression of this gene is appears to be associated with normal tissues over 
5 cancer cell lines. Thus, expression of this gene could be used to differentiate between normal 
and malignant tissues and potentially as a treatment for cancer. 

Panel 2.2 Summary: Ag3070 As seen in the previous panel, the NOV54a gene shows 
greater expression in normal tissues than in samples derived from malignant tissue. Thus, 
expression of this gene may be useful in distinguishing the two types of tissue. 

10 Panel 4D Summary: Ag3015/Ag2070 Two experiments with the same probe and 

primer set produce results that are in excellent agreement. The NOV54a gene, a protein 
phosphatase 2C homolog is expressed by T lymphocytes prepared under a number of 
conditions at moderate levels and is expressed at higher levels in treated and untreated 
dendritic cells, monocytes, and macrophages. Dendritic cells and macrophages are powerful 

15 antigen-presenting cells (APC) whose function is pivotal in the initiation and maintenance of 
normal immune responses. Autoimmunity and inflammation may also be reduced by 
suppression of this function. Therefore, small molecule drugs that antagonize the function of 
this gene product may reduce or eliminate the symptoms in patients with several types of 
autoimmune and inflammatory diseases, such as lupus erythematosus, Crohn's disease, 

20 ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, 
emphysema, rheumatoid arthritis, or psoriasis. 

NOV55 

Expression of gene NOV55 was assessed using the primer-probe set Ag3024, 
described in Table AX A. Results of the RTQ-PCR runs are shown in Tables AXB, AXC, 
25 AXD, AXE, AXF, AXG and AXH. 

Table AXA. Probe Name Ag3024 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5' -ggtgggctgctataacttgact-3 1 


22 


1033 


1207 


Probe 


TET- 5 1 - tgaaagaaacaccatcctgttgcaga-3 * - j 
TAMRA 


26 


1069 


1208 


Reverse 


5 ' -tgttcttcaggttgttctttgc-3 ■ 


22 


1097 


1209 
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Table AXB. AI_comprehensive panel_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
248122026 


Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
248122026 


110967 COPD-F 


10.6 


11 2427 Match Control 1 
Psoriasis-F 


37.1 


110980 COPD-F 


9.5 


112418 Psoriasis-M 


6.1 


1 10968 COPD-M 


10.8 


112723 Match Control 
Psoriasis-M 


0.0 


1 10977 COPD-M 


16.7 


112419 Psoriasis-M ! 


16.6 


110989 Emphysema- 
F 


44.8 


112424 Match Control : 
Psoriasis-M 


17.8 


1 10992 Emphysema- 
F 


3.8 


112420 Psoriasis-M 


100.0 


1 1 0993 Emphysema- 
F 


2.6 


1 12425 Match Control ; 
Psoriasis-M 


26.2 


110994 Emphysema- 
F 


1.0 


104689 (MF) OA 
Bone-Backus 


0.0 


110995 Emphysema- 
F 


9.0 


104690 (MF) Adj 
"Normal" Bone-Backus J 


0.4 


1 10996 Emphysema- 
F 


1.3 


104691 (MF) OA 
Synovium-Backus 


3.0 


110997 Asthma-M 


1.0 


104692 (BA) OA 
Cartilage-Backus 


1.0 


111001 Asthma-F 


33.4 


104694 (B A) OA 
Bone-Backus 


0.0 


111002 Asthma-F 


37.9 


104695 (BA) Adj 
"Normal" Bone-Backus 


0.6 


111003 Atopic 
Asthma-F 


41 .5 


104696 (BA) OA 
Synovium-Backus 


0.5 


111004 Atopic 
Asthma-F 


21 .2 


104700 (SS) OA Bone- 
Backus 


0.2 


111005 Atopic 
Asthma-F 


39.0 


104701 (SS) Adj 
"Normal" Bone-Backus 


A 1 

0.3 


111006 Atopic 
Asthma-F 


5.5 


104702 (SS) OA 
Synovium-Backus 




111417 Allergy-M 


26.6 


117093 OA Cartilage 
Rep7 


42.0 


112347 Allergy-M 


8.6 


112672 OA Bone5 


18.4 


112349 Normal Lung- 
F 


9.2 


112673 OA Synoviums 


8.0 


112357 Normal Lung- 
F 


0.6 


112674 OA Synovial 
Fluid cells5 


2.4 


1 12354 Normal Lung- 
M 


1.3 


117100 OA Cartilage 
Rep 14 


5.5 
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II 2374 Crohns-F 


4.0 


H2756 OA Bone9 


3.5 


ll 23 89 Match 
Control Crohns-F 


1.3 


ll 2757 OA Synovium9 


2.1 


H2375 Crohns-F 


4.5 


1 12758 OA Synovial 
Fluid Cells9 


11.9 


H2732 Match 
Control Crohns-F 


0.0 


1 1 71 25 RA Cartilage 
Rep2 


2.5 


H2725 Crohns-M 


9.9 


1 13492 Bone2RA 


1.7 


H2387 Match 
Control Crohns-M 


1 5.3 


113493 Synovium2RA 


0.0 


H2378 Crohns-M 


9.7 


113494 Syn Fluid Cells: 
RA 


1.6 


H2390 Match 
Control Crohns-M 


23.7 


113499 Cartilage4RA 


0.4 


H2726 Crohns-M 


2.9 


1 13500 Bone4RA 


1.3 


H2731 Match 
Control Crohns-M 


1 .4 


113501 Synovium4 RA 


0.0 


ll 2380 Ulcer Col-F 


8.7 


113502 Syn Fluid 
Cells4 RA 


0.2 


H2734 Match 
Control Ulcer Col-F 


0.7 


113495 Cartilage3 RA 


0.2 


H2384 Ulcer Col-F 


25.3 


113496 Bone3 RA 


0.6 | 


ll 2737 Match 
Control Ulcer Col-F 


3.0 


113497 Synovium3RA 


0.9 


H2386 Ulcer Col-F 


3.3 


113498 Syn Fluid 
Cells3 RA 


0.6 


H2738 Match 
Control Ulcer Col-F 


0.6 


117106 Normal 
Cartilage Rep20 


14.1 


H2381 Ulcer Col-M 


1.8 


113663 Bone3 Normal 


8.5 


H2735 Match 
Control Ulcer Col-M 


32.8 


1 13664 Synovium3 
Normal 


4.0 


H2382 Ulcer Col-M 


1.5 


1 13665 Syn Fluid 
Cells3 Normal 


5.7 


H2394 Match 
Control Ulcer Col-M 


1.8 


117107 Normal 
Cartilage Rep22 


13.8 


H2383 Ulcer Col-M 


6.3 


1 13667 Bone4 Normal 


17.2 


H2736 Match 
Control Ulcer Col-M 


l.l 


113668 Synovium4 
Normal 


14.7 


1 12423 Psoriasis-F 


12.7 


113669 Syn Fluid 
Cells4 Normal 


7.6 



Table AXC. CNS_neurodegeneration_vl.O 



Tissue Name 


ReL Exp.(%) 


ReL Exp.(%) 


Tissue 


Rel. Exp.(%) 


Rel. Exp.(%) 


Ag3024, Run 


Ag3024, Run j 


Name 


Ag3024, Run 


Ag3024, Run 
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1 flitf i':?'r: b .f'ii.j;_ p ,. njt 3.'* op 






211011006 


233677302 




211011006 


233677302 


AD 1 Hippo 


11.7 


15.4 


Control 

Temporal 
Ctx 


1.9 


1.9 


AD 2 Hippo 


23.3 


32.3 


Control 

fPntM A 
\\ clUI ) H 

Temporal 
Ctx 


25.2 


41.2 


AD 3 Hippo 


10.5 


13.6 


AD 1 

Occipital 
Ctx 


7.7 


11.7 


AD 4 Hippo 


6.2 


9.9 


r\YJ Z 

Occipital 
Ctx 

(Missing) 


0.0 


0.0 ■ 


AD 5 hippo 


77.9 


80.7 


AD 3 

Occipital 

Ctx 


2.9 


4.5 


AD 6 Hippo 


46.0 


45.7 


AD 4 

Occipital 

Ctx 


16.6 


27.2 


Control 2 
Hippo 


28.7 


53.6 


AD 5 

Occipital 

Ctx 


10.9 


20.3 


Control 4 
Hippo 


2.8 


2.4 


AD 6 

Occipital 

Ctx 


61.1 


51.4 


Control (Path) 
3 Hippo 


4.5 


3.8 


Control 1 
Occipital 
Ctx 


1.3 


1.1 


AD 1 Temporal 
Ctx 


4.6 


7.0 


Control 2 
Occipital 
Ctx 


52.1 


72.7 


AD 2 Temporal 
Ctx 


16.8 


26.1 


Control % 
Occipital 
Ctx 


9.9 


13.1 


AD 3 Temporal 
Ctx 


2.9 


3.8 


Control 4 
Occipital 
Ctx 


1.6 


1.4 


AD 4 Temporal 
Ctx 


15.8 


19.1 


Control 
(Path) 1 
Occipital 
Ctx 


81.2 


87.1 


AD 5 Inf 
Temporal Ctx 


47.6 


94.6 


Control 
(Path) 2 
Occipital 
Ctx 


12.2 


16.5 
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AD 5 

SupTemporal 
Ctx 


42.9 
• 


46.0 


Control 
(Path) 3 
Occipital 
Ctx 


0.6 


0.6 


AD 6 Inf 
Temporal Ctx 


22.8 


29.9 


Control 
(Path) 4 
Occipital 
Ctx 


11.0 


14.3 


AD 6 Sup 
Temporal Ctx 


24.0 


30.1 


Control 1 
Parietal Ctx \ 


3.5 


4.0 


Control 1 
Temporal Ctx 


2.3 


2.8 


Control 2 
Parietal Ctx 


16.3 


16.8 


Control 2 
Temporal Ctx 


35.8 


48.0 


Control 3 
Parietal Ctx 


17.2 


26.2 


Control 3 
Temporal Ctx 


13.1 


14.5 


Control 
(Path) 1 
Parietal Ctx 


100.0 


100.0 


Control 4 
Temporal Ctx 


4.4 


4.2 


Control 
(Path) 2 
Parietal Ctx 


29.3 


38.7 


Control (Path) 
1 Temporal Ctx 


56.3 


68.8 


Control 
(Path) 3 
Parietal Ctx 


1.1 


2.4 


Control (Path) 
2 Temporal Ctx 


47.6 


54.0 


Control 
(Path) 4 
Parietal Ctx 


42.0 


56.6 



Table AXD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3024, 
Run 165517896 


Tissue Name 


Rel. Exp.(%) Ag3024, 
Run 165517896 


Liver adenocarcinoma 


0.4 


Kidney (fetal) 


0.2 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


2.4 


Adrenal gland 


2.6 


Renal ca. RXF 393 


0.0 


Thyroid 


0.7 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca. UO-3 1 


0.0 


Pituitary gland 


2.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


11.0 


Liver 


0.7 


Brain (whole) 


54.7 


Liver (fetal) 


0.0 


Brain (amygdala) 


27.2 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


12.9 


Lung 


0.0 


Brain (hippocampus) 


36.3 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


5.7 


Lung ca. (small cell) 


0.0 
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LX-1 




Brain (thalamus) 


64.2 


Lung ca. (small cell) 
NCT-H69 


0.0 


Cerebral Cortex 


100.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


3.8 


Lung ca. (large 
cell)NCI-H460 


0 0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0 7 


glio/astro U-l 1 8-MG 


14.7 


Lung ca. (non-s.cell) 
NCI-H23 


0 0 


astrocytoma SW1783 


1.7 


Lung ca. (non-s.cell) 
HOP-62 


0.6 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


2.1 


actmpvtnmfl QTsJ R _7 S 
doll KJK^y LVJl 1 It* 01M-> / J 


6 0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.6 


Mammary gland 


0.4 


glUJIllct U/.J 1 


0 0 


Breast ca.* (pl.ef) 
MCF-7 


11.0 


glioma. jr-z,7J 


0 0 


Breast ca.* (pl.ef) 
MDA-MB-23 1 


0.9 


Heart (fetaH 


0.0 


Breast ca.* (pl.ef) 
T47D 


2.3 


Heart 


0.6 


Breast ca. BT-549 


1.0 


Skeletal muscle (fetal) 


1.9 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.4 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 

3 


0.0 


Thymus 


n n 
U.U 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.7 


Ovarian ca. OVCAR- 
5 


0.2 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 


0.8 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


6.9 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


1.4 


Uterus 


2.2 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.4 
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Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


1.7 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T * 


0.0 


Colon ca. 
tissue(UUiJiooo) 


1.0 


Melanoma* (met) 

naOOO^D ). 1 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.2 


Gastric ca.* (liver met) 
NCI-N87 


0.4 


Melanoma M14 


0.0 


Bladder 


0.7 


Melanoma LOX 
IMVI 


0.0 


Trachea 


2.4 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.6 


Adipose 


0.0 



Table AXE. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
163577593 


Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
163577593 


Normal Colon 


4.7 


Kidney Margin 
8120608 


1.3 


CC Well to Mod Diff 
(OD03866) 


5.0 


Kidney Cancer 
8120613 


100.0 


CC Margin (OD03866) j 


3.1 


Kidney Margin 
8120614 


5.3 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


0.8 


CC Margin (OD03868) 


0.3 


Kidney Margin 
9010321 


6.3 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.7 


CC Margin (ODO3920) 


0.5 


Uterus Cancer 06401 1 


9.4 


CC Gr.2 ascend colon 
(OD03921) 


3.1 


Normal Thyroid 


5.7 


CC Margin (OD03921) 


2.1 


Thyroid Cancer 
064010 


0.3 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


1.8 


Thyroid Cancer 
A302152 


2.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


23.5 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


7.9 


Lung Margin (OD0445 1 - 
02) 


0.0 


Breast Cancer 
(OD04566) 


7.4 
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Normal Prostate 6546-1 


3.7 


Breast Cancer 1 0 „ 
(OD045 90-01) | 


Prostate Cancer 
(OD04410) 


5.2 


Breast Cancer Mets , , 
(OD04590-03) J 


Prostate Margin 


5.4 


Breast Cancer 

Metastasis 

(OD04655-05) 


3.8 


Prostate Cancer 
(OD04720-01) 


1 1 9 

1 1.7 


Breast Cancer 064006 


7.1 


Prostate Margin 
(OD04720-02) 


6 4 


Breast Cancer 1024 


6.2 


Normal Lung 061010 


0.0 


Breast Cancer 
9100266 


27.7 


Lung Met to Muscle 


0.6 


Breast Margin 
9100265 


4.3 


Muscle Margin 
(OD04286) 


1.0 


Breast Cancer 
A209073 


30.8 


Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin 
A7090734 


3.0 


Lung Margin (UIJU3 Izoj ; 


1 A 

1 .0 


Normal Liver 


1.3 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


A A 


Lung Margin (OD04404) 


1.0 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) j 


8.4 


Liver Tissue 6004-N 


1.1 


Lung Margin (OD04237- 
02) 


0.0 


T ivpr Panrpr ^OOS-T 
LjlVCI l^dliCCI \J\jyJJ~ 1 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


2.5 




0.0 


Liver Margin (OD043 10) 


0.0 


Normal Bladder 


2.8 


Melanoma Mets to Lung 
(OD04321) 


0.4 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.5 


Bladder Cancer 
A302173 


2.2 


Normal Kidney 


23.7 


Bladder Cancer 
(OD047 18-01) 


0.7 


Kidney Ca, Nuclear grade 
2 (OD04338) 


1.2 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.4 


Kidney Margin 
(OD04338) 


10.6 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.0 


Ovarian Cancer 
064008 


1.4 


Kidney Margin 
(OD04339) 


7.7 


Ovarian Cancer 
(OD04768-07) 


0.0 
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Kidney Ca, Clear cell 
type (OD04340) 




Ovary Margin 
(OD04768-08) 


3.3 


Kidney Margin 
(OD04340) 


1 7 0 


I r\ m~\ r A 1 r\ \~\~\ '\ c Vi 
INUiilldl C5 LVJlIlcHwlI 


17.8 


Kidney Ca, Nuclear grade 
3 (OD04348) 




Gastric Cancer 
9060358 


1.0 


Kidney Margin 
(OD04348) 


J . 1 


Stomach Margin 
9060359 


13 6 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


7.5 


Kidney Margin 
(OD04622-03) 


2.8 


Stomach Margin 
9060394 


4.1 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


13.4 


Kidney Margin 
(OD04450-03) 


17.8 


Stomach Margin 
9060396 


6.7 


Kidney Cancer 8120607 


1.7 


Gastric Cancer 
064005 


1.2 



Table AXF. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
164886426 


Tissue Name 


ReL Exp.(%) 
Ag3024, Run 
164886426 


Daoy- Medulloblastoma 


1.6 


Ca Ski- Cervical epidermoid | 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.3 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.6 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.6 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.4 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


14.1 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


0.2 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.4 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


15.3 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


5.4 


Jurkat- T cell leukemia 


0.2 


NCI-H292- 


0.0 


TF-1 - Erythroleukemia 


0.0 
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Mucoepidermoid lung 
carcinoma 








DMS-1 14- Small cell 
lung cancer 


0 0 

VJ.KJ 


HT IT 78- T-cell Ivmnhoma 


0.0 


DMS-79- Small cell lung 
cancer 


D .J 


T TQ17- Vlictinr'vtip lvmnhntna 


0.0 


NCI-H146- Small cell 1 
lung cancer 


0.0 


KU-8 1 2- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


57.4 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


14.4 


Caki-2- Clear cell renal 
carcinoma 


32.8 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


0.2 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms' tumor 


0.0 


NCI-HI 155- Large cell \ 
lung cancer 


2.9 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 
lung cancer 


0.3 


CAPAN-1 - Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727- Lung 
carcinoid 


1J.7 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


i 5 


NCI-UMC-ll-Lung 
carcinoid 


1 7 
1 .Z 


BxPC-3- Pancreatic 
adenocarcinoma 


0 0 


LX-1- Small cell lung 
cancer 


u.u 


HP AC- Pancreatic 
adenocarcinoma 


0.3 




0 0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


is-ivi i z- i^oion cancer 


U.U 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


0.0 


IXlVlZ.UL'Z. V_^vJHJII C-d.IlV-'CI 


0 0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


0.8 


Mr^T 1-4716- Pnlnn ranrpr 




T24- Bladder carcinma 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarcinoma 


0.0 


5637- Bladder carcinoma 


0.0 


SW1116- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarc i noma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


1.1 


HT-1 080- Fibrosarcoma 


0.3 
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NCI-SNU-5- Gastric 
carcinoma 


U.U 


1 VI vj — VJ_> WolCUoul \j\J\Y\CL 


0 0 


KATO III- Gastric 
carcinoma 


u.u 


SK-LMS-1- Leiomyosarcoma; 
(vulva) 




NCI-SNU-16- Gastric 
carcinoma 


n a 


SJRH30- Rhabdomyosarcoma; 
(met to bone marrow) 




NCI-SNU-1- Gastric 
carcinoma 


u.u 


/v+j>i- j^piuenrioiu carciiioiiia 


0 0 


RF-1- Gastric 
adenocarcinoma 


u.u 


W 1V1Z,DD _, 4— ivicidnouici 


0 0 


RF-48- Gastric 
adenocarcinoma 


u.u 


DU 145- Prostate carcinoma 
(brain metastasis) 


0 0 

u.u 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


1.3 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.2 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table AXG. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
162427416 


Tissue Name 


Rel. Exp.(%) 
Ag3024, Run 
162427416 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


31.4 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


14.9 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 | 


8.8 


Secondary Th2 rest 


4.4 


HUVEC IL-11 


11.4 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


10.1 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


8.7 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


11.3 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


3.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


5.3 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


3.7 



1183 



1 




Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


66.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


21.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


10.4 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0 0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 5 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD 1 106 (Keratinocytes) 
i in r aipna -r i 1 ueia 


0.0 


lak cells 1L-Z 


U.U 


Liver cirrhosis 


1 7 7 
1 / . / 


LAK cells IL-Z+IL-IZ 


U.U 


Lupus kidney 


D .O 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


3.2 


LAK Cells IL-z-r 1L- 1 o 


U.U 






LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


10.4 


jnk i^eiis il-z rest 


U.U 


XhPT T-JOQO TT 11 


0 0 


i wo way iviLK j uay 


U.U 


IN Ll-nZVZ TrlN gdmnid 


0 0 
\j .yj 


Two Way MLR 5 day 


0.0 


HPAEC none 


6.3 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-l 
beta 


3.2 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-l beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


4 7 


T nnp fibroblast IL-1^ 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 . 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-l beta 


4.1 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 


6.1 
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Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


8.7 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


7.7 


Monocytes LPS 


0.0 


Colon 


24.1 


Macrophages rest 


0.0 


Lung 


14.5 


Macrophages LPS 


5.3 


Thymus 


100.0 


HUVEC none 


24.7 


Kidney 


0.0 


HUVEC starved 


11.8 







Table AXH. Panel CNS 1 



Tissue Name 


ReL Exp.(%) Ag3024, 
Run 171oV45Jo 


Tissue Name 


ReL Exp.(%) Ag3024, 

p lin AHA AOislS 


BA4 Control 


38.2 


BA17PSP 


27.7 


BA4 Control2 


73.2 


BA17 PSP2 


10.7 


BA4 

Alzheimer's2 


7.7 


Sub Nigra Control 


11.7 


BA4 Parkinson's 


* 60.7 


Sub Nigra Control2 


20.4 


BA4 

Parkinson's2 


85.3 


Sub Nigra 
Alzheimer's2 


6.4 


BA4 

Huntington's 


44.1 


Sub Nigra 
Parkinson's2 


29.5 


BA4 

Huntington's2 


8.7 


Sub Nigra 
Huntington's 


28.9 


BA4 PSP 


7.7 


Sub Nigra 
Huntington's2 


20.7 


BA4 PSP2 


39.5 


Sub Nigra PSP2 


1.5 


BA4 Depression 


2.1 


Sub Nigra 
Depression 


0.8 


BA4 

Depression2 


5.8 


Sub Nigra 
Depression2 


4.7 


BA7 Control 


66.0 


Glob Palladus 
Control 


3.9 


BA7 Control2 


52.5 


Glob Palladus 
Control2 


6.9 


BA7 

AIzheimer f s2 


8.8 


Glob Palladus 
Alzheimer's 


5.5 


BA7 Parkinson's 


20.4 


Glob Palladus 
Alzheimer's2 


2.8 


BA7 

Parkinson's2 


64.6 


Glob Palladus 
Parkinson's 


48.6 


BA7 

Huntington's 


55.1 


Glob Palladus 
Parkinson's2 


7.1 
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BA7 

Huntington's2 


54.7 


Glob Paliadus PSP 


3.2 


BA7 PSP 


57.8 


Glob Paliadus PSP2 


3.0 


BA7 PSP2 


26.4 


Glob Paliadus 
Depression 


2.1 


BA7 Depression i 


7.7 


Temp Pole Control 


14.8 


BA9 Control 


31.4 


Temp Pole Control2 


57.0 


BA9 Controlz 


lUU.O 


Temp Pole 
Alzheimer's 


O. 1 


B A9 Alzheimer s 


A O 

4.8 


Temp Pole 
Alzheimer's2 




BA9 

Alzheimer's2 


1 A C 

14.5 


Temp Pole 
Parkinson's 


ZJ.Z 


BA9 Parkinson s 


31.6 


Temp Pole 
Parkinson's2 


jU. 1 


BA9 

Parkinson f s2 


56.3 


Temp Pole 
Huntington's 


41,8 


BA9 

Huntington's 


53.2 


Temp Pole PSP 


3.5 


BA9 

Huntington's2 


14 3 


Temn Pole PSP2 


3.3 


BA9 PSP 


17.7 


Temp Pole 
Depression2 


1.9 


BA9 PSP2 


5.1 


Cing Gyr Control 


67.8 


BA9 Depression 


9.0 


Cing Gyr Control2 


38.4 


BA9 

Depression2 


7.4 


Cing Gyr 
Alzheimer's 


7.4 


BA1 7 Control 


51.8 


Cing Gyr 
Alzheimer's2 


12.2 


BA17 Control2 


54.0 


Cing Gyr Parkinson's 


15.4 


BA17 

Alzheimer's2 


Z. / 


Cing Gyr 
Parkinson's2 


39 5 


BA17 
Parkinson's 


Z3.Z 


Cing Gyr 
Huntington's 


48 0 


BA17 

Parkinson's2 


53 6 


Cing Gyr 
Huntington's2 


14.2 


BA17 

Huntington's 


41.2 


Cing Gyr PSP 


13.9 


BA17 

Huntington's2 


9.5 


Cing Gyr PSP2 


5.0 


BA17 
Depression 


6.0 


Cing Gyr Depression 


5.7 


BA17 

Depression2 


14.4 


Cing Gyr 
Depression2 


9.7 
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AI_comprehensive panel vl.O Summary: Ag3024 The NOV55 gene is found at low 
but significant levels in lung tissue from COPD, emphysema and asthma patients. This 
expression is consistent with panel 4D which shows expression in small airway epithelium. 
Therefore, this gene could be a marker or a target for lung inflammatory diseases. 
5 CNSneurodegenerationvl.O Summary: Ag3024 Results of two experiments with 

the same probe and primer set confirm expression of the NOV55 gene in the brain. Please see 
Panel 1 .3D for discussion of utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3024 Expression of the NOV55 gene, a heparin sulfate 
proteoglycan homolog, is highly brain preferential and suggests a role for this gene product in 

10 CNS processes. Heparin sulfate proteoglycans (HSPGs)are a component of amyloid plaques in 
Alzheimer's disease. The interaction of apoE with HSPGs has also been implicated in the 
pathogenesis of Alzheimer's disease and may play a role in neuronal repair. apoE has an 
HSPG-binding site highly complementary to heparan sulfates rich in N- and O-sulfo groups. 
Therefore, enzymes that influence the structure of HSPGs, such as the putative protein product 

15 of the NOV55 gene, may influence protein agregation and the functional processes underlying 
Alzheimer's disease. Thus, agents that target and modulate the activity of this gene product 
may be effective in the treatment of neurodegenerative diseases including Alzheimer's disease. 
This gene is also expressed in breast and brain cancer cell lines at low but significant levels. 
Therefore, the expression of this gene could be of use as a marker for breast and brain cancer. 

20 In addition, therapeutic inhibition of the activity of the product of this gene, through the use of 
antibodies or small molecule drugs, may be useful in the therapy of brain and breast cancer. 
References: 

Libeu CP, Lund-Katz S, Phillips MC, Wehrli S, Hernaiz MJ, Capila I, Linhardt RJ, 
Raffai RL, Newhouse YM, Zhou F, Weisgraber KH. New insights into the heparan sulfate 
25 proteoglycan-binding activity of apolipoprotein E. J Biol Chem 2001 Oct 19;276(42):39138- 
44 

Defective binding of apolipoprotein E (apoE) to heparan sulfate proteoglycans 
(HSPGs) is associated with increased risk of atherosclerosis due to inefficient clearance of 
lipoprotein remnants by the liver. The interaction of apoE with HSPGs has also been 
30 implicated in the pathogenesis of Alzheimer's disease and may play a role in neuronal repair. 
To identify which residues in the heparin-binding site of apoE and which structural elements 
of heparan sulfate interact, we used a variety of approaches, including glycosaminoglycan 
specificity assays, (13)C nuclear magnetic resonance, and heparin affinity chromatography. 
The formation of the high affinity complex required Arg-142, Lys-143, Arg-145, Lys-146, and 
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Arg-147 from apoE and N- and 6-O-sulfo groups of the glucosamine units from the heparin 
fragment. As shown by molecular modeling, using a high affinity binding octasaccharide 
fragment of heparin, these findings are consistent with a binding mode in which five 
saccharide residues of fully sulfated heparan sulfate lie in a shallow groove of the alpha-helix 
5 that contains the HSPG-binding site (helix 4 of the four-helix bundle of the 22-kDa fragment). 
This groove is lined with residues Arg-136, Ser-139, His-140, Arg-142, Lys-143, Arg-145, 
Lys-146, and Arg-147. In the model, all of these residues make direct contact with either the 
2-O-sulfo groups of the iduronic acid monosaccharides or the N- and 6-O-sulfo groups of the 
glucosamine sulfate monosaccharides. This model indicates that apoE has an HSPG-binding 

10 site highly complementary to heparan sulfate rich in N- and O-sulfo groups such as that found 
in the liver and the brain. 

Inoue S. Basement membrane and beta amyloid fibrillogenesis in Alzheimer's disease. 
Int Rev Cytol 200 1 ;2 1 0: 1 2 1 -6 1 

, High-resolution ultrastructural and immunohistochemical studies revealed that in situ 

15 beta amyloid fibrils of Alzheimer's disease were made up of a core consisting of a solid 

column of amyloid P component (AP) and associated chondroitin sulfate proteoglycan, and a 
heparan sulfate proteoglycan surface layer with externally associated fine filaments of beta 
protein. The main body of beta amyloid fibrils closely resembled that of microfibrils. 
Abundant microfibrils were reported to be present at the basement membrane of capillaries 

20 with "leaky" blood-urine or blood-air barriers. Similarly, abundant microfibril-like beta 
amyloid fibrils are formed at the microvascular basement membrane in cerebrovascular 
amyloid angiopathy with altered blood-brain barrier. Since AP is an indispensable major 
component of microfibrils and microfibril-like structures, the formation of microfibrils may 
depend on, among other factors, the availability of AP. Thus, in beta amyloid fibrillogenesis 

25 fibrils may be built around AP which continuously leaks out from circulation into vascular 
basement membrane, and beta amyloid fibrils may be regarded as pathologically altered 
basement membrane-associated microfibrils. With no source of AP around them, senile plaque 
fibrils may also be derived from perivascular amyloid. 

Panel 2D Summary: Ag3024 The NOV55 gene is expressed at low but significant 

30 levels in most of the samples on this panel, with highest expression in a kidney cancer sample 
(CT=30.6). Significant levels of expression are also seen in samples derived from breast and 
gastric cancer samples. 

Therefore, expression of this gene could be of use as a marker for breast and gastric 
cancer. In addition, therapeutic inhibition of the activity of the product of this gene, through 
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the use of antibodies or small molecule drugs, may be useful in the therapy of breast and 
gastric cancer. 

Panel 3D Summary: Ag3024 The NOV55 gene is expressed at low but significant 
levels in cell lines from a renal carcinoma, colon cancer, glioblastoma and three lung cancer 
5 lines. Thus, this gene could be a marker as well as a target for inhibition in these cancers. 

Panel 4D Summary: Ag3024 The NOV55 gene, a heparin Sulfate 6-Sulfotransferase 
3 homolog, is expressed at low but significant levels in thymus and small airway epithelium 
treated with TNFalpha + IL-lbeta (CTs=34). Thus, the NOV55 gene product may be a marker 
for thymus or activated small airway epithelium. 
10 Panel CNS__1 Summary: Ag3024 Expression of the NOV55 gene in this panel 

confirms the presence of this gene product in the brainl. Please see Panel 1.3D for discussion 
of utility of this gene in the central nervous system. 

NOV56a and NOV56b 

Expression of gene NOV56a and variant NOV56b was assessed using the primer-probe 
1 5 sets Ag3027 and Agl 169, described in Tables AYA and AYB. Results of the RTQ-PCR runs 
are shown in Tables AYC, AYD, AYE, AYF, AYG, AYH and AYI. 

Table AYA. Probe Name Ag3027 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forwards 


5 1 -aaaaccagatttggagttcgtt-3 ' ] 


22 


355 


1210 


Probe 


TET-5 ' -cttgaaatgtcctcaccacaactgat-3 1 - 
TAMRA 


26 


377 


1211 


Reverse 


5 ' -tccagatagatggtggaatcag- 3 1 


22 


425 


1212 


Table AYB. Probe Name Agl 169 


Primers: 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -aaaaccagatttggagttcgtt-3 1 


22 


355 


1213 


Probe 


TET-5 ' -cttgaaatgtcctcaccacaactgat-3 1 - 
TAMRA ■ 


26 


377 


1214 


Reverse 


5 ' -tccagatagatggtggaatcag- 3 ' 


22 


425 


1215 



20 



Table AYC. General_screening_panel_vl.5 



Tissue Name 


Rel. Exp.(%) Ag3027, 
Run 228714682 


Tissue Name 


Rel. Exp.(%) Ag3027, 
Run 228714682 


Adipose 


2.3 


Renal ca. TK-10 


10.4 


Melanoma* 


1.4 


Bladder 


11.0 
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Hs688(A).T 








Melanoma* 
Hs688(B).T 


1.0 


Gastric ca. (liver met.) 
NCI-N87 


0.6 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


7.4 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


43.8 


Melanoma* SK- 

MbLO 


1.9 


Colon ca. SW480 


6.2 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
meij j> w ozu 


9.9 


Testis Pool 


0.7 


Colon ca. HT29 


2.0 


Prostate ca.* (bone 
met) PC-3 


0.7 


Colon ca. HCT-116 


0.5 


Prostate Pool 


3.8 


Colon ca. CaCo-2 


73.2 


Placenta 


0.0 


Colon cancer tissue 


21.8 


Uterus Pool 


0.4 


Colon ca. SW1116 


0.7 


Ovarian ca. 
OVCAR-3 


u.u 


C^r\\r\rt rr\ C^r\\r\ OO^ 

v^oion ca. i^oiu-zuj 




Ovarian ca. SK-OV- 
3 




i^oion ca. ow-^o 


KJ.J 


Ovarian ca. 
OVCAR-4 


0.8 


Colon Pool 


2.6 


Ovarian ca. 
OVCAR-5 


11.0 


Small Intestine Pool 


0.8 


Ovarian ca. IGROV- 
1 


6.9 


Stomach Pool 


2.0 


Ovarian ca. 
OVCAR-8 


1.1 


Bone Marrow Pool 


0.0 


Ovary 


0.8 


Fetal Heart 


1.4 


oreast ca. ivicr - / 


u.z 


i lean rooi 


U.J 


Breast ca. MDA- 

IVlxJ-Z j 1 


0.5 


Lymph Node Pool 


1.4 


Breast ca. BT 549 


0.5 


Fetal Skeletal Muscle 


1.1 


Breast ca. T47D 


0.4 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


4.9 


Thymus Pool 


0.3 


Trachea 


0.3 


CNS cancer (glio/astro) 
U87-MG 


4.7 


Lung 


0.0 


CNS cancer (glio/astro) 
U-118-MG 


1.6 


Fetal Lung 


2.0 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.5 


Lung ca. LX-1 


100.0 


CNS cancer (astro) 


4.1 



1190 







SNB-75 




Lung ca. NCI-HI 46 


0.3 


CNS cancer (glio) 

IMNt>- i y 


13.8 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 

one 


3.6 




Lung ca. A549 


1 A 


Brain (Amygdala) Pool 


u.z 


Lung ca. fNCI-ri3ZO 


i n 
1 .u 


Brain (cerebellum) 


n i 

U. I 


Lung ca. NLI-Hzj 


z.o 


rsrain (jetaij 


u.z 


Lung ca. NCI-H460 


1.2 


Brain (Hippocampus) 
Pool 


0.3 


Lune ca HOP-62 


0.2 


Cerebral Cortex Pool 


0.5 


Lung ca. NCI-H522 


0.7 


Brain (Substantia nigra) 
Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.4 


Fetal Liver 


3.2 


Brain (whole) 


0.6 


Liver ca. HepG2 


0.5 


Spinal Cord Pool 


0.3 


Kidney Pool 


1.8 


Adrenal Gland 


1.1 


Fetal Kidney 


29.9 


Pituitary gland Pool 


0.3 


Renal ca. 786-0 


1.8 


Salivary Gland 


0.2 


Renal ca. A498 


0.7 


Thyroid (female) 


0.1 


Renal ca. ACHN 


3.2 


Pancreatic ca. CAPAN2 


3.1 


Renal ca. UO-31 


3.2 


Pancreas Pool 


4.4 



Table AYD. Panel 1.2 



Tissue Name 


Rel. Exp.(%) 
Agll69, Run 
129128191 


Rel. Exp.(%) | 
Agll69, Run! 
129656838 


Tissue Name 


Rel. Exp.(%) 
Agll69, Run 
129128191 


Rel. Exp.(%) 
Agll69, Run 
129656838 


Endothelial cells 


0.0 


24.3 


Renal ca. 786- 
0 


0.2 


10.5 


Heart (Fetal) 


0.0 


57.0 


Renal ca. A498 




10.9 


Pancreas 


0.0 


0.0 


Renal ca. RXF 
393 


0.9 


14.9 


Pancreatic ca. 
CAPAN 2 


0.2 


0.0 


Renal ca. 
ACHN 


1.5 


22.8 


Adrenal Gland 


0.0 


7.0 


Renal ca. UO- 
31 


5.6 


12.3 


Thyroid 


0.1 


0.1 


Renal ca. TK- 
10 


11.3 


1.7 


Salivary gland 


0.1 


6.2 


Liver 


0.0 


12.9 


Pituitary gland 


0.0 


0.3 


Liver (fetal) 


0.0 


1.4 


Brain (fetal) 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.2 


4.5 
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Brain (whole) 


0.0 


4.5 


Lung 0.0 


14.9 


Brain 

(amygdala) 


0.0 


6.3 


Lung (fetal) 


0.0 


22.2 


Brain 

(cerebellum) 


0.0 


0.3 


Lung ca. 
( small cein 
LX-1 


27.0 


74.2 


Brain 

(hippocampus) 


0.0 


14.4 


Lung ca. 

( smal I pp! W 

NCI-H69 


0.0 


0.0 


Brain (thalamus) 


0.0 


1.2 


T i in (i pa (<z pp-11 

var.) SHP-77 


0.0 


0.0 


Cerebral Cortex 


0.0 


15.5 


T liner cz\ 

cell)NCI-H460 


0.0 


0.0 


Spinal cord 


0.0 


22.4 


Lung ca. (non- 

<:m ppIH AS4Q 


0.9 


0.0 


glio/astro U87- 
MG 


0.2 


0.0 


Lung ca. (non- 
s.cell) NCI- 


0.3 


0.0 


<j\\r\/c\<ztm I I- 

lil l\Jf doit \J V-J 

118-MG 


0.0 


0.0 


Lung ca. (non- 
o ce\W HOP-62 


0.1 


0.6 


SW1783 


0.0 


0.0 


Lung ca. (non- 


0.1 


0.1 


neuro*; met SK- 
N-AS 


0.0 


0.0 


Lung ca. 
(squam.) SW 
900 


6.3 


0.0 


astrocytoma SF- 
539 


0.0 


0.0 


Lung ca. 
fsauam ^ NCI- 
H596 


0.0 


0.0 


astrocytoma 
SNB-75 


0.0 


0.0 


Mammarv 

xv j. cxi ill 1 1 tit y 

gland 


100.0 


100.0 


glioma SNB-19 


0.4 


0.6 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


4.0 


glioma U251 


0.1 


1.7 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


0.0 


glioma SF-295 


0.5 


0.0 


Breast ca.* (pi. 
ef) T47D 


0 ft 


1.4 


Heart 


0.0 


58.2 


Breast ca. BT- 
549 


0.0 


0.1 


Skeletal Muscle 


0.0 


9.3 


Breast ca. 
MDA-N 


0.0 


0.0 


Bone marrow 


0.0 


0.0 


Ovary 


0.0 


14.0 


Thymus 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


5.2 


Spleen 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


15.3 
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Lymph node 


0.0 


3.0 


Ovarian ca. 
OVCAR-5 


50.7 


15.7 


Colorectal 
Tissue 


0.1 


0.0 


Ovarian ca. 
OVCAR-8 


1.5 


0.1 


Stomach 


8.4 


23.0 


Ovarian ca. 
IGROV-1 


43.5 


17.0 


Small intestine 


0.2 


10.0 


Ovarian ca. 
(ascites) SK- 
OV-3 


13.5 


17.0 


Colon ca. 
SW480 


0.6 


0.0 


Uterus 


0.2 


7.7 


Colon ca.* 
SW620 (SW480 
met) 


7.5 


0.4 


Placenta 


0.0 


75.8 


Colon ca. HT29 


0.0 


0.0 


Prostate 


12.1 


13.7 


Colon ca. HCT- 
116 


. 0.0 


0.0 


Prostate ca.* 
(bone met) PC- 
3 


0.0 


10.4 


Colon ca. CaCo- 
2 


34.4 


11.8 


Testis 


0.6 


0.2 


Colon ca. Tissue 
(OD03866) 


11.4 


15.9 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


2.0 


0.0 


Melanoma* 
(met) 

Hs688(B).T 


0.2 


0.0 


Gastric ca.* 
(liver met) NCI- 
N87 


0.0 


0.0 


Melanoma 


0.0 


0.0 


Bladder 


5.3 


11.3 


Melanoma 
M14 


0.0 


0.0 


Trachea 


0.0 


2.6 


Melanoma 
LOX IMVI 


0.0 


0.0 


Kidney 


4.5 


14.0 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney (fetal) 


14.3 


31.2 







Table AYE. Panel 1.3D 



Tissue Name 

^ — — « 


Rel. Exp.(%) 
Agll69, Run 
165518394 


Rel. Exp.(%) 
Ag3027, Run; 
165519993 


Tissue Name 


Rel. Exp.(%) 
Agll69, Run 
165518394 


Rel. Exp.(%) 
Ag3027, Run 
165519993 


Liver 

adenocarcinoma 


0.5 


0.4 


Kidney (fetal) 


0.7 


0.0 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


0.5 


0.6 
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.i 



Pancreatic ca. 
CAPAN 2 


n 7 


ft 4. 


Renal ca. 
A498 


0.9 


1.6 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 
393 


0.0 


0.9 


Thyroid 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


0.0 


Salivary gland 


0.0 


0.0 


Renal ca. UO- 
31 


0.0 


0.0 


Pituitary gland 


0.0 


0.0 


Renal ca. TK- 
10 


2.1 


0.4 


Brain (fetal) 


0.0 


0.0 


Liver 


0.0 


0.0 


Brain (whole) 


0.0 


0.0 


Liver (fetal) 


0.0 


0.0 


Brain (amygdala) 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.0 


0.0 


Brain 

(hippocampus) 


0.0 


0.4 


Lung (fetal) 


0.0 


0.0 


Brain (substantia 
nigra) 


0.3 


0.0 


Lung ca. 
(small cell) 
LX-1 


21.5 


12.0 

■ — — — — - 


Brain (thalamus) 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.0 


Cerebral Cortex 


0.0 


0.5 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 


Spinal cord 


0.0 


0.3 


Lung ca. (large 
cell)NCI-H460 


0.0 


1.3 


1 - / . T TOT * Jf 

glio/astro U87-MG 


0.0 


0.3 


Lung ca. (non- 
sm. cell) A549 


u.u 


U.U 


glio/astro U-l 18- 
MG 


0.0 


03 


Lung ca. (non- 
sxell) NCI- 
H23 


0.7 


0.3 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.0 


0.0 


neuro*; met SK-N- 

A C 

Ab 


0.0 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


0.0 


0.0 


astrocytoma SF- 
539 


0.3 


0.0 


Lung ca. 
(squam.) SW 
900 


2.8 


0.0 


astrocytoma SNB- 
75 


1.9 


0.9 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.0 


glioma SNB-19 


0.0 


0.4 


Mammary 
gland 


100.0 


100.0 
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glioma U251 


0.4 


0.4 


Breast ca.* 
(pl.ef) MCF-7 i 


0.5 


0.0 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


0.3 


Heart (fetal) 


0.0 


0.0 


Breast ca.* 
(pl.ef) T47D 


0.0 


0.0 


Heart 


0.0 


0.0 


Breast ca. BT- 
549 


0.3 


0.0 


Skeletal muscle 
(fetal) 


0.0 


0.0 


Breast ca. 
MDA-N 


0.0 


0.0 


Skeletal muscle 


0.0 


0.0 


Ovary 


0.0 


0.0 


Rnnp marrow 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Thymus 


0.9 


0.0 


Ovarian ca. 
OVCAR-4 


0.4 


0.0 




0.7 


0.0 


Ovarian ca. 
OVCAR-5 


0.9 


1.4 


Lymph node 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 




0 9 


0.2 


Ovarian ca. 
fGROV-1 


1.0 


2.3 


Stomach 


1.0 


0.6 


Ovarian ca.* 
(ascites) SK- 
OV-3 


1.4 


1.8 


Small intestine 


0.7 


0.0 


Uterus 


0.5 


0.0 


Colon ca. SW480 


0.8 


0.4 


Placenta 


0.0 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.6 


0.4 


Prostate 


0.3 


0.5 


Colon ca. HT29 


0.0 


0.8 


Prostate ca.* 
(bone met)PC- 

3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.9 


0.3 


Testis 


0.8 


1.2 


Colon ca. CaCo-2 

i 


6.7 


8.1 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


2.9 


2.0 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.4 


0.3 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


0.3 


0.0 


Melanoma 
M14 


0.0 


0.0 


Bladder 


1.7 


0.4 


Melanoma 
LOX 1MVI 


0.0 


0.0 
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Trachea 


0.0 


Melanoma* 
0.0 (met) SK- 
|MEL-5 


0.0 


0.0 


Kidney 


2.7 


1.3 (Adipose 


0.7 


0.0 



Table AYR Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3027, Run 
163577594 


Tissue Name 


Rel. Exp.(%) 
Ag3027, Run 
163577594 


Normal Colon 


14.6 


Kidney Margin 
8120608 


5.9 


CC Well to Mod Diff 
(OD03866) 


15.1 


Kidney Cancer 
8120613 


34.4 


CC Margin (OD03866) j 


0.0 


Kidney Margin 
8120614 


5.7 


CC Gr.2 rectosigmoid 
(OD03868) 


15.5 


Kidney Cancer 
9010320 


10.5 


CC Margin (OD03868) 


0.7 


Kidney Margin 
9010321 


25.0 


CC Mod Diff (ODO3920)! 


64.2 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


2.2 


Uterus Cancer 06401 1 


18.4 


(OD03921) 


11.2 


Normal Thyroid 


1.6 


CC Margin (OD03 921) 


1.8 


Thyroid Cancer 
064010 


1.6 


CC from Partial 
Henatectomv CODO4309"> 
Mets 


16.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


0.3 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


6.8 


Normal Breast 


28.7 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD045 66) 


0.5 


Normal Prostate 6546-1 


14.9 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


29.7 


Breast Cancer Mets 
(OD04590-03) 


3.7 


Prostate Margin 
(OD04410) 


19.1 


Breast Cancer 

Metastasis 

(OD04655-05) 


1.7 


Prostate Cancer 
(OD04720-01) 


4.5 


Breast Cancer 064006 


0.4 


Prostate Margin 
(OD04720-02) 


3.0 


Breast Cancer 1 024 


55.1 
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1, Si ft '.7 r*Tt t S" 1 



1 r* 



Normal Lune 061010 


0.5 


Breast Cancer 
9100266 


1.9 


Lung Met to Muscle 
(OD04286) 


0.0 


Breast Margin 
9101)265 


8.2 


Muscle Margin 
(OD04286) 


0.9 


Breast Cancer 
A209073 


18.3 


Lung Malignant Cancer 
(OD03126) 


11.0 


Breast Margin 
A2090734 


34.6 


Lung Margin (OD03 126) 


1.1 


Normal Liver 


U.U 


Lung Cancer (OD04404) 


1.8 


Liver Cancer 064003 


15.0 


Lung Margin (OD04404) 


0.4 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.5 


Liver Cancer 1 026 


0.0 


Lung Margin (OD04565) 


2.1 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


3.3 


Liver Tissue 6004-N 


1.1 


Lung Margin (OD04237- , 
02) 


0.0 


Liver Cancer 6005-T 


2.6 


Ocular Mel Met to Liver 
(ODO4310) 


1.6 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


0.0 


Normal Bladder 


13.5 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


32.5 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


2.6 


Normal Kidney 


14.6 


Bladder Cancer 
(OD047 18-01) 


0.0 


Kidney Ca, Nuclear grade 


3.3 


Bladder Normal 
Adjacent (OD04718- 
03) 


1.0 


Kidney Margin 
(OD04338) 


3.1 


Normal Ovary 


n ft 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


*7 O 

/.y 


Ovarian Cancer 
064008 


1 R 
1 .o 


Kidney Margin 
(OD04339) 


Z 1 .u 


Ovarian Cancer 
(OD04768-07) 


ft ft 
\}.K) 


Kidney Ca, Clear cell 
type (OD04340) 


3.4 


Ovary Margin 
(OD04768-08) 


0.5 


Kidney Margin 
(OD04340) 


61.6 


Normal Stomach 


45.7 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.4 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 
(OD04348) 


84.1 


Stomach Margin 
9060359 


8.2 


Kidney Cancer 
(OD04622-01) 


- - jGastric Cancer 
J9060395 


6.4 
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Kidney Margin 
(OD04622-03) 


1.4 


Stomach Margin 
9060394 


12.9 


Kidney Cancer 
(OD04450-01) 


39.2 


Gastric Cancer 
9060397 


42.9 


Kidney Margin 
(OD04450-03) 


51.8 


Stomach Margin 
9060396 


31.4 


Kidney Cancer 8120607 


4.0 


Gastric Cancer 
064005 


100.0 



Table AYG. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Agll69, Run 
164038616 


Tissue Name 


Rel. Exp.(%) 
Agll69, Run 
164038616 


1 JdKjy I V I CU Li 1 1U Lf ICl^lWI 1 1 CI 


1.9 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma j 


0.0 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


0.0 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


0.0 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


1.8 


TF-1- Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 


100.0 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-812- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


2.6 


769-P- Clear cell renal 
carcinoma 


1.0 
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NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


2.3 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


3.7 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms' tumor 


0.0 


NCI-H1155- Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


3.5 


NCI-H727- Lung 
carcinoid 


u.u 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


Z.J 


NCI-UMC-ll-Lung 
carcinoid 


1 .7 


BxPC-3- Pancreatic 
adenocarcinoma 


1 .z 


LX-1- Small cell lung 
cancer 


zU.o 


HP AC- Pancreatic 
adenocarcinoma 


J.J 


Colo-205- Colon cancer 


n n 
0.0 


MIA PaCa-2- Pancreatic 
carcinoma 


a n 
U.U 


KM 12- Colon cancer 


1Z.3 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


Z.O 


KMzULz- Colon cancer 


U.U 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


fi fi 

u.u 


jNi_.i-.ri / 10- c^oion cancer 


n n 
U.U 


T24- Bladder carcinma 
(transitional cell) 


fi fi 
u.u 


SW-48- Colon 
adenocarcinoma 


o.u 

, „„ „ .. .. ,„„ 


joj /- r>iaaaer carcinoma 


fi fi 
u.u 


SW1116- Colon 
adenocarcinoma 


Z.O 


i i i-i IV/- o lauaer carcinoma 


fi fi 


LS 174T- Colon 
adenocarcinoma 


Z.D.D 


UM-UC-3- Bladder carcinma 
(transitional cell) 


fi fi 
u.u 


SW-948- Colon 
adenocarcinoma 


in 7 


A On/1 D V\ar\f\f\YY\\rr\c£\i~r*r\m 

/\zuh- ivnaDUUinyubcirL'Uiiia 


fi fi 

u.u 


SW-480- Colon 
adenocarcinoma 


fi fi 
u.u 


UT.1 fiRfi_ Fihrncarfnma 


1 fi 


NCI-SNU-5- Gastric 
carcinoma 


1.0 


lVTir-f*^- O^tposarcoma 

1VAVJ vy jHwUOUI 


0.0 


KATO III- Gastric 
carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


0.0 


NCI-SNU-16- Gastric 
carcinoma 


0.0 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


2.0 


NCI-SNU-1- Gastric 
carcinoma 


29.7 


A431- Epidermoid carcinoma 


0.0 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


0.0 
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RF-48- Gastric 
adenocarcinoma 


0 0 


DU 145- Prostate carcinoma j 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


3.5 


MDA-MB-468- Breast 
adenocarcinoma 


0.0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue ; 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table AYH. Panel 4D 



Tissue 
Name 


Rel. 
Exp.(%) ! 
Agll69, ; 
Run 
139591349 


Rel. 

Exp.(%) j 
Agll69, 
Run 
145735616 


Rel. 
Exp.(%) 
Ag3027, 
Run 
162426723 


Tissue 
Name 


Rel. 
Exp.(%) 
Agll69, 
Run 
139591349 


Rel. ! 
Exp.(%) | 
Agll69, ] 
Run 
1457356161 


Rel. 

Exp.(%) 
Ag3027, 
Run 
162426723 


Secondary i 
Thl act 


0.0 


0.0 


0.0 


HUVEC 
IL-lbeta 


0.0 


0.0 


0.0 


Secondary ! 
Th2 act 


0.0 


0.0 


0.0 


HUVEC 

IFN 

gamma 


0.0 


0.0 

! 


0.7 


Secondary 
Trl act 


0.0 


0.0 


0.0 


HUVEC 
TNF alpha 
+ IFN 
gamma 


0.0 


0.0 


0.0 


Secondary ; 
Thl rest 


0.0 


0.0 


0.0 


HUVEC 
TNF alpha 
+ IL4 


0.4 


0.0 


0.0 


Secondary 
Th2 rest 


0.0 


0.0 


0.0 


HUVEC 
IL-11 


0.0 


0.0 


0.0 


Secondary 
Trl rest 


0.0 


0.0 


0.0 


Lung 

Micro vase 
ular EC 
none 


0.4 


0.0 


0.0 


Primary 
Thl act 


0.0 


0.0 


0.0 


Lung 

Micro vase 
ular EC 
TNFalpha 
+ IL-lbeta 


0.9 


0.0 


0.0 


Primary 
Th2 act 


0.0 


0.0 


0.0 


Microvasc 
ular 

Dermal EC 
none 


0.5 


0.0 


3.1 
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Primary 
Trl act 


0.0 


0.0 


0.0 


Microsvas 
ular 

Dermal EC 

IT n 1 1-\ h o 

i lNraipna 
+ IL-lbeta 


0.0 


0.0 


0.0 


Primary 
Thl rest 


0.0 


0.0 


0.0 


Bronchial 

epithelium 

TNFalpha 

i 1 1 v I DC Id 


1.0 


0.0 


5.1 


Primary 
Th2 rest 


0.0 


0.0 


0.0 


Small 
airway 
epithelium 
none 


0.0 


0.0 


2.0 


Primary 
Trl rest 


0.0 


0.0 


0.0 


Small 
airway 
epithelium 
TNFalpha 
+ 1JL- 1 beta 


0.0 


1.3 


1.9 


CD45RA 
CD4 

lympnocyx 
e act 


0.0 


0.0 


0.0 


Coronery 
artery 
SMC rest 


0.0 


0.0 


0.0 


CD45RO 
CD4 

lymphocyt 
e act 


n n 
u.u 


u.u 


u.u 


Coronery 
artery 

TNFalpha 
+ IL-lbeta 


U.U 


u.u 


u.u 


CD8 

\y IIipiltJL<y L 

e act 


u.u 


u.u 


u.u 


Astrocytes 
rest 


u.u 


u.u 


u.u 


Secondary 
CD8 

l\/m nhnrvt 
ly 1 \ iy)ii\j\*>y l 

e rest 


0.0 


0.0 


0.0 


Astrocytes 
TNFalpha 
+ IL-lbeta 


1.4 


3.1 


0.6 


Secondary 
CD8 

lymphocyt 
e act 


0.0 


0.0 


0.0 


KU-812 

(Basophil) 

rest 


0.0 


0.0 


0.0 


CD4 

lymphocyt 
e none 


0.0 


0.0 


0.0 


KU-812 

fRasnnhin < 

^UCIlJ W L/lll 1 1 

PMA/iono 
mycin 


0.0 


0.0 


0.0 


2ry 

Thl/Th2/Ti 
rl anti- 
CD95 
CHll 


0.0 j 


0.0 


0.0 


CCD 1106 
(Keratinoc 
ytes) none 


0.0 


0.0 


0.0 


LAK cells | 
rest 


0.0 


0.0 


0.0 


CCD 1106 
(Keratinoc 


0.0 


0.0 


0.0 
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;t nnypni p ... in; i.'^ 1 " 











ytes) 

TNFalpha 
+ IL-lbeta 








LAK cells I 
IL-2 


0.0 


0.0 


0.0 


Liver 
cirrhosis 


2.1 


1.3 


3.2 


LAK cells 

IL-2+IL- : 

12 


0.0 


0.0 


0.0 


Lupus 
kidney 


5.3 


3.7 


4.8 


LAK cells 
IL-2+IFN 1 
gamma 


0.0 


0.0 


0.0 


NCI-H292 
none 


0.9 


0.0 


2.2 


LAK cells i 

IL-2+IL- 

18 


0.0 


0.0 


0.0 


NCI-H292 
IL-4 


0.0 


1.2 


0.7 


LAK cells | 

PMA/iono 

mycin 


0.0 


0.0 


0.0 


NCI-H292 
IL-9 


0.3 


0.0 


0.9 


NK Cells 
IL-2 rest 


0.0 


0.0 


0.0 


NCI-H292 
IL-13 


0.0 


0.0 


0.7 


Two Way ; 
MLR 3 
day 


0.0 


0.0 


0.0 


NCI-H292 

IFN 

gamma 


0.0 


3.7 


0.0 


Two Way 
MLR 5 
day 


0.0 


0.0 


0.0 


HPAEC 
none 


36.1 


0.0 


1.1 


i w \j vv ay 

MLR 7 
day 


0.0 


0.0 


1.1 


HPAEC 
TNF alpha 
+ IL-1 beta 


0.0 


0.0 


0.0 


PBMC rest 


0.0 


0.0 


0.0 


Lune 

fibroblast 

none 


0.0 


0.0 


0.0 


PBMC 
PWM 


0.0 


1.4 


0.0 


Lune 
fibroblast 
TNF alpha 
+ IL-1 beta 


0.0 


0.0 


0.0 


PBMC 
PHA-L 


0.5 


0.0 


1.0 


Lung 

fibroblast 

IL-4 


0.0 


0.0 


0.0 


Ramos (B 
cell) none 


0.0 


1.8 


0.0 


Lung 

fibroblast 

IL-9 


0.0 


0.0 


2.0 


Ramos (B 
cell) 

ionomycin 


0.5 


0.0 


0.0 


Lung 

fibroblast 

IL-13 


0.0 


0.0 


1.0 


B 

lymphocyt 
es PWM 


0.0 


0.0 


1.3 


Lung 
fibroblast 
IFN 
gamma 


0.5 


0.0 


0.0 




B 

lymphocyt 
es CD40L 
and IL-4 


0.0 


0.0 


0.0 


Dermal 
fibroblast 
CCD 1070 
rest 


0.0 


0.0 


0.0 


EOL-1 
dbcAMP 


0.0 


0.0 


0.0 


Dermal 
fibroblast 
CCD 1070 
TNF alpha 


0.0 


0.0 


0.0 


EOL-1 
dbcAMP 
PMA/iono 
mycin 


0.0 


0.0 , 


0.0 


Dermal 
fibroblast 
CCD 1070 
IL-1 beta 


0.3 


0.0 


0.0 


Dendritic 
cells none 


2.1 


3.5 


1.1 


Dermal 
fibroblast 

ir in 

gamma 


0.0 


0.0 


0.0 


Dendritic ; 
cells LPS 


0.0 


0.0 


0.0 


L/vJI 1 1 lcll 

fibroblast 
IL-4 


0.0 


0.0 


0.0 


Dendritic 
cells anti- 

WVIIJ til 1 I 1 

CD40 


0.0 


0.0 


0.0 


IBD 

Colitis 2 


1.3 


3.6 


2.4 


Monocytes 
rest 


0.0 


0.0 


0.0 


IBD 

Crohn's 


0.4 


A A 
0.0 


1 .U 


Monocytes 
LPS 


0.0 


0.0 


A A 

u.u 


L-Oion 


9. 1 
o. 1 


32.3 


56.6 


Macropha 
ges rest 


0.0 


0.0 


4.8 


Lung 


0.5 


0.0 


0.0 


Macropha 
ges LPS 


0.0 


0.0 


1.8 


Thymus 


100.0 


100.0 


100.0 


HUVEC 
none 


0.0 


0.0 


0.0 


Kidney 


0.9 


0.0 


0.0 


HUVEC 
starved 


0.0 


0.0 


1.3 









Table AYI. Panel 5 Islet 



Tissue Name 


Rel. Exp.(%) 
Ag3027, Run 
225051163 


Tissue Name 


Rel. Exp.(%) 
Ag3027, Run 
225051163 


97457 Patient- 
02go_adipose 


0.0 


94709_Donor 2 AM - A adipose 


9.5 


97476_Patient- 
07sk skeletal muscle 


3.2 


94710_Donor 2 AM - B adipose 


0.0 


97477_Patient- 
07ut uterus 


15.0 


9471 I Donor 2 AM - C adipose 


5.3 
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97478_Patient- 
07plj>lacenta 


5.4 


9471 2_Donor 2 AD - A adipose 


11.8 


99167_Bayer Patient 1 


0.0 


94713JDonor 2 AD - B_adipose 


8.7 


97482 Patient- 
08ut_uterus 


0.0 


94714 Donor 2 AD - Cadipose 


Q 1 

y.j 


97483_Patient- 
08pl_placenta 


4.2 


94742_Donor 3 U - 
AJVlesenchymal Stem Cells 


U.U 


97486_Patient- 
09sk skeletal muscle 


0.0 


94743_Donor 3 U - 

B Mesenchymal Stem Cells 


a A 
U.U 


97487 Patient- 
09ut_uterus 


0.0 


94730 Donor 3 AM - A adipose 


O A 
5.U 


97488 Patient- 
09plj>lacenta 


0.0 


9473 1 Donor 3 AM - Badipose 


A A 

U.U 


97492 Patient- 
1 Out uterus 


0.0 


94732_Donor 3 AM - C adipose 


O 1 
5.1 


97493 Patient- 
1 Opl jlacenta 


0.0 


94733_Donor 3 AD - A adipose 


A Cm 

4.9 


97495 Patient- 
11 go adipose 


8.1 


94734 Donor 3 AD - B adipose 


13.3 


97496 Patient- 

1 lsk skeletal muscle 


0.0 


94735 Donor 3 AD - C adipose 


5.1 


97497 Patient- 
1 lututerus 


0.0 


77138 Li ver_HepG2untreated 


A A 

0.0 


97498 Patient- 
1 lpl_placenta 


0.0 


73556_Heart_Cardiac stromal 
cells (primary) 


O.U 


97500_Patient- 
12go adipose 


3.5 


81735_Small Intestine 


22.7 


97501 _Patient- 
12sk_skeletal muscle 


0.0 


72409_Kidney_Proximal 
Convoluted Tubule 


100.0 


Q7S09 Pntipnt- 

12ut_uterus 


0.0 


82685_Small intestine_Duodenum 


6.1 


97503JPatient- 
12pl placenta 


0.0 


90650 AdrenalAdrenocortical 
adenoma 


0.0 


94721_Donor2U- 
A Mesenchymal Stem 
Cells 


0.0 


72410_Kidney_HRCE 


74.7 


94722_Donor 2 U - 
B Mesenchymal Stem 
Cells 


0.0 


72411_Kidney_HRE 


18.2 


94723_Donor 2 U - 
C Mesenchymal Stem 
Cells 


0.0 


73139 UterusJJterine smooth 
muscle cells 


0.0 



General_screening_panel_vl.5 Summary : Ag3027 Expression of the NOV56a gene 
is highest in a sample derived from a lung cancer (CT-30.5). Significant expression is also 
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seen in samples derived from colon cancer and ovarian cancer. Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of lung, colon, and breast cancers. Furthermore, therapeutic 
modulation of the expression or function of this gene may be effective in the treatment of lung, 
5 colon and breast cancers. 

While expression of this gene is seen predominantly in cancer cell lines, significant 
expression is also seen in fetal kidney (CT=31.8). Furthermore, expression is higher in fetal 
kidney than in adult kidney (CT=35.8). Thus, expression of this gene could be used to 
differentiate between adult and fetal kidney. In addition, the expression in fetal kidney 
10 suggests that this gene product may be involved in the development of the kidney. Therefore, 
therapeutic modulation of the expression or function of this gene may be useful in treating 
disease of the kidney. 

Panel 1.2 Summary: Agl 169 Results from one experiment, Run 129128191, with the 
NOV56a gene are in agreement with Results in Panel L3D and 
15 General_screening_panel_vl.5. A second run, Run 129656838, produces disparate results. 

Panel 1.3D Summary: Agl 169/Ag3027 Two experiments with the same probe and 
primer both show highest expression of the gene NOV56a in the mammary gland (CTs=31). 
Low, but significant levels of expression are also seen in a lung cancer cell line. Thus, 
expression of this gene may be used to differentiate between these samples and other samples 
20 on this panel. 

Panel 2D Summary: Ag3027 Highest expression of the NOV56a gene is seen in a 
gastric cancer. Significant expression is also seen in breast cancer, colon cancer and normal 
kidney. Thus, expression of this gene could be used to differentiate between these samples and 
other samples on this panel and as a marker for the presence of breast, colon and kidney 

25 cancers. A second experiment with the probe/primer set Agl 169 is not included. The amp plot 
indicates that there were experimental difficulties with this run. 

Panel 3D Summary: Agl 169 Expression of the NOV56a gene is restricted to samples 
derived from lung and gastric cancer cell lines (CTs=33-35). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 

30 marker to detect the presence of lung and gastric cancers. Furthermore, therapeutic modulation 
of the expression or function of this gene may be effective in the treatment of lung and gastric 
cancers. 

Panel 4D Summary: Agl 169/Ag3027 Two experiments with the same probe and 
primer set show expression of the NOV56a gene limited to the thymus (CTs=32-33). Thus, 
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expression of this gene could be used as a marker for thymic tissue. Furthermore, this 
restricted expression suggests that this gene product may play an important role in T cell 
development. Therefore, small molecule therapeutics, or antibody therapeutics designed 
against the protein encoded for by this gene could be utilized to modulate immune function (T 
5 cell development) and be important for organ transplant, AIDS treatment or post 
chemotherapy immune reconstitution. 

Panel 5 Islet Summary: Ag3027 Expression of the NOV56a gene is restricted to a 
sample derived from the kidney (CT=34.9). This expression is consistent with expression in 
Panel 1 .3D. Thus, expression of this gene could be used as a marker for kidney tissue. 

10 NOV57 

Expression of gene NOV57 was assessed using the primer-probe sets Ag303 1 , 
Agl301b and Agl415, described in Tables AZA, AZB and AZC. Results of the RTQ-PCR 
runs are shown in Tables AZD, AZE, AZF and AZG. 

Table AZA. Probe Name Ag3031 



Primers • 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward! 


5 ' -aaaaggtgatgtctggagcat-3 ' 


21 


616 


1216. 


Probe 


TET-5 ' -tgtatgtcatgctctgtgccagccta-3 ' - 
TAMRA 


26 


648 


1217 


Reverse 


5 1 -gatgtctgtgtcgtcaaaagg-3 ■ 


21 


674 


1218 



15 

Table AZB. Probe Name Agl301b 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -aaaaggtgatgtctggagcat-3 ' 


21 


616 


1219 


Probe 


TET-5 ' -tgtatgtcatgctctgtgccagccta-3 1 - ; 
TAMRA 


26 


648 


1220 


Reverse 


5 1 -gatgtctgtgtcgtcaaaagg-3 1 


21 


674 


1221 



Table AZC. Probe Name Agl415 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -aaaaggtgatgtctggagcat-3 1 


21 


616 


1222 


Probe 


TET- 5 * -tgtatgtcatgctctgtgccagccta-3 ' - 
TAMRA 


26 


648 


1223 


Reverse 


5 ' -gatgtctgtgtcgtcaaaagg-3 1 


21 


674 


1224 
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Table AZD. CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3031,Run 
211011868 


Rel. Exp.(%) 
Ag3031,Run 
225437445 


Tissue 
Name 


Rel. Exp.(%) 
Ag3031,Run 
211011868 


Rel. Exp.(%) 
Ag3031, Run 
225437445 


AD 1 Hippo 


23.8 


46.7 


Control 
(Path) 3 
Temporal 
Ctx 


6.0 


10.2 


AD 2 Hippo 


21.2 


21.9 


Control 
(Path) 4 
Temporal 
Ctx 


26.4 


28.1 


AD 3 Hippo 


14.2 


11.6 


AD 1 

Occipital 

Ctx 


31.9 


41.8 


f\VJ *-* rlippo 


7 A 


1 n s 


AD 2 

Occipital 

Ctx 

(Missing) 


0 0 


0.0 


AD 5 hippo 


94.0 


94.0 


AD 3 

Occipital 

Ctx 


17.9 


17.4 


AD 6 Hippo 


53.6 


61.6 


AD 4 

Occipital 

Ctx 


12.9 


17.3 


Hippo 


12.7 


17.0 


AD 5 

Occipital 

Ctx 


35.1 


20.9 


Hippo 


14.6 


15.2 


AD 6 

Occipital 

Ctx 


20.9 


27.5 


Control (Tatrri 

V^-VJilLl \J\ dill J 

3 Hippo 


22.2 


7.2 


Control 1 
Occipital 
Ctx 


4.9 


4.9 


AD 1 Temporal 
Ctx 


42.3 


42.9 


Control 2 
Occipital 
Ctx 


46.7 


32.1 


AD 2 Temporal 
Ctx 


78.5 


29.9 


Control 3 
Occipital 
Ctx 


17.8 


19.2 


AD 3 Temporal 
Ctx 


17.9 


20.4 


Control 4 
Occipital 
Ctx 


13.2 


12.1 


AD 4 Temporal 
Ctx 


30.1 


31.2 


Control 
(Path) 1 
Occipital 
Ctx 


49.3 


59.9 
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AD 5 Inf 
Temporal Ctx 


100.0 


94.6 


Control 
(Path) 2 
Occipital 
Ctx 


13.6 


8.8 


AD 5 

SupTemporal 
Ctx 


59.9 


52.5 


Control 
(Path) 3 
Occipital 
Ctx 


6.1 


4.7 


AD 6 Inf 
Temporal Ctx 


75.3 


95.9 


Control 
(Path) 4 
Occipital 
Ctx 


31.2 


21.2 


AD 6 Sup 
Temporal Ctx 


94.6 


100.0 


Control 1 
Parietal Ctx 


10.3 


11.1 


control 1 
Temporal Ctx 


8.5 


7.3 


Control 2 
Parietal Ctx 


56.6 


60.7 


Control 2 
i cmporai 


16.3 


18.8 


Control 3 
Parietal Ctx 


15.4 


15.0 


Control 3 
Temporal Ctx 


13.0 


15.3 


Control 
(Path) 1 
Parietal Ctx 


34.2 


41.8 


Control 4 
Temporal Ctx 


12.8 


16.6 


Control 
(Path) 2 
Parietal Ctx 


15.4 


20.3 


Control (Path) 
1 Temporal Ctx 


26.8 


31.0 


Control 
(Path) 3 
Parietal Ctx 


8.4 


7.0 


Control (Path) 
2 Temporal Ctx 


25.5 


17.8 


Control 
(Path) 4 
Parietal Ctx 


44.1 


31.6 



Table AZE. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Agl301b, 

Run 
165528224 


Rel. Exp.(%) 
Ag3031, Run 
167961982 


Tissue Name 


Rel. Exp.(%) 
Agl301b, 

Run 
165528224 


Rel. Exp.(%) 
Ag3031, Run 
167961982 


Liver 

adenocarcinoma 


18.4 


32.3 


Kidney (fetal) 


10.7 


66.0 


Pancreas 


4.7 


11.9 


Renal ca. 786- 
0 


15.3 


25.5 


Pancreatic ca. 
CAPAN 2 


4.9 


4.9 


Renal ca. 
A498 


11.9 


12.5 


Adrenal gland 


23.0 


11.0 


Renal ca. RXF 

393 


14.9 


18.0 


Thyroid 


7.1 


8.5 


Renal ca. 
ACHN 


8.0 


10.5 
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^aliv^rv crl fi n H 
oaii vai y j^iciiivj 


10 1 

1 I/. 1 


1 3 7 


Renal ca. UO- 
31 


6 0 


1 1 

1.1 


Pitnitarv tilanH 


1 R 2 


9 2 


Renal ca. TK- 
10 


7.4 


18.6 


Brain (fetal) 


24.0 


42.3 


Liver 


11.9 


17.8 


Brain (whole) 


43.8 


19.6 


Liver (fetal) 


20.6 


14.7 


Brain (amygdala) 


20.4 


9.6 


Liver ca. 

(hepatoblast) 

HepG2 


22.2 


28.7 


Brain (cerebellum) 


50.7 


50.3 


Lung 


17.0 


22.8 


Brain 

^HHJJ-JVJC'ClliI|Jl4o ^ 


18.7 


8.1 


Lung (fetal) 


10.4 


26.4 


Brain (substantia 
nigra) 


8.4 


8.9 


Lung ca. 
(small cell) 


9.1 


17.4 


Rrciin (\ Ki <i 1 om i ic\ 
Dldlli ^llldlClJllUi)^ 




1 ft ft 

» 


Lung ca. 

icmcill <t* 1 1 \ 
^MI 1U1 1 ) 

NCI-H69 


1 <; 
1 


4 ft 


V_*t/1 t^L/i dl V—^UI it A 


S 3 




Lung ca. 

^z>. L/t/ii vai 

SHP-77 


18 4 


57 4 


Spinal cord 


9.6 


7.0 


T lino r*?i 

(large 

cell)NCI-H460 


50.3 


4.2 


glio/astro U87-MG 


12.1 


16.5 


T liner c f nnr* — 
L^Ullg ta. ^IHJIl 

sm.cell) A549 


8.0 


28.1 


glio/astro U-118- 
MG 


15.5 


13.4 


Lung ca. (non- 
s celH NCI- 
H23 


10.2 


18.2 


astrocytoma 
SW1783 


6.9 


11.1 


Lung ca. (non- 
s.cell) HOP-62 


20.3 


33.0 


neuro*; met SK-N- 
AS 


11.4 


10.9 


Lung ca. (non- 
s.cl) NCI- 
H522 


9.7 


23.5 


astrocytoma SF- 
539 


12.5 


25.5 


Lung ca. 
(squam.) SW 
900 


4.0 


8.5 


astrocytoma SNB- 
75 


10.9 


18.9 


Lung ca. 
(squam.) NCI- 
H596 


1.6 


4.8 


glioma SNB-19 


32.1 


25.3 


Mammary 
gland 


36.9 


22.5 


glioma U251 


100.0 


100.0 


Breast ca.* 
(pl.ef) MCF-7 


5.4 


6.7 


glioma SF-295 


18.4 


46.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


16.7 


7.9 
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1 1 11 Ifnip 



Heart (fetal) 


2.3 


6.2 


Breast ca.* 
(pl.ef) T47D 


13.2 


46.7 


Heart 


8.4 


8.2 


Breast ca. BT- 
549 


9.9 


6.6 


Skeletal muscle 
(fetal) 


6.3 


24.1 


Breast ca. 
MDA-N 


1.6 


11.9 


Skeletal muscle 


20.2 


7.9 


Ovary 


3.0 


5.1 


doi ie mdrrow 


9 1 £ 

zi.o 


Z D.Z 


Ovarian ca. 
OVCAR-3 




1 1 A 

iz.U 


Thymus 


18.2 


38.4 


Ovarian ca. 
OVCAR-4 


4.7 


8.8 


opieen 


ZO.O 


14. U 


Ovarian ca. 
OVCAR-5 


o.3 


42.3 


Lymph node 


42.6 


23.7 


Ovarian ca. 
OVCAR-8 


2.8 


0.9 


v^oioreciai 


1 o.o. 


1 A O 


Ovarian ca. 
IGROV-1 


Z.o 


1 1 .7 


Stomach 


37.4 


5.9 


Ovarian ca.* 
(ascites) SK- 
OV-3 


10.0 


29.9 


Small intestine 


36.6 


14.2 


Uterus 


38.4 


17.4 


Colon ca. SW480 , 


3.7 


4.5 


Placenta 


11.1 


9.4 


Colon ca.* 

SW620(SW480 

met) 


3.8 


24.0 


Prostate 


18.6 


8.5 


Colon ca. HT29 


1.6 


4.6 


Prostate ca.* 
(bone met)PC- 
3 


6.9 


14.1 


Colon ca. HCT- 
116 


3.0 


9.9 


Testis 


45.1 


26.6 


Colon ca. CaCo-2 


2.1 • 


5.1 


Melanoma 
Hs688(A).T 


0.8 


1.2 


Colon ca. 
tissue(OD03866) 


5.4 


3.6 


Melanoma* 
(met) 

Hs688(B).T 


3.8 


3.4 


Colon ca. HCC- 
2998 


6.2 


15.6 


Melanoma 
UACC-62 




8 0 


Gastric ca.* (liver 
met) NCI-N87 


20.0 


13.3 


Melanoma 
M14 


28.5 


9.9 


Bladder 


8.4 


12.2 


Melanoma j 
LOX IMVI 


0.7 


8.1 


Trachea 


13.6 


6.4 


Melanoma* 
(met) SK- 
MEL-5 


2.2 


2.9 


Kidney 


10.4 23.8 


Adipose 


8.1 


12.3 
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Table AZF. Panel 2.2 



Tissue Name 


Kel. JLxp.( /o) 
Agl301b, Run 
173859869 


Tissue Name 


Kel. H,xp.(Vo) 
Agl301b, Run 
173859869 


Normal Colon 


39.2 


Kidney Margin 
(OD04348) 


79.0 


Colon cancer 
(OD06064) 


6.3 


Kidney malignant 
cancer fOD06204B"> 


6.0 


Colon Margin 
(OD06064) 


13.0 


Kidney normal adjacent 
tissue (OD06204E > > 


15.4 


Colon cancer 
(OD06159) 


0.0 


Kidney Cancer 
(OD04450-0H 


24.5 


Colon Margin 
(OD06159) 


27.0 


Kidney Margin 
(OD04450-03 1 


23.7 


Colon cancer 
(0006297-04^ 


2.5 


Kidney Cancer 


3.0 


Colon Margin 
fOD06297-015i 


39.8 


Kidney Margin 
8170614 


38.2 


CC Gr.2 ascend colon 


3.8 


Kidney Cancer 
901 0^70 


8.0 


CC Margin (OD03921) 


4.4 


ICiHnpv lVTarcrin 

9010321 


10.8 


Colon cancer metastasis 

WUIIVVI UIvIUljIUjI J 

(OD06104) 


14.3 


8120607 


15.2 


Lung Margin 
(OD06104) 


17.2 


KiHnev TVfarpin 
8120608 


3.8 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Uterus 


33.7 


Lung Margin 
(OD04451-02) 


10.2 


Uterine Cancer 06401 1 


33.9 


Normal Prostate 


30.6 


Normal Thyroid 


2.9 


Prostate Cancer 
(OD04410) 


7.6 


Thyroid Cancer 064010 


4.4 


Prostate Margin 
(OD04410) 


25.5 


Thyroid Cancer 
A302152 


29.9 


Normal Ovary 


25.0 


Thyroid Margin 
A302153 


7.3 


Ovarian cancer 
(OD06283-03) 


8.1 


Normal Breast 


37.6 


Ovarian Margin 
(OD06283-07) 


34.4 


Breast Cancer 
(OD04566) 


21.6 


Ovarian Cancer 064008 


46.3 


Breast Cancer 1024 


100.0 


Ovarian cancer 
(OD06145) 


34.4 


Breast Cancer 
(OD04590-01) 


25.2 


Ovarian Margin 
(OD06145) 


52.9 


Breast Cancer Mets 
(OD04590-03) 


35.6 
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Ovarian cancer 
(OD06455-03) 


11.6 


Breast Cancer 
Metastasis (OD04655- 
05) 


45.1 


Ovarian Margin 
(OD06455-07) 


18.8 


Breast Cancer 064006 


21.2 


Normal Lung 


22.8 


Breast Cancer 9100266 


26.6 


Invasive poor diff. lung 
adeno (ODO4945-01 


1 8 7 

J. o. / 


Rrf*«Qt K/farcrin 01007/^ 

DlCdbl IVidlglll 7 1UUZOJ 




Lung Margin 
(ODO4945-03) 


1 ^ 9 

1J.7 


DICclDl V^-dJlCCr AZU"U ID 


O. 1 


Lung Malignant Cancer 
(OD03126) 


11.7 


Breast Margin 
A2090734 


35.8 


Lung Margin 
(OD03126) 


12.2 


Breast cancer 
(OD06083) 


49.3 


Lung Cancer 
(OD05014A) 


9.0 


Breast cancer node 
metastasis (OD06083) 


25.9 


Lung Margin 
(OD05014B) 


35.4 


Normal Liver 


36.3 


Lung cancer (OD06081) 


23.7 


Liver Cancer 1 026 


2.5 


Lung Margin 
(OD06081) 




reiver cancer i \)Lj 




Lung Cancer 
(OD04237-01) 


13.3 


Liver Cancer 6004-T 


30.1 


Lung Margin 
(OD04237-02) 


28.5 


Liver Tissue 6004-N 


27.7 


Ocular Melanoma 
Metastasis 


11.5 


Liver Cancer 6005-T 


6.7 


Ocular Melanoma 
Margin (Liver) 


27.0 


Liver Tissue 6005-N 


17.4 


Melanoma Metastasis 


21.6 


Liver Cancer 064003 


32.3 


Melanoma Margin 
(Lung) 


14.7 


Normal Bladder 


13.2 


Normal Kidney 


17.8 


Bladder Cancer 1023 


23.0 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


50.3 


Bladder Cancer 
A302173 


90 O 


Kidney Margin 
(OD04338) 


24.3 


Normal ^tom?ir*fi 

livl 111 CI I O IUI 1 1 Cl^ 1 1 


oy.D 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


62.0 


Gastric Cancer 
9060397 


5.7 


Kidney Margin 
(OD04339) 


16.3 


Stomach Margin 
9060396 


17.7 


Kidney Ca, Clear cell 
type (OD04340) 


7.3 


Gastric Cancer 
9060395 


19.6 


Kidney Margin 
(OD04340) 


10.7 


Stomach Margin 
9060394 


42.6 


Kidney Ca, Nuclear 


4.6 


Gastric Cancer 064005 


7.5 
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jgrade 3 (OD04348) 



Table AZG. Panel 4D 



Tissue 
Name 


Rel. 

; Ex P .(%) 

j Agl301b, 

Run 
138983163 


Rel. 
Exp.(%) 
Agl415, 
Run 
138642033 


Rel. 

j Exp.(%) 
Ag3031, 
Run 
162426783 


Tissue 
Name 


Rel. 
Exp.(%) 
Agl301b, 

Run 
138983163 


Rel. 
Exp.(%) 
Agl415, 
Run 
138642033 


Rel. 
Exp.(%) 
Ag3031, 
Run 
162426783 


Secondary 
Thl act 


17.0 


25.3 


13.8 


HUVEC 
IL-lbeta 


11.5 


13.0 


11.3 


Secondary 
Th2act 


22.7 


18.9 


20.0 


HUVEC 

IFN 

gamma 


28.3 


24.7 


\ 20.2 

i 


Secondary 
Trl act 


32.3 


26.4 


20.0 


HUVEC 
TNF alpha 
+ IFN 
gamma 


3.2 


6.6 


12.3 


Secondary 
Thl rest 


22.2 


20.3 


12.0 


HUVEC 
TNF alpha 
+ IL4 


11.7 


9.7 


9.7 


Secondary 
Th2 rest 


42.6 


37.9 


21 0 


HUVEC 
IL-11 


1 n 1 


o.Z 


1 1 A 
J 1 .0 


Secondary 
Trl rest 


27.9 


27.5 


26.8 


Lung 
Microvasc 
ular EC 
none 


33.9 


28.7 


25.2 


Primary 
Thl act 


33.4 


29.9 

1 


19.1 


Lung 
Microvasc 
ular EC 
TNFalpha 
+ IL-lbeta \ 


23.2 


19.1 


24.8 


Primary 
1 nz act 


28.1 


41.2 


13.3 


Microvasc 
ular 

Dermal EC 
none 


41.8 


45.1 


25.7 


Primary 
Trl act 


49.0 


52.5 


20.9 


Microsvas 
ular 

Dermal EC 
TNFalpha 
+ IL-lbeta 


27.5 


40.3 


22.4 


Primary 
Thl rest 


80.7 


87.1 


80.7 


Bronchial 
epithelium 
TNFalpha 
-MLlbeta 


- — *-" " 
20.3 


32.3 


22.1 


Primary 
Th2 rest 


67.4 


68.8 


64.2 


Small 

airway 

epithelium 


7.3 


3.3 


4.8 
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none 








Primary 
Trl rest 


46.0 


50.3 


54.7 


Small 
airway 
epithelium 
TNFalpha 
+ IL-1 beta 

1 ±1—/ 1 l/lslci 


34.4 


35.1 


26.8 


CD45RA 
CD4 

lvmohocvt 
e act 


12.9 


8.0 


16.5 


Coronery 
artery 
SMC rest 


7.9 


7.9 


12.3 


CD45RO 
CD4 

lymphocyt 
e act 


33.4 


44 8 


22 4 


Coronery 

artery 

SMC 

TNFalpha 
+ IL-lbeta 


7 6 


10 7 

IV./ 


R 1 

O. 1 


CD8 

Ivmnhocvt 
e act 


22.8 


21 0 

■L- > - V / 


1 8 4 


Astrocytes 
rest 


7 0 


1 U.J 


1 7 7 


Secondary 
CD8 

Ivmnhnrvt 

e rest 


22.2 


25.5 


25.0 


Astrocytes 
TNFalpha 
+ IL-lbeta 


8.7 


5.8 


13.3 


Secondary 
CD8 

lymphocyt 
e act 


19.3 


21.8 


22.5 


KU-812 

(Basophil) 

rest 


40.1 


35.4 


48.0 


CD4 

lymphocyt 
e none 


41.5 


42.3 


34.6 


KU-812 
(Basophil) 
PMA/iono ! 
mycin 


57.8 


61.1 


78.5 


2ry 

Thl/Th2/T 
rl anti- 
CD95 
CH11 


65.1 


54.7 


41.2 


CCD 1106 
ytes) none 


3 8 


8 0 


ft ft 


LAK cells 
rest 


28.7 j 


37.1 


32.3 


CCD1106 1 

(Keratinoc 

ytes) 

TNFalpha 
+ IL-lbeta 


27.4 


24.5 


5.3 


LAK cells 
IL-2 


38.7 1 


49.0 


36.3 


Liver 
cirrhosis 


17.0 


9.4 


5.0 


LAK cells 

IL-2+IL- 

12 


26.8 


27.4 


26.8 


Lupus 
kidney 


24 A 


23.8 


9.5 


LAK cells j 
IL-2+IFN : 
gamma j 


43.5 


45.4 


42.9 


NCI-H292 
none 


38.7 


49.3 


45.4 
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LAK cells 
IL-2+ IL- 
18 


26.2 


25.2 


35.6 


NCI-H292 

II A 


58.6 


51.4 


46.7 


LAK cells 
PMA/iono 
mycin 


8.3 


8.6 


3.3 


NCI-H292 

TT O 


563 


46.0 


54.3 


NK Cells 
IL-2 rest 


28.7 


35.8 


32.8 


NCI-H292 
IL-13 


30.4 


31.9 


23.3 


Two Way 
MLR 3 
day 


42.3 


49.0 


46.3 


NCI-H292 

IFN 

gamma 


16.7 


28.3 


29.1 


Two Way 
MLR 5 
day 


17.9 


16.2 


13.5 


HPAEC 
none 


15.7 


26.4 


19.1 


Two Way 
MLR 7 
day 


14.7 


12.6 


14.4 


HPAEC 
TNF alpha 
+ IL-1 beta 


23.8 


32.5 


27.9 


PBMC rest 


21.9 


29.9 


19.2 


Lune 

fibroblast 

none 


13.0 


11.0 


11.9 


PBMC 
PWM 


66.9 


53.2 


50.3 


Luns 
fibroblast 
TNF alpha 
+ IL-1 beta 


8.7 


7.2 


13.2 


PBMC 
PHA-L 


35.6 


46.7 


23.3 


Lung 

fibroblast 

IL-4 


5.6 


12.0 


10.5 


Ramos (B | 
cell) none 


• 23.5 


33.2 


16.5 


Lung 

fibroblast 

IL-9 


8.5 


7.4 


15.3 


Ramos (B 
cell) 

ionomycin j 


53.2 


53.2 ; 


49.3 


Lung 

fibroblast 

IL-13 


20.3 


16.3 


8.9 


B 

lymphocyt j 
es PWM 


29.9 j 


36.3 


34.2 


Lung 
fibroblast 
IFN 
gamma 


11.1 


10.4 


13.0 


B 

lymphocyt j 
esCD40L 
and IL-4 ! 


jj.Z 


jO.O 


JJ.O 


Dermal 
fibroblast : 
CCD 1070 
rest 


-IT ft 

47.0 


1 1 .3 


13.8 


EOL-1 
dbcAMP | 


18.9 1 


14.8 


12.8 


Dermal 
fibroblast 
CCD1070 
TNF alpha 


45.7 


53.6 


55.1 


EOL-1 

dbcAMP 

5 MA/iono 


30.4 


29.1 


25.5 


Dermal 
fibroblast 
CCD 1070 


16.8 


17.3 


15.4 
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t, rro t p rii .1 ;y ... m ^tn? 



mycin 








IL-1 beta 






Dendritic 
cells none 


14.9 


10.0 


13.7 


Dermal 
fibroblast 
IFN 
gamma 


5.8 


8.8 


6.5 


cells LPS 


15.7 


5.8 


8.7 


Dermal 

fibroblast 

IL-4 


17.6 


20.0 


13.6 


Dendritic 
cells anti- 
CD40 


12.7 


17.7 


15.2 


IBD 

Colitis 2 


2.0 


1.3 


1.1 


Monocytes 
rest 


35.6 


27.5 


36 f> 


IBD 

Crohn's 


3.0 


1.4 


1 A 


Monocytes^ 

I TIC 1 

LPS 


34.6 


43.8 


25.0 


Colon 


33.0 


26.2 


34.6 


Macropha 
ges rest 


19.2 | 


16.6 


17.2 


Lung 


6.8 


13.5 


7.5 


Macropha 
ges LPS 


17.9 


16.2 


6.7 


Thymus 


100.0 


100.0 j 


55.5 


HUVEC 
none 


9.8 


13.2 


17.3 


Kidney 


87.1 


79.0 


100.0 


HUVEC 
starved 


27.2 


27.5 


28.3 









CNS_neurodegeneration_vl.O Summary: Ag3031 Two experiments with the same 
probe and primer set produce results that are in excellent agreement. The NOV57 gene, a 
kinase homolog, is expressed more highly in the temporal cortex of brains from Alzheimer's 
disease patients than in the temporal cortex of normal brains unaffected by Alzheimer's 
disease. Kinases have been shown to play a role in the pathogenesis of Alzheimer's disease. 
The dysregulation of this kinase, NOV57, indicates an active role for this pathway in disease 
pathogenesis. Thus, inhibitors of this gene product, by modulating this pathway, may have 
utility in the treatment of Alzheimer's disease and other neurodegenerative diseases. 

References: 

Morishima Y, Gotoh Y, Zieg J, Barrett T, Takano H, Flavell R, Davis RJ, Shirasaki Y, 
Greenberg ME. Beta-amyloid induces neuronal apoptosis via a mechanism that involves the c- 
Jun N-terminal kinase pathway and the induction of Fas ligand. J Neurosci 2001 Oct 
l;21(19):7551-60 

Elevated levels of beta- Amyloid (Abeta) are present in the brains of individuals with 
either the sporadic or familial form of Alzheimer's disease (AD), and the deposition of Abeta 
within the senile plaques that are a hallmark of AD is thought to be a primary cause of the 
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cognitive dysfunction that occurs in AD. Recent evidence suggests that Abeta induces 
neuronal apoptosis in the brain and in primary neuronal cultures, and that this Abeta-induced 
neuronal death may be responsible in part for the cognitive decline found in AD patients. In 
this study we have characterized one mechanism by which Abeta induces neuronal death. We 
found that in cortical neurons exposed to Abeta, activated c-Jun N-terminal kinase (JNK) is 
required for the phosphorylation and activation of the c-Jun transcription factor, which in turn 
stimulates the transcription of several key target genes, including the death inducer Fas ligand. 
The binding of Fas ligand to its receptor Fas then induces a cascade of events that lead to 
caspase activation and ultimately cell death. By analyzing the effects of mutations in each of 
the components of the JNK-c-Jun-Fas ligand-Fas pathway, we demonstrate that this pathway 
plays a critical role in mediating Abeta-induced death of cultured neurons. These findings 
raise the possibility that the JNK pathway may also contribute to Abeta-dependent death in 
AD patients 

Panel 1.3D Summary: Agl301b/Ag3031 Two experiments with the same probe and 
primer set produce results that are in excellent agreement, with highest expression of the 
NOV57 gene in a brain cancer cell line (CTs=29-30). Overall, this gene is expressed at 
moderate to low levels in all the samples in this panel. 

This gene has low to moderate expression in several endocrine/metabolic-related 
tissues, including adipose, pancreas, liver, skeletal muscle and thyroid. Thus, a therapeutic 
modulator to this gene and/or gene-product may be useful in the treatment of diseases which 
affect the endocrine system. 

Panel 2.2 Summary: Agl301b The NOV57 gene is expressed in breast cancer at a 
moderate level. It is also expressed at a higher level in normal gastric, prostate and colon 
tissues compared to the adjacent tumors. Hence, inhibition of this drug might be used for 
treatment of breast cancer. It could also be used as a diagnostic marker for gastric, prostate and 
colon cancers. 

Panel 4D Summary: Agl301b/Agl415/Ag3031 Three experiments with the same 
probe and primer sets produce results that are in excellent agreement, with highest expression 
of the NOV57 gene in the thymus and kidney. This gene is also expressed at higher levels in 
resting Thl and Th2 lymphocytes than in activated Thl and Th2 lymphocytes. Therefore, 
small molecule agonists of the gene product may be useful as therapeutics to reduce the 
activation of Thl and Th2 cells and thus reduce symptoms in patients with autoimmune and 
inflammatory diseases, such as Crohn's disease, ulcerative colitis, multiple sclerosis, chronic 
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obstructive pulmonary disease, asthma, emphysema, rheumatoid arthritis, lupus 
erythematosus, or psoriasis. 



NOV58a and NOV58b: Gap Junction Beta-5 (connexin) 

Expression of gene NOV58a and variant NOV58b was assessed using the primer-probe 
set Ag2914, described in Table BAA. 



Table BAA. Probe Name Ag2914 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -aacactgtggactgcttcatct-3 * 


22 


517 


1225 


Probe 


TET- 5 1 -ccaaacccactgagaagacgatcttca- 
3 1 -TAMRA 


27 


539 


1226 


Reverse 


5 ' -atacacaagcatgaggtgatga-3 ' 


22 


578 


1227 



CNS__neurodegeneration_vl.O Summary: Ag2914 The amp plot indicates that there 
are experimental difficulties with this run (data not shown). 

Panel 13D Summary: Ag2914 Expression of this gene is low/undetectable (CTs >35) 
across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag2914 Expression of this gene is low/undetectable (CTs >35) 
across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2914 The amp plot indicates that there are experimental 
difficulties with this run (data not shown). 

BB. CG56633-01: TRANSLATION INITIATION FACTOR 5 

Expression of gene CG56633-01 was assessed using the primer-probe set Ag2900, 
described in Table BBA. Results of the RTQ-PCR runs are shown in Tables BBB, BBC, BBD 
and BBE. 



Table BBA. Probe Name Ag2900 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gctaagttccttgatgcttctg-3 * 


22 


184 


1228 


Probe 


TET- 5 1 -caaaacttgattaccgtcgatgtgca-3 1 - 
TAMRA 


26 


209 


1229 


Reverse 


5 * -ccaccagaatgtcaaagagtgt-3 1 


22 


238 


1230 



Table BBB. CNS_neurodegeneration_vl.O 



Tissue Name 



ReL Exp.(%) Ag2900, j Tissue Name [ ReL Exp.(%) Ag2900, 
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Run 206485415 


AD 1 Hippo 


12.9 


control atn ) j 
Temporal Ctx 


7.0 


AJJ z Hippo 


33.9 


Control (Path) 4 
Temporal Ctx 


18.6 


AD 3 Hippo 


7.4 


AD 1 Occipital Ctx 


14.5 


AD 4 Hippo 


11.2 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


68.8 


AD 3 Occipital Ctx 


8.0 


AD 6 Hinno 


0 1 .0 


AD 4 Occipital Ctx 


f 16.6 


Control 2 Hippo 


36.1 


AD 5 Occipital Ctx j 36.3 


Control 4 Hippo 


21.0 


AD 6 Occipital Ctx 


25.7 


Control (Path) 3 
Hippo 


7.7 


Control 1 Occipital 
Ctx 


6.8 


AD 1 Temporal Ctx 


19.6 


Control 2 Occipital 
Ctx 


65.5 


AD 2 Temporal Ctx 


27.5 


Control 3 Occipital 
Ctx 


14.6 


AD 3 Temporal Ctx 


6.2 


Control 4 Occipital 
Ctx 


12.3 


AD 4 Temporal Ctx 


17.6 


Control (Path) 1 
Occipital Ctx 


100.0 


AD 5 Inf Temporal 
Ctx 


59.0 


Control (Path) 2 
Occipital Ctx 


6.5 


AD 5 Sup Temporal 
ctx 


42.0 


Control (Path) 3 
Occipital Ctx 


5.1 


AD 6 Inf Temporal 
Ctx 


65.1 

[ 


Control (Path) 4 
Occipital Ctx 


13.2 


AD 6 Sup Temporal 
ctx 


4g 2 JControl 1 Parietal 
jCtx 


8.0 


Control 1 Temporal 

LA 


g q JControl 2 Parietal 

jCtx 


24.5 


Control 2 Temporal ; 
Ctx 


52.9 ' 
< 


Control 3 Parietal 

Ctx 


18.3 


Control 3 Temporal 
Ctx 


18.2 | 


Control (Path) 1 
^arietal Ctx 


86.5 


Control 3 Temporal 
Ctx 


9.0 Jj 


Control (Path) 2 
^arietal Ctx 


17.4 


Control (Path) 1 
Temporal Ctx 


57.0 : j 


Control (Path) 3 j 
^arietal Ctx 


9.4 


Control (Path) 2 
Temporal Ctx 


38.2 j| 


:ontrol (Path) 4 
'arietal Ctx 


31.6 
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Table BBC. Panel 1.3D 



Tissue Name 


Kei. J^xp.( /o) Agz^UU, 
Run 159996755 


Tissue Name 


| Rel. Exp.(%) Ag2900, 


Liver adenocarcinoma 


22.7 


Kidney ffetaH 


14 1 


Pancreas 


1.8 


j Renal ca. 786-0 


15.1 


2 


2.8 


Renal ca. A498 


23.7 


Adrenal gland 


22.1 


Renal ca RXF ^Q^ 


9 7 


Thyroid 


3.6 


Renal ca. ACHN 


3.0 


^ n 1 1 \/ z\ tt\/ ct\arii~\ 
oailvcliy glallU 


z.V 


Renal ca. UO-31 


6.7 


Pituitary gland 


5.8 


Renal ca. TK-10 


3.6 


Brain (fetal) 


10.7 


Liver 


6.0 


Brain (whole) 


12.3 


Liver (fetal) 


12.5 


Brain (amygdala) 


8.4 


Liver ca. 

(hepatoblast) HepG2 


11.8 


Brain (cerebellum) 


9.3 


Lung 


15.0 


Brain (hippocampus) 


61.6 


Lung (fetal) 


6.9 


Brain (substantia nigra) 


5.1 


Lung ca. (small cell) 
LX-1 


11.1 


Brain (thalamus) 


10.4 


Lung ca. (small cell) 
NCI-H69 


12.3 


Cerebral Cortex 


22.4 


Lung ca. (s.cell var.) 
SHP-77 


30.4 


Spinal cord 


4.7 


Lung ca. (large 
celI)NCI-H460 


28.3 


glio/astro U87-MG 


26.1 


Lung ca. (non-sm. 
cell) A549 


15.7 


elio/astro U- 1 1 8-Mfi 


Q4 0 


Lung ca. (non-s.cell) 
NCI-H23 




astrocytoma SW1783 




Lung ca. (non-s.cell) 
HOP-62 


A O 

4.z 


neuro** met SK-N-AS 


_J VJ.VI 


Lung ca. (non-s.cl) 
NCI-H522 




astrocytoma 




Lung ca. (squam.) 
SW 900 


Q A 

o.4 


astrocytoma SNB-75 


10.9 


Lung ca. (squam.) 
NCI-H596 


2.8 


glioma SNB-19 


10.2 


Mammary gland 


10.1 


glioma U251 


3 3 


Breast ca.* (pLef) 
MCF-7 


21.8 


glioma SF-295 


2.9 


Breast ca.* (pl.ef) 
V1DA-MB-231 


100.0 


Heart (fetal) 


2.1 ;| 


Breast ca.* (pl.ef) 
f47D 


5.3 


Heart 


3.1 fBreast ca. BT-549 


40.9 
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Skeletal muscle (fetal) 


3.0 


Breast ca. MDA-N 


12.0 


Skeletal muscle 


2.1 


Ovary 


3.0 


Bone marrow 


8.1 


Ovarian ca. OVCAR- 
3 


7 3 


Thymus 


4.0 


Ovarian ca. OVCAR- 
4 


0 9 


Spleen 


5.3 


Ovarian ca. OVCAR- 

5 


5.9 


Lymph node 


4.0 


Ovarian ca. OVCAR- 
8 


8.3 


Colorectal 


7.6 


Ovarian ca. IGROV- 
1 


2.1 


Stomach 


2.7 


Ovarian ca.* (ascites) 
SK-OV-3 


11.3 


oiuaii iiucsLine 


5.7 


Uterus 


3.0 


Colon ca. SW480 


11.0 


Placenta 


12.6 


Colon ca.* 

o w uzuyo w tow mei ) 


7.2 jprostate 


4.3 


Colon ca. HT29 


y j jProstate ca.* (bone 
|met)PC-3 


24.7 


Colon ca. HCT-116 


23.0 (Testis 


7.7 


Colon ca. CaCo-2 


. _ _ jMelanoma 
}Hs688(A).T 


11.3 


Colon ca. 
tissue(OD03866) 


12© jMelanoma* (met) 
|Hs688(B).T 


5.3 


Colon ca. HCC-2998 


33.4 jMelanoma UACC-62 


4.2 


Gastric ca.* (liver met) 
NCI-N87 


21.9 jMelanoma M 14 


4.4 


Bladder 


11.8 | 


Melanoma LOX 
IMVI 


28.7 


Trachea 


10.2 


Vlelanoma* (met) 
SK-MEL-5 


17.9 


Kidney 


4.3 


Adipose ; 


13.1 



Table BBD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2900, Run 
159996787 


Tissue Name 


Rel. Exp.(%) 
A g 2900, Run 
159996787 


Normal Colon 


71.2 


Kidney Margin 
8120608 


3.0 


CC Well to Mod Diff 
(OD03866) 


24.5 


Kidney Cancer 
8120613 


2.0 


CC Margin (ODQ3866) 


20.6 


Kidney Margin 
8120614 


2.5 
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CC Gr.2 rectosigmoid 
(OD03868) 


47.3 


Kidney Cancer 
9010320 


7.1 


CC Margin (OD03868) 


6.0 


Kidney Margin 
9010321 


5.0 - 


CC Mod Diff (ODO3920) 


39.2 


Normal Uterus 


5.3 


CC Margin (ODO3920) 


21.0 


Uterus Cancer 06401 1 


16.5 


CC Gr.2 ascend colon 
(OD03921) 


69.3 


Normal Thyroid 


8.2 


CC Margin (OD03921) 


16.0 


Thyroid Cancer 
064010 


15.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


41.8 


Thyroid Cancer 
A302152 


7.7 


Liver Margin (ODO4309) 


40.6 


Thyroid Margin 
A302153 


1 1.0 


Colon mets to lung 
(OD04451-01) 


14.3 


Normal Breast 


12.7 


Lung Margin (OD0445 1 - 
02) 


15.0 


Breast Cancer 
(OD04566) 


21.3 


Normal Prostate 6546-1 


11.7 


Breast Cancer 
(OD04590-01) 


63.7 


Prostate Cancer 
(OD04410) 


40.1 


Breast Cancer Mets 
(OD04590-03) 


48.0 


Prostate Margin 
(OD04410) 


32.3 


Breast Cancer 

Metastasis 

(OD04655-05) 


32.1 


Prostate Cancer 
(OD04720-01) 


26.4 


Breast Cancer 064006 


21.8 


Prostate Margin 
(OD04720-02) 


35.8 


Breast Cancer 1 024 


6.3 


Normal 1 una 061010 

1 i Ul 111CI1 L/ LI 1 1 LI Uu lulu 


46 7 


Breast Cancer 
9100266 


19 0 


Lung Met to Muscle 
(OD04286) 


34.6 


Breast Margin 
9100265 


14.4 


Muscle Margin 
(OD04286) 


12.4 


Breast Cancer 
A209073 


37.1 


Lung Malignant Cancer 
(OD03126) 


18.8 


Breast Margin 
A2090734 


14.4 


Lung Margin (OD03126) 


16.7 


Normal Liver 


14.1 


Lung Cancer (OD04404) 


20.9 


Liver Cancer 064003 


20.2 


Lung Margin (OD04404) 


15.3 


Liver Cancer 1 025 


7.8 


Lung Cancer (OD04565) 


15.8 


Liver Cancer 1026 


5.6 


Lung Margin (OD04565) 


10.3 


Liver Cancer 6004-T 


6.0 


Lung Cancer (OD04237- 
01) 


29.1 


Liver Tissue 6004-N 


6.6 
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Lung Margin (OD04237- 
02) 


29.5 


Liver Cancer 6005-T 

i 


6.6 


Ocular Mel Met to Liver 
(ODO4310) 


17.0 


Liver Tissue 6005-N 


6.7 


Liver Margin (ODO4310) 


23.8 


Normal Bladder 


54.7 


Melanoma Mets to Lung 
(OD04321) 


16.7 


Bladder Cancer 1023 


7.6 


Lung Margin (OD04321) 


20.6 


Bladder Cancer 
A302173 


23.2 


Normal Kidney 


16.6 


Bladder Cancer 
(OD047 18-01) 


100.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


15.8 


Bladder Normal 
Adjacent (OD04718- 
03) 


30.4 


Kidney Margin 
(OD04338) 


11.5 


Normal Ovary 


2.3 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


10.2 


Ovarian Cancer 
064008 


30.6 


Kidney Margin 
(OD04339) 


15.9 


Ovarian Cancer 
(OD04768-07) 


37.6 


Kidney Ca, Clear cell 
type (OD04340) 


28.7 


Ovary Margin 
(OD04768-08) 


14.8 


Kidney Margin 
(OD04340) 


15.5 


Normal Stomach 


17.9 


Kidney Ca, Nuclear grade 
3 (OD04348) 


12.3 


Gastric Cancer 
9060358 


10.0 


Kidney Margin 
(OD04348) 


16.0 


Stomach Margin 
9060359 


23.0 


Kidney Cancer 
(OD04622-01) 


12.5 


Gastric Cancer 
9060395 


35.8 


Kidney Margin 
(OD04622-03) 


3.6 


Stomach Margin 
9060394 


27.5 


Kidney Cancer 
(OD04450-01) 


9.5 


Gastric Cancer 
9060397 


66.0 


Kidney Margin 
(OD04450-03) 


13.0 


Stomach Margin 
9060396 


18.0 


Kidney Cancer 8120607 


3.1 


Gastric Cancer 
064005 


62.0 



Table BBE. Panel 4D 





Rel. Exp.(%) 




Rel. Exp.(%) 


Tissue Name 


Ag2900, Run 


Tissue Name 


Ag2900, Run 




159996820 




159996820 


Secondary Thl act 


68.3 


HUVEC IL-lbeta 


10.5 


Secondary Th2 act 


67.8 


HUVEC 1FN gamma 


15.5 
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Secondary Trl act 


73.2 


1 HUVEC TNF alpha + IFN 

gul 1 1 1 lid 


16.0 


Secondary Thl rest 


8.2 


HUVEC TNF alpha + IL4 


18.8 


ocv^vji iucii y i iiz, rest 


1 o.o 


inn t r -1 tt 1 1 

HUVEC IL-l l 


13.1 


Secondary Trl rest 


10.1 


Lung Microvascular EC 
none 


18.8 


Primary Thl act 


75.8 


jLung Microvascular EC 
TNFalpha + IL-l beta 


25.2 


Primary Th2 act 


60.7 


; Microvascular Dermal EC 
none 


30.8 


Primary Trl act 


74.7 


Microsvasular Dermal EC 
TNFalpha 4- IL-l beta 


27.9 


Primary Thl rest 


42.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


16.8 


Primary Th2 rest 


29.3 


Small airway epithelium 
none 


11:7 


Primary Trl rest 


16.3 


Small airway epithelium 
TNFalpha + IL-l beta 


79.0 


LU^jKA LLJ4 

lymphocyte act 


34.4 


Coronery artery SMC rest 


19.2 


lymphocyte act 


53.6 


Coronery artery SMC 
TNFalpha + IL-l beta 


7.6 


y^iJc* lympnocyte act 


35.8 


Astrocytes rest 


8.6 


Secondary CD8 
lympnocyte rest 


39.0 


Astrocytes TNFalpha + 
IL-l beta 


7.2 


occuiiuary \^lJ(S 
lymphocyte act 


33.9 


KU-812 (Basophil) rest 


7.8 


CD4 lymphocyte none 


6.2 


KU-8 12 (Basophil) 

PA/T A /innnirw/r 1 In 


40.6 


2ry Thl/Th2/Trl anti- 
CD95CH11 


11.7 


y^y-yVj 1 1 up ^jveraiinocytes ) 
none 


30.4 


LAK cells rest 


25.2 


^vl^i i uu ^xvcicHiiiocyiesy 
TNFalpha + IL-l beta 


13.6 


LAK cells IL-2 


23.8 


Liver cirrhosis 


3.9 


LAK cells IL-2+IL-12 


22.5 


Lupus kidney 


1.3 


LAK cells IL-2+1FN 
gamma 


36.3 


NCI-H292 none 


87.1 


LAK cells IL-2+ IL-18 


29.7 ; 


NCI-H292 IL-4 


91.4 


LAK cells 
PMA/ionomycin 


47.0 ] 


NCI-H292 IL-9 


92.0 


NK Cells IL-2 rest 


13.7 ] 


^CI-H292 IL-l 3 


39.8 


Two Way MLR 3 day 


15.7 1 


MCI-H292 IFN gamma 


47.0 


Two Way MLR 5 day 


15.6 1 


-IPAEC none 


18.0 


Two Way MLR 7 day 


12.6 j 


1PAEC TNF alpha + IL-l 
)eta 


26.1 
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PBMC rest 


7.5 


I 1 1 n or ft nT*r»nl Q C f nnnp 

i-^uiig iiunjuiasi none 


zz.z 


PBMC PWM 


100.0 


Lung fibroblast TNF alpha 

4- IT -1 he>tn 


16.8 


PBMC PHA-L 


42.6 


Lung fibroblast IL-4 


62.4 


ivamos ^t> cenj none 


1 — 7 / \ 

17.0 


I C 1 11 4 W T <^ 

Lung fibroblast IL-9 


39.5 


Ramos (B cell) 
lonomycin 


54.0 


Lung fibroblast IL-13 


35.4 


B lymphocytes PWM 


73.7 


Lung fibroblast IFN 
gamma 


70.2 


t> lympnocytes CJJ4UL 
and IL-4 


12.9 


Dermal fibroblast 
CCD 1070 rest 


49.3 


EOL-1 dbcAMP 


9.5 


Dermal fibroblast 
^\_-uiu/u iiNr aipna 


92.7 


EOL-1 dbcAMP 
PMA/ionomycin 


22.8 


i^ermai iiDroDiast 
CCD1070IL-1 beta 


29.5 


Dendritic cells none 


13.7 


Dermal fibroblast IFN 
gamma 


27.7 


Dendritic cells LPS 


15.0 


Dermal fibroblast IL-4 


41.5 


Dendritic cells anti- 
CD40 


19.1 


IBD Colitis 2 


1.1 


Monocytes rest 


10.7 


IBD Crohn's 


2.6 


Monocytes LPS 


4.3 


Colon 


14.8 


Macrophages rest 


26.1 |Lung 


21.5 


Macrophages LPS 


15.3 (Thymus 


25.0 


HUVEC none 


30.8 Kidney 


37.4 


HUVEC starved 


34.6 





CNS_neurodegeneration_vl.O Summary: Ag2900 This panel confirms the 
expression of this gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's diseased 
postmortem brains and those of non-demented controls in this experiment. Please see Panel 
1.3D for a discussion of the potential utility of this gene in treatment of central nervous system 
disorders. 

Panel 1.3D Summary: Ag2900 The CG56633-01 gene is expressed at moderate levels 
in the cancer cell lines in this panel, with highest expression in a breast cancer cell line 
(CT=27). Expression of this gene could potentially be used as a diagnostic marker of cell 
proliferation and hence as a diagnostic marker for cancer. 

This gene also has moderate levels of expression in adipose, liver, heart, skeletal 
muscle, adrenal, pituitary, thyroid and pancreas. Therefore, therapeutic modulation of this 
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gene product may be a treatment for endocrine and metabolic diseases, including obesity and 
Types 1 and 2 diabetes. 

Overall, this gene, a translation initiation factor homolog, exhibits brain-preferential 
expression, particularly in the hippocampus, a structure critical for learning and memory. The 
processes of learning and memory are subject to regulation by mechanisms of translational and 
transcriptional control, including the regulation elongation factor phosphorylation by the 
memory-mediating NMDA receptor. The hippocampus-preferential expression of this gene 
suggests that it plays a role in translationally-mediated learning and memory processes. 
Therefore, agents that modulate the activity and function of this gene product may have utility 
in treating CNS disorders involving memory deficits, including Alzheimer's disease and aging. 

References: 

Scheetz AJ, Nairn AC, Constantine-Paton M. N-methyl-D-aspartate receptor activation 
and visual activity induce elongation factor-2 phosphorylation in amphibian tecta: a role for N- 
methyl-D-aspartate receptors in controlling protein synthesis. Proc Natl Acad Sci U S A 1997 
Dec 23 ;94(26): 14770-5 

N-methyl-D-aspartate receptor (NMDAR) activation has been implicated in forms of 
synaptic plasticity involving long-term changes in neuronal structure, function, or protein 
expression. Transcriptional alterations have been correlated with NMDAR-mediated synaptic 
plasticity, but the problem of rapidly targeting new proteins to particular synapses is unsolved. 
One potential solution is synapse-specific protein translation, which is suggested by dendritic 
localization of numerous transcripts and subsynaptic polyribosomes. We report here a 
mechanism by which NMDAR activation at synapses may control this protein synthetic 
machinery. In intact tadpole tecta, NMDAR activation leads to phosphorylation of a subset of 
proteins, one of which we now identify as the eukaryotic translation elongation factor 2 
(eEF2). Phosphorylation of eEF2 halts protein synthesis and may prepare cells to translate a 
new set of mRNAs. We show that NMDAR activation-induced eEF2 phosphorylation is 
widespread in tadpole tecta. In contrast, in adult tecta, where synaptic plasticity is reduced, 
this phosphorylation is restricted to short dendritic regions that process binocular information. 
Biochemical and anatomical evidence shows that this NMDAR activation-induced eEF2 
phosphorylation is localized to subsynaptic sites. Moreover, eEF2 phosphorylation is induced 
by visual stimulation, and NMDAR blockade before stimulation eliminates this effect. Thus, 
NMDAR activation, which is known to mediate synaptic changes in the developing frog, 
could produce local postsynaptic alterations in protein synthesis by inducing eEF2 
phosphorylation. 
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Panel 2D Summary: Ag2900 The CG56633-01 gene is expressed at increased levels 
in colon, breast and bladder cancers compared to the normal adjacent tissue samples. 
Therefore, expression of this gene could be of use as a marker for these cancers. 

Panel 4D Summary: Ag2900 The CG56633-01 gene is expressed in a number of 
5 preparations of activated T lymphocytes at levels greater than in resting T cells. Therefore, 

small molecule antagonists of the CG56633-01 gene product may reduce T cell activation and 
thus reduce or eliminate the symptoms in patients with autoimmune arid inflammatory 
diseases, such as Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive 
pulmonary disease, asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or 
10 psoriasis. 

NOV60a and NOV60b 

Expression of gene NOV60a and variant NOV60b was assessed using the primer-probe 
sets Ag041b and Ag41, described in Tables BCA and BCB. Results of the RTQ-PCR runs are 
shown in Tables BCC, BCD, BCE, BCF, BCG, BCH and BCI. 

1 5 Table BCA. Probe Name Ag041b 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward; 


^L\-9 ta gtaggtgcgcgtggtcat-3 1 


21 


486 


1231 


Probe 


TET-5 1 -accatagccgggcagcgcatg-3 1 -TAMRA 


21 


455 


1232 


Reverse; 


5 ■ -caacggagacaactgcttcaac-3 ' 


22 


431 


1233 



Table BCB. Probe Name Ag41 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gtaggtgcgcgtggtcatg-3 1 


19 


485 


1234 


Probe | 


TET-5 ' -ccatgcgctgcccggctatg-3 ' -TAMRA 


20 


454 


1235 


Reverse: 


5 1 -octacaacggagacaactgcttc-3 ' 


23 


427 


1236 



Table BCC. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag041b, 
Run 206231412 


Tissue Name 


Rel. Exp.(%) Ag041b, 
Run 206231412 


AD 1 Hippo 


20.7 


Control (Path) 3 
Temporal Ctx 


9.7 


AD 2 Hippo 


44.1 


Control (Path) 4 
Temporal Ctx 


70.2 


AD 3 Hippo 


9.5 


AD 1 Occipital Ctx 


17.9 


AD 4 Hippo 


17.4 


AD 2 Occipital Ctx 


0.0 
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f(Missing) 




AL> 3 Hippo 


97.9 jAD 3 Occipital Ctx 


5.8 


AD 6 Hippo 


65.1 JAD 4 Occipital Ctx 


39.5 


Control 2 Hippo 


66.0 |AD 5 Occipital Ctx 


76.8 


Control 4 Hippo 


18.4 jAD 6 Occipital Ctx 


20.0 


Control (Path) 3 
Hippo 


11.3 


Control 1 Occipital 
Ctx 


5.3 


AD 1 Temporal Ctx 


11.3 


Control 2 Occipital 

Ctx 


82.4 


AD 2 Temporal Ctx 


54.3 


Control 3 Occipital 

LA. 


36.9 


AD 3 Temporal Ctx 


9.5 


Control 4 Occipital 

LA. 


12.1 


AD 4 Temporal Ctx 


39.8 


Control (Path) 1 


95.3 


AD 5 Inf Temporal 
Ctx 


80.7 


Control (Path) 2 


33.4 


AD 5 Sup Temporal 
Ctx 


57.8 


Control (Path) 3 


4.4 


AD 6 Inf Temporal 
Ctx 


52.1 


Control (Path) 4 
Occipital Ctx 


53.6 


AD 6 Sup Temporal 
Ctx 


54.3 


Control 1 Parietal 
Ctx 


9.3 


Control 1 Temporal 
Ctx 


13.8 


Control 2 Parietal 
Ctx 


49.0 


Control 2 Temporal 
Ctx 


66.0 


Control 3 Parietal 
Ctx 


34.6 


Control 3 Temporal 
Ctx 


36.3 


Control (Path) 1 
Parietal Ctx 


100.0 


Control 3 Temporal 
Ctx 


19.8 


Control (Path) 2 
Parietal Ctx 


44.4 


Control (Path) 1 
Temporal Ctx 


85.3 


Control (Path) 3 
Parietal Ctx 


6.9 


Control (Path) 2 
Temporal Ctx 


73.7 


Control (Path) 4 
Parietal Ctx 


72.2 



Table BCD. Panel 1 



Tissue Name 


Rel. Exp.(%) 
Ag41, Run 
97804013 


Rel. Exp.(%) 
Ag41, Run 
97807227 


Tissue Name 


Rel. Exp.(%) 
Ag41, Run 
97804013 


Rel. Exp.(%) 
Ag41, Run 
97807227 


Endothelial cells 


0.0 


0.2 


Renal ca. 786-0 


0.0 


0.4 


Endothelial cells 
(treated) 


0.0 


0.3 


Renal ca. A498 


11.6 


32.8 


Pancreas 


0.1 


4.2 


Renal ca. RXF 


3.6 


1.0 
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Pancreatic ca. 
CAPAN 2 


0.0 


0.5 


Renal ca. 
ACHN 


0.0 




Adrenal gland 


1.2 


15.3 


Renal ca. UO- 
i 1 

D I 


0.0 


0.9 


Thyroid 


0.0 


4.4 


Kenai ca. 1 K- 
10 


0.0 


_ 

0.2 


kJdllVdljr L^lttllU 


U. 1 


O.J 


Liver 


0.9 


8.3 


Pituitary gland 


0.2 


6.0 


Liver (fetal) 


0.1 


3.3 


Brain (fetal) 


0.0 


3.7 


Liver ca. 
(hepatoblast) 

I I /"i -""V 

Hepu2 


0.2 


5.6 


Brain (whole) 


0.0 


35.4 


Lung 


0.0 


1.8 


Brain (amygdala) 


6.4 


60.7 


Lung (fetal) 


0.0 


1.1 


Brain 

(cerebellum) 


4.2 


25.5 


Lung ca. (small 
cell) LX-1 


0 0 

yj . \j 




Brain 

(hippocampus) 


6.1 


54.0 


Lung ca. (small 
cell) NCI-H69 


0 0 


ft 1 


Brain (substantia 
nigra) 


3.4 


35.6 


Lung ca. (s.cell 
var.) SHP-77 


0.0 




Brain (thalamus) 


11.1 


100.0 


Lung ca. (large 
cell)NCI-H460 


1 5 9 


? 7 
z. / 


Brain 

(hypothalamus) 


0.2 


6.3 


Lung ca. (non- 
sm. cell) A549 


0.0 


1.4 


Spinal cord 


4.1 


14.0 


Lung ca. (non- 
s.cell) NCI-H23 


5.7 


14.7 


glio/astro U87- 
MG 


0 0 


u.u 


Lung ca. (non- 
s.cell) HOP-62 \ 


U.U 


0.4 


glio/astro U-118- 
MG 


0 ft 


0 9 


Lung ca. (non- 
s.cl) NCI-H522 


a i 
U.Z 


O.J 


astroc vtom a 
SW1783 


0.0 


0.4 


Lung ca. 
(squam.) SW 
900 


0.3 


6.9 


neuro*; met SK- 
N-AS ' 


0.0 


0.3 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


0.4 


astrocytoma SF- 
539 


0.0 


0.8 


Mammary 
gland 


100.0 


10.5 


astrocytoma SNB- 
75 


0.3 


4.7 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


0.0 


glioma SNB-19 


0.3 


2.3 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


0.2 


glioma U251 


7.6 


0.5 


Breast ca.* (pi. 
sf) T47D 


0.8 


7.4 


glioma SF-295 


0.0 


0.8 ] 


Breast ca. BT- 


00 i 


0.1 
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1 




549 






Heart 


6.8 


49.0 


Breast ca. 
MDA-N 


0.0 


0.0 


Skeletal muscle 


0.5 


o.u 


Ovary 


0.9 


13.0 


Bone marrow 


0.0 


1.0 


Ovarian ca. 
UVLAR-3 


0.0 


1.0 


Thymus 


0 0 


0.9 


Ovarian ca. 
OVCAR-4 


0.0 


0.4 


Spleen 


0.0 


2.5 


Ovarian ca. 
OVCAR-5 


0.4 


7.1 


Lymph node 


0.0 


1.1 


Ovarian ca. 
OVCAR-8 


0.0 


0.2 


Colon (ascending) 


0.0 


1.7 


Ovarian ca. 
IGROV-l 


0.0 


0.5 


Stomach 


4.6 


3.2 


Ovarian ca. 
(ascites) SK- 
OV-3 


3.8 


0.7 


Small intestine 


0.1 


3.4 


Uterus 


0.7 


11.7 


Colon ca. SW480 


0.0 


0.3 


Placenta 


0.9 


9.9 


Colon ca.* 
SW620 (SW480 
met) 


0.0 


0.6 


Prostate 


0.3 


7.7 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met) PC- 
3 


0.0 


0.1 


Colon ca. HCT- 
116 


0.0 


0.8 


Testis 


0.1 


3.0 


Colon ca. CaCo-2 


0.0 


0.5 


Melanoma 
Hs688(A).T 


0.0 


0.9 


Colon ca. HCT-15 


0.0 


0.4 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


2.3 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.3 


Gastric ca. * (liver 
met) NCI-N87 


0.0 


0.3 


Melanoma M14 


0.0 


0.1 


Bladder 


0.0 


2.5 


Vlelanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.0 


2.0 i 


vlelanoma* 
;met) SK-MEL- 
5 


0.0 


0.1 


Kidney 


1.0 


11.6 


Vlelanoma SK- 
V1EL-28 


0.1 


1.5 


Kidney (fetal) 


0.0 


3.4 




„, „.,. 




ii ji ii • 
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Table BCE. Panel 1.1 



Tissue Name 


ReL Exp.(%) Ag041b, 
rvun luyooov*)/ 


Tissue Name 


Rel. Exp.(%) Ag041b, 
Run 109666937 


Adrenal gland 


11.5 


Renal ca. UO-31 


1.3 


Bladder 


2.4 


Renal ca. RXF 393 


0.4 


Brain (amygdala) 


25.2 


Liver 


9.9 


Brain (cerebellum) 


57.4 


Liver (fetal) 


2.9 


Brain fhinnocamnn*;^ 

JJ1 Mill \111L/UWUL1I11UUJ 1 


77 4 


Liver ca. 

(hepatoblast) HepG2 


2.7 


Brain (substantia 
nigra) 


^7 A 
J / 


Lung 


1.6 


Brain (thalamus) 


47.6 


Lung (fetal) 


3.3 


Cerebral Cortex 


100.0 


Lung ca. (non-s.cell) 
HOP-62 


0.3 


Brain (fetal) 


5.5 


Lung ca. (large 
cell)NCI-H460 


1.4 


Brain (whole) 


71.2 


Lung ca. (non-s.cell) 
NCI-H23 


6.2 


glio/astroU-118-MG 


0.2 


Lung ca. (non-s.cl) 
NCI-H522 


9.2 


astrocytoma SF-539 


1.1 


Lung ca. (non-sm. 
cell) A549 


1.7 


astrocvtoma SNR-7S 




Lung ca. (s.cell var.) 
SHP-77 


0.2 


astrocytoma SW1783 


0.6 


Lung ca. (small cell) 
LX-1 


1 4 


glioma U251 


0.2 


Lung ca. (small cell) 
NCI-H69 


0.6 


glioma SF-295 


1.2 


Lung ca. (squam.) 
SW 900 


5.9 


glioma SNB-1 9 j 


2.8 


Lung ca. (squam.) 
NCI-H596 


0.5 


glio/astro U87-MG ; 


0.0 


Lymph node j 


1.4 


neuro*; met SK-N-AS 


0.2 


Spleen 


2.5 


Mammary gland 


6.3 


Thymus 


0.5 


Breast ca. B I -549 


0.0 


Ovary 


6.7 


Breast ca. MDA-N 


0.0 


Ovarian ca. IGROV- 
1 


0.5 


Breast ca.* (pl.ef) 
T47D 


8.0 


Ovarian ca. 
OVCAR-3 


1.1 


Breast ca.* (pl.ef) 
MCF-7 


o.o ( 

< 


Ovarian ca. 
OVCAR-4 


0.2 


Breast ca.* (pl.ef) 
MDA-MB-231 


: 


Ovarian ca. 
3VCAR-5 


9.8 


Small intestine j 


6.7 < 


Dvarian ca. 


0.0 
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OVCAR-R 




Colorectal 


0.5 


Ovarian ca.* 
(ascites^ SK-OV-^ 


0.7 


Colon ca. HT29 


0.0 


Pancreas 


11.5 


Colon ca. CaCo-2 


0.9 


Pancreatic ca. 
CAPAN ? 


0.4 


Colon ca. HCT-15 


0.1 


Pituitary gland 


6.6 




f\ A 


Placenta 


9.5 


Colon ra Hrr.?QQR 


U.U 


Prostate 


7.0 


Colon ca. SW480 


0.1 


Prostate ca.* (bone 


0.0 


Colon ca.* SW620 
(SW480 met) 


1.3 


Salivary gland 


6.2 


Stomach 


8.0 


T rq r» Yl t\ 


1.5 


Gastric ca. (liver met) 
NCI-N87 


0.3 


Spinal cord 


14.3 


Heart 


92.7 


Testis 


3.1 


oKeiciai muscie ^retaij 


4.9 


Thyroid 


6.1 


Skeletal muscle 


16.8 


Uterus 


4.7 


Endothelial cells 


1.0 


Melanoma Ml 4 


0.0 


Heart (Fetal) 


33.9 


Melanoma LOX 
IMVI 


U.U 


Kidney 


16.8 


Melanoma UACC- 
62 


0 1 
U. 1 


Kidney (fetal) 


3.1 


Melanoma SK-MEL- 
28 


0.7 


Renal ca. 786-0 


0.2 


Melanoma* (met) 
SK-MEL-5 


0.0 


Renal ca. A498 


30.8 


Melanoma 
Hs688(A).T 


1.1 


Renal ca. ACHN 


i 


Melanoma* (met) 
Hs688(B).T 


1.5 


Renal ca. TK-10 


0.2 





Table BCF. Panel 1.3D 



Tissue Name 


ReL Exp.(%) 
Ag041b, Run 
150010102 


Tissue Name 


Rel. 

Exp.(%) Ag041b, 
Run 150010102 


Liver 
adenocarcinoma 


0.0 


Kidney (fetal) 


0.5 


Pancreas 


0.6 


Renal ca. 786-0 


0.0 


Pancreatic ca. 
CAPAN 2 


0.1 


Renal ca. A498 


11.0 


Adrenal gland 


2.0 


Renal ca. RXF 


0.1 
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Thyroid 


1.0 


Renal ca. ACHN 


0.2 


Salivary gland 


0.6 


Renal ca. UO-31 


0.3 


Pitniturv olanrl 


U.o 


rvenai ca. ijv-iu 


u.u 


Brain (fetal) 


0.5 


Liver 


1.0 


Brain (whole) 


25.0 


Liver (fetal) 


0.8 


Brain (amygdala) 


25.3 


Liver ca. 
(hepatoblast) HepG2 


0.6 


Brain 
(cerebellum) 


2.7 


Lung 


0.6 


Brain 
(hippocampus) 


100.0 


Lung (fetal) 


0.9 


Brain (substantia 

nigra) 


3.5 


Lung ca. (small 
cell) LX-1 


0.1 


Brain (thalamus) 


20.7 


Lung ca. (small 
cell) NCI-H69 


0.2 


Cerebral Cortex 


63.3 


Lung ca. (s.cell 
var.) SHP-77 


0.1 


Spinal cord 


3.6 


Lung ca. (large 
celI)NCI-H460 


0.1 


glio/astro U87- 

MG 


0.0 


Lung ca. (non- 
sm. cell) A549 


0.2 


glio/astro U-118- 

MG 


0.1 


Lung ca. (non- 
s.cell) NCI-H23 


3.4 


astrocytoma 
SW1783 


0.1 


Lung ca. (non- 
s.cell) HOP-62 


0.1 


neuro*; met SK- 

N-AS 


0.0 


Lung ca. (non- 
s.cl) NCI-H522 


1.2 


astrocytoma SF- 

539 


0.1 


Lung ca. 
(squam.) SW 900 


0.5 


astrocytoma 
SNB-75 


0.8 


Lung ca. 
(squam.) NCI-H596 


0.0 


glioma SNB-19 


0.4 


Mammary gland 


1.2 


glioma U251 


0.1 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


glioma SF-295 


0.4 


Breast ca.* 
(pl.ef) MDA-MB-231 


0.1 


Heart (fetal) 


21.9 


Breast ca.* 
(pl.ef) T47D 


1.0 


Heart 


3.7 


Breast ca. BT- i 

549 


0.0 


Skeletal muscle 

(fetal) 


10.7 


Breast ca. MDA- 

N 


0.0 


Skeletal muscle 


0.2 


Ovary 


8.6 


Bone marrow 


0.2 


Ovarian ca. 


0.1 
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OVCAR-3 




Thymus 

^ — . . 


0.1 


Ovarian ca. 
OVCAR-4 


0 1 


Spleen 


1.7 


Ovarian ca. 
OVCAR-5 


0.8 


Lymph node 


0.4 


Ovarian ca. 
OVCAR-8 


0.0 


Colorectal 


1.7 


Ovarian ca. 
IGROV-1 


0.1 


Stomach 


1.5 


Ovarian ca.* 
(ascites) SK-OV-3 


0.1 


Small intestine 


1.6 


Uterus 


1.9 


Colon ca. 

SW480 


0.2 


Plappntfi 

-L Jl K\\~>\s 1 1 lU 


1 7 


Colon ca.* 
SW620(SW480 met) 


0.2 


Prncf^tf* 




Colon ca. HT29 


0.0 


Prostate ca.* 
(bone met)PC-3 




Colon ca. HCT- 

116 


0.1 


A toll J 


U.o 


Colon ca. CaCo- 

2 


0.1 


Melanoma 
Hs688(A).T 




Colon ca. 
tissue(OD03866) 


0.3 


Melanoma* 
(met) Hs688(B).T 


0 7 


Colon ca. HLC- 

2998 


0.0 


Melanoma 
UACC-62 


0.0 


Gastric ca.* 
(liver met) NCI-N87 


0.0 


Melanoma Ml 4 \ 


0.0 


Bladder 


0.6 


Melanoma LOX \ 

IMVI 


0.0 


Trachea 


0.7 


Melanoma* 
(met) SK-MEL-5 


0.0 


Kidney 


0.7 Adipose 


1.6 



Table BCG. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag041b, Run 
157096248 


Rel. Exp.(%) 
Ag41, Run 
157938256 


Tissue Name 


Rel. Exp.(%) 
Ag041b, Run 
157096248 


Rel. Exp.(%) 
Ag41, Run 
157938256 


Normal Colon 


15.7 


21.5 


Kidney 
Margin 
8120608 


9.2 


12.9 


CC Well to Mod 
Diff (OD03866) 


4.4 


2.9 


Kidney Cancer 
8120613 


4.9 


7.2 


CC Margin 
(OD03866) 


6.3 


7.3 


Kidney 
Margin 


14.4 


16.6 



1234 










8120614 




CC Gr.2 

rectosigmoid 

(OD03868) 


3.3 


2.3 


rviuncy ^diiL/Ci 

9010320 


63.7 


. 69.3 


CC Marein 
(OD03868) 


8.2 


8.6 


Kidney 
Margin 
9010321 


33.0 


42.3 


CC Mod Diff 
(ODO3920) 


4.9 


6.9 


Normal Uterus 


8.5 


14.9 


CC Margin 
(ODO3920) 


8 8 


1 1 7 
11*/ 


Uterus Cancer 
064011 


16.2 


14.3 


CC Gr.2 ascend 
colon 

(OD03921) 


2.4 


1.9 


Normal 
i nyroiQ 


5.7 


8.4 


CC Margin 
(OD03921) 


3.4 


4.1 


Thyroid 
Cancer 064010 


8.4 


12.1 


CC from Partial 
Hepatectomy 
(ODO4309) Mets 


2.1 


7.1 


Thyroid 

Cancer 

A302152 


8.5 


9.9 


1— ilrvl 1V1CU u,lll 

(ODO4309) 


15.1 


24.3 


Thyroid 

Margin 

A302153 


9.2 


12.6 


Colon mets to 
lung (OD04451- 

oi) 


0.8 


1.2 


Normal Breast ; 


24.3 


42.3 


Lung Margin 
(OD04451-02) 


2.3 


"} 1 


Breast Cancer 
(OD04566) 1 


1.8 


2.1 


Normal Prostate 
6546-1 


9.7 


9 0 


Breast Cancer 
(OD04590-01); 


5.2 


3.9 


Prostate Cancer 
(OD04410) 


13.6 


12.4 


Breast Cancer j 
Mets 

(OD04590-03) j 


8.5 


9.9 


Prostate Margin 
(OD04410) 


18.3 


19.3 


Breast Cancer 
Metastasis 
(OD04655-05) ; 


8.8 


11.7 


Prostate Cancer 
(OD04720-01) 


19.6 


28.1 


Breast Cancer 
064006 


10.3 


12.9 


Prostate Margin j 
(0004790-0?^ 


33.4 


36.9 


Breast Cancer 

1 (\1A 
1 UZ4 


15.4 


16.2 


Normal Lung 
061010 


7.2 


10.9 


Breast Cancer j 
9100266 


3.8 


5.7 


Lung Met to 

Muscle 

(OD04286) 


0.7 


1.2 


Breast Margin i 
9100265 


4.0 


6.3 


Muscle Margin 
(OD04286) 


7.9 


• 9.7 


Breast Cancer 
A209073 


4.4 


7.2 


Lung Malignant 


3.0 3.7 


Breast Margin j 


8.2 


8.4 
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Cancer 
(OD03126) 






A2090734 






Lung Margin 
(OD03 1 26) 


8.2 


12.9 


Normal Liver 


21.2 


20.3 


Lung Cancer 
(OD04404) 


6.3 


9.9 


Liver Cancer 
064003 


1.8 


1.4 


Lung Margin 
(OD04404) 


14.8 


15.5 


Liver Cancer 
1025 


25,5 


35.8 


Lung Cancer 
(OD04565) 


12.0 


21.6 


Liver Cancer 
1026 


23.3 


25.5 


Lung Margin 
(OD04565) 


7.5 


5.7 


Liver Cancer 
6004-T 


64.6 


60.3 


Lung Cancer 
(OD04237-01) 


3.9 


4.5 


Liver Tissue 
6004-N 


19.8 


23.0 


Lung Margin 
(OD04237-02) 


o.u 


ft Q 


Liver Cancer 
6005-T 


ii.Z 


35.8 


Ocular Mel Met 
to Liver 
(ODO4310) 


4.0 


4.9 


Liver Tissue 


17.4 


22.8 


Liver Margin 
(ODO4310) 


11.7 


17.7 


Normal 
Bladder 


6.3 


5.3 


Melanoma Mets 
to Lung 
(OD04321) 


0.3 


0.3 


Bladder 
Cancer 1023 


1.1 


2.3 


Lung Margin 
(OD04321) 


9.7 


15.5 


Bladder 

Cancer 

A302173 


0.3 


1.0 


Normal Kidney 


18.4 


28.1 


Bladder 
Cancer 

(OD047 18-01) 


3.7 


3.7 


Kidney Ca, 
Nuclear grade 2 
(OD04338) 


6.9 


6.7 


Bladder 
Normal 
Adjacent 
(OD047 18-03)! 


15.4 


15.4 


Kidney Margin 
(OD04338) 


11.1 


21.2 


Normal Ovary 


15.4 


19.8 


Kidney Ca 
Nuclear grade 
1/2 (OD04339) 


13.5 


14.3 


Ovarian 
Cancer 064008; 


22.7 


32.3 


Kidney Margin 
(OD04339) 


30.4 


39.0 


Ovarian 
Cancer 

(OD04768-07) 


100.0 


100.0 


Kidney Ca, Clear 
cell type 
(OD04340) 


9.7 


10.1 


Ovary Margin j 
(OD04768-08) | 


25.5 


25.7 


Kidney Margin 
(OD04340) 


31.2 


33.0 


Normal 
Stomach 


13.9 


16.7 



Kidney Ca, 
Nuclear grade 3 
(OD04348) 


3.1 


3.3 


Gastric Cancer 
9UoU3:>o 


3.5 


3.9 


Kidney Margin 
(OD04348) 


8.4 


12.7 


Stomach 

Margin 

9060359 


3.0 


3.7 


Kidney Cancer 
(OD04622-01) 


3.0 




Gastric Cancer 
9060395 


6.7 


6.3 


jviuiiey iviargiii 
(OD04622-03) 


4.5 


4.7 


Stomach 

Margin 

9060394 


6.1 


7.8 


Kidney Cancer 
(OD04450-01) 


1.6 


0.7 


Gastric Cancer 
9060397 


9.2 


13.8 


Kidney Margin 
(OD04450-03) 


9.9 


13.9 


Stomach 

Margin 

9060396 


3.3 


3.0 


Kidney Cancer 
8120607 


2.4 


1.8 


Gastric Cancer 
064005 


4.7 


5.0 



Table BCH. Panel 3D 



Tissue Name 


Rel. 
Exp.(%) 
Ag041b, 
Run 
156897045 


Rel. 
Exp.(%) 
Ag41, Run 
157938257 


Tissue Name 


Rel. 

Exp.(%) 
Ag041b, 
Run 
156897045 


Rel. 

Exp.(%) 
Ag41, Run 
157938257 


Daoy- 

Medulioblastoma 


0.4 


0.2 


Ca Ski- Cervical 
epidermoid carcinoma 
(metastasis) 


0.0 


0.0 


TE671- 

Medulloblastoma 


0.3 


0.2 


ES-2- Ovarian clear 
cell carcinoma 


0.0 


0.0 


D283 Med- 
Medulloblastoma 


0.1 


0.3 


Ramos- Stimulated 
with PMA/ionomycin 
6h 


0.0 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.1 


0.5 


Ramos- Stimulated 
with PMA/ionomycin 
14h 


0.0 


0.0 


XF-498- CNS 


1.8 


2.7 


MEG-01- Chronic 
myelogenous 
leukemia 
(megokaryoblast) 


0.0 


0.3 


SNB-78- Glioma 


0.0 


0.0 


Raji- Burkitt's 
lymphoma 


0.0 


0.0 


SF-268- 
Glioblastoma 


0.4 


0.7 


Daudi- Burkitt's 
lymphoma 


0.3 


0.0 


T98G- 

Glioblastoma 


1.9 


2.0 


U266- B-cell 
plasmacytoma 


0.0 


0.0 
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SK-N-SH- 

Neuroblastoma 

(metastasis) 


2.3 


2.9 


CA46- Burkitt's 
lymphoma 


0.1 


0.0 


SF-295- 
Glioblastoma 


0.5 


1.1 


RL- non-Hodgkin's 
B-cell lymphoma 


0.0 


0.0 


Cerebellum 


30.8 


55.9 


JM1- pre-B-cell 
lymphoma 


0.0 


0.0 


Cerebellum 


323 


60.3 


Jurkat- T cell 
leukemia 


0.0 


0.0 


NCI-H292- 
Mucoepidermoid 
lung carcinoma 


2.1 


1.7 


I r-l- 

Erythroleukemia 


0.0 


0.1 


DMS-114- Small 
cell lung cancer 


6.0 


6.4 


HUT 78- T-cell 
lymphoma 


0.0 


0.0 


DMS-79- Small 
cell lung cancer 


100.0 


100.0 


U937- Histiocytic 
lymphoma 


0.2 


■ 0.0 


NCI-H146- Small 
cell lung cancer 


0.0 


0.3 


KU-812- 

Myelogenous 

leukemia 


0.8 


0.8 


NCI-H526- Small 
cell lung cancer 


2.2 


3.8 


769-P- Clear cell 
renal carcinoma 


0.5 


0.1 


NCI-N417- Small 
cell lung cancer 


2.7 


4.5 


Caki-2- Clear cell 
renal carcinoma 


1.4 


1.4 


NCI-H82- Small 
cell lung cancer 


0.1 


0.3 


SW 839- Clear cell 
renal carcinoma 


0.0 


0.0 


NCI-H157- 
Squamous cell 
lung cancer 
(metastasis) 


0.0 


0.0 


G401- Wilms' tumor j 


0.1 


0.0 


NCI-H1155- 
Large cell lung 
cancer 


2.1 


2.6 


Hs766T- Pancreatic 
carcinoma (LN 
metastasis) 


1.1 


1.2 


NCI-H1299- 
Large cell lung 
cancer 


0.1 


0.3 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


7.5 


13.0 


NCI-H727- Lung 
carcinoid 


0.0 


0.0 


SU86.86- Pancreatic 
carcinoma (liver 
metastasis) 


1.9 


2.9 


NCI-UMC-11- 
Lung carcinoid 


0.1 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


5.0 


7.0 


LX-1- Small cell 
lung cancer 


1.3 


1.4 


HPAC- Pancreatic 
adenocarcinoma 


0.4 


0.4 


Colo-205- Colon 
cancer 


0.0 


0.0 


MIA PaCa-2- 
Pancreatic carcinoma 


3.6 


4.0 


KM 12- Colon 
cancer 


0.0 


0.0 


CFPAC-1- Pancreatic 
ductal 

adenocarcinoma 


1.6 


2.0 
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KM?fH ?_ Colon 

cancer 


0.0 


0.1 


PANC-1- Pancreatic 
epithelioid ductal 
carcinoma 


20.3 


22.1 


cancer 


0.1 


0.0 


T24- Bladder 
carcinma (transitional 
cell) 


0.4 


0.4 


SW-48- Colon 
adenocarcinoma 


0.1 


0.0 


5637- Bladder 
carcinoma 


1.2 


0.7 


SW1116- Colon 
adenocarcinoma 


0.0 


0.0 


HT-1 197- Bladder 
carcinoma 


0.0 


0.2 


LS 174T- Colon 
adenocarcinoma 


0.3 


0.1 


UM-UC-3- Bladder 
carcinma (transitional 
cell) 


0.1 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


0.0 


A204- 

Rhabdomyosarcoma 


2.3 


3.5 


SW-480- Colon 
adenocarcinoma 


u.u 


u.u 


HT-1 080- 
Fibrosarcoma 


U.Z 


A A 
0.4 


NCI-SNU-5- 
Gastric carcinoma 


U.D 


A 

U.O 


MG-63- 
Osteosarcoma 


9.3 


20.9 


carcinoma 


0.4 


0.4 


SK-LMS-1- 

Leiomyosarcoma 

(vulva) 


0.1 


0.4 


INV-^l-olN U-IO- 

Gastric carcinoma 


0.1 


0.0 


SJRH30- 

Rhabdomyosarcoma 
(met to bone marrow) 


0.0 


0.0 


NCI-SNU-1- 
Gastric carcinoma 


u.z 


u.u 


A43 1 - Epidermoid 
carcinoma 


A A 


A '} 

U.3 


RF-1- Gastric 
adenocarcinoma 


0.3 


0.0 


WM266-4- 
Melanoma 


0.6 


0.5 


RF-48- Gastric 
adenocarcinoma 


0.0 


0.0 


DU 145- Prostate 
carcinoma (brain 
metastasis) 


0.0 


0.0 


MKN-45- Gastric 
carcinoma 


0.7 


0.4 


MDA-MB-468- 
Breast 

adenocarcinoma 


26.4 


30.4 


NCI-N87- Gastric 
carcinoma 


0.2 


0.1 


SCC-4- Squamous 
cell carcinoma of 
tongue 


0.2 


0.0 


OVCAR-5- 
Ovarian carcinoma 


1.7 


1.7 


SCC-9- Squamous 
cell carcinoma of 
tongue 


0.2 


0.2 


RL95-2- Uterine 
carcinoma 


5.2 


9.1 


SCC-15- Squamous 
cell carcinoma of 
tongue 


0.6 


0.3 


HelaS3- Cervical 
adenocarcinoma 


0.0 


0.0 


CAL 27- Squamous 
cell carcinoma of 
tongue 


0.0 


0.1 



1239 




Table BCI. Panel 4D 



Tissue Name 


Rel. Exp.(%) 

AgU41D, KUI1 

146087302 


l issue iidme 


Rel. Exp.(%) 
Ao041h Run 

146087302 


Secondary Th 1 act 


0 1 




1.6 


Secondary Th2 act 


0.9 


HUVEC IFN gamma 


11.0 


Secondary Trl act 


0.9 


HUVEC TNF alpha + IFN 
gamma 


5.5 


Secondary Thl rest 


1.2 


HUVEC TNF alpha + IL4 


2.9 


Secondary Th2 rest 


2.7 


HUVEC IL-11 


5:8 


Secondary Trl rest 


3.0 


Lung Microvascular EC 
none 


2.7 


Primary Thl act 


0.9 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0i8 


Primary Th2 act 


1.6 


Microvascular Dermal EC 
none 


2.4 


Primary Trl act 


2.6 


Microsvasular Dermal EC 
TNFalpha + 1L- 1 beta 


2.0 


Primarv Thl rest 

X 1 1111 Ul J ± 1 1 A 1 V 


2.7 


Bronchial epithelium 
TNFalpha + IL1 beta 


7.5 


Primary Th2 rest 


2.7 


Small airway epithelium 
none 


2.3 


Primary Trl rest 


0.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


6.2 


CD45RA CD4 
lymphocyte act 


2.4 


Coronery artery SMC rest 


6.6 


CD45RO CD4 
lymphocyte act 


3.9 


Coronery artery SMC 

r f iv TT^ 1 1 i TT "1 1 _ j 

TNFalpha + IL-lbeta 


10.9 


CD8 lymphocyte act 


0.7 


Astrocytes rest 


1 A O 
lU.Z 


Secondary CD8 
lymphocyte rest 


2.0 


Astrocytes TNFalpha + 
IL-lbeta 


13.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


2.3 


CD4 lymphocyte none 


1.9 


KU-812 (Basophil) 
PMA/ionomycin 


2.5 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) 
none 


2.2 


LAK cells rest 


3.3 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.8 


LAK cells IL-2 


1.3 


Liver cirrhosis 


3.9 


LAK cells IL-2+IL-12 


1.3 


Lupus kidney 


5.6 


LAK cells IL-2+IFN 
gamma 


2.2 


NCI-H292 none 


5.3 
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LAK cells IL-2+IL- 18 


0.8 


NC1-H292 IL-4 


6.2 


LAK cells 

PM A/ionomyc i n 


1.3 


NCI-H292 IL-9 


4.9 


NK Cells IL-2 rest 


5.1 


NC1-H292 IL-13 


9.2 


Two Way MLR 3 day 


1.8 


NCI-H292 IFN gamma 


7.1 


Two Way MLR 5 day 


0.2 


HPAEC none 


3.6 


Two Way MLR 7 day 


0.4 


HPAFC TNF alnha + 11 -1 
beta 


1.1 


r\t>iviL^ rest 


n q 




74 2 


PBMC PWM 


1.6 


Lung fibroblast TNF alpha; 

-4- IT -1 hetn 


33.4 


PBMC PHA-L 


0.3 


Lung fibroblast IL-4 


100.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


62.9 


Ramos (B cell) 
ionomycin 


1.3 


Lung fibroblast IL-13 


51.4 


B lymphocytes PWM 


0.8 


Lung fibroblast IFN 
gamma 


84.1 


B lymphocytes CD40L 
and IL-4 


0.4 


Dermal fibroblast 
CCD 1070 rest 


6.7 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
rrni 070 tkif ainha 


6.7 


CfYI 1 ,4U« A AAT> 
cUlv- 1 aDC/VlVLr 

PM A/ionomyc in 


0.0 


i^ennai iiuruuidai 
CCD 1070 IL-1 beta 


15.3 


Dendritic cells none 


12.2 


Dermal fibroblast IFN 
gamma 


40.6 


Dendritic cells LPS 


7.7 


Dermal fibroblast IL-4 


83.5 


Dendritic cells anti- 
CD40 


12.2 


IBD Colitis 2 


3.9 


Monocytes rest 


6.3 


IBD Crohn's 


4.5 


Monocytes LPS 


0.7 


Colon 


20.4 


Macrophages rest 


6.7 


Lung 


11.5 


Macrophages LPS 


0.9 


Thymus 


41.5 


HUVEC none 


5.3 


Kidney 


4.1 


HUVEC starved 


6.3 







CNS_neurodegeneration_vl.O Summary: Ag041b This panel does not show 
differential expression of the NOV60a gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1 Summary: Ag41 Two experiments with the same probe and primer set 
produce results that are in reasonable agreement, with highest expression of the NOV60a gene 
in the mammary gland and the brain. Overall, this gene appears to express in normal tissues at 
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higher levels than in cancer cell lines. This gene encodes a lynxl homology Lynxl is an 
endogenous toxin-like modulator of nicotinic acetylcholine receptors in the mammalian CNS. 
Activation of nicotinic receptors is associated with positive effect on schizophrenia and 
alzheimer's disease. Therefore, agents that block Ag41 action in the CNS are likely to have 
5 utility in the treatment of these, and related, disorders. 

This gene also has high levels of expression in pancreas, adrenal, thyroid, pituitary, 
heart, skeletal muscle and liver. Therefore, therapeutic modulation of this gene product may be 
a treatment for endocrine and metabolic diseases, including obesity and Types 1 and 2 
diabetes. Please note that two additional experiments with the same probe and primer set show 
10 low/undetectable levels of expression (CTs>35). (Data not shown.) The results indicate that 
there is a possibility of a probe failure. 

References: 

Miwa JM, Ibanez-Tallon I, Crabtree GW, Sanchez R, Sali A, Role LW, Heintz N. 
lynxl, an endogenous toxin-like modulator of nicotinic acetylcholine receptors in the 

15 mammalian CNS. Neuron 1999 May;23(l):105-14 

Elapid snake venom neurotoxins exert their effects through high-affinity interactions 
with specific neurotransmitter receptors. A novel murine gene, lynxl, is highly expressed in 
the brain and contains the cysteine-rich motif characteristic of this class of neurotoxins. 
Primary sequence and gene structure analyses reveal an evolutionary relationship between 

20 lynxl and the Ly-6/neurotoxin gene family, lynxl is expressed in large projection neurons in 
the hippocampus, cortex, and cerebellum. In cerebellar neurons, lynxl protein is localized to a 
specific subdomain including the soma and proximal dendrites, lynxl binding to brain sections 
correlates with the distribution of nAChRs, and application of lynxl to Xenopus oocytes 
expressing nAChRs results in an increase in acetylcholine-evoked macroscopic currents. 

25 These results identify lynxl as a novel protein modulator for nAChRs in vitro, which could 
have important implications in the regulation of cholinergic function in vivo. 

Panel 1.1 Summary: Ag041b The NOV60a gene is expressed in most cell lines and 
normal tissues with a significantly higher level of expression in normal brain and heart 
compared to cancer cell lines on this panel. The results in this panel are consistent with 

30 expression in Panel 1. Please see Panel 1 for further discussion of utility of this gene in 
metabolic and ens diseases and cancer. 

Panel 1,3D Summary: Ag041b Highest expression of the NOV60a gene is seen in the 
brain. Overall, this gene is expressed in most cell lines and normal tissues with a significantly 
higher level of expression in heart in addition to brain when compared to cancer cell lines on 
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this panel. Please see Panel 1 for discussion of utility of this gene in the central nervous 
system. 

Among metabolic tissues, this gene has a low level of expression in adipose, adult and 
fetal liver, adrenal, pituitary, fetal skeletal muscle, fetal and adult heart, thyroid and pancreas. 
5 Therefore, modulation of this gene product may be a treatment for endocrine and metabolic 
diseases, including obesity and Types 1 and 2 diabetes. In addition, this gene differentially 
expressed in fetal (CT value = 29) vs adult skeletal muscle (CT value = 35) and may be useful 
for the identification of the two sources of this tissue. 

Panel 2D Summary: Ag041b/Ag41 The expression of the NOV60a gene was assessed 
10 in two independent runs on this panel with good concordance between runs. This protein is a 
good diagnostic marker and target in ovarian, renal and liver cancer as the cancer expressed 
this gene at a higher level than the normal adjacent tissue. 

Panel 3D Summary: Ag041b/Ag41 Two experiments show expression of the 
NOV60a gene in cell lines derived from brain, lung, ovarian, renal, pancreatic, breast and 
15 osteosarcoma. Therefore, expression of this gene could be used as a diagnostic marker for the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or function 
of this gene may be effective in the treatment of brain, lung, ovarian, renal, pancreatic, breast 
and osteosarcoma cancers. 

Panel 4D Summary: Ag041b The NOV60a gene, a lynxl homolog is expressed at 
20 moderate levels in untreated lung fibroblasts, lung fibroblasts activated with IL-4, IL-9 or I FN 
gamma, and dermal fibroblasts activated with IL-4 (CTs=30). Therefore, small molecules or 
therapeutic antibodies that antagonize the function of the NOV60a gene product may be useful 
to reduce or eliminate the symptoms in patients with chronic obstructive pulmonary disease, 
asthma, emphysema, or psoriasis. 

25 NOV61 : Adlican-like 

Expression of gene NOV61 was assessed using the primer-probe sets Ag2933, Ag3370 
and Ag3837, described in Tables BDA, BDB and BDC. 

Table BDA. Probe Name Ag2933 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -caccaccactaagccagaac-3 ' 


20 


4011 


1237 


Probe 


TET-5 ■ -ttctcagtccaagaacatctcaaatatgt-' 
3 ' - T AMR A 


29 


4031 


1238 


Reverse 


5 1 -ggattccccatgtaattcaag-3 ■ 


21 


4083 


1239 



1243 




Table BDB. Probe Name Ag3370 



Primers; 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -agctggattcttccaaacaga-3 ' 


21 


1327 


1240 


Probe 


TET-5 ■ -tcacatgtatacatgctgccaaatgg-3 1 - 
TAMRA 


26 


1375 


1241 


Reverse 


5 1 -acctttgggatggaaagagtt-3 ' 


21 


1401 


1242 



Table BDC. Probe Name Ag3837 



Primers 


Sequences 


| Length ; 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -acgagcttgaggatgtggat-3 1 


}2 0 


3725 


1243 


Probe 


TET-5 ' -ttttgtcctctgtgacagtctccaca-3 ' 
TAMRA 




3758 


1244 


Reverse 


5' -gcttcttcctggtgaaatgg-3 • 


i 2 ° 


3784 


1245 



5 CNS_neurodegeneration_vl.O Summary: Ag2933/Ag3370 Expression of this gene 

is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General_screening_panel_vl.4 Summary: Ag3370/Ag3837 The amp plots suggest 
that there were experimental difficulties with these runs (data not shown). 

Panel 1.3D Summary: Ag2933 Expression of this gene is low/undetectable (CTs > 
10 35) across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag2933 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2933/Ag3370 Expression of this gene is low/undetectable 
(CTs > 35) across all of the samples on this panel (data not shown). 

15 BE. CG56781-01: NEUROPSIN PRECURSOR 

Expression of gene CG56781-01 was assessed using the primer-probe sets Ag3019 and 
Ag4966, described in Tables BEA and BEB. 



Table BEA. Probe Name Ag3019 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -aggatctgagcctgtgttcag-3 1 




714 


1246 


Probe 


TET- 5 ' -cggagacccgctgtctacactaacgt-3 ' - j 
TAMRA j 


26 


739 


1247 


Reverse 


5 1 -ttcaatccactccaggtagtca-3 ' 


22 


768 


1248 



20 Table BEB. Probe Name Ag4966 

jpjri^r^£" "' "~~ " Sequences " '" [ Length | ^~ Start ] sierra NO : 










Position 




Forward 


5 1 -ccaccctcttcctcagag-3 ' 


18 


7 


1249 


Probe 


TET-5 ' -caccctgtgcaatccagccgtg-3 ' - 
TAMRA 


22 


44 


1250 


Reverse 


5 1 -acacctgcccacgctc-3 ' 


16 


89 


1251 



CNS_neurodegeneration_vl.O Summary: Ag3019 The amp plot suggests that there 
were experimental difficulties with this run in one sample (data not shown). Given the lack of 
expression of this gene on the other panels the expression detected in the occipital cortex is 
5 likely artifactual. 

Panel 1.3D Summary: Ag3019 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

Panel 4.1D Summary: Ag3019 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 
10 Panel 4D Summary: Ag3019 Expression of this gene is low/undetectable (CTs > 35) 

across all of the samples on this panel (data not shown). 

Panel 5 Islet Summary: Ag3019 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

NOV63 

15 Expression of gene NOV63 was assessed using the primer-probe sets Ag2261 and 

Ag3035, described in Tables BFA and BFB. Results of the RTQ-PCR runs are shown in 
Tables BFC, BFD, BFE and BFF. 



Table BFA. Probe Name Ag2261 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5' -ggatgactcgcctagcttct-3 ' 


20 


882 


1252 


Probe 


TET-5 ' -gccgtaggtgccaccgtgagaag-3 1 - 
TAMRA 


23 


935 


1253 


Reverse 


5 ' -agcagatgctctcgcagtt-3 1 


19 


958 


1254 



20 Table BFB. Probe Name Ag3035 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -acagcagcaagttcgtcaag-3 ' 


20 


527 


1255 


Probe 


TET-5 ■ -agacggtcaagcaaggatctgcgag-3 1 - 
TAMRA 


25 


559 


1256 


Reverse 


5 • -cacgaggttgttgtggaagt-3 * 


20 


593 


1257 



1245 




Table BFC. Panel 1.3D 



Tissue 
Name 


Rel. 

Exp.(%) 
Aa2261 

Run 

150631675 


Rel. 
Exp.(%) 
A&2261 

Run 
152887692 


Rel. 
Exp.(%) 
Ae3035, 

Run 
167597764 


Tissue 
Name 


Rel. 
Exp.(%) 
Ag2261, 
Run 
150631675 


Rel. 
Exp.(%) 
Ag2261, 

Run 
152887692 


Rel. 
Exp.(%) 
Ag3035, 

Run 
167597764 


Liver 

adenocarci j 
noma 


22.4 


19.6 


71.2 


Kidney 
jetal) 


2.1 


0.0 


2.7 


Pancreas i 


3.9 


2.5 


2.8 


Renal ca. 
786-0 


a ft 
u.u 


u.u 


0 0 


Pancreatic j 
ca. 

CAP AN 2 ; 


5.3 


3.5 


9.5 


Renal ca. 
A498 


10.2 


5.3 


9.2 


Adrenal 
gland 


2.1 


0.6 


2.0 


Renal ca. 
RXF 393 


0.0 


0.0 


0.0 


Thyroid ! 


7.0 


9.8 


3.9 


Renal ca. 
ACHN 


0.0 


2.2 


0.0 


Salivary 
gland 


1.9 


2.1 


4.2 


Renal ca. 
UO-31 


u.u 


ft ft 

U.U 


u.u 


Pituitary 
gland 


1.0 


2.2 


6.7 


Renal ca. 
TK-10 


0.0 


0.0 


0.0 


Brain 
(fetal) 


6.8 


4.9 


10.8 


Liver 


0.0 


0.0 


0.0 


Brain 
(whole) 


4.8 


3.0 


1.4 


Liver 
(fetal) 


7.6 


0.0 


0.0 


Brain 

) 


4.6 


5.3 


1.5 


Liver ca. 
(hepatobla 
st) HepG2 


0.0 


0.0 


0.0 


Brain 

( pprfhpllu 

m) 


1.6 


1.6 


2.0 


Lung 


14.3 


15.8 


9.2 


Brain 

rhinnpicflm 

pus) 


7.5 


1 1.3 


0.6 


Lung 
(fetal) 


15.1 


15.4 


7.4 


Brain 

(substantia 

nigra) 


1.2 


2.6 


1.3 


Lung ca. 
(small cell) 
LX-1 


1.6 


0.0 


0.0 


Brain 
(thalamus) 


2.5 


1.7 


2.6 


Lung ca. 
(small cell) 
NCI-H69 


29.5 


19.1 


31.2 


Cerebral 
Cortex 


0.0 


i o.o 


5.0 


Lung ca. 
(s.cell var.) 
SHP-77 


11.0 


5.1 


37.4 


Spinal 
cord 


1.7 


2.1 


2.7 


Lung ca. 

(large 

cell)NCI- 


0.0 


0.0 


0.0 
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\ 











H460 








glio/astro 
U87-MG 


0 0 


0 0 


0 0 
v/.w 


Lung ca. 

( X~\f~\V\ — C tTl 

1 1HJ11 olll. 

cell) A549 


0 0 


1 7 

1 .z. 




gllU/ doll U 

U-118-MG 


55.1 


50.3 


42.9 


Lung ca. 
(non- 
s.cell) 
NCI-H23 


0.0 


1.3 


0.8 


astrocytom 
aSW1783 


0.0 


7.5 


0.0 


Lung cd. 
(non- 
sxell) 
HOP-62 


0.0 


1.7 


0.0 


neuro*; 
met SK-N- 
AS 


0.0 


0.0 


0.7 


Lung ca. 

(non-s.cl) 

NCI-H522 


8.0 


8.3 


7.3 


astrocytom 
a SF-539 


1.9 


4.7 


9.9 


Lung ca. 
(squam.) 
SW 900 


4.0 


0.0 


1.8 


astrocytom 
a SNB-75 


2.0 


4.9 


6.9 


Lung ca. 
(squam.) 
NCI-H596 


15.8 


10.2 


58.2 


glioma 
SNB-19 


6.7 


2.4 


3.7 


Mammary 
gland 


7.2 


4.1 


4.4 


glioma 
U251 


2.1 


4.5 


6.8 


t>reast ca. 

(pl.ef) 

MCF-7 


1.7 


3.4 


7.3 


glioma SF- 
295 


10.0 


0.6 


4.6 


Breast ca.* 
(pl.ef) 
MDA- 
MB-231 


23.2 


19.6 


19.2 


Heart 
(fetal) 


11.1 


9.9 


38.2 


Breast ca.* 

(pl.ef) 

T47D 


4.3 


5.8 


21.8 


Heart 


4.9 


6.0 


15.2 


Breast ca. 
BT-549 


0.0 


4.2 


2.2 


Skeletal 

muscle 

(fetal) 


100.0 


100.0 


85.3 


Breast ca. 
MDA-N 


0.0 


0.0 


0.0 . 


Skeletal 
muscle 


5.5 


8.4 


39.8 


Ovary 


3.6 


3.1 


8.1 


Bone 
marrow 


0.0 


0.0 


0.7 


Ovarian 
ca. 

OVCAR-3 


1.1 


1.0 


5.6 


Thymus 


10.0 


3.9 


^6.4 


Ovarian 
ca. 

OVCAR-4 


0.0 


0.0 


0.7 


Spleen 


3.8 


4.2 


1.6 


Ovarian 


0.0 


0.0 


11.5 
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ca. 

OVCAR-5 








Lymph 
node 


5.0 


1.1 


1.4 


Ovarian 
ca. 

OVCAR-8 


1.3 


4.3 


4.1 


Colorectal 


3.4 


5.4 


6.8 


Ovarian 
ca. 

IGROV-1 


0.0 


0.0 


8.1 


O IvJl 1 1 ClV^l 1 




1 5 4 

1 .*-r 


3 1 


Ovarian 
ca.* 

(ascites) 
SK-OV-3 


7 S 


In 0 


ivu.v 


Small 
intestine 


1 S Q 


1 8 7 
1 o. / 


z,._> 


yj ICI U5> 


1 7 R 


I J . 1 


0 0 


Colon ca. 
SW480 


94 ^ 




1 1 fx 

1 l.D 


1 IclCCIllct 


4 




? 1 


Colon ca.* 
SW620(S 
W480 met) 


0.0 


0.0 


2.1 


Prostate 


3.6 


5.3 


0.6 


Colon ca. 
HT29 


0.0 


0.0 


0.0 


Prostate 
ca.* (bone 
met)PC-3 


1.7 


1.5 


6.1 


Colon ca. 
HCT-116 


3.8 


0.6 


3.3 


Testis 


21.9 


14.6 


1.6 


Colon ca. 
CaCo-2 


0.0 


0.8 


0.3 


Melanoma 
Hs688(A). 
T 


3.1 


4.7 


1.4 


Colon ca. 
tissue(OD 
03866) 


2.3 


0.0 


1.6 


Melanoma 
* (met) 
Hs688(B). 
T 


0.4 


1.3 


0.0 


Colon ca. 
HCC-2998 


0.0 


0.0 


1.6 


Melanoma 
UACC-62 


0.0 


0.0 


0.0 


Gastric 
ca.* (liver 
met) NCI- 
N87 


16.7 


1 A A 

i4.y 


1 C 1 

15.3 


Melanoma 
M14 


O.U 


u.u 


A A 

U.U 


Bladder 


16 


3.2 


3.0 


Melanoma 

LOX 

IMVI 


0.0 


0.0 


0.0 


Trachea 


24.3 


33.7 


5.7 


Melanoma 
* (met) 
SK-MEL- ! 

5 ] 


0.0 


2.0 


0.7 


Kidney 


0.0 


0.0 


0.0 


Adipose 


6.7 


7.2 


21.2 
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Table BFD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2261, Run 
150811744 


Rel. Exp.(%) 
Ag2261,Run 
152887693 


Tissue Name 


Rel. Exp.(%) 
Ag2261, Run 
150811744 


Rel Exd ( °/o> 
Ag2261, Run 
152887693 


Normal Colon 


19.1 


19.8 


Kidney 
Margin 
8120608 


2.4 


0.0 


CC Well to Mod 
Diff(OD03866) 


0.0 


5.8 


Kidnev Cancer 
8120613 


14.6 


7.3 


CC Margin 
(OD03866) 


19.5 


12.5 


K idnev 
Margin 
8120614 


4.8 


1.5 


CC Gr.2 

rectosigmoid 

(OD03868) 


3.8 


1.4 


Kidney Cancer 
9010320 


0.0 


0.0 


CC Margin 
(OD03868) 


2.6 


' 5.1 


Kidney 
Margin 
9010321 


0.0 


0.0 


CC Mod Diff 
(ODO3920) 


6.0 


2.9 


Normal Uterus 


9.7 


2.8 


CC Margin 
(0003920} 

1 \j § 


23.8 


6.4 


Uterus Cancer 
06401 1 


85.9 


41.5 


CC Gr.2 ascend 
colon 

(OD03921) 


9.3 


2.2 


Normal 
Thyroid 


15.2 


7.3 


CC Margin 
(OD03921) 


16.8 


11.7 


Thyroid 

Cancer 

064010 


0.0 


3.0 


CC from Partial 
Hepatectomy 
(ODO4309) 
Mets 


2.4 


0.0 


Thyroid 

Cancer 

A302152 


1.9 


1.2 


Liver Margin 
(ODO4309) 


2.6 


0.0 


Thyroid 

Margin 

A302153 


2.6 


2.8 


Colon mets to 
lung (OD04451- 
01) 


7.9 


4.5 


Normal Breast 


16.2 


2.7 


Lung Margin 
(OD04451-02) 


1 1.3 


12.9 


Breast Cancer 
(OD04566) 


78.5 


29.7 


Normal Prostate 
6546-1 


6.3 


2.6 


Breast Cancer 
(OD045 90-01) 


37.6 


23.8 


Prostate Cancer 
(OD04410) 


17.8 


7.3 


Breast Cancer 
Mets 

(OD04590-03) 


100.0 


24.5 


Prostate Margin 
(OD04410) 


10.7 


7.4 


Breast Cancer 
Metastasis 


94.0 


45.4 
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(OD04655-05) 






Prostate Cancer 
(OD04720-01) 


4.7 


4.4 


Breast Cancer 
064006 


25.7 


24.8 


Prostate Margin 
(OD04720-02) 


13.9 


5.6 


Breast Cancer 
1024 


23.2 


7.1 


Normal Lung 

A/C 1 A 1 A 


36.6 


14.3 


Breast Cancer 
9100266 


33.0 


7.5 


Lung Met to 

Muscle 

(OD04286) 


1.0 


0.0 


Breast Margin 
9100265 


7.6 


7.6 


Muscle Margin 
(OD04286) 


31.0 


38.2 


Breast Cancer 
A209073 


13.9 


0.9 


Lung Malignant 
Cancer 


81.8 


100.0 


Breast Margin 
A2090734 


2.5 


0.0 


Lung Margin 

\\JLJ\JD 1 Z.\J J 


35.8 


18.2 


Normal Liver 


0.0 


0.0 


Lung Cancer 


57.0 


39.5 


Liver Cancer 
U04UU3 


0.0 


0.0 


Lung Margin 


9.4 


11.8 


Liver Cancer 
1U25 


4.8 


1.7 


Lung Cancer 
(OD04565) 


37.1 


42.0 


Liver Cancer 
1026 


7.1 


0.0 


Lung Margin 
(OD04565) 


22.7 


9.3 


Liver Cancer 
6004-T 


4.8 


0.0 


Lung Cancer 
(OD04237-01) ! 


5.3 


6.4 


Liver Tissue 
6004-N 


4.4 


1.8 


Lung Margin 
(OD04237-02) 


78.5 


32.8 


Liver Cancer 
6005-T 


0.0 


6.0 


Ocular Mel Met 
to Liver 
(ujj<J4 5 1 u) 


0.0 


0.0 


Liver Tissue 
6005-N 


0.0 


1.8 


Liver Margin 
(ODO4310) 


2.4 


0.0 


Normal 
Bladder 


2.4 


3.0 


Melanoma Mets 
to Lung 
(OD04321) 


13.0 


0.0 


Bladder 
Cancer 1023 


8.5 


4.9 


Lung Margin 
(OD04321) 


96.6 


50.0 


Bladder 

Cancer 

A302173 


17.0 


11.8 


Normal Kidney 


0.0 


0.0 

■ 


Bladder 
Cancer 

(OD047 18-01) 


10.0 


5.7 


Kidney Ca, 
Nuclear grade 2 
(OD04338) 


0.0 


0.0 

1 


Bladder 
Normal 
Adjacent 
;OD047 18-03) 


19.3 


27.5 
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Kidney Margin 
(OD04338) 


4.0 


4.6 


Normal Ovary 


13.6 


12.4 


Kidney Ca 
Nuclear grade 
1/2 (OD04339) 


0.0 


3.3 


Ovarian 

Cancer 

064008 


37.9 


2.1 


Kidney Margin 
(OD04339) 


18.7 


0.0 


Ovarian 
Cancer 

(OD04768-07) 


18.4 


3.7 


Kidney Ca, Clear 
cell type 
(OD04340) 


8.8 


11.7 


Ovary Margin 
(OD04768-08) 


28.3 


12.2 


Kidney Margin 
(OD04340) 


0.0 


2.0 


Normal 
Stomach 


48.3 


17 3 


Kidney Ca, 
Nuclear grade 3 
(OD04348) 


3.5 


4.0 


Gastric Cancer 
9060358 


0.0 


0.0 


Kidney Margin 
(OD04348) 


2.0 


1.7 


Stomach 

Margin 

9060359 


■ 

9.9 


3.0 


Kidney Cancer 
(OD04622-01) 


9.3 


0.0 


Gastric Cancer 
9060395 


20 7 


10 4 


Kidney Margin 
(OD04622-03) 


0.0 


6.3 


Stomach 

Margin 

9060394 


10.0 


12.2 


Kidney Cancer 
(OD04450-01) 


0.0 


0.0 


Gastric Cancer 
9060397 


8.7 


1.5 


Kidney Margin 
(OD04450-03) 


0.0 


0.0 


Stomach 

Margin 

9060396 


7.5 


6.2 


Kidney Cancer 
8120607 


0.0 


0.7 


Gastric Cancer! 
064005 


10.7 | 


4.8 



Table BFE. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag3035, Run 
190944495 


Tissue Name 


Rel. Exp.(%) 
Ag3035, Run 
190944495 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


1.7 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.8 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.2 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.6 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


1.1 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


2.7 


Primary Thl act 


0.0 


Lung Microvascular EC 


0.6 
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T\lt?nlv,Ur. 1 TT IL^ 

1 in r alpha + IE- 1 beta 




Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


3.8 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
1 Nr alpha + 1L-1 beta 


1.2 


Primary Thl rest 


0.0 


Bronchial epithelium 
1 JNr aipna + 1E1 beta 


3.7 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


L9 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


4.0 


Ly-tJ IN. /A K^UH 

lymphocyte act 


0.0 


Coronery artery SMC rest 


0.2 


i~*T\A ^T>f~\ f~^T\A 

lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


2.4 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


1.3 


oeconaary k^uo 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


2.1 


2ry Thl/Th2/Trl anti- 
CD95CH11 


0.0 


CCD 1 106 (Keratinocytes) 
none 


22.2 


LAK cells rest 


0.0 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


18.8 


LAK cells IL-2 


0.0 


Liver cirrhosis 


0.7 


LAK cells IL-2+IL- 12 


0.0 


NCI-H292 none 


0.4 


LAK cells IL-2+IFN 
gamma 


0.0 | NCI-H292 IL-4 


1.5 


LAK cells IL-2+ IL-18 


0.0 j NCI-H292 IL-9 


2.0 


LAK cells 
PMA/ionomycin 


11.0 


NCI-H292 IL-13 


1.4 


NK Cells IL-2 rest 


0.0 


NCI-H292 IFN gamma 


1.5 


Two Way MLR 3 day 


0.0 


HPAEC none 


3.1 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.5 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


6.2 


PBMC rest 


0.0 


Lung fibroblast TNF alpha! 
+ IL-1 beta 


2.1 


PBMC PWM 


0.5 


Lung fibroblast IL-4 


4.2 j 


PBMC PHA-L 


0.4 


Lung fibroblast IL-9 


8.3 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


4.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


8.1 
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B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.4 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD1 070 TNF alnha 


0.9 


EOL-1 dbcAMP 


0.0 


normal fiHrohla^t 

CCD 1070 IL-1 beta 


2.9 


EOL-1 dbcAMP 
PMA/ionomycin 


1.0 


Dermal fibroblast IFN 
gamma 


5.8 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


17.2 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


4.8 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


1.0 


Monocytes rest 


0.0 


Neutrophils rest 


2.2 


Monocytes LPS 


0.6 


Colon 


2.6 


Macrophages rest 


0.0 


Lung 


8.8 


Macrophages LPS 


0.0 


Thymus 


17.1 


HUVEC none 


2.4 


Kidney 


100.0 


HUVEC starved 


8.8 







Table BFF. Panel 4D 



Tissue Name 


Rel. 

Exp.(%) 
Ag2261, 
Run 
152887762 


Rel. 
Exp.(%) 
Ag3035, 
Run 
165242424 


Tissue Name 


Rel. 

Exp.(%) 
Ag2261, 
Run 
152887762 


Rel. 
Exp.(%) 
Ag3035, 
Run 
165242424 


Secondary Thl act 


0.0 


2.1 


HUVEC IL-1 beta 


0.0 


1.7 


Secondary Th2 act 


0.0 


0.0 


HUVEC IFN 
gamma 


3.7 


11.5 


Secondary Trl act 


0.0 


4.2 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


3.1 


Secondary Th 1 rest 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


4.3 


5.1 


Secondary Th2 rest 


0.0 


2.3 


HUVEC IL-1 1 


4.0 


11.2 


Secondary Trl rest 


0.0 


0.0 


Lung 
Microvascular EC 
none 


7.2 


8.1 


Primary Thl act 


0.0 


0.0 


Lung 
Microvascular EC 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


8.4 


14.5 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 


0.0 


2.2 
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n li'n 









TNFalpha + IL- 
lbeta 






Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


0.0 


16.3 


Primary Th2 rest 


0.0 


0.0 


Small airway 
epithelium none 


5.9 


18.8 


Primarv TY1 rp<;t 

1 1 1 1 1 1 ill j 111 ILol 


0 0 

VJ.VJ 


yj.u 


Small airway 
epithelium 
TNFalpha + IL- 
Ibeta 




jo.O 


CD45RA CD4 
lymphocyte act 




ft ft 


Coronery artery 
SMC rest 


u.u 


*> ft 


lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


0.0 


0.0 


CD8 lymphocyte 
act 


0 0 


n n 

u.u 


/\sirocyies rest 


"X 'X 


1 'X ^ 
1 J.D 


Secondary CD8 

lvmnhorvtp rf»<;t 


0.0 


0.7 


Astrocytes 
TNFalpha + IL- 
lbeta 


0.0 


8.6 


Secondary CD8 
lymphocyte act 


1.6 


0.0 


KU-812 
(Basophil) rest 


0.0 


0.0 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 
(Basophil) 
PMA/ionomycin 


0.0 


9.7 


2ry 

Th1/Th2/Trl anti- 
CD95 CH11 


0.0 


1.4 


CCD 1106 
(Keratinocytes) 
none 


47.3 


100.0 


LAK cells rest 


3.5 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


9.0 


53.6 


LAK cells IL-2 


0.0 


0.0 


Liver cirrhosis 


32.8 


9.4 


T A \S ... 11 TT <"» ■ TT 

LAK cells IL-2+IL- 
12 


0.0 


0.0 


Lupus kidney 


0.0 


1.6 


T A iy 1 1 „ TT 

LAK cells IL- 
2+IFN Pamma 


0.0 


4.0 


NCT-H292 none 


3.8 


3.4 


LAK cells IL-2+ 
IL-18 


0.0 


0.0 


NCI-H292IL-4 


8.0 


19.5 


LAK cells 
PMA/ionomycin 


26.1 


50.7 


NCI-H292 IL-9 


0.0 


4.2 


NK Cells IL-2 rest 


0.0 


0.0 


NCI-H292IL-13 


13.8 


7.0 


Two Way MLR 3 
day 


0.0 


0.0 


NCI-H292 IFN j 
gamma 


16.2 


5.7 


Two Way MLR 5 


0.0 


0.0 


HPAEC none j 


6.7 


30.1 
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day 










Two Way MLR 7 
day 


0.0 


0.0 


HPAECTNF 
alpha + IL-1 beta 


n n 
u.w 


u.u 


PBMC rest 


0.0 


0.0 


Lung fibroblast 
none 


/ -O 




PBMC PWM 


0.0 


0.0 


Lung fibroblast 
TNF alpha + IL-1 
beta 


3.1 


6.3 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast 
IL-4 


4.3 


34.2 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 
IL-9 


iz. / 


Z /.J 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast 
IL-1 3 


6.8 


19.9 


B lymphocytes 
PWM 


n n 


ft ft 


Lung fibroblast 
IFN gamma 


30.4 


51.1 


B lymphocytes 
CD40L and IL-4 


^ 1 

D . 1 


n ft 


Dermal fibroblast 
CCD 1070 rest 


0.0 


2.8 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


5.2 


19.6 


EOL-1 dbcAMP 
PMA/ionomycin 


3.5 


2.7 


Dermal fibroblast 
CCD1070 IL-1 
beta 


0.0 


2.0 


juenariiic ceus 
none 


0.0 


0.0 


Dermal fibroblast 
IFN gamma 


28.5 


32.1 


Dendritic cells LPS 


0.0 


0.0 


Dermal fibroblast 
IL-4 


42.9 


91.4 


Dendritic cells anti- 
CD40 


0.0 


0.0 


IBD Colitis 2 


2.2 


5.5 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


3.1 


9.6 


Monocytes LPS 


0.0 


0.0 


Colon 


100.0 


58.6 


Macrophages rest 


0.0 


0.0 


Lung 


36.3 


26.1 


Macrophages LPS 


0.0 


0.0 


Thymus 


0.0 


0.0 


HUVEC none 


0.0 


17.7 


Kidney 


4.0 


33.0 


HUVEC starved 


17.4 


51.1 







Panel 1.3D Summary: Ag2261 The NOV63 gene is expressed at moderate levels in a 
number of metabolic tissues, with highest overall expression seen in fetal skeletal muscle 
(CTs=30.4-31.8). The higher levels of expression in fetal skeletal muscle when compared to 
5 adult skeletal muscle suggests that the protein product encoded by the 8809101 0 EXT gene 
may be useful in treating muscular dystrophy, Lesch-Nyhan syndrome, myasthenia gravis and 
other conditions that result in weak or dystrophic muscle. This gene is also expressed in 
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adipose, thyroid and heart. Since biologic cross-talk between adipose and thyroid is a 
component of some forms of obesity, this gene product may be a protein therapeutic for the 
treatment of metabolic disease, including obesity and Type 2 diabetes. 

Ag3035 This probe/primer set recognizes a distinct portion of this gene and shows a 
5 distinctive expression pattern when compared to Ag2261 . This observation may indicate that 
the probe/primer sets can distinguish splice variants of this gene. In contrast to the results 
obtained with Ag 2261, expression of this gene is highest in an ovarian cancer cell line (CT = 
30.6). As is the case for Ag2261, expression of this gene using Ag3035 also shows relatively 
high levels in fetal skeletal muscle. However, in addition, Ag3035 shows increased levels of 

10 this gene in adult skeletal muscle as well as in adult and fetal heart. Most other expression is 
similar using both probe/primer sets. Please see Ag2261 for additional information. 

Panel 2D Summary: Ag2261 The expression of this gene was assessed in two 
independent runs on panel 2D. This is consistently expressed in samples of breast cancer, 
uterine cancer and lung cancer when compared to their respective normal adjacent tissue 

1 5 controls. Thus, the expression of this gene could be used to distinguish breast cancer, lung 
cancer or uterine cancer from their normal tissues. Moreover, therapeutic modulation of this 
gene, through the use of small molecule drugs, antibodies or protein therapeutics might be of 
use in the treatment of breast, lung or uterine cancer. 

Panel 4. ID Summary: Ag3035 This probe/primer set recognizes a distinct portion of 

20 this gene and shows a distinctive expression pattern when compared to Ag2261 in Panel 4D. 
This observation may indicate that the probe/primer sets can distinguish splice variants of this 
gene. In contrast to the results obtained with Ag2261, expression of this gene is highest in 
kidney (CT = 30.6). Most other expression is similar using both probe/primer sets. The 
NOV63 gene, a WNT-14 homolog is also expressed at moderate to low levels in several 

25 unstimulated or cytokine-activated keratinocyte and lung and dermal fibroblast preparations 
(CT range 29-34). Thus, the NOV63 gene may be useful as a protein therapeutic that reduces 
or eliminates the symptoms of chronic obstructive pulmonary disease, asthma, emphysema, or 
psoriasis. In addition, due to its known effects on development of vertebrate joints, the protein 
encoded by the NOV63 gene may also reduce or eliminate the symptoms of osetoarthritis (See 

30 Hartmann and Tabin, 2001). 
References: 

Christine Hartmann and Clifford J. Tabin Wnt-14 Plays a Pivotal Role in Inducing 
Synovial Joint Formation in the Developing Appendicular Skeleton Cell, Vol 104, 341-351, 
February 2001 
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The long bones of the vertebrate appendicular skeleton arise from initially continuous 
condensations of mesenchymal cells that subsequently segment and cavitate to form discrete 
elements separated by synovial joints. Little is known, however, about the molecular 
mechanisms of joint formation. We present evidence that Wnt-14 plays a central role in 
5 initiating synovial joint formation in the chick limb. Wnt-14 is expressed in joint-forming 

regions prior to the segmentation of the cartilage elements, and local misexpression of Wnt-14 
induces morphological and molecular changes characteristic of the first steps of joint 
formation. Induction of an ectopic joint-like region by Wnt-14 suppresses the formation of the 
immediately adjacent endogenous joint, potentially providing insight into the spacing of joints. 

10 Panel 4D Summary: Ag2261 The NOV63 transcript is expressed at low levels in 

colon (CT=33.5). Low but significant levels of expression are also found in the lung, 
keratinocytes and dermal fibroblast. Thus, this transcript could be used as a marker for thymic, 
lung and skin tissues. The putative Wnt-14 molecule encoded by this transcript may play an 
important role in the normal homeostasis of these tissues. Therapeutics designed with the 

15 protein encoded by this transcript could be important for maintaining or restoring normal 
function to these organs during inflammation. 

NOV64 

Expression of gene NOV64 was assessed using the primer-probe set Ag3043, 
described in Table BGA. Results of the RTQ-PCR runs are shown in Tables BGB, 
20 BGC and BGD. 

Table BGA. Probe Name Ag3043 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cctgtatgaggaagtcgatgag-3 1 


22 


868 


1258 


Probe 


TET-5 1 -aggtcattcacgtcccctctcctg-3 ■ - 
TAMRA 


24 


900 


1259 


Reverse 


5 1 -gatacgagtccgtcttcctttc-3 1 


22 


932 


1260 



Table BGB. CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) Ag3043, 
Run 211012232 


Tissue Name 


Rel. Exp.(%) Ag3043, 
Run 211012232 


AD 1 Hippo 


23.0 


Control (Path) 3 
Temporal Ctx 


10.2 


AD 2 Hippo 


32.8 


Control (Path) 4 
Temporal Ctx 


28.3 


AD 3 Hippo 


12.3 


AD 1 Occipital Ctx 


20.2 
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AD 4 Hippo | 11.2 


AD 2 Occipital Ctx 

flVf i^inc*^ 

l tTlliJJlIlg 1 


0.0 


AD 5 Hippo 


88.9 


AD 3 Occipital Ctx 


13.1 


AD 6 Hippo 


73.2 


AD 4 Occipital Ctx 


26.2 


Control 2 Hippo 


31.6 


AD 5 Occipital Ctx 


47.6 


Control 4 Hippo 


15.0 


AD 6 Occipital Ctx 


27.5 


Control (Path) 3 
Hippo 


1 1 1 

u. i 


Control 1 Occipital 
Ctx 


8.7 


AD 1 Temnnral Ctx 


3? 3 


Control 2 Occipital 
Ctx 


57.0 


AD ? Temnnral Prv 


44 4 


Control 3 Occipital 
Ctx 


25.5 


/a i y j i empurai ^ ix 


1^4 


Control 4 Occipital 
Ctx 


1 1 9 


ay i j *t i ciiipordi v^ix 


7Q 1 


Control (Path) 1 
Occipital Ctx 


\JO .0 


AD 5 Inf Temporal 
Ctx 


1 uu.u 


Control (Path) 2 
Occipital Ctx 


14 1 


AD 5 Sup Temporal 
Ctx 


*h4 ^ 


Control (Path) 3 
Occipital Ctx 


6 7 


AD 6 Inf Temporal 

Ctx 


79 7 


Control (Path) 4 
Occipital Ctx 


17 3 


AD 6 Sup Temporal 
Ctx 




' Control 1 Parietal 
Ctx 


12.9 


Control 1 Temporal 
Ctx 


12.0 


Control 2 Parietal 
Ctx 


47.0 


Control 2 Temporal 
Ctx 


37.1 


Control 3 Parietal 
Ctx 


19.6 


Control 3 Temporal 
Ctx 


21.0 


Control (Path) 1 
Parietal Ctx 


62.4 


Control 3 Temporal 
Ctx 


13.5 


Control (Path) 2 j 
Parietal Ctx 


24.8 


Control (Path) 1 j 
Temporal Ctx 


56.6 


Control (Path) 3 
Parietal Ctx 


6.0 


Control (Path) 2 
Temporal Ctx 


40.3 


Control (Path) 4 
Parietal Ctx 


46.3 



Table BGC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3043, 
Run 167963717 


Tissue Name 


Rel. Exp.(%) Ag3043, 
Run 167963717 


Liver adenocarcinoma 


79.0 


Kidney (fetal) 


21.6 


Pancreas 


4.3 


Renal ca. 786-0 


19.2 | 


Pancreatic ca. CAPAN 
2 


16.3 


Renal ca. A498 


18.4 
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Adrenal gland 


5.9 


Renal ca. RXF 393 


31.2 


Thyroid 


2.8 


Renal ca. ACHN 


19.2 


Salivary gland 


2.2 


Renal ca. UO-31 


46.0 


I^lLUILdly gldlllJ 


~ ~ 


Renal ca TK-1 0 

1\ tl IClI \^H. J. XV 1 \f 


18 2 


Brain (fetal) 


7.4 


Liver 


3.5 


Brain (whole) 


9.7 


Liver (retal) 


11/1 

1 1 .4 


Brain (amygdala) 


6.9 


Liver ca. 
(hepatoblast) HepG2 


18.8 


Brain (cerebellum) 


12.9 


Lung 


3.9 


Brain (hippocampus) 


6.6 


Lung (fetal) 


13.0 


Rr?iiTi ^QiiH<5tantia nityfyii 

U 1 Cl 1 1 1 ^oUUoldl I lid 1 1 1 g, 1 CI } 


8.2 


Lung ca. (small cell) 
LX-1 


19.5 




4 6 


Lung ca. (small cell) 
NCI-H69 


1 1.0 




10 5 


Lung ca. (s.cell var.) 
SHP-77 


55.5 




5 7 


Lung ca. (large 
cell)NCI-H460 


3.3 


gllVJf CI .3 LI \J W O / 1VIVJ 


40.1 


Lung ca. (non-sm. 
cell) A549 


25.2 


a I in/astro T 1-1 1 8-MG 

g I l\J/ CI J 11 U I lO 1VIVJ 


50.3 


Lung ca. (non-s.cell) 
NCI-H23 


15.0 


astrocytoma SW1783 


29.1 


Lung ca. (non-s.cell) 
HOP-62 


13.2 


neuro*; met SK-N-AS 


11.7 


Lung ca. (non-s.cl) 
NCI-H522 


12.0 


astrocytoma SF-539 


23.0 


Lung ca. (squam.) 
SW 900 


27.9 


astrocytoma SNB-75 


44.4 


Lung ca. (squam.) 
NC1-H596 


22.4 


glioma SNB-19 


1 C ft 

15.9 


Mammary gland 


4.0 


glioma U251 


17.3 


Breast ca.* (pl.ef) 
MCF-7 


23.3 


glioma SF-295 


28.9 


Breast ca.* (pl.ef) 
MDA-MB-231 


46.7 


Heart (fetal) 


20.7 


Breast ca.* (pl.ef) 
T47D 


21.5 


Heart 


8.2 


Breast ca. BT-549 


15.2 


Skeletal muscle (fetal) 


29.3 


Breast ca. MDA-N 


42.9 


Skeletal muscle 


37.4 


Ovary 


6.1 


Bone marrow 


4.9 


Ovarian ca. OVCAR- 
3 


10.4 


Thymus 


11.0 


Ovarian ca. OVCAR- 
4 


19.5 


Spleen 


7.0 


Ovarian ca. OVCAR- 


39.5 
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5 




Lymph node 


19.3 


Ovarian ca. OVCAR- 

o 
O 


6.3 


Colorectal 


12.9 


Ovarian ca. IGROV- 
1 


6.5 


Stomach 


3.7 


Ovarian ca.* (ascites) 
SK-OV-3 


100.0 


Small intestine 


5.4 


Uterus 


^ Q 
J.O 


Colon ca. SW480 


14.2 


Placenta 


3.8 


Colon ca.* 
SW620(SW480 met) 


45.4 


Prostate 


2.9 


Colon ca. HT29 


12.7 


Prostate ca.* (bone 
met)PC-3 


40.9 


Colon ca. HCT-116 


25.9 


Testis 


2.8 


Colon ca. CaCo-2 


25.2 


Melanoma 
Hs688(A).T 


6.8 


Colon ca. 
tissue(OD03866) 


15.4 


Melanoma* (met) 

tt zr oo/n\ T 1 

Hsooo(Jd). 1 


7.7 


Colon ca. HCC-2998 


13.3 


Melanoma UACC-62 


17.3 


Gastric ca.* (liver met) 
NCI-N87 


10.4 


Melanoma M14 


11.0 


Bladder 


13.8 


Melanoma LOX 
IMVI 


43.5 


Trachea 


3.0 


Melanoma* (met) 
SK-MEL-5 


27.5 


Kidney 


9.5 


Adipose 


7.3 



Table BGD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3043, Run 
164315037 


Tissue Name 


Rel. Exp.(%) 
Ag3043, Run 
164315037 


Secondary Th 1 act 


38.4 


HUVEC IL-lbeta 


20.6 


Secondary Th2 act 


38.7 


HUVEC IFN gamma 


22.5 


Secondary Trl act 


40.3 


HUVEC TNF alpha + IFN 
gamma 


28.7 


Secondary Th 1 rest 


10.2 


HUVEC TNF alpha + IL4 


40.6 


Secondary Th2 rest 


18.2 


HUVEC IL-11 


18.4 


Secondary Trl rest 


16.3 


Lung Microvascular EC 
none 


23.3 


Primary Thl act 


47.6 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


26.1 


Primary Th2 act 


34.6 


Microvascular Dermal EC 
none 


47.0 


Primary Trl act 


46.3 


Microsvasular Dermal EC 


28.5 
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TNFalpha + IL-lbeta 




Primfirv Th1 rf*<it 


55 9 


Bronchial epithelium 
TNFalpha + ILlbeta 


54.7 


Primary Th2 rest 


28.5 


Small airway epithelium 
none 


29.7 


Primary Trl rest 


33.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


89.5 


CD45RA CD4 
lymphocyte act 


31.4 


Coronery artery SMC rest 


33.0 


CD45RO CD4 
lymphocyte act 


39.8 


Coronery artery SMC 
TNFalpha + IL-lbeta 


14.2 


CD8 lymphocyte act 


47.0 


Astrocytes rest 


10.6 


Secondary CD8 
lymphocyte rest 




Astrocytes TNFalpha + 
IL-lbeta 


k 9 


Secondary CD8 
lymphocyte act 


25.3 


KU-812 (Basophil) rest 


20.4 


CD4 lymphocyte none 


9.5 


KU-8 12 (Basophil) 
PM A/i onomy c in 


49.7 


2ry Thl/Th2/Trl anti- 
CD95CH11 


21.2 


CCD 1 106 (Keratinocytes) 
none 


47.3 


LAK cells rest 


43.8 


CCD1 106 (Keratinocytes) 
i JNraipna • tl- i oeia 


25.3 


L//\IV cens TL-Z 




Liver cirrnosis 


A 2 


T A V <-p>I1c TT 9-4-TT 19 


18. A 


1 i ir\i i c L/"ii^i^O"\/ 

TvUpus Kiuney 




LAK cells IL-2+IFN 

ctck \m tn a 
gcllllllla 


52.5 


NCI-H292 none 


46.7 


T AK rp11« TT -9-1- TT -1 R 

1 v/A IN. L-C lib 1 1 v Z. i lL"IO 


49 0 


NIPI-1 1909 TT -d 




LAK cells 

P1V/T A /\ f\v\ r\ tn \/r* i m 
1 IVLrY/ ILJH U 1 1 l_y 1 1 1 


15.6 


NCI-H292 IL-9 


70.7 


lMRT rVHc TT -9 r**Qt 


90 1 


XTPT-T4909 TT -1 1 




i wo w d.y iviLriv tidy 


9Q 1 


XTPT-TT909 I PNJ a^mma 




Two Way MLR 5 day 


23.5 


HPAEC none 


17.3 


Two Way MLR 7 day 


16.2 


HPAEC TNF alpha + 1L-1 
beta 


29.5 


PBMC rest 


10.9 


Lung fibroblast none 


20.2 


PBMC PWM 


91.4 


Lung fibroblast TNF alpha 
+ 1L-1 beta 


17.8 


PBMC PHA-L 


43.5 


Lung fibroblast IL-4 


40.9 


Ramos (B cell) none \ 


50.3 


Lung fibroblast IL-9 


40.6 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-13 


20.9 


B lymphocytes PWM 


94.6 


Lung fibroblast IFN 
gamma 


41.8 


B lymphocytes CD40L 
and IL-4 


39.0 


Dermal fibroblast 
CCD1070 rest 


56.3 
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EOL-1 dbcAMP 


24.7 


Dermal fibroblast 
LLJJiu/u i iNr aipna 


84.7 

M — . — — — 


r^/^I 1 J t A A All 

EOL-1 dbcAMP 

P1VT A /innnmvfiti 


40.9 


Dermal fibroblast 
CCD 1070 IL-1 beta 


25.0 


Dendritic cells none 


34.2 


Dermal fibroblast IFN 
gamma 


29.3 


Dendritic cells LPS 


38.7 


Dermal fibroblast IL-4 


42.6 


Dendritic cells anti- 
CD40 


48.0 


IBD Colitis 2 


2.0 


Monocytes rest 


17.9 


IBD Crohn's 


1.8 


Monocytes LPS 


19.1 


Colon 


17.3 


Macrophages rest 


40.6 


Lung 


16.8 


Macrophages LPS 


18.4 


Thymus 


17.7 


HUVEC none 


37.4 


Kidney 


29.1 


HUVEC starved 


65.1 







CNS_neurodegeneration_vl.O Summary: Ag3043 There is an association with a 
statistical confidence of 0.1 between increased expression of the NOV64 gene in the temporal 
cortex and Alzheimer's disease. This gene encodes a homolog of dipeptidyl peptidase, which 
5 belongs to a known class of markers of T cell activation in Multiple Sclerosis. This indicates 
that inhibitors of this gene product may have utility in treatment of this disease. A dipeptidyl 
peptidase is also dysregulated in Huntington's disease. Our finding of increased expression of 
this gene product in the temporal cortex of Alzheimer's disease patients indicates that there 
may be a wider utility of inhibitors of the protein encoded by this gene, including the treatment 
10 of neurodegenerative diseases such as Huntington's and Alzheimer's, as well as Multiple 
Sclerosis. 

References: 

Khoury SJ, Guttmann CR, Orav EJ, Kikinis R, Jolesz FA, Weiner HL. Changes in 
activated T cells in the blood correlate with disease activity in multiple sclerosis. Arch Neurol 
15 2000 Aug;57(8): 11 83-9 

OBJECTIVE: To determine whether changes in activation markers on peripheral blood 
T cells correlate with disease activity in patients with multiple sclerosis. DESIGN: In a 
prospective longitudinal study during 1 year, we analyzed the change in percentage of 
activated T lymphocytes in the peripheral blood of 40 patients with multiple sclerosis in 
20 relation to clinical findings and changes on brain magnetic resonance imaging (MR1) scans. 

The patients underwent repeated imaging of the brain (mean number of MRIs for each patient, 
22) at the time blood samples were obtained as well as at monthly neurological examinations, 
and at the time of scoring on the Kurtzke Expanded Disability Status Scale (EDSS) and 
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ambulation index scale. RESULTS: A change in the percentage of cells expressing the 
activation markers interleukin 2 receptor (CD25), class II major histocompatibility complex 
(MHC) (13) or surface dipeptidyl peptidase (CD26) correlated significantly with a change in 
lesion volume or a change in number of gadolinium-enhancing lesions as detected on MRI. 
5 Changes in CD25( +) cells and in CD4(+) cells expressing class II MHC also correlated with 
changes in disability as measured by EDSS in patients with relapsing-remitting disease, and 
changes in CD4(+)CD25(+) cells correlated with the occurrence of attacks in patients with 
relapsing-remitting disease. These correlations are dependent on measurement of changes 
between time points sampled at 1- or 2-week intervals. CONCLUSION: There is a linkage 
10 between peripheral T-lymphocyte activation as measured by cell surface markers and disease 
activity in patients with multiple sclerosis. 

Mantle D, Falkous G, Ishiura S, Perry RH, Perry EK. Comparison of cathepsin 
protease activities in brain tissue from normal cases and cases with Alzheimer's disease, Lewy 
body dementia, Parkinson's disease and Huntington's disease. J Neurol Sci 1995 Jul;131(l):65- 



Recent evidence, based upon immunocytochemical and histochemical analysis of brain 
cortical tissue from alzheimer's disease patients, has suggested that altered activity and/or 
distribution of the lysosomal proteases cathepsins B and D may be implicated in the abnormal 
protein processing pathway resulting in formation of the neurotoxic amyloid A4 peptide, 



biochemical assay techniques using conventional or specially synthesised (corresponding to 
protein cleavage points of relevant to A4 peptide formation) fluorogenic substrates, the levels 
of activity of the lysosomal proteases cathepsins B, D, H and L, and dipeptidyl 
aminopeptidases I and II in frontal cortex (grey/white matter) from control and Alzheimer's 

25 disease patients. For comparative purposes, activity levels of the above enzymes were also 
determined in frontal cortex tissue from cases with Lewy body dementia and Parkinson's 
disease, and in caudate tissue from control and Huntington's disease cases. There was no 
significant difference in activity for any protease types in tissue from control cases and cases 
with Alzheimer's disease, Lewy body dementia or Parkinson's disease, with the exception of 

30 reduced dipeptidyl aminopeptidase II activity in Lewy body dementia and Parkinson's cases. 
We have therefore been unable to confirm a potential role for lysosomal cathepsins in the 
characteristic neurodegeneration associated with Alzheimer's disease; however the finding of 
significant increases in activity of dipeptidyl aminopeptidase II, cathepsin H and cathepsin D 
specifically in cases with Huntington's disease is of particular note. We therefore suggest the 
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20 characteristic of this neurodegenerative disorder. We have therefore compared, via 
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potential role of the latter enzymes in the pathogenesis of Huntington's disease requires further 
investigation 

Panel 1.3D Summary: Ag3043 Highest expression of the NOV 64 gene is seen in an 
ovarian cancer cell line (CT>26.2). This gene is expressed at moderate levels in all the cancer 
5 cell lines in this panel. Thus, this is a potential target for small molecule inhibitor drugs in 
cancer. 

This gene also has moderate levels of expression in pancreas, adrenal, thyroid, 
pituitary, heart, skeletal muscle, liver and adipose. Therefore, this gene product may be a small 
molecule target for the treatment of metabolic and endocrine diseases, including obesity and 
10 Types 1 and 2 diabetes. 

In addition, this gene is expressed in the central nervous system. Please see 
CNS_neurodegeneration vl .0 for discussion of utility of this gene in the central nervous 
system. 

Panel 4D Summary: Ag3043 The NOV64 gene is expressed in a number of cells and 
15 tissues of immunological importance, especially in activated B cells, T cells, dendritic cells, 
and activated lung and skin fibroblasts. Therefore, small molecule antagonists that block the 
function of the NOV64 gene product may reduce or eliminate the symptoms of a wide range 
of autoimmune and inflammatory diseases, including Crohn's disease, ulcerative colitis, 
multiple sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, rheumatoid 
20 arthritis, lupus erythematosus, or psoriasis. 

NOV65a and NOV65b 

Expression of gene NOV65a and variant NOV65b was assessed using the primer-probe 
sets Ag3020 and Ag2968, described in Tables BHA and BHB. Results of the RTQ-PCR runs 
are shown in Tables BHC, BHD, BHE, BHF and BHG. 

25 Table BHA. Probe Name Ag3020 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggaatcacccacattctgaat-3 1 


21 


193 


1261 


Probe 


TET-5 ' -cgtttacactggccccgaattctaca-3 ' - : 
TAMRA 


26 


231 


1262 


Reverse 


5 1 -cctctacacccaggtactggat-3 ' 


22 


268 


1263 


Table BHB. Probe Name Ag2968 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -ggaatcacccacattctgaat-3 ' 




193 


1264 


Probe 


TET-5 ' -cgtttacactggccccgaattctaca-3 1 - 


26 


231 


1265 



1264 



Table BHC. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) Ag3020, 

JKUn ZzlVVo074 


Tissue Name 


Rel. Exp.(%) Ag3020, 
Run 221998694 


Adipose | 


0.4 


Renal ca. TK- 10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.1 


Melanoma* 


0.0 


vjasinc ca. ^nver inei.j 
NCI-N87 


0.0 


Melanoma* M 14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.2 


Squamous cell 
carcinoma SCC-4 


0.0 


i^oion ca. 

met) SW620 


0.0 


Testis Pool 


A A 

0.0 


j^oion ca. ri i Z7 


0 0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.2 


Colon ca. CaCo-2 


0.0 


Placenta 


0.1 


Colon cancer tissue 


31 .0 


Uterus Pool 


0.0 


Colon ca. SW1 1 16 


A A 
0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


100.0 


Ovarian ca. SK-OV- 

D 


0.3 


Colon ca. SW-48 


5.7 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


A 1 

0.1 


Qrvioll Tnlpclinr> 0/-v^-\ 1 

oman iniesune jtooi 


0.0 


Ovarian ca. IGROV- 
1 


18.9 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.1 


Fetal Heart 


12.7 


Breast ca. MCF-7 


0.0 


Heart Pool 


4.9 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.1 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


23.3 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


20.0 


Breast ca. MDA-N 


0.0 


| Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 
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Trachea 


0.3 


CNS cancer (glio/astro) 
U87-MG 


A A 

0.0 


Lung 


0.0 


CNS cancer (glio/astro) 
U-118-MG 


A 1 

0.1 


Fetal Lung 


0.2 


CNS cancer 
(neuro;met) SK-N-AS 


A A 


T lino r>Q Wn Md.17 


0 0 


CNS cancer (astro) SF- 
539 


0.7 


Lung ca. LX-1 


12.7 


CNS cancer (astro) 
SNB-75 


0.1 


Lung ca. NCI-H146 


0.1 


CNS cancer (glio) 
SNB-19 


15.9 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 
295 


0.5 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.1 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. NCI-H23 


0.0 


Brain (fetal) 


0.1 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 

rDOi 


0.1 


T ^ ^ t IAD £L O 

Lung ca. HOF-oz 


a i 


Cerebral Cortex Pool 


U. 1 


Lung ca. NCI-H522 


0.0 


r>rain ^oUDSianna nigra j 
Pool 


0.1 


Liver 


0.0 


Brain (Thalamus) Pool 


0.1 


Fetal Liver 


0.1 


Brain (whole) 


0.1 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


0.1 


! Adrenal Gland 


0.0 


Fetal Kidney 


0.1 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


1 Salivary Gland 


0.1 


Renal ca. A498 


0.0 


| Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


[Pancreatic ca. CAPAN2 


0.4 


Renal ca. UO-31 


0.0 


] Pancreas Pool 


0.0 



Table BHD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
166220058 


Rel. Exp.(%)| 
Ag3020, Run; 
167819114 


Tissue Name 


Rel. Exp.(%) 
Ag2968, Run! 
166220058 


Rel. Exp.(%) 
Ag3020, Run 
167819114 


Liver 
adenocarcinoma j 


0.0 


0.1 


Kidney (fetal) 


0.0 


0.3 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


0.0 


0.0 


Pancreatic ca. 
CAPAN 2 


0.1 


0.2 


Renal ca. 
A498 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 
393 


0.0 


0.0 
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Thyroid 


0.3 


1.6 


Renal ca. 
ACHN 


0.0 


0.0 


Salivary gland 


0.8 


0.6 


Renal ca. UO- 


0.0 


0.0 


Pituitary gland 


0.0 


0.0 


Renal ca. TK- 

i n 
i \j 


0.0 


0.0 


Brain (fetal) 


0.0 


0.1 


Liver 


0.0 


0.0 


Brain (whole) 


0.4 


1.0 


Liver (fetal) 


0.5 


0.0 


Brain (amygdala) 


0.2 


1.0 


Liver ca. 
(hepatoblast) 
HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.0 


0.5 


Brain 
(hippocampus) 


0.0 


0.0 


Lung (fetal) 


0.0 


0.0 


Brain (substantia 
nigra) 


0.2 


0.1 


Lung ca. 
(small cell) 
LX-1 


10.7 

_ 


16.2 


Brain (thalamus) 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.5 


Cerebral Cortex 


0.4 


0.1 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 


Spinal cord 


0.0 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


giio/asiro uo/-lvkj 


u.u 


u.u 


Lung ca. (non- 
sm. cell) A549 


0 0 

\J.\J 


0 2 


glio/astro U-l 18- 

1V1VJ 


0.0 


0.1 


Lung ca. (non- 
s.cell) NCI- 
H23 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.1 


0.1 


neuro*; met SK-N- 

/A o 


0.0 


0.0 


Lung ca. (non- 
s.cl) NCI- 
H522 


0.0 


0.0 


doinjuy luiuct or - 

539 


12.1 


_ 

8.4 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


astrocytoma SNB- 
75 


0.0 


0.3 


Lung ca. 
(squam.) NCI- 
H596 


0.1 


0.4 


glioma SNB-19 


0.0 


0.0 


Mammary 
gland 


0.2 


0.2 


glioma U251 


0.4 


0.0 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


0.0 


glioma SF-295 


0.3 


ft ~ 1 Breast ca.* 
| (pl.ef) MDA- 


0.0 


0.0 
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1 


MB-231 




Heart (fetal) 


7.4 


26.8 


Breast ca.* 
(pl.ef) T47D 


0.0 


A A 

0.0 


Heart 


29.9 


35.4 


Breast ca. BT- 
549 


A A 

0.0 


A A 

0.0 


Skeletal muscle 
(fetal) 


10.8 


33.9 


Breast ca. 
MDA-N 


0.0 


A A 
0.0 


Skeletal muscle 


100.0 


100.0 


Ovary 


0.0 


0.1 


Bone marrow 


0.1 


0.6 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Thymus 


0.1 


0.1 


Ovarian ra 

V Cll lC4.il \^d. 

OVCAR-4 


0.0 


0.0 


Spleen 


0.0 


0.0 


Ovarian ca. j 
OVCAR-5 


0.0 


0.6 


Lymph node 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Colorectal 


0.2 


0.2 


Ovarian ca. 
IGROV-1 


26.2 


26.2 


Stomach 


0.0 


0.0 


wvariciii Ks<\. 

(ascites) SK- 
OV-3 


0.2 


1.0 


Small intestine ] 


0.0 j 0.0 


Uterus 


0.0 


0.0 


Colon ca. bW4oU \ 


0.0 j 0.2 


PI C\C*F*\^\\C\ 
I luLCllLa 


1 0 


0.0 


Colon ca.* 
SW620(SW480 
met) 


1.6 


6.1 


Prostate 


0.2 


0.1 


Prklrm m HT70 


0.0 


0.0 


Prostate ca.* 
fbone mef>PC- 
3 


0.0 


0.0 


I^UIOI 1 Ca. iiV^ 1 - 

116 


0.0 


0.0 


Testis 


0.2 


0.2 


Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


21.9 


30.6 


Melanoma* 

(met) 
Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.5 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


0.0 


0.0 


Melanoma 
M14 


0.0 


0.0 


Bladder 


0.1 


0.0 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.8 


0.6 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 


1 00 


0.0 


Adipose 


1.1 


1.7 
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Table BHE. Panel 3D 



Tissue Name 


ReL Exp.(%) 
Ag2968, Run 
170188142 


Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
170188142 


Daoy- Medulloblastoma 


0.0 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


2.2 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 




0 0 


JvlbCj-U 1 - Chronic 
TYivploppnoiis leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma = 


\J.\J 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 
Neuroblastoma 
(metastasis) 


0.0 


CA46- Burkitt's lymphoma 1 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell \ 
lymphoma 


0.0 


Cerebellum 


0.0 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


0.0 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


0.0 


TF-1- Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 


0.0 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


0.0 


Ku-oii- iviyeiogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


0.0 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- Clear cell renal 
carcinoma 


0.0 


NCT-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms' tumor 


0.0 


NCI-HI 155- Large cell 


0.0 


Hs766T- Pancreatic 


0.0 
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lung cancer 


carcinoma (LN metastasis) } 


NCI-H1299- Large cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.0 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


4.4 


HPAC- Pancreatic 
adenocarcinoma 


0.0 


Colo-210- Colon cancer 


100.0 


MIA PaCa-2- Pancreatic 
carcinoma 


0.0 


KM 12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal j 
adenocarcinoma 


0.0 


KM20L2- Colon cancer 


0.0 


PANC- 1 - Pancreatic 
epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon cancer 


0.0 


T24- Bladder carcinma j 
(transitional cell) 


0.0 


SW-48- Colon 
adenocarc inoma 


4.3 


5637- Bladder carcinoma 


0.0 


SW1116- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder carcinoma 


0.0 


LS174T- Colon 
adenocarcinoma 


10.1 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1 080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


1.2 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


0.0 


NCI-SNU-16- Gastric 
carcinoma 


0.0 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.7 


NCI-SNU-1- Gastric 
carcinoma 


0.1 


A431- Epidermoid carcinoma 


0.0 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


0.0 


RF-48- Gastric 
adenocarcinoma 


0.0 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


0.0 


MDA-MB-468- Breast 
adenocarcinoma 


0.0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 
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RL95-2- Uterine 
carcinoma 


0.0 


SCC- 1 5- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table BHF. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3020, Run 
164528102 


Tissue Name 


Rel. Exp.(%) 

A /rlA^A Dun 

Ag^uzu, Jtvun 
164528102 


Secondary Thl act 


0.0 


HUVEC lL-lbeta 


U.U 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-l 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primarv Thl act 
jt i iiiidi y iiii uvi 


0.0 


Lung Microvascular EC 
TNFalpha + IL-l beta 


0.0 


Primarv Th2 act 

M. m. Illicit Jf A. 1 l^w C* \*s X. 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primarv Trl act 

X 1 1111(41 J 111 M V I 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-l beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primarv Th2 rest 

A 1 1111U1 J X * lAm* k W <J %> 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
1 fNraipna ^ i-L.- i oeia 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-l beta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


65.1 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-l beta 


21.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


10.7 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


13.3 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-l beta 


0.0 
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LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


I AK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PM A/ionomvcin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


8.0 


NCI-H292 IL-13 


0.0 


Two Wav Ml R ^ Hav 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day \ 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day ! 


10.6 


ri i /\ iii i iNr aipna • ii^-i ; 
beta 


11.2 


PBML rest 


U.U 


i^ung iiDroDiasi none 


0 0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ 1L- 1 oeia 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


9.9 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 

L.L/L'lu/U dipilcl 


0.0 


h(JL-l ClDCAMr 
P1V1 A /innnmvrin 


0.0 


Ti a **»"Vi o 1 ti r\ ire~\ r* 1 q ct 

uermdi nurouidoi 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


23.8 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


44.4 


Macrophages rest 


0.0 


Lung 


26.6 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 






Table BHG, Panel 5D 


Tissue Name 


Rel. Exp.(%) 
Ag3020, Run 
172171108 


Tissue Name 


Rel. Exp.(%) 
Ag3020, Run 
172171108 
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97457_Patient- 
02go_adipose 


0.2 


94709_Donor 2 AM - A adipose 


0.0 


97476_Patient- 
07sk_skeletal muscle 


8.4 


94710_Donor 2 AM - B adipose 


0.0 


97477_Patient- 
07ututerus ! 


0.0 


9471 1 J3onor 2 AM - C adipose 


0.0 


97478__Patient- 
07plj>lacenta 


1.3 


94712JDonor 2 AD - A adipose 


3.0 


97481_Patient- 
08sk_skeletal muscle 


12.6 


94713_Donor 2 AD - B adipose \ 


0.0 


97482_Patient- 
08ut_uterus 


0.0 


94714_Donor 2 AD - C adipose 


0.0 


97483_Patient- 
08pl_placenta 


0.0 


94742 Donor 3 U - 
A Mesenchymal Stem Cells 


0.0 


97486_Patient- 
09sk skeletal muscle 


12.9 


94743_Donor 3 U - 
B Mesenchymal Stem Cells 


0.0 


97487_Patient- 
09ut_uterus 


0.7 


94730_Donor 3 AM - A adipose 


0.0 


97488_Patient- 
09pl_placenta 


0.2 


94731_Donor 3 AM - B adipose : 


0.0 


97492_Patient- 
lOututerus 


0.2 


94732_Donor 3 AM - C adipose 


0.0 


97493_Patient- 
1 Opl_placenta 


1.1 


94733_Donor 3 AD - A adipose 


0.0 


97495_Patient- 
1 lgoadipose 


0.0 


94734_Donor 3 AD - B adipose 


0.0 


97496 Patient- 
1 1 sk skeletal muscle 


53.6 


94735_Donor 3 AD - C adipose 


0.0 


97497_Patient- 
1 1 ututerus 


0.3 


77 1 3 8_Liver_HepG2untreated 


0.0 


97498_Patient- 
1 lpl_placenta 


2.3 


73556_Heart_Cardiac stromal 
cells (primary) 


0.0 


97500_Patient- 
1 2go_adipose 


0.4 


81735_Small Intestine 


0.0 


9750 l_Patient- 
12sk_skeletal muscle 


100.0 


72409_Kidney_Proximal 
Convoluted Tubule 


0.0 


97502_Patient- 
1 2ut_uterus 


0.0 


82685 Small intestine Duodenum 


0.0 


97503_Patient- 
1 2pl jlacenta 


1.3 


90650_Adrenal_Adrenocortical 
adenoma 


0.2 


94721_Donor2U- 
A Mesenchymal Stem 
Cells 


0.0 


72410_Kidney_HRCE 


0.3 


94722_Donor 2 U - 
B Mesenchymal Stem 
Cells 


0.0 


72411_Kidney_HRE 


0.0 
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94723_Donor 2 U - 
C Mesenchymal Stem 
Cells 


0.0 


73 1 39__Uterus Uterine smooth 
muscle cells 


0.0 



General screening_panel_vl. 4 Summary: Ag3020 The NOV65a gene is expressed 
in brain, colon, lung and ovarian cancer cell lines with highest expression in a colon cancer 
cell line Colo-205 (CT=24.37). This suggests that this gene can be used as a diagnostic marker 
5 for these types of cancer . Furthermore, inhibition of the protein using small molecule drugs 
could potentially be useful for the treatment of brain, colon, lung and ovarian cancer. 

In addition, this gene has low expression in adipose and high expression in adult and 
fetal heart and skeletal muscle. Thus, this protein phosphatase may be a small molecule target 
for the treatment of obesity, Type 2 diabetes and cardiac and skeletal muscle disease. 

10 Panel 1.3D Summary: Ag2968/Ag3020 Results from two experiments using identical 

probe/primer sets are in excellent agreement. Expression of the NOV65a gene is highest in 
adult skeletal muscle (CTs = 26-28). Significant but somewhat lower expression is also seen in 
fetal skeletal muscle and adult/fetal heart. Thus, expression of this gene may be used to 
distinguish these samples from the other samples on this panel. 

15 This gene is also expressed in brain, colon, lung and ovarian cancer cell lines, 

consistent with Genera l_screening_panel_v 1.4. This suggests that this gene can be used as a 
diagnostic marker for these types of cancer and inhibition of the protein using small molecule 
drugs can be used for the treatment of brain, colon, lung and ovarian cancer. 

Panel 3D Summary: Ag2968 Expression of the NOV65a gene is highest in colon 

20 cancer cell line Colo-205 (CT = 25.6). In addition, significant expression of this gene is seen 
in two other colon cancer cell lines. Thus, expression of this gene may be used to distinguish 
these colon cancer cell lines from the other samples on this panel. Moreover, therapeutic 
modulation of the activity of this gene or its protein product, using small molecules, antibodies 
or protein therapeutics, may be of benefit in the treatment of colon cancer. 

25 Panel 4D Summary: Ag3020 Expression of the NOV65a gene is highest in a liver 

cirrhosis sample (CT = 33.3). Furthermore, expression of this gene is not detected in normal 
liver in Panels 1.3D or 1.4, suggesting that its expression is unique to liver cirrhosis. This gene 
encodes a putative protein phosphatase; therefore, antibodies or small molecule therapeutics 
could reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this 

30 protein could also be used for the diagnosis of liver cirrhosis. Low levels of expression are 
also seen in colon and resting astrocytes. 
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Ag2968 Expression of this gene is low/undetectable (CTs > 35) across all of the 
samples on this panel (data not shown). 

Panel 5D Summary: Ag3020 Expression of the NOV65a gene is primarily restricted 
to samples from skeletal muscle. This specific expression is in agreement with the results in 
5 Panels 1.3D and 1.4. Thus, expression of this gene could be used to differentiate between 
skeletal muscle and other samples on this panel, and as a marker of skeletal muscle. Results 
from one experiment with the probe and primer set Ag2968 are not included. The amp plot 
indicates that there were experimental difficulties with this run. 

NOV66 

1 o Expression of gene NO V66 was assessed using the primer-probe set Ag29 13, 

described in Table BIA. Results of the RTQ-PCR runs are shown in Tables BIB, B1C and 
BID. 

Table BIA. Probe Name Ag2913 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward! 


5 » -tttgtggcttgatggcttt-3 ' 


19 


956 


1267 


Probe 


TET- 5 1 -ttcctttccgcatttcctatgtgaat-3 * -: 
TAMRA 


26 


977 


1268 


Reverse 


5 * - ttccagttaaaggcataacgaa-3 * 


22 


1012 


1269 



1 5 Table BIB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2913, 
Run 157366466 


Tissue Name 


Rel. Exp.(%) Ag2913, 
Run 157366466 


Liver adenocarcinoma 


1.2 


Kidney (fetal) 


4.3 


Pancreas 


0.4 


Renal ca. 786-0 


0.6 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.6 


Adrenal gland 


2.3 


Renal ca. RXF 393 


0.2 


Thyroid 


0.7 


Renal ca. ACHN 


0.0 


Salivary gland 


8.3 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.3 


Renal ca. TK-10 


0.0 


Brain (fetal) 


26.4 


Liver 


2.0 


Brain (whole) 


6.4 


Liver (fetal) 


3.5 


Brain (amygdala) 


18.4 


Liver ca. 
(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


8.6 


Lung 


9.2 


Brain (hippocampus) 


100.0 


Lung (fetal) 


4.7 


Brain (substantia nigra) 


2.6 


Lung ca. (small cell) 


0.6 
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LX-1 




Brain (thalamus) 




Lung ca. (small cell) 
NCI-H69 


2.1 


Cerebral Cortex 


J I .Z 


Lung ca. (s.cell var.) 
SHP-77 


3.8 


Spinal cord 


i n 


Lung ca. (large 
cell)NCI-H460 


1.5 


1 * _ / A. 1 TO'7 A 

glio/astro U87-M(j 


1 .Z 


Lung ca. (non-sm. 
cell) A549 


2.2 


glio/astro U-l lo-Mu 




Lung ca. (non-s.cell) 
NCI-H23 


7.2 


astrocytoma SW 1783 ! 


2.8 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


8.2 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


2.5 


Lung ca. (squam.) 
SW 900 1 


0.9 




0.7 


Lung ca. (squam.) 
NCI-H596 


0.8 


glioma SNB-19 


1.6 


Mammary gland 


5.7 


glioma U251 


0.4 


Breast ca.* (pl.ef) 
MCF-7 


2.2 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


3.1 


Heart (fetal) 


0.1 


Breast ca.* (pl.ef) 
T47D 


1.4 


Heart 


0.4 


Breast ca. BT-549 


2.6 


Skeletal muscle (fetal) 


0.4 


Breast ca. MDA-N 


1.8 


Skeletal muscle 


0.5 


Ovary 


0.0 


Bone marrow 


O.D 


Ovarian ca. OVCAR- 
3 


0.9 


Thymus 


Z.O 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


7.7 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


9.1 


Ovarian ca. OVCAR- 

o 
o 


1.0 


Colorectal 


8.4 


Ovarian ca. TGROV- 
1 


0.0 


Stomach 


0.9 


Ovarian ca.* (ascites) 
SK-OV-3 


0.9 


Small intestine 


11.2 


Uterus 


1.2 


Colon ca. SW480 


0.0 


Placenta 


1.7 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


1.9 
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Colon ca. HT29 


0.0 


Prostate ca.* (bone | 


0.0 


Colon ca.HCT-1 16 


0.1 


Testis 


2.8 


Colon ca. CaCo-2 


7.7 


Melanoma 
Hs688(A).T 


0.2 


Colon ca. 

1 1 Z> j> LI ^ v_y L-/ vy o \t v./ j 


1.1 


Melanoma* (met) 
Hs688(B).T 


0.8 


Colon ca. HCC-2998 


7.9 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


9.2 


Melanoma M 1 4 


0.7 


Bladder 


6.4 


Melanoma LOX 
IMV1 


1.5 


Trachea 


1.4 


Melanoma* (met) 
SK-MEL-5 


1.3 


Kidney 


1.4 


) Adipose 


2.3 



Table BIC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2913, Run 
157366467 


Tissue Name 


Rel. Exp.(%) 
Ag2913, Run 
157366467 


Normal Colon 


32.5 


Kidney Margin 
8120608 


0.0 


CC Well to Mod Diff 
(OD03866) 


11.2 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


8.7 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


13.5 


Kidney Cancer 
9010320 


0.0 


CC Margin (OD03868) 


9.5 


Kidney Margin 
9010321 


0.0 


CC Mod Diff (OD03 920) 


35.4 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


57.4 


Uterus Cancer 064011 


1.6 


CC Gr.2 ascend colon 
(OD03921) 


30.8 


Normal Thyroid 


11.8 


CC Margin (OD03921) 


6.4 


Thyroid Cancer 
064010 


0.3 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


59.0 


Liver Margin (ODO4309) 


5.3 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


0.4 


Breast Cancer 
(OD04566) 


0.0 
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Normal Prostate 6546-1 ■ 


1.6 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


21.8 


Breast Cancer Mets 
(OD04590-03) 


0.4 


Prostate Margin 
(OD04410) 


19.6 


Breast Cancer 

Metastasis 
(OD04655-05) 


1.3 


Prostate Cancer 
(OD04720-01) 


16.2 


Breast Cancer 064006 


2.4 


Prostate Margin 
(OD04720-02) 


18.9 


Breast Cancer 1 024 


0.0 


Normal Lung 061010 i 


73.7 


Breast Cancer 

V 1 UUZOO 


10.7 

— — — 


Lung Met to Muscle ! 
fOD04286"> 

l l-J W ii.U V/ J 


4.0 

^ M 


Breast Margin 
9100265 


2.0 


IVlti^^'IC lVIa.1 JJ111 

(OD04286) 


0.0 


Breast Cancer 
A209073 


2.4 


(OD03126) 


0.0 


Breast Manrin 
A2090734 


1.9 


T una N/I'iroin ^DDO^I?^ 


2 1 


Normal Liver 


2.6 


Lung Cancer (OD04404) 


0.4 


Liver Cancer 064003 


4.5 


Lung Margin (OD04404) ; 


0.0 


Liver Cancer 1025 


n n 
u.u 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0 0 

VJ.U 


Lung Margin (OD04565) 


2.5 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


5.2 


Liver Tissue 6004-N 


1.5 


Lung Margin (OD04237- 
02) 


1.9 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.4 


Liver Tissue 6005-N 


0.0 


Liver Margin (OD043 10); 2.7 


Normal Bladder 


18.6 


Melanoma Mets to Lung 
(OD04321) 


0.7 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


12.6 


Normal Kidney 


0.0 


Bladder Cancer 
(OD047 18-01) 


6.3 


Kidney Ca, Nuclear grade 
2 (OD04338) 


0.0 


Bladder Normal 
Adjacent (OD04718- 
03) 


7.3 


Kidney Margin 
(OD04338) 


0.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.0 


Ovarian Cancer 
064008 


0.0 


Kidney Margin 
(OD04339) 


0.5 


Ovarian Cancer 
(OD04768-07) 


0.0 
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Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


0.0 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


2.0 


Kidney Margin 
(OD04348) 


1.0 


Stomach Margin 
9060359 


13.1 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


11.2 


Kidney Margin 
(OD04622-03) 


0.0 


Stomach Margin 
9060394 


15.9 


Kidney Cancer 
(OD04450-01) 


0.5 


Gastric Cancer 
9060397 


41.2 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


5.6 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


100.0 



Table BID. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2913, Run 
157366468 


Tissue Name 


Rel. Exp.(%) 
Ag2913, Run 
157366468 


Secondary Thl act 


14.9 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


27.7 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


47.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Th 1 rest 


18.2 


HUVEC TNF alpha + IL4 


0.9 


Secondary Th2 rest 


19.6 


HUVEC IL-11 


1.3 


Secondary Trl rest 


13.9 


Lung Microvascular EC 
none 


9.2 


Primary Thl act 


61.6 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


9.1 


Primary Th2 act 


56.3 


Microvascular Dermal EC 
none 


10.4 


Primary Trl act 


41.5 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


7.8 


Primary Thl rest 


100.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.2 


Primary Th2 rest 


46.0 


Small airway epithelium 
none 


0.5 


Primary Trl rest 


16.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


2.6 


CD45RA CD4 
lymphocyte act 


9.9 


Coronery artery SMC rest 


0.1 
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CD45RO CD4 | 
lymphocyte act j 


16.7 


Coronery artery SMC j 
TNFalpha+ IL-lbeta j 


0.0 


CD8 lymphocyte act j 


15.5 


Astrocytes rest j 0.6 


Secondary CD8 
lymphocyte rest 


17.9 


Astrocytes TNFalpha + IL- 
lbeta 


0.5 


Secondary CD8 
lympnocyie aci 


16.6 


KU-8 12 (Basophil) rest 


4.6 


CD4 lymphocyte none 1 


16.2 


KU-8 12 (Basophil) 
PMA/ionomycin 


18.0 

. 


zry ini/inz/iri ami- 
CD95CH11 


29.9 


CC1J1 lUo (Reratinocyiesj 
none 


7.2 


LAK cells rest 


28.3 


CCD1 106 CKeratinocvtes} 
TNFalpha + IL-lbeta 


0.2 


LAK cells IL-2 


18.9 


I iver cirrhosis 


8.1 


LAK cells IL-2+IL- 12 


6.0 


T nnii^ kidnev 


3.4 


LAK cells IL-2+IFN \ 
gamma 


6.8 


NCI-H292 none 


4.3 


LAK cells IL-2+ IL-18 


3.5 


NCI-H292 IL-4 


2.1 


LAK cells 
PM A/ionomvcin 


8.8 


NCI-H292 IL-9 


1.8 


NK Cells IL-2 rest 


4.6 


NCI-H292 IL-13 


0.2 


Two Way MLR 3 day 


12.4 


NCI-H292 IFN gamma " 


0.5 


Two Way MLR 5 day 


6.9 


ril none 




Two Way MLR 7 day 


5.1 


HP A EC TNF alpha + IL-1 
beta 


6.3 


Jr divii_x re si 




Lung fibroblast none 


9.9 


PBMC PWM 


37.4 


Lung fibroblast TNF alpha 
+ IL-1 beta 


2.5 


PBMC PHA-L 


12.9 


Lung fibroblast IL-4 


12.7 


Ramos (B cell) none 


11.3 


Lung fibroblast IL-9 


8.7 


Ramos (B cell) 
ionomycin 


13.7 


Lung fibroblast IL-13 


12.1 


B lymphocytes PWM 


1.1 


Lung fibroblast IFN 
gamma 


9.3 


B lymphocytes CD40L 
and IL-4 


0.5 


Dermal fibroblast 
CCD 1070 rest 


9.9 


EOL-1 dbcAMP 


0.4 


Dermal fibroblast 
CCD 1070 TNF alpha 


29.3 


EOL-1 dbcAMP 
PMA/ionomycin 


0.9 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.7 


Dendritic cells none 


0.7 


Dermal fibroblast IFN 
gamma 


1.4 


Dendritic cells LPS 


1.5 


Dermal fibroblast IL-4 


5.1 


Dendritic cells anti- 
CD40 


4.7 


IBD Colitis 2 


16.6 
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Monocytes rest 


5.6 


IBD Crohn's 


7.9 


Monocytes LPS 


0.3 


Colon 


24.7 


Macrophages rest 


3.6 


Lung 


13.4 


Macrophages LPS 


0.4 


Thymus 


72.7 


HUVEC none 


1.2 


Kidney 


65.1 


HUVEC starved 


1.2 







CNS_neurodegeneration_vl.O Summary: Ag2913 No significant expression 
detected. Potential failed chemistry reaction or bad probe/primer set (data not shown). 

Panel 1.3D Summary: Ag2913 The NOV66 gene represents a novel G-protein 
5 coupled receptor (GPCR) with expression in the brain. The GPCR family of receptors contains 
a large number of neurotransmitter receptors, including the dopamine, serotonin, a and b- 
adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic glutamate 
receptors. GPCRs are excellent drug targets in various neurologic and psychiatric diseases. All 
antipsychotics have been shown to act at the dopamine D2 receptor; similarly novel 

10 antipsychotics also act at the serotonergic receptor, and often the muscarinic and adrenergic 

receptors as well. While the majority of antidepressants can be classified as selective serotonin 
reuptake inhibitors, blockade of the 5-HT1 A and a2 adrenergic receptors increases the effects 
of these drugs. The GPCRs are also of use as drug targets in the treatment of stroke. Blockade 
of the glutamate receptors may decrease the neuronal death resulting from excitotoxicity; 

15 further more the purinergic receptors have also been implicated as drug targets in the treatment 
of cerebral ischemia. The b-adrenergic receptors have been implicated in the treatment of 
ADHD with Ritalin, while the a-adrenergic receptors have been implicated in memory. 
Therefore this gene may be of use as a small molecule target for the treatment of any of the 
described diseases. 

20 In addition, this gene is expressed in clusters of cell lines derived from lung cancer and 

colon cancer. Thus, expression of this gene could be used to differentiate between these 
sample and other samples on this panel and as a marker to detect the presence of colon and 
lung cancers. Furthermore, therapeutic modulation of the expression or function of this gene 
may be effective in the treatment of lung and colon cancers. 

25 References: 

El Yacoubi M, Ledent C, Parmentier M, Bertorelli R, Ongini E, Costentin J, Vaugeois 
JM. Adenosine A2A receptor antagonists are potential antidepressants: evidence based on 
pharmacology and A2A receptor knockout mice. Br J Pharmacol 2001 Sep;134(l):68-77 
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1. Adenosine, an ubiquitous neuromodulator, and its analogues have been shown to 
produce 'depressant' effects in animal models believed to be relevant to depressive disorders, 
while adenosine receptor. antagonists have been found to reverse adenosine-mediated 
'depressant* effect. 2. We have designed studies to assess whether adenosine A2A receptor 
antagonists, or genetic inactivation of the receptor would be effective in established screening 
procedures, such as tail suspension and forced swim tests, which are predictive of clinical 
antidepressant activity. 3. Adenosine A2A receptor knockout mice were found to be less 
sensitive to 'depressant' challenges than their wildtype littermates. Consistently, the adenosine 
A2A receptor blockers SCH 58261 (1 - 10 mg kg(-l), i.p.) and KW 6002 (0.1 - 10 mg kg(-l), 
p.o.) reduced the total immobility time in the tail suspension test. 4. The efficacy of adenosine 
A2A receptor antagonists in reducing immobility time in the tail suspension test was 
confirmed and extended in two groups of mice. Specifically, SCH 58261 (1 - 10 mg kg(-l)) 
and ZM 241385 (15 - 60 mg kg(-l)) were effective in mice previously screened for having 
high immobility time, while SCH 58261 at 10 mg kg(-l) reduced immobility of mice that were 
selectively bred for their spontaneous 'helplessness' in this assay. 5. Additional experiments 
were carried out using the forced swim test. SCH 58261 at 10 mg kg(-l) reduced the 
immobility time by 61%, while KW 6002 decreased the total immobility time at the doses of 1 
and 10 mg kg(-l) by 75 and 79%, respectively. 6. Administration of the dopamine D2 receptor 
antagonist haloperidol (50 - 200 microg kg(-l) i.p.) prevented the antidepressant-Iike effects 
elicited by SCH 58261 (10 mg kg(-l) i.p.) in forced swim test whereas it left unaltered its 
stimulant motor effects. 7. In conclusion, these data support the hypothesis that A2A receptor 
antagonists prolong escape-directed behaviour in two screening tests for antidepressants. 
Altogether the results support the hypothesis that blockade of the adenosine A2A receptor 
might be an interesting target for the development of effective antidepressant agents. 

Blier P. Pharmacology of rapid-onset antidepressant treatment strategies. Clin 
Psychiatry 2001 ;62 Suppl 15:12-7 

Although selective serotonin reuptake inhibitors (SSRIs) block serotonin (5-HT) 
reuptake rapidly, their therapeutic action is delayed. The increase in synaptic 5-HT activates 
feedback mechanisms mediated by 5-HT1 A (cell body) and 5-HT1B (terminal) autoreceptors, 
which, respectively, reduce the firing in 5-HT neurons and decrease the amount of 5-HT 
released per action potential resulting in attenuated 5-HT neurotransmission. Long-term 
treatment desensitizes the inhibitory 5-HT1 autoreceptors, and 5-HT neurotransmission is 
enhanced. The time course of these events is similar to the delay of clinical action. The 
addition of pindolol, which blocks 5-HT1 A receptors, to SSRI treatment decouples the 
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feedback inhibition of 5-HT neuron firing and accelerates and enhances the antidepressant 
response. The neuronal circuitry of the 5-HT and norepinephrine (NE) systems and their 
connections to forebrain areas believed to be involved in depression has been dissected. The 
firing of 5-HT neurons in the raphe nuclei is driven, at least partly, by alpha 1 -adrenoceptor- 
mediated excitatory inputs from NE neurons. Inhibitory alpha2-adrenoceptors on the NE 
neuroterminals form part of a feedback control mechanism. Mirtazapine, an antagonist at 
alpha2-adrenoceptors, does not enhance 5-HT neurotransmission directly but disinhibits the 
NE activation of 5-HT neurons and thereby increases 5-HT neurotransmission by a 
mechanism that does not require a time-dependent desensitization of receptors. These 
neurobiological phenomena may underlie the apparently faster onset of action of mirtazapine 
compared with the SSRIs. 

Tranquillini ME, Reggiani A. Glycine-site antagonists and stroke. Expert Opin Investig 
Drugs 1999Nov;8(ll):1837-1848 

The excitatory amino acid, (S)-glutamic acid, plays an important role in controlling 
many neuronal processes. Its action is mediated by two main groups of receptors: the 
ionotropic receptors (which include NMDA, AM PA and kainic acid subtypes) and the 
metabotropic receptors (mGluR(l-8)) mediating G-protein coupled responses. This review 
focuses on the strychnine insensitive glycine binding site located on the NMDA receptor 
channel, and on the possible use of selective antagonists for the treatment of stroke. Stroke is a 
devastating disease caused by a sudden vascular accident. Neurochemically, a massive release 
of glutamate occurs in neuronal tissue; this overactivates the NMDA receptor, leading to 
increased intracellular calcium influx, which causes neuronal cell death through necrosis. 
NMDA receptor activation strongly depends upon the presence of glycine as a co-agonist. 
Therefore, the administration of a glycine antagonist can block overactivation of NMDA 
receptors, thus preserving neurones from damage. The glycine antagonists currently identified 
can be divided into five main categories depending on their chemical structure: indoles, 
tetrahydroquinolines, benzoazepines, quinoxalinediones and pyrida-zinoquinolines. 

Monopoli A, Lozza G, Forlani A, Mattavelli A, Ongini E. Blockade of adenosine A2A 
receptors by SCH 58261 results in neuroprotective effects in cerebral ischaemia in rats. 
Neuroreport 1998 Dec l;9(17):3955-9 

Blockade of adenosine receptors can reduce cerebral infarct size in the model of global 
ischaemia. Using the potent and selective A2A adenosine receptor antagonist, SCH 58261, we 
assessed whether A2A receptors are involved in the neuronal damage following focal cerebral 
ischaemia as induced by occluding the left middle cerebral artery. SCH 58261 (0.01 mg/kg 
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either i.p. or i.v.) administered to normotensive rats 10 min after ischaemia markedly reduced 
cortical infarct volume as measured 24 h later (30% vs controls, p < 0.05). Similar effects were 
observed when SCH 58261 (0.01 mg/kg, i.p.) was administered to hypertensive rats (28% 
infarct volume reduction vs controls, p < 0.05). Neuroprotective properties of SCH 58261 
5 administered after ischaemia indicate that blockade of A2A adenosine receptors is a 
potentially useful biological target for the reduction of brain injury. 

Panel 2D Summary: Ag2913 The NOV66 gene is a diagnostic marker for gastric 
thyroid and bladder cancer and a target for therapeutic intervention in gastric, thyroid and 
bladder cancer through the use of antibodies or small molecule drugs. This is based on the 
10 expression profile of this gene that shows higher expression in some gastric, thyroid and 
bladder cancer samples compared to normal tissues. 

Panel 4D Summary: Ag2913 The NOV66 gene, an olfactory receptor homolog is 
expressed at moderate levels in activated and resting T lymphocytes (CT range 30.13-32.98). 
Small molecules or therapeutic antibodies that antagonize the function of the NOV66 gene 
15 prodcut may reduce or eliminate the symptoms of autoimmune and inflammatory diseases, 

including Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary 
disease, asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or psoriasis. 

Panel CNS_1 Summary: Ag2913 No significant expression detected. Potential 
probe/primer failure (data not shown). 

20 NOV67 

Expression of gene NOV67 was assessed using the primer-probe set Ag295 1, 
described in Table BJA. Results of the RTQ-PCR runs are shown in Tables BJB and BJC. 

Table BJA. Probe Name Ag2951 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -acctctcacttgtggacatctg-3 ' 


22 


243 


1270 


Probe 


TET-5 ' -tacacctccagcagggtccctcagat-3 • - " 
TAMRA 


26 


266 


1271 


Reverse 


5 1 -ggcaaaggagatggtctttct-3 • 


21 


314 


1272 



25 Table BJB. AI_comprehensive panelvl.O 



Tissue Name 


Rel. Exp.(%) 
Ag2951, Run 
248065290 


Tissue Name 


Rel. Exp.(%) 
Ag2951,Run 
248065290 


110967 COPD-F 


3.1 


112427 Match Control 
Psoriasis-F 


49.3 
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110980 COPD-F 


4.9 


1 12418 Psoriasis-M 


13.6 


110968 COPD-M 


2.4 


1 12723 Match Control 
Psoriasis-M 


0.0 


1 10977 COPD-M 


45.7 


112419 Psoriasis-M 


20.2 


110989 Emphysema- 
F 


25.9 


1 12424 Match Control 
Psoriasis-M 


9.5 


1 10992 Emphysema- 
F 


33.9 


1 12420 Psonasis-M 


93.3 


110993 Emphysema- 
F 


5.8 


112425 Match Control 
Psoriasis-M 


32.8 i 


110994 Emphysema- 
F 


0.0 


104689 (MF) OA 
Bone-Backus 


0.0 


1 10995 Emphysema- 
F 


55.5 


104690 (MF) Adj 
"Normal" Bone-Backus 


0.0 


1 10996 Emphysema- 
F 


7.0 


104691 (MF) OA 
Synovium-Backus 


0.0 


1 10997 Asthma-M 


12.8 


104692 (BA) OA 
Cartilage-Backus 


0.0 


lllOOl Asthma-F 


18.6 


104694 (BA) OA 
Bone-Backus 


0.0 


111002 Asthma-F 


27.7 


104695 (BA) Adj 
"Normal" Bone-Backus 


5.5 


111003 Atopic 
Asthma-F 


16.7 


104696 (BA) OA 
Synovium-Backus 


10.2 


111004 Atopic 
Asthma-F 


37.4 


104700 (SS) OA Bone- 
Backus 


7.1 


111005 Atopic 
Asthma-F 


31.6 


104701 (SS) Adj 
"Normal" Bone-Backus 


14.3 


111006 Atopic 
Asthma-F 


1.8 


104702 (SS) OA 
Synovium-Backus 


28.3 


111417 Aller&v-M 




117093 OA Cartilage 
Rep7 


UJ . o 


112347 Allergy-M 


2.5 


112672 OA Bone5 


26.6 


112349 Normal Lung- 
F 


0.0 


112673 OA Synovium5; 


21.6 


1 12357 Normal Lung- 
F 


12.8 


112674 OA Synovial 
Fluid cells5 


9.3 


112354 Normal Lung- 
M 


4.5 


117100 OA Cartilage 
Rep 14 


2.9 


112374 Crohns-F 


16.3 


112756 OA Bone9 


7.3 


11 2389 Match 
Control Crohns-F 


15.0 


112757 OA Synovium9 


3.3 


112375 Crohns-F 


7.5 


112758 OA Synovial 
Fluid Cells9 


12.9 


112732 Match 
Control Crohns-F 


41.5 


117125 RA Cartilage j 
Rep2 


0.0 
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112725 Crohns-M 


2.0 


113492 Bone2 RA 


8.5 


112387 Match 
Control Crohns-M 


5.3 


113493 Synovium2 RA 


1.9 


112378 Crohns-M 


4.0 


113494 Syn Fluid Cells 
RA 


7.6 


112390 Match 
Control Crohns-M 


61.6 


1 13499 Cartilace4 RA 


1 ^ 7 


1 12726 Crohns-M 


21.3 


113500 Bone4 RA 


8.3 


112731 Match 
Control Crohns-M 


14.6 


113501 Synovium4 RA 


20.4 


112380 Ulcer Col-F 


18.3 


113502 Syn Fluid 
Cells4 RA 


6.5 


112734 Match 
Control Ulcer Col-F 


100.0 


1 1 JtyJ V_.cH 11 lclt!,C 3 rVrV 


t 1 7 
11./ 


112384 Ulcer Col-F 


60.7 


113496 Bone3 RA 


4.4 


112737 Match 
Control Ulcer Col-F 


22.1 


113497 Synovium3 RA; 


3.0 


112386 Ulcer Col-F 


4.6 


113498 Syn Fluid 
Cells3 RA 


17.3 


112738 Match 
Control Ulcer Col-F 


3 7 


117106 Normal 
Cartilage Rep20 


11/1 
1 1 .4 


112381 Ulcer Col-M 


0.0 


113663 Bone3 Normal 


0.0 


1 12735 Match 
Control Ulcer Col-M 


15.4 


1 13664 Synovium3 
Normal 


0.0 


112382 Ulcer Col-M 


58.2 


113665 Syn Fluid 
Cells3 Normal 


1.3 


112394 Match 
Control Ulcer Col-M 


4.2 


117107 Normal 
Cartilage Rep22 


0.0 


112383 Ulcer Col-M 


63.3 


113667 Bone4 Normal 


39.5 


112736 Match 
Control Ulcer Col-M 


12.1 


1 1 3668 Synovium4 
Normal 


25.5 


112423 Psoriasis-F 


26.2 


113669 Syn Fluid 
Cells4 Normal 


26.1 



Table BJC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2951,Run 
164403342 


Tissue Name 


Rel. Exp.(%) 
Ag2951, Run 
164403342 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.8 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 
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m -~- 


? ... o j "3 x o.i? 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Th 1 act 


0.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 . 


Primary Thl rest 


1.3 


Bronchial epithelium 
TNFalpha + ILlbeta 


1.0 


Primary Th2 rest 


7.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


17.2 


Small airway epithelium 
TNFalpha + IL-lbeta 


5.2 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


1.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


u.u 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


1.0 


CD4 lymphocyte none 


13.3 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


0.9 


CCD1 106 (Keratinocytes) 
none 


1.2 


LAK cells rest 


1 1.8 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


u.u 


LAK cells 1L-2 


0.0 


Liver cirrhosis 


7.7 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


3.2 


JNCl-rizyz none 




LAK cells IL-2+ IL-18 


4.6 


NCI-H292 IL-4 


L9 


LAK cells 
PMA/ionomycin 


0.0 




n n 
u,u 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


1.6 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


u.u 


PBMC rest 


7.6 


Lung fibroblast none 


0.0 


rt>lvlv_, r WJV1 


j 1UU.U 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


EZ ?■» „ 


Lung fibroblast IL-4 


0.0 
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Ramos (B cell) none 


™» ■'»■■>--' — ■ — ^>-^>»> i »>-^^^....>».>.>^^ — ^ 

0.0 


Lung fibroblast IL-9 


1.7 


Ramos (B cell) 
ionomycin 


0 0 


I una flhroHlaQt TI -1 3 


0.0 


o lympnocyies r w lvi 


1 0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


5.2 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


19.9 


Dermal fibroblast 
CCD 1 070 TNF alpha 


1.4 


EOL-1 dbcAMP 
PMA/ionomycin 


7.8 


Dermal fibroblast 
Lv/Uiu/u iL-i ueia * 


0.0 


Dendritic cells none 


9.2 


uermai iiDroDiasi iriN 
gamma 


0.0 


Dendritic cells LPS 


2.9 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


3.0 


IBD Colitis 2 


1.8 


Monocytes rest 


26.8 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


2.5 


Macrophages rest 


2.4 


Lung 


1.2 


Macrophages LPS 


0.0 


Thymus 


0.8 


HUVEC none 


0.0 


Kidney J 8.7 


HUVEC starved 


0.0 





AI_comprehensive panel_vl.O Summary: Ag2951 Highest expression of the NOV67 
gene is seen in normal tissue adjacent to colon from an ulcerative colitis patient (CT=33). 
Thus, expression of this gene could be used to distinguish this sample from other samples on 
5 this panel. Please see Panel 4D for further discussion of utility of this gene in inflammation. 
CNS_neurodegeneration_vl.O Summary: Ag2951 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel UD Summary: Ag2951 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 
10 Panel 4.1D Summary: Ag295 1 Expression of this gene is low/undetectable (CTs > 

35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2951 The NOV67 gene is expressed at a moderate level 
(CT=32.78) in pokeweed mitogen-stimulated peripheral blood leukocytes, consisting primarily 
of activated B lymphocytes. Small molecule antagonists or therapeutic antibody antagonists 
15 that block the function of the CG56571-gene product may be useful in several autoimmune 
and inflammatory diseases in which activated B cells can play major roles as sources of 
autoantibody-producing cells and as powerful antigen-presenting cells, including, but not 
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limited to, Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary 
disease, asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or psoriasis. 

NOV69a and NOV69b 

Expression of gene NOV69a and variant NOV69b was assessed using the primer-probe 
5 sets Ag2460 and Ag349, described in Tables BKA and BKB. Results of the RTQ-PCR runs 
are shown in Tables BKC, BKD, BKE and BKF. 

Table BKA. Probe Name Ag2460 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -tcatagcagtcccgaggaa-3 ' 


19 


89 


1273 


Probe 


TET- 5 1 -tcactattgccttaatctcatgccga-3 1 - 
TAMRA 


26 


125 


1274 


Reverse 


5 ' -ttctcaagggtctccacatg-3 ' 


20 


151 


1275 



Table BKB. Probe Name Ag349 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward! 


5 1 -gggaaagccacagactcgaa-3 1 


20 


289 


1276 


Probe 


TET- 5 ' - cttctaccacagccagagtggcaggaact- 
3 1 -TAMRA 


29 


255 


1277 


Reverse 


5 ' -acccgagcctgtgaagtcct-3 ' 


20 


231 


1278 



10 

Table BKC. Panel 1 



Tissue Name 


Rel. Exp.(%) Ag349, 
Run 97804233 


Tissue Name 


Rel. Exp.(%) Ag349, 
Run 97804233 


Endothelial cells 


0.0 


Renal ca. 786-0 


0.0 


Endothelial cells 
(treated) 


0.0 


Renal ca. A498 


0.2 


Pancreas 


0.0 


Renal ca. RXF 393 


0.0 


Pancreatic ca. CAP AN 
2 


0.0 


Renal ca. ACHN 


0.0 


Adrenal gland 


0.0 


Renal ca. UO-31 


0.0 


Thyroid 


0.5 


Renal ca. TK-10 


0.0 


Salivary gland 


25.5 


Liver 


0.0 


Pituitary gland 


0.0 


Liver (fetal) 


0.0 


Brain (fetal) 


0.0 


Liver ca. (hepatoblast) 
HepG2 


0.0 


Brain (whole) 


0.0 


Lung 


0.0 


Brain (amygdala) 


0.0 


Lung (fetal) 


0.0 


. Brain (cerebellum) 


0.0 


Lung ca. (small cell) 


0.0 
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LX-1 




Rr?iin f h i n r» f* a tti r> i i q ^ 

J_>1U1II ^lilJjpU^CUIljJLlD^ 


0.0 


Lung ca. (small cell) ; 
NCI-H69 


1.5 


JDIdlll ^bUUbldlllld Ill^Id^ 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


R m i n "f"Vi die* nri i i c \ 
JJldlll ^llldldlllUo^ 


0.0 


Lung ca. (large 
ce!l)NCI-H460 


1.3 


Did in ^iiypuiiididiiiub^ 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


opindi coru 


u.u 


Lung ca. (non-s.cell) 
NCI-H23 


0 0 


glio/astro U87-MG 




Lung ca. (non-s.cell) ! 
HOP-62 


0.0 


glio/astro U-118-MG 


U.U 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SW1783 


U.U 


Lung ca. (squam.) SW 
900 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


astrocytoma SF-539 


0.0 


Mammary gland 


0,0 


astrocytoma SNB-75 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SNB-19 


3.1 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


glioma U251 


0.0 


Breast ca.* (pi. ef) 
T47D 


0.0 


glioma SF-295 


0.0 


Breast ca. BT-549 


0.0 


Heart 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


u.u 


Ovarian ca. OVCAR- 

3 


0.0 


Thymus 


66.4 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.1 


Ovarian ca. OVCAR- 
5 


1.5 


Lymph node 


3.1 


Ovarian ca. OVLAK- 
8 


0.0 


Colon (ascending) 


29.9 


Ovarian ca. IGROV-1 


0.0 


Stomach 


77.4 


Ovarian ca. (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* SW620 
(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 


0.0 
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15. II 







met) PC-3 




Colon ca. HCT-116 


0.0 


Testis 


0.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. HCT-15 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 ; 


0.0 


Gastric ca. * (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


15.5 


Melanoma* (met) SK-; 
MEL-5 


0.0 


Kidney 


0.0 


Melanoma SK-MEL- : 
28 


100.0 


Kidney (fetal) 


0.0 







Table BKD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2460, 
Run 157914666 


Tissue Name 


Rel. Exp.(%) Ag2460, 
Run 157914666 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAP AN 
2 


0.0 


Renal ca. A498 


9.6 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


5.6 


Renal ca. ACHN 


0.0 


Salivary gland 


19.8 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. 
(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


2.2 


Brain (hippocampus) 


0.0 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NC1-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


45.7 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG ! 


2.4 


Lung ca. (non-sm. 
cell) A549 


0.0 
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glio/astro U-l 18-MG 


2.5 


Lung ca. (non-s.cell) 
NCI-H23 


0 0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


100.0 


glioma U25 1 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 . 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart 


0.0 


Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 

3 


0 0 


Thymus 


61.1 


Ovarian ca. OVCAR- 
4 


0 0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.0 


Lymph node 


8.4 


Ovarian ca. OVCAR- 
8 


0.0 


Colorectal 

— — -~— 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


77.4 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


A A 
0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
oWozU(j>W4o(J met) 


0.0 


Prostate 


2.3 


Colon ca. HT29 


0.0 


r rosidie ca. ^oone 
met)PC-3 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


0.0 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 


0.0 


Melanoma M14 


0.0 
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NCI-N87 








Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


59.5 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


0.0 



Table BKE. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2460, Run 
157914720 


Tissue Name 


Rel. Exp.(%) 
Ag2460, Run 
157914720 


Normal Colon 


0.0 


Kidney Margin 
8120608 


0.0 


CC Well to Mod Diff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.0 


Vsv- iviargin ^ULUjJoooj 


A A 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


A A 

0.0 


Kidney Cancer 
9010320 


0.0 


CC \yfcirrrir» /Y iFlO^ 

ividrgin ^uuujoOoj 


A A 
U.U 


Kidney Margin 
9010321 


0.0 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 06401 1 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


o.o 


Normal Thyroid 


0.9 


CC Margin (OD03921) 

.._ 


0.0 


Thyroid Cancer 
064010 


0.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD0445 1 - 
02) 


0.0 


Breast Cancer 
(OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 


Breast Cancer 
(OD045 90-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD045 90-03) 


0.2 


Prostate Margin 
(OD04410) 


0.0 


Breast Cancer 

Metastasis 
(OD04655-05) 


0.6 


Prostate Cancer 
(OD04720-01) 


0.0 


Breast Cancer 064006 


0.0 
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Prostate Margin 
(OD04720-02) 


0.0 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


0.4 


Breast Cancer 
s yiuuzoo 


0.0 


Luns? Met to IVfu^rle 
(OD04286) 


0.0 


oreast iviargin 
9100265 


0.0 


Muscle Margin 
(OD04286) 


0.0 


Breast Cancer 
A209073 


0.0 


Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin 
A2090734 


0.0 


Lung Margin (OD03126) 


0.0 


Normal Liver 


0 0 


Lung Cancer (OD04404) 


100.0 


Liver Cancer 064003 


0.0 


T unp Ma rain fOr)044.(kn 

L/Ltn^ ivicll gill ^vy JLyv^r^UT^ J 


u.u 


Liver Cancer 1025 


0.1 


T una Cancer iTVnOdS^S^ 




Liver cancer lOzo 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (ODO4310) 


0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


0.0 


Normal Kidney 


0.0 


Bladder Cancer 
(OD047 18-01) 


0.0 


Kidney Ca, Nuclear grade i 
2 (OD04338) 


0.0 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(UL)U4JJo) 


0.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 
1/2 (OL/043J9) 


0.0 


Ovarian Cancer 
064008 


0.0 


Kidney Margin 
(0004339^ 


0.0 


Ovarian Cancer 

{KJUKJH /Oo-U / ) 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


0.0 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


0.0 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 
(OD04348) 


0.0 


~~ — ~ 

Stomach Margin 
9060359 


0.0 
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Kidney Cancer 
(0004677-01^ 

\\~/ yj i } 


q q 1 Gastric Cancer 
1 9060395 


0.0 


Kidney Margin 
(OD04622-03) 


0.0 


Stomach Margin 
9060394 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


0.0 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.6 



Table BKF. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2460, Run 
157914794 


Tissue Name 


Rel. Exp.(%) 
Ag2460, Run 
157914794 


oeconaary i n i act 


0.4 


T XT T 1 T I~" IT t 1 j 

HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.5 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


2.3 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.5 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


3.7 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.6 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


1.1 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


1.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


15.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


9.2 


Small airway epithelium 
none 


2.1 


Primary Trl rest 


4.3 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 
lymphocyte act 


0.9 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


1.0 


Coronery artery SMC 
TNFalpha I IL-lbeta i 


0.0 


CD8 lymphocyte act 


1.1 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.5 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.5 


KU-8 12 (Basophil) rest 


0.0 
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CD4 lymphocyte none 


1.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2rv Thl /Th?/TY1 anti- 

Z-ljr 1111/1 11Z./ Ill dllll" 

CD95CHlf 


2.6 


t^L,ui iuo (Reratinocytes) 
none 


0.0 


LAK cells rest 


0.7 


CL-JJi iuo (Reratinocytes) 
TNFalDha + IT -1 heta 


0.5 


LAK cells IL-2 


0.0 


Liver cirrho<xi<; 


0 0 

VJ.VJ 


LAK cells IL-2+IL-12 


0.0 


T 11mm ViHnRv 
\-j w yj li r> iviuiiC/V 


u.u 


LAK cells IL-2+IFN 
gamma 


0.5 


NCI-H292 none 


0.3 


LAK cells IL-2+ IL-1 8 


0.0 


NCI-H29? II -4 


u.o 


LAK cells 
PMA/ionomy c in 


0.0 


NCI-H292 IL-9 


1.1 


NK Cells IL-2 rest 


0.0 


NCT-H292 II -1 T, 


n n 
u.u 


Two Way MLR 3 day 


[ 0.0 


NCI-H292 TFN fxflmma 

lli.7L 11 IX J^CHllllld. 


u.u 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.8 


HPALC iNF alpha + IL-1 
beta 


0.0 


i^oivi^ rest 


1. 1 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


2.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


1.8 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


1.6 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
iJNr alpha 


0.8 


FOI -1 dhrAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD^ 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


1.1 


Colon 


0.0 


Macrophages rest 


0.6 


Lung 


0.0 


Macrophages LPS 


0.0 | 


Thymus 


0.0 


HUVEC none 


0.0 j 


Kidney 


9.9 


HUVEC starved | 


0.0 ] 
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CNS_neurodegeneration_vl.O Summary: Ag2460 Expression of the NOV69a gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown). 

Panel 1 Summary: Ag349 Highest expression of the NOV69a gene is seen in a 
melanoma cell line (CT=28.7). Thus, expression of this gene could be used to differentiate 
between this sample and other samples on this panel. There is also significant expression in 
thymus. Please see Panel 4D for discussion of utility of this gene in autoimmunity. 

Panel 1.3D Summary: Ag2460 Expression of the NOV69a gene is limited to a few 
samples that are all derived from normal tissue. Significant levels of expression are seen in 
mammary gland, trachea, stomach, thymus, and spinal cord. Thus, expression of this gene can 
be used to differentiate between these samples and other samples on this panel. 

Panel 2D Summary: Ag2460 Expression of the NOV69a gene is limited to a few 
samples, with highest expression in a lung cancer (CT=27.5). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of lung cancer. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of lung cancer. 

Panel 4D Summary: Ag2460 The NOV69a gene encodes a homolog of the IL-1 
epsilon. Interleukin 1 (IL-1) is a member of a large family of cytokines, which modulates 
immune and inflammatory responses. IL-1 molecules such as IL-1 alpha, -beta, -delta, - 
gamma, and IL1 -receptor agonist (IL-lra) are typically secreted by macrophages, 
mononuclear cells, epithelial and endothelial cells. IL-1 molecules are first produced as 
precursors of about 30 kDa and do not contain a signal sequence. The IL-1 precursors are then 
proteolytically cleaved into their secreted active forms (-17 kDa). Their immuno-modulatory 
functions are mediated by two IL-1 receptors, which are members of the immunoglobulin 
superfamily. The biological functions of IL-1 include: activation of vascular endothelial cells 
to secrete IL-6, increase leukocyte adhesion and activate mononuclear phagocytes that activate 
inflammatory leukocytes; tissue destruction, and fever. Given the biological potency of the IL- 
1 family of proteins, a need exists to identify new members of this family as well as 
understand the biological function of its members. The high levels of expression of this gene 
in small airway epithelium activated by treatment with TNF-alpha + IL-1 beta(CT=28.9) 
indicate that CG561 36-01 may play a substantial role in mediating inflammation in the lung. 
Thus, therapeutic targeting of CG56 136-01 with a monoclonal antibody is anticipated to limit 
or block the extent of inflammation potential and thus the symptoms, caused by pro- 
inflammatory cytokines such as IL-1 epsilon, when these cytokines are induced in allergic, 
asthma and COPD patients. 
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References: 

Smith,D.E., Renshaw,B-R., Ketchem,R.R., Kubin,M, Garka,K.E. and Sims,J.E.Four 
new members expand the interleukin-1 superfamily J. Bioi. Chem. 275 (2), 1 169-1 175 (2000) 
Abstract: We report here the cloning and characterization of four new members of the 
5 interleukin-1 (IL-1) family (FILldelta, FILlepsilon, FILlzeta, and FILleta, with FIL1 

standing for "Family of IL-1 "). The novel genes demonstrate significant sequence similarity to 
IL-lalpha, IL-lbeta, IL-lra, and IL-18, and in addition maintain a conserved exon-intron 
arrangement that is shared with the previously known members of the family. Protein structure 
modeling also suggests that the FIL1 genes are related to IL-lbeta and IL-lra. The novel genes 
0 form a cluster with the IL-ls on the long arm of human chromosome 2. 

NOV71 

Expression of gene NOV71 was assessed using the primer-probe set Ag3049, 
described in Table BLA. Results of the RTQ-PCR runs are shown in Tables BLB, BLC, BLD 
and BLE. 



Table BLA. Probe Name Ag3049 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gctggcgatatttaatgaattg-3 » 


22 


696 


1279 


Probe 


TET-5 • -catgcagacgtggatctttacgcact-3 ' - 
TAMRA 


26 


718 


1280 


Reverse 


5 ' -agtacaatggcaacagcatcat-3 ' 


22 


767 


1281 



Table BLB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3049, 
Run 209823735 


Tissue Name 


Rel. Exp.(%) Ag3049, 
Run 209823735 


AD 1 Hippo 


10.2 


Control (Path) 3 
Temporal Ctx 


5.9 


AD 2 Hippo 


33.7 


Control (Path) 4 
Temporal Ctx 


28.1 


AD 3 Hippo 


9.3 


AD 1 Occipital Ctx 


14.2 


AD 4 Hippo 


7.7 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


99.3 


AD 3 Occipital Ctx; 


5.8 


AD 6 Hippo 


46.3 


AD 4 Occipital Ctx 


16.0 


Control 2 Hippo 


44.1 


AD 5 Occipital Ctx 


17.1 


Control 4 Hippo 


6.0 


AD 6 Occipital Ctx 


63.7 


Control (Path) 3 
Hippo j 


7.6 


Control 1 Occipital 
Ctx 


2.2 
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AD 1 Temporal Ctx 


13.0 


Control *J Orrinital 

Ctx 


68.3 


AD 2 Temporal Ctx 


33.4 


Control 3 Orrinital 

Ctx 


11.5 


AD 3 Temporal Ctx 


6.8 


Control 4 Orrinital 
Ctx 


5.5 


AD 4 Temporal Ctx 


16.5 


Control fPflthl 1 
Occipital Ctx 


77.9 


AD 5 Inf Temnoral 

1 1J-/ -*s 1111 -1 VI11L/VI til 

Ctx 


100.0 


Control fPatM ? 
w 1 1 ti kj i y i civil ) i-i 

Occipital Ctx 


8.6 


AD 5 SunTemnoral 
Ctx 


48.0 


Control 1 
in ui ^ir etui J J 

Occipital Ctx 


2.1 


AD 6 TnTTernnoral 

1 VJ_/ V* 1111 X vlllL/v/l ui 

Ctx 


37.9 


Control (Vi\\\\\ A 
cui in \ji yi diii j *+ 

Occipital Ctx 


12.9 


AD ft Sinn Tfimnor^l 
Ctx 


41.5 


V_,*JI1UU1 1 I culCltU 

Ctx 


4.1 


Control 1 Tpmnnral i 
Ctx 


6.3 


f^rmtrnl 9 Parietal 

Ctx 


29.9 


Control 2 Temporal 
Ctx 


62.4 


Control 3 Parietal 
Ctx 


18.0 


Control 3 Temporal 
Ctx 


14.2 


Control (Path) 1 
Parietal Ctx 


97.9 


Control 4 Temporal 
Ctx 


7.7 


Control (Path) 2 
Parietal Ctx 


19.3 


Control (Path) 1 j 
Temporal Ctx 


64.6 


Control (Path) 3 
Parietal Ctx 


6.3 


Control (Path) 2 
Temporal Ctx 


29.9 


Control (Path) 4 
Parietal Ctx 


42.6 



Table BLC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3049, 
Run 167972763 


Tissue Name 


Rel. Exp.(%) Ag3049, 
Run 167972763 


Liver adenocarcinoma 


37.6 


Kidney (fetal) 


7.5 


Pancreas 


1.8 


Renal ca. 786-0 


4.7 


Pancreatic ca. CAPAN 
2 


8.0 


Renal ca. A498 


8.7 


Adrenal gland 


2.1 i 


Renal ca. RXF 393 


28.3 


Thyroid 


3.0 


Renal ca. ACHN 


6.3 


Salivary gland 


1.5 


Renal ca. UO-31 


36.9 


Pituitary gland 


8.5 


Renal ca. TK-10 


9.5 


Brain (fetal) 


10.4, 


Liver 


0.2 


Brain (whole) 


9.0 


Liver (fetal) 


1.7 


Brain (amygdala) 


18.7 


Liver ca. 
(hepatoblast) HepG2 


6.3 
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Brain (cerebellum) 


32.5 


Lung 


0.4 


Brain (hippocampus) 


14.0 


Lung (fetal) 


2.0 


Brain (substantia nigra) 


6.0 


Lung ca. (small cell) 
LX-1 


5.4 


Brain (thalamus) 


2.9 


Lung ca. (small cell) 
NC1-H69 


3.6 


Cerebral Cortex 


30.1 


Lung ca. (s.cell var.) 
SHP-77 


15.9 


Spinal cord 


3.4 


Lung ca. (large 
cell)NCI-H460 


1.7 


glio/astro U87-MG 


27.4 


Lung ca. (non-sm. 
cell) A549 


28.9 


glio/astro U-118-MG 


16.0 


Lung ca. (non-s.cell) 
NCI-H23 


19.6 


astrocytoma SW1783 


40.3 


Lung ca. (non-s.cell) 
HOP-62 


18.3 


neuro*; met SK-N-AS 


0.6 


Lung ca. (non-s.cl) 
NCI-H522 


23.5 


astrocytoma SF-539 


2.9 


Lung ca. (squam.) 
SW 900 


15.4 


astrocytoma SNB-75 


26.8 


Lung ca. (squam.) 
NC1-H596 


4.0 


L^IUJllld ljJMD" 1 7 




Mammary gland 


i.i 


glioma U251 


100.0 


Breast ca.* (pl.ef) 
MCF-7 


37.4 


glioma SF-295 


39.2 


Breast ca.* (pl.ef) 
MDA-MB-23 1 


25.7 


Heart (fetal) 


1.3 


Breast ca.* (pl.ef) 
T47D 


55.5 


Heart 


1 9 

1 .z. 


oreast ca. r> l 04y 




Skeletal muscle (fetal) 


2.0 


Breast ca. MDA-N 


2.0 


Skeletal muscle 


9.3 


Ovary 


2.7 


Bone marrow 


0.3 


Ovarian ca. OVCAR- 
3 


2.6 


Thymus 


3.0 


Ovarian ca. OVCAR- 
4 


17.4 


Spleen 


2.2 


Ovarian ca. OVCAR- 
5 


50.7 


Lymph node 


5.0 


Ovarian ca. OVCAR- 
8 


1.1 


Colorectal 


1.9 


Ovarian ca. IGROV- 
1 


5.2 


Stomach 


3.3 


Ovarian ca.* (ascites) 
SK-OV-3 


24.7 


Small intestine 


0.2 


Uterus 0.1 
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Colon ca. SW480 


7.9 


Placenta 


0.2 


Colon ca.* 
o w ozu^o w^ou meij 


30.4 


Prostate 


i 

3.7 


Colon ca. HT29 


8.7 


Prostate ca.* (bone 
met)PC-3 


91.4 


Colon ca. HCT-116 


12.4 


Testis 


0.5 


Colon ca. CaCo-2 


18.7 


Melanoma 
Hs688(A).T 


6.0 


Colon ca. 
tissue(DD03 866"> 


3.7 


Melanoma* (met) 

nSOOO^D ). 1 


10.1 


Colon ca. HCC-2998 


8.9 


Melanoma UACC-62 


11.0 


Gastric ca.* (liver met) 
NCI-N87 


8.7 


Melanoma M14 


0.4 


Bladder 


10.7 


Melanoma LOX 
IMVI 


17.8 


Trachea 


1.0 


Melanoma* (met) 
SK-MEL-5 


1.7 


Kidney 


2.2 


Adipose 


3.9 



Table BLD. Panel 2.2 



Tissue Name 


Rel. Exp.(%) 
Ag3049, Run 
174441445 


Tissue Name 


Rel. Exp.(%) 
Ag3049, Run 
174441445 


Normal Colon 


11.7 


Kidney Margin 
(OD04348) 


64.6 


Colon cancer 
(OD06064) 


57.4 


Kidney malignant 
cancer (OD06204B) 


10.5 


Colon Margin 
(OD06064) 


3.4 


Kidney normal adjacent 
tissue (OD06204E) 


10.1 


Colon cancer 
(OD06159) 


12.1 


Kidney Cancer 
(OD04450-01) 


24.1 


Colon Margin 
(OD06159) 


1.8 


Kidney Margin 
(OD04450-03) 


21.6 


Colon cancer 
(OD06297-04) 


7.1 


Kidney Cancer 8120613 


9.9 


Colon Margin 
(OD06297-015) 


14.7 


Kidney Margin 
8120614 


13.6 


CC Gr.2 ascend colon 
(ODQ3921) 


5.1 


Kidney Cancer 9010320 


10.3 


CC Margin (OD03921) 


5.4 


Kidney Margin 
9010321 


11.3 


Colon cancer metastasis ; 
(OD06104) 


1.8 


Kidney Cancer 8120607 


30.6 


Lung Margin 
(OD06104) 


14.8 


Kidney Margin 
8120608 


11.2 
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1 rsfj'-prs i p 



Colon mets to lung 
(OD04451-01) 


25.5 


Normal Uterus 


2.0 


Lung Margin 
(OD04451-02) 


5L4 


Uterine Cancer 06401 1 


6.8 


Normal Prostate 


28.9 


Normal Thyroid 


6.3 


Prostate Cancer 
(OD04410) 


13.4 


Thvroid Cancer 06401 0 


14 2 


Prostate Margin 
(OD04410) 


20.3 


Thyroid Cancer 
A302152 


51.4 


Normal Ovary 


7.9 


Thyroid Margin 
A302153 


8.5 


Ovarian cancer 
(OD06283-03) 


56.3 


Normal Breast 


53.6 


Ovarian Margin 
(OD06283-07) 


1 1 fx 


Breast Cancer 
(OD04566) 


f Q 


Ovarian Cancer 064008 


24.1 


Breast Cancer 1024 


100.0 


Ovarian cancer 
(OD06145) 




Breast Cancer 
(OD04590-01) 




Ovarian Margin 
(OD06145) 


14 0 


Breast Cancer Mets 
(OD04590-03) 




Ovarian f^n^pr 

(OD06455-03) 


10.4 


Breast Cancer 
Metastasis (OD04655- 
05) 


65.1 


Ovarian Margin 
(OD06455-07) 


1.0 


Breast Cancer 064006 


26.2 


Normal Lung 


11.8 


Breast Cancer 9100266 


24.5 


Invasive poor diff. lung 
adeno(ODO4945-01 


10.7 


Rrpa^t Marcrin 
ijiuaji i vi cii ^iii i 




Lung Margin 
(ODO4945-03) 


6.2 


Breast Cancer A?0907^ 




Lung Malignant Cancer 
(OD03126) 


5.9 


Breast Margin 
A2090734 


58.6 


Lung Margin 
(OD03126) 


3.3 


Breast cancer 
(OD06083) 


79.0 


Lung Cancer 
(OD05014A) 


42.3 


Breast cancer node 
metastasis (OD06083) 


40.1 


Lung Margin 


13.0 


Normal Liver 


31.0 


Lung cancer (OD06081) 


13.1 


Liver Cancer 1026 


12.5 


Lung Margin 
(OD06081) 


8.1 


Liver Cancer 1025 


25.5 


Lung Cancer 
(OD04237-01) 


35.8 


Liver Cancer 6004-T 


14.8 


Lung Margin 
(OD04237-02) 


14.7 


Liver Tissue 6004-N 


4.7 


Ocular Melanoma 


0.0 


Liver Cancer 6005-T 


25.9 
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Metastasis 




r 




Ocular Melanoma 
Margin (Liver) 


8.5 


Liver Tissue 6005-N 


70.7 


Melanoma Metastasis 


4.3 


Liver Cancer 064003 


14.5 


Melanoma Margin 
(Lung) 


6.5 


Normal Bladder 


32.3 


Normal Kidney 


16.5 


Bladder Cancer 1023 


6.6 


Kidney Ca, Nuclear 
grade2(OD04338) 


41.2 


Bladder Cancer 
A302173 


51 1 

J I • 1 


Kidney Margin 1 
(OD04338) 


10.8 


Normal StomarH 


72 7 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


45.7 


Gastric Cancer 9060397 


4.7 


Kidney Margin 
(OD04339) 


8.2 


Stomach Margin 
9060396 


40.6 


Kidney Ca, Clear cell > 
type (OD04340) 


18.4 


Gastric Cancer 9060395 


10.4 


Kidney Margin 
(OD04340) 


7.4 


Stomach Margin 
9060394 


57.8 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


17.9 


Gastric Cancer 064005 


24.7 



Table BLE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3049, Run 
164334396 


Tissue Name 


Rel. Exp.(%) 
Ag3049, Run 
164334396 


Secondary Thl act 


9.4 


HUVEC IL-lbeta 


1.0 


Secondary Th2 act 


7.8 


HUVEC I FN gamma 


3.1 


Secondary Trl act 


8.1 


HUVEC TNF alpha + IFN 
gamma 


2.1 


Secondary Th 1 rest 


0.3 


HUVEC TNF alpha + 1L4 


1.0 


Secondary Th2 rest 


2.0 


HUVEC IL-11 


0.7 


Secondary Trl rest 


2.1 


Lung Microvascular EC 
none 


0.3 


Primary Thl act 


13.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.2 


Primary Th2 act 


13.5 


Microvascular Dermal EC 
none 


0.5 


Primary Trl act 


16.2 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.1 


Primary Thl rest 


3.8 


Bronchial epithelium 
TNFalpha + ILlbeta 


4.5 


Primary Th2 rest 


3.7 


Small airway epithelium 
none 


2.6 
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Primary Trl rest 


1.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


8.5 


CD45RA CD4 
lymphocyte act 


6.6 


Coronery artery SMC rest 


11.8 


CD45RO CD4 
lymphocyte act 


7.8 


Coronery artery SMC 
TNFalpha + IL-1 beta 


6.8 


CD8 lymphocyte act 


5.6 


Astrocytes rest 


3.8 


Secondary CD8 
lymphocyte rest 


4 0 


Astrocytes TNFalpha + 
IL-lbeta 




Secondary CD8 
lymphocyte act 


9.2 


KU-812 (Basophil) rest 


1.5 


CD4 lymphocyte none 


0.6 


KU-812 (Basophil) 
PMA/ionomycin 


10.4 


2ry Thl/Th2/Trl anti- 
CD95CH11 


3.7 


CCD 1 106 (Keratinocytes) 
none 


3.7 


LAK cells rest 


9.3 


CCD1 106 (Keratinocytes) 

T'NTP^lrkTi^* -4- TT 1 K<=»f o 

iiNraipna • li^-iDeta 


4.5 






iviver cirrnosis 


V.J 






L-upus K.iuney 




LAK cells IL-2+IFN 

^ctllilllcl 


9.6 


NCI-H292 none 


9.5 


T Ak' rpllc TT 9-1- IT 18 




Txir^T I 1909 TT A 
INLvl-l 1ZVZ 


1 ^ 1 

1 D. 1 


LAK cells 

PTV/f A /innrtrnvf in 
it ivi/A./ luuui i \y\^\\\ 


4.7 


NC1-H292 IL-9 


12.5 


NK Oik TT -? rest 




\fpj T-T9Q9 TT 1 ^ 


7 7 


Two Wav MI R 1 Hav 


^ 6 


TMPT-T490? IFtsJ ( yom inn 


Q O 


Two Way MLR 5 day 


3.7 


HPAEC none 


2.1 


Two Way MLR 7 day 


2.3 


HPAEC TNF alpha + IL-1 
beta 


1.1 


PBMC rest 


1.1 


Lung fibroblast none 


2.5 


PBMC PWM 


17.6 


Lung fibroblast TNF alpha 

i TT 1 1 

+ 1L- 1 beta 


1.2 


PBMC PHA-L 


10.8 


Lung fibroblast IL-4 


5.0 - 


Ramos (B cell) none 


23.2 


Lung fibroblast IL-9 


7.5 


Ramos (B cell) 
ionomycin 


100.0 


Luni? fibroblast IL-1 3 


3.9 


B lymphocytes PWM 


40.9 


Lung fibroblast IFN 
gamma 


14.6 


B lymphocytes CD40L 
and IL-4 


25.2 


Dermal fibroblast 
CCD 1070 rest 


9.4 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


14.8 


EOL-1 dbcAMP 
PMA/ionomycin 


0.4 


Dermal fibroblast 
CCD 1070 IL-1 beta 


6.0 


Dendritic cells none 


7.7 


Dermal fibroblast IFN 


10.2 
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I . i 


gamma 




Dendritic cells LPS 


10.8 


Dermal fibroblast IL-4 


9.2 


Dendritic cells anti- 
CD40 


13.1 


IBD Colitis 2 


0.7 


Monocytes rest 


0.9 


IBD Crohn's 


0.1 


Monocytes LPS 


0.3 


Colon 


1.1 


Macrophages rest 


19.9 


Lung 


1.1 


Macrophages LPS 


4.3 


Thymus 


3.7 


HUVEC none 


2.7 


Kidney 


7.4 


HUVEC starved 


2.9 







CNS_neurodegeneration_vl.O Summary: Ag3049 This panel does not show 
differential expression of the NOV71 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3049 The NOV71 gene is expressed at moderate to high 
leves in the cancer cell lines in this panel with the highest expression shown by a glioma cell 
line U251 (CT=26.6). Because normal tissues show a lower level of expression of this gene, 
expression of this gene migh be used as a diagnostic marker for cancer. Furthermore, 
10 therapeutics designed using antibodies and small molecule inhibitors of this gene may be 
effective in the treatment of cancer. 

Among tissues with metabolic function, this gene has moderate levels of expression in 
pancreas, adrenal, thyroid, pituitary, heart, skeletal muscle and adipose. Therefore, this gene 
product may be a small molecule target for the treatment of endocrine and metabolic diseases, 
15 including obesity, and Types 1 and 2 diabetes. 

In addition, moderate expression of this gene in the CNS suggests a role for this gene 
product in brain processes. Inhibition of SODIUM/HYDROGEN EXCHANGER function in 
the brain is associated with the activity of several enzymes known to play a positive role in 
cell survival and learning and memory, such as PKA and PKC. Therefore, inhibitors of the 
20 protein encoded by this gene may have utility in mimicking the potentially therapeutic action 
of these enzymes in the treatment of neurodegenerative diseases including Alzheimer's and 
Parkinson's diseases, as well as in memory loss due to aging. 

References: 

Am J Physiol Cell Physiol 2001 Oct;281(4):Cl 146-Acute inhibition of brain-specific 
25 Na(+)/H(+) exchanger isoform 5 by protein kinases A and C and cell shrinkage. Attaphitaya S, 
Nehrke K, Melvin JE. 
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Little is known of the functional properties of the mammalian, brain-specific 
Na(+)/H(+) exchanger isoform 5 (NHE5). Rat NHE5 was stably expressed in NHE-deficient 
PS 120 cells, and its activity was characterized using the fluorescent pH-sensitive dye 2',7'- 
bis(2-carboxyethyl)-5(6)-carboxyfluorescein. NHE5 was insensitive to ethylisopropyl 
5 amiloride. The transport kinetics displayed a simple Michaelis-Menten relationship for 
extracellular Na(H-) (apparent K(Na) = 27 +/- 5 mM) and a Hill coefficient near 3 for the 
intracellular proton concentration with a half-maximal activity at an intracellular pH of 6.93 
+/- 0.03. NHE5 activity was inhibited by acute exposure to 8-bromo-cAMP or forskolin 
(which increases intracellular cAMP by activating adenylate cyclase). The kinase inhibitor H- 

10 89 reversed this inhibition, suggesting that regulation by cAMP involves a protein kinase A 
(PKA)-dependent process. In contrast, 8-bromo-cGMP did not have a significant effect on 
activity. The protein kinase C (PKC) activator phorbol 12-myristrate 13-acetate inhibited 
NHE5, and the PKC antagonist chelerythrine chloride blunted this effect. Activity was also 
inhibited by hyperosmotic-induced cell shrinkage but was unaffected by a hyposmotic 

15 challenge. These results demonstrate that rat brain NHE5 is downregulated by activation of 
PKA and PKC and by cell shrinkage, important regulators of neuronal cell function. 

Panel 2.2 Summary: Ag3049 The NOV71 gene can be used as a diagnostic marker 
for stomach, breast, lung, ovarian and some colon cancers as expression in the normal adjacent 
tissue and the tumor tissue differs. Antibodies and small molecule inhibitors designed with this 

20 gene product may also be used for therapy in breast, lung, ovarian and some colon cancers. 

Panel 4D Summary: Ag3049 The NOV71 gene, a sodium/hydrogen Exchanger 
homolog is expressed at a high level in Ramos (B cell) activated with ionomycin (CT=24.72), 
and at a moderate to high level in other activated B cell preparations. Therefore, small 
molecule antagonists or therapeutic antibody antagonists that block the function of the NOV71 

25 gene product may be useful in several autoimmune and inflammatory diseases in which 

activated B cells can play major roles as sources of autoantibody -producing cells and also as 
powerful antigen-presenting cells, including, but not limited to, Crohn's disease, ulcerative 
colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, 
rheumatoid arthritis, lupus erythematosus, or psoriasis. 

30 NOV72: UBIQUITIN-SPECIFIC PROTEASE 

Expression of gene NOV72 was assessed using the primer-probe set Ag3050, 
described in Table BMA. Results of the RTQ-PCR runs are shown in Tables BMB, BMC, 
BMD and BME. 
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Table BMA. Probe Name Ag3050 



Primers 


Sequences 


Length ; 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' - tgaccaggtattaaccatggaa- 3 ' 


22 


799 


1282 


Probe 


TET-5 • - ttactgctgcagggacatgctctcct-3 ' - 
TAMRA 


26 


823 


1283 


Reverse 


5 ' -taggcaaaggtctctttgtcaa-3 1 


22 


852 


1284 



Table BMB. CNS neurodegeneration vl.O 



Tissue Name 


ReL Exp.(%) Ag3050, 
Run 211012446 


Tissue Name 


Rel. Exp.(%) Ag3050, 
Run 211012446 


AD 1 Hippo 


12.5 


Control (Path) 3 
Temporal Ctx 


4.3 


AD 2 Hippo 


22.1 


Control (Path) 4 
Temporal Ctx 


29.1 


AD 3 Hippo 


3.0 


AD 1 Occipital Ctx 


15.0 


AD 4 Hippo 


11.9 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


59.0 


AD 3 Occipital Ctx 


7.1 


AD 6 Hinno 


51.1 


AD 4 Occioital Ctx 


22.2 


Control 2 Hippo 


17.6 


AD 5 Occipital Ctx 


24.7 


Control 4 Hippo 


a 1 

0. 1 


i\yj o v_Jccipuai i^tx 




Control (Path) 3 
Hippo 


7.0 


Control 1 Occipital 
t^ix 


2.9 


AD 1 Temporal Ctx 


17.4 


Control 2 Occipital 

V^IX 


39.0 


AD 2 Temporal Ctx 


23.5 


Ctx 


16.8 


AD 3 Temporal Ctx 


7.9 ! 


Control 4 Occipital 
Ctx 


6.0 


AD 4 Temporal Ctx 


27.5 


Control (Path) 1 
Occipital Ctx 


51.4 


AD 5 Inf Temporal 
Ctx 


100.0 j 


Control (Path) 2 
Occipital Ctx 


12.2 


AD 5 Sup Temporal 
Ctx 


34.9 


Control (Path) 3 
Occipital Ctx 


2.4 


AD 6 Inf Temporal 
Ctx 


54.7 


Control (Path) 4 
Occipital Ctx 


15.9 


AD 6 Sup Temporal 
Ctx 


60.3 


Control 1 Parietal 
Ctx 


8.4 


Control 1 Temporal 
Ctx 


3.7 j 


Control 2 Parietal 
Ctx 


36.3 


Control 2 Temporal 
Ctx 


20.3 i 


Control 3 Parietal 
Ctx 


15.9 


Control 3 Temporal 


9.7 


Control (Path) 1 


39.0 
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Ctx 




Parietal Ctx 


_. 


Control 3 Temporal 

Ctx 


7.2 


Control (Path) 2 
Parietal Ctx 


16.2 


Control (Path) 1 
Temporal Ctx 


46.7 


Control (Path) 3 
Parietal Ctx 


2.2 


Control (Path) 2 
Temporal Ctx 


34.9 


Control (Path) 4 
Parietal Ctx 


36.1 



Table BMC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3050, 
Run 167985384 


Tissue Name 


Rel. Exp.(%) Ag3050, 
Run 167985384 


Liver adenocarcinoma 


10.1 


Kidney (fetal) 


28.7 




^ 7 


Jxciiai ca. / oo _ u 


1 A 0 


Pancreatic ca. CAPAN 


2.5 


Renal ca. A498 


6.6 


Adrenal gland 


2.5 


Renal ca. RXF 393 


8.8 


Thyroid 


3.7 


T"fc 1 A ^ — < T "TV T 

Renal ca. ACHN 


5.4 


Salivary gland 


1.5 


Renal ca. UO-31 


6.2 


Pituitary gland 


6.8 


Renal ca. TK-10 


6.5 


Brain (fetal) 


100.0 


Liver 


2.2 


Brain (whole) 


23.3 


Liver (fetal) 


6.4. 


Brain (amygdala) 


zz.z 


Liver ca. 
(hepatoblast) HepG2 


1 1 .3 


Brain (cerebellum) 


88.9 


Lung 


0.8 


Brain (hippocampus) 


19.5 


Lung (fetal) 


15.1 


Brain (substantia nigra) 


19.3 


Lung ca. (small cell) 
LX-1 


14.4 


Brain (thalamus) 


7.7 


Lung ca. (small cell) 
NCI-H69 


18.4 


Cerebral Cortex 


12.1 


Lung ca. (s.cell var.) 
SHP-77 


43.2 


Spinal cord 


11.1 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


4.6 


Lung ca. (non-sm. 
cell) A549 


5.2 


glio/astro U-118-MG 


18.9 


Lung ca. (non-s.cell) 
NCI-H23 


7.4 


astrocytoma SW 1783 


9.4 


Lung ca. (non-s.cell) 
HOP-62 


7.2 


neuro*;met SK-N-AS 


15.7 


Lung ca. (non-s.cl) 
NCI-H522 


21.0 


astrocytoma SF-539 


17.1 


Lung ca. (squam.) 
SW 900 


7.7 


astrocytoma SNB-75 


12.2 


Lung ca. (squam.) 


60.7 
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i\ h V, fi- 



ll 



If BiT" 





- 


NCI-H596 




glioma SNB-19 


11.9 


Mammary gland 


1.3 


glioma U251 


15.7 


Breast ca.* (pl.ef) 
MCF-7 


14.5 


glioma SF-295 


16.4 


Breast ca.* (pl.ef) 
MDA-MB-231 


24.7 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


21.8 


Heart 


3.8 


Breast ca. BT-549 


7.3 


Skeletal muscle (fetal) 


8.1 


Breast ca. MDA-N 


13.5 


Skeletal muscle 


11.0 


Ovary 


4.0 


Done marrow 




Ovarian ca. OVCAR- 
3 


Z..Z, 


i nymus 




Ovarian ca. OVCAR- 
4 


A 7 


Spleen 


5.4 


Ovarian ca. OVCAR-j 
5 


11.7 


Lymph node 


14.6 


Ovarian ca. OVCAR- 
8 


4.9 


Colorectal 


5.6 


Ovarian ca. IGROV- 
1 


2.2 


Stomach 


2.0 


Ovarian ca.* (ascites) 
SK-OV-3 


24.3 


Small intestine 


1.6 


Uterus 


6.9 


Colon ca. SW480 


5.1 


Placenta 


1.3 


Colon ca.* 
SW620(SW480 met) 


22.2 


Prostate 


5.9 


Colon ca. HT29 


4.9 


Prostate ca.* (bone 
met)PC-3 


4.8 


Colon ca. HCT-116 


9.3 


Testis 


11.2 


Colon ca. CaCo-2 


18.2 


Melanoma 
Hs688(A).T 


1.7 


Colon ca. 
tissue(OD03 866) 


2.4 


Melanoma* (met) 
Hs688(B).l 


0.0 


Colon ca. HCC-2998 


10.1 


Melanoma UACC-62 


4.0 


Gastric ca.* (liver met) 
NCI-N87 


9.8 


Melanoma M14 


2.1 


Bladder 


6.3 


Melanoma LOX 
IMVI 


5.8 


Trachea 


2.2 


Melanoma* (met) 
SK-MEL-5 


2.1 


Kidney 


6.7 


Adipose 


4.3 
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Table BMD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3050, Run 
164317257 


Tissue Name 


Rel. Exp.(%) 
Ag3050, Run 
164317257 


Secondary Th 1 act 


21.0 


HUVEC IL-lbeta 


5.3 


Secondary Th2 act 


22.4 


HUVEC IFN gamma 


6.3 


Secondary Trl act 


23.3 


HUVEC I NF alpha + IFN 
gamma 


11.6 


Secondary Thl rest 


4.0 


HUVEC TNF alpha + IL4 


7.5 


Secondary Th2 rest 


7.9 


HUVEC IL-11 


2.6 


Secondary Trl rest 


9.8 


Lung Microvascular EC 
none 


9.5 


Primary Thl act 


24.3 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


9.9 


Primary Th2 act 


13.5 


Microvascular Dermal EC 
none 


13.4 


Primary Trl act 


21.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


11.7 


Primary Thl rest 


36.6 


Bronchial epithelium 
TNFalpha + ILlbeta 


21.9 


Primary Th2 rest 


17.6 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


19.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


15.7 


CD45RA CD4 
lymphocyte act 


7.1 


Coronery artery SMC rest 


A f\ 

4.U 


CD45RO CD4 
lymphocyte act 


14.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


1 A 

1 .4 


CD8 lymphocyte act 


7.7 


Astrocytes rest 


11.8 


Secondary CD8 
lymphocyte rest 


17.2 


Astrocytes TNF alpha + 
IL-lbeta 


1 .o 


Secondary CD8 
lymphocyte act 


13.9 


KU-8 12 (Basophil) rest 


15.6 


vj'-x lyiiipiHJcy ic nunc 




KU-812 (Basophil) 
PMA/ionomy c i n 


40.1 


2ryThl/Th2/Trl anti- 
CD95 CH11 


17.0 


CCD1 106 (Keratinocytes) 
none 


11.0 


LAK cells rest 


16.7 


CCD 1 106 (Keratinocytes) 
TNFalpha 4- IL-lbeta 


4.6 


LAK cells IL-2 


15.1 


Liver cirrhosis 


5.3 


LAK cells IL-2+IL- 12 


12.6 


Lupus kidney 


1.6 


LAK cells IL-2+IFN 
gamma 


15.3 


NCI-H292 none 


17.1 


LAK cells IL-2+ IL-18 


14.9 


NCI-H292 IL-4 


15.3 


LAK cells 


7.2 


NCI-H292 IL-9 


15.5 
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NIC Pell^ II -? rpQt 

1 > IX. \^-C«ll3 1 1 j Z. 1 Col 


1 H.J 


iNV^l-rlZVZ 


D.Z 


Two Wav MT R ^ Hav 


7 ^ 

/ .J 


XTf^T TI7M tram ma 

JN^i-rizvz iriM gamma 




Two Way MLR 5 day 


7.0 


HPAEC none 


9.7 


Two Way MLR 7 day 


6.0 


HPAEC TNF alpha + IL-1 
beta 


13.9 


PBMC rest 


3.4 


Lung fibroblast none 


5.3 


PBMC PWM 


38.4 


Lung fibroblast TNF alpha 
+ IL-T beta 


5.7 


PBMC PHA-L 


14.5 


Lung fibroblast IL-4 


7.7 


Ramos (B cell) none 


24.8 


Lung fibroblast IL-9 


7.0 


Ramos (B cell) 
ionomycin 


48 0 


I lin CT T 1 r\ f/"\ r\ loot II 11 

l^UIlg llDlODiaSl 1 1 


D.O 


W Ivmnhnrvtps P\A/1VI 


54 7 


Lung fibroblast IFN 
gamma 


1 A 4 


B lymphocytes CD40L 
and IL-4 


43.2 


Dermal fibroblast 
CCD 1070 rest 


12.6 


EOL-1 dbcAMP 


10.7 


Dermal fibroblast 
CCD 1070 TNF alpha 


30.1 


EOL-1 dbcAMP 
rrvi/v ionomycin 


10.8 


Dermal fibroblast 
LLUIU/U IL-l beta 


2.1 


Dendritic cells none 


6.9 


Dermal fibroblast IFN 

gam m a 

wUllllilU 


4.5 


Dendritic cells LPS 


5.6 


Dermal fibroblast IL-4 


15.8 


Dendritic cells anti- 
CD40 


5.5 


IBD Colitis 2 


0.9 


Monocytes rest 


9.9 


IBD Crohn's 


1.2 


Monocytes LPS 


5.0 


Colon 


3.2 


Macrophages rest 


10.4 


Lung 


2.6 


Macrophages LPS 


2.0 


Thymus 


8.6 


HUVEC none 


8.1 


Kidney 


100.0 


HUVEC starved 


23.7 







Table BME. Panel CNS 1 



Tissue Name 


Rel. Exp.(%) Ag3050, 
Run 171694540 


Tissue Name 


Rel. Exp.(%) Ag3050, 
Run 171694540 


BA4 Control 


20.7 


BA17PSP 


30.4 


BA4 Control2 


35.8 


BA17PSP2 


15.7 


BA4 
Alzheimer's2 


0.0 


Sub Nigra Control 


38.2 


BA4 Parkinson's 


45.7 


Sub Nigra Control2 


14.3 


BA4 
Parkinson's2 


59.9 


Sub Nigra 1 
Alzheimer's2 


10.4 



131 1 



n i!' 



li.S H 



is ni 



BA4 
Huntington's 


24.7 


Sub Nigra 
Parkinson's2 


46.7 


BA4 
Huntington's2 


4.8 


Sub Nigra 
Huntington's 


49.0 


BA4 PSP 


31.4 


Sub Nigra 
Huntington's2 


11.6 


BA4 PSP2 


32.3 


Sub Nigra PSP2 


6.7 


da^ i^epression ; 


1 .D 


Sub Nigra 
Depression 


0 ? 

y .z. 


BA4 
Depression2 


1 1 .D 


Sub Nigra 
Depression2 


1 j.Z 


/ v^onxroi 


jo.Z 


Glob Palladus 
Control 


ZJ> .Z 


t>/\ / controiz 


1 .z 


Glob Palladus 
Control2 


/.J 


BA7 
Alzheimer's2 




Glob Palladus 
Alzheimer's 


fx 1 


BA7 Parkinson's 


43.5 


Glob Palladus 
Alzheimer's2 


0.0 


BA7 
Parkinson's2 


57.8 


Glob Palladus 
Parkinson's 


100.0 


BA7 
Huntington's 


43.8 


Glob Palladus 
Parkinson's2 


20.2 


BA7 
Huntington's2 


59.5 


Glob Palladus PSP 


0.0 


BA7 PSP 


42.6 


Glob Palladus PSP2 


3.0 


BA7 PSP2 


31.9 


Glob Palladus 
Depression 


16.4 


BA7 Depression 


12.6 


Temp Pole Control 


18.9 


BA9 Control 


26.2 


Temp Pole Control2 


45.7 


D AO C^nntrn\2 


1 0 R 
1 y ,<y 


Temp Pole 
Alzheimer's 




RAO A l7liAimr»r'c 


o.o 


Temp Pole 
Alzheimer's2 


10 0 

1 \J.\J 


BA9 
Alzheimer's2 


1 1 5 


Temp Pole 
Parkinson's 


22 8 


RA9 Parkinson's 

Liny X CXI rv 1 1 1 o U 1 1 k5 


32.1 


Temp Pole 
Parkinson's2 


16.7 


BA9 
Parkinson's2 


30.1 


Temp Pole 
Huntington's 


34.9 


BA9 
Huntington's 


38.2 


Temp Pole PSP 


5.8 


BA9 
Huntington's2 


31.6 


Temp Pole PSP2 


10.7 


BA9 PSP 


17.6 


Temp Pole 
Depression2 


24.3 



1312 



BA9 PSP2 


2.8 


Cing Gyr Control 


62.9 


BA9 Depression 


20.3 


Cing Gyr Control2 


13.4 


BA9 
Depression2 


11.9 


Cing Gyr 
Alzheimer's 


23.7 


BA 17 Control = 


65.5 


Cing Gyr 
Alzheimer's2 


11.7 


BA17Control2 


31.4 


Cing Gyr Parkinson's 


36.6 


BA17 
Alzheimer's2 


1 1 n 

1 l.U 


Cing Gyr 
Parkinson's2 




BA17 
Parkinson's 


OZ.7 


Cing Gyr 
Huntington's 




BA17 
Parkinson's2 




Cing Gyr 
Huntington's2 




BA17 
Huntington's 


72.2 


Cing Gyr PSP 


11.6 


BA17 
Huntington's2 • 


38.7 


Cing Gyr PSP2 


2.0 


BA17 
Depression 


12.2 


Cing Gyr Depression 


13.0 


BA17 
Depression2 


42.9 


Cing Gyr 
Depression2 


20.0 



CNS_neurodegeneration_vl.O Summary: Ag3050 This panel does not show 
differential expression of the NOV72 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3050 The NOV72 gene exhibits brain-preferential 
expression and is a member of a family of proteins that mediates ubiquitin-mediated protein 
degradation. Misprocessing of proteins involved in ubiquitin-mediated protein degradation is 
thought to be the cause of many neurodegenerative disorders such as Parkinson's disease, as 
well as those resulting from CAG repeat expansion genes, such as Huntingtin's disease. 
Therefore, therapeutic modulation of the expression or function of this gene may affect the 
protein degradation dysfunction seen in these diseases. 

In addition, this gene is expressed at a slightly higher level in cancer cell lines 
compared to the normal lung, ovary, breast, and colon samples on this panel. This suggests 
that expression of this gene could be used as a diagnostic marker of cancer. Furthermore, 
inhibition of this gene product using small molecule drugs may be useful in the treatment of 
cancer in these tissues. 

Among tissues with metabolic function, this gene is has a low level of expression in 

pancreas, thyroid, pituitary, heart, skeletal muscle, and adipose. This gene product may be a 
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small molecule target for the treatment of metabolic and endocrine diseases, including obesity 
and Types 1 and 2 diabetes. 

Panel 4D Summary: Ag3050 The NOV72 gene is expressed at moderate to low levels 
(CT=29-34) in a wide range of cell types and tissues of significance in the immune response in 
5 health and disease, Highest expression of this gene is seen in kidney tissue (CT=29.36). 

Therefore, targeting of this gene product with a small molecule drug or antibody therapeutic 
may modulate the functions of cells of the immune system as well as resident tissue cells and 
lead to improvement of the symptoms of patients suffering from autoimmune and 
inflammatory diseases such as asthma, allergies, inflammatory bowel disease, lupus 
10 erythematosus, and arthritis, including osteoarthritis and rheumatoid arthritis. 

Panel CNS_1 Summary: Ag3050 This panel confirms expression of the NOV72 gene 
in the brain. Please see Panel 1.3D for discussion of utility of this gene in the central nervous 
system. 

NOV73 

1 5 Expression of gene NOV73 was assessed using the primer-probe set Ag3030, 

described in Table BNA. 

Table BNA. Probe Name Ag3030 



Primers; 

J 


Sequences 


Length ! 


Start 
Position 


SEQ ID NO: 


Forward- 


5 ' - tgaaattcagaaccaggaaatg- 3 ' 


22 


838 


1285 


i 

Probe 


TET-5 ' -aaagagtgcttagcaggcacctccct-3 ■ - 
TAMRA 


26 


872 


1286 


Reverse 


5 * -aaccctggcaatatgattcata-3 1 


22 


914 


1287 



CNS neurodegeneration_vl.O Summary: Ag3030 Expression of this gene is 
20 low/undetectable (CTs > 34.5) across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 
34.5) across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 
25 Panel 3D Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 35) 

across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 



1314 



• 



i! n in y ;p n "li ? -*n t s 1 f«?' 



Panel 5 Islet Summary: Ag3030 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

NOV74 

Expression of gene NOV74 was assessed using the primer-probe set Ag30l6, 
5 described in Table BOA. Results of the RTQ-PCR runs are shown in Tables BOB, BOC, BOD 
and BOE. 



Table BOA. Probe Name Ag3016 



Primers; 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward; 


5 ' -atctcagtgacctgctctcaga-3 1 


22 


83 


1288 


Probe 


TET- 5 1 -cagtggctgctacagcctcccaag-3 1 - 
TAMRA 


24 


108 


1289 


Reverse 


5 ' -aaatcttcagggtgacctcatt-3 1 


22 


142 


1290 



Table BOB. CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) Ag3016, 
Run 209820675 


Tissue Name 


Rel. Exp.(%) Ag3016, 
Run 209820675 


AD 1 Hippo 


4.6 


Control (Path) 3 
Temporal Ctx 


3.1 


AD 2 Hippo 


7.9 


Control (Path) 4 
Temporal Ctx 


33.9 


AD 3 Hippo 


2.9 


AD 1 Occipital Ctx 


17.9 


AD 4 Hippo 


3.3 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


7.5 


AD 6 Hippo 


8.5 


AD 4 Occipital Ctx 


12.5 


Control 2 Hippo 


5.4 


AD 5 Occipital Ctx 


11.7 


Control 4 Hippo 


1.8 


AD 6 Occipital Ctx 


12.9 


Control (Path) 3 
Hippo 


1.5 


Control 1 Occipital 
Ctx 


3.7 


AD 1 Temporal Ctx 


10.8 


Control 2 Occipital 
Ctx 


18.8 


AD 2 Temporal Ctx 


20.4 


Control 3 Occipital 
Ctx 


0.7 


AD 3 Temporal Ctx 


3.5 


Control 4 Occipital 
Ctx 


5.8 


AD 4 Temporal Ctx 


22.2 


Control (Path) 1 
Occipital Ctx 


50.3 


AD 5 Inf Temporal 
Ctx 


60.7 


Control (Path) 2 
Occipital Ctx 


23.3 


AD 5 Sup Temporal: 


17.7 - 


Control (Path) 3 


1.5 
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Ctx 




Occipital Ctx 




AD 6 Inf Temporal 
Ctx 


Ad. / 


Control (Path) 4 
Occipital Ctx 




AD 6 Sup Temporal 
Ctx 


15. 1 


Control 1 Parietal 
Ctx 


O. I 


Control 1 Temporal 
Ctx 




Control 2 Parietal 
Ctx 


Zo. / 


Control 2 Temporal 
Ctx 


s o 

D.y 


Control 3 Parietal 

Ctx : 


91 ^ 

jL 1 . J 


Control 3 Temporal 
Ctx 


11.1 


Control (Path) 1 
Parietal Ctx 


49.3 


Control 3 Temporal 
Ctx 


6.3 


Control (Path) 2 
Parietal Ctx 


27.0 


Control (Path) 1 
Temporal Ctx 


45.4 


Control (Path) 3 
Parietal Ctx 


5.6 


Control (Path) 2 
Temporal Ctx 


36.9 


Control (Path) 4 
Parietal Ctx 


39.0 



Table BOC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3016, 
Run 167819111 


Tissue Name 


Rel. Exp.(%) Ag3016, 
Run 167819111 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


9.2 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 

2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


99.3 


Liver 


0.0 


Brain (whole) 


96.6 


Liver (fetal) 


0.0 


Brain (amygdala) 


21.5 


Liver ca. 
(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


12.0 


Lung 


0.0 


Brain (hippocampus) 


24.7 


Lung (fetal) 


6.2 


Brain (substantia nigra) 


15.0 


Lung ca. (small cell) 
LX-1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


8.2 


Cerebral Cortex 


100.0 


Lung ca. (s.cell var.) 
SHP-77 


6.9 


Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


46.0 


Lung ca. (non-sm. 


10.9 
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cell) A549 




glio/astro U-l 18-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


o.u 


astrocytoma SW1783 


6.3 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


7.1 


i Lung ca. (squam.) 
SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


0.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


32.5 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


81.2 


T T 4. 

Heart 


0.0 


Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


10.6 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0 0 


Ovarian ca. OVCAR- 
3 


A A 
U.O 


Thymus 


7.0 


Ovarian ca. OVOAR- 
4 




Spleen 


0.0 


Ovarian ca. OVCAR- 

5 


0.0 


Lymph node 


6.3 


Ovarian ca. OVCAR- 
8 


100.0 


Colorectal 


0.0 

, — m<1 — , — ^ 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


72.7 


Small intestine 0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


15.4 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


19.5 


Colon ca. 
tissue(OD03866) 


0.0 j 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


00 1 


Melanoma UACC-62 57.0 
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Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma Ml 4 


26.8 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


19.8 


Kidney 


0.0 


Adipose 


0.0 



Table BOD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3016, Run 
164404251 


Tissue Name 


Rel. Exp.(%) 
Ag3016, Run 
164404251 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


6.2 


HUVEC IFN gamma 


0.0 


oC^UIIUaiy III dCL 


u.u 


HUVEC TNF alpha + IFN 
gamma 


u.u 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


.0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Th 1 act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


13.7 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ryThimi2/Trl_anti- 


0.0 


CCD1 106 (Keratinocytes) j 


0.0 
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CD95 CH11 




none 




LAK cells rest 


0.0 


LlL)1 IOo (Reratinocytesj 
TNFalnha + IL-lbeta 


0.0 


T AF <-p>11c TT -9 
!_,/ A IS. is CI 1 S 


0 0 


T ivpr cirrhosis 


56.6 


I A 1/ ,~ ~ 1 1 c TT 0_i_TT 10 

LA.lV CeilS 1 1 i 1L- 1 z 


U.U 


T nniiQ IriHnpv 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


1 a if ~^11 c TT 1-4- TT 1 O 
LAK CeilS 1L-Z^ IL-lo 


U.U 


NCT-H79? Tl -4 


0.0 


LAK cells 
r ivi a/ lonomycin 


0.0 


NCI-H292 IL-9 


0.0 


JNiv ceils il-z rest 


U.U 


NFCT-H7Q? TT -1 1 


0.0 


Two Way MLK 5 cay 


U.U 


MPT T-I9Q? I FN aamrrm 


0 0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0 0 


T nno fibroblast TL-1^ 


0.0 


d lyiiipiiOL/y iC5» r w ivi 


0 0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 1 Nr alpha 


0.0 


EOL-1 dbcAMP 
f ivi/\/ lonoiiiyciii 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


7.6 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


6.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


5.6 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


6.7 


Macrophages LPS 


0.0 


Thymus 


4.3 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 
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Table BOE. Panel CNS 1 



Tissue Name 


Rel. Exp.(%) Ag3016, 
Run 171688428 


' 1 'lectio M^iin** 


Rel. Exp.(%) Ag3016, 
Run 171688428 


BA4 Control 


9.9 


BA17PSP 


28.5 


BA4 Control2 


12.0 


BA17PSP2 


17.8 


BA4 
Alzheimer's2 


2.9 


Sub Nigra Control 


5.4 


BA4 Parkinson's 1 


28.9 


Sub Nigra ControI2 


0.0 


BA4 
Parkinson's2 


35.1 


Sub Nigra 
Alzheimer's2 


7.7 


BA4 
Huntington's 


12.2 


Sub Nigra 
Parkinson's2 


15.1 


BA4 
Huntington's2 


8.3 


Sub Nigra 
Huntington's 


3.5 


BA4 PSP 


11.4 


Sub Nigra 
Huntington's2 


0.0 


BA4 PSP2 


17.9 


Sub Nigra PSP2 


0.0 


BA4 Depression 


9.7 


Sub Nigra 
Depression 


5.2 


BA4 
Depression2 


11.7 


Sub Nigra 
Depression2 


14.8 


BA7 Control 


31.4 


Glob Palladus 
Control 


5.0 


BA7 Control2 


11.7 


Glob Palladus 
Control2 


0.0 


BA7 
Alzheimer f s2 


20.2 


Glob Palladus 
Alzheimer's 


0.0 


BA7 Parkinson's 


15.3 


Glob Palladus 
Alzheimer's2 


10.6 


BA7 
Parkinson's2 


J7.U 


Glob Palladus 
Parkinson's 


58 2 


BA7 
Huntington's 




Glob Palladus 
Parkinson's2 


0.0 


BA7 
Huntington's2 




Olnh Palladus PSP 

VJ1UU 1 cl 1 1 Cl\A \A o I OJT 


0.0 


BA7 PSP 


37.6 


Glob Palladus PSP2 


5.0 


BA7 PSP2 


21.9 


Glob Palladus 
Depression 


5.7 


BA7 Depression 


21.9 


Temp Pole Control 


5.9 


BA9 Control 


14.8 


Temp Pole Control2 


0.0 


BA9 Control2 


24.5 


Temp Pole 
Alzheimer's 


0.0 


BA9 Alzheimer's 


10.2 


Temp Pole 
AIzheimer's2 


9.4 


BA9 


12.2 


Temp Pole 


30.4 
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Alzheimer's2 




Parkinson's 




BA9 Parkinson's ! 


^9 1 

jZ. 1 


Temp Pole 
Parkinson's2 


37.1 


BA9 
Parkinson's2 


45.4 


Temp Pole 
Huntington's 


11.5 


BA9 
Huntington's 


20.2 


Temp Pole PSP 


19.3 


BA9 
Huntington's2 


35.1 


Temp Pole PSP2 


0.0 


BA9 PSP 


24.3 


Temp Pole 
Depression2 


12.5 


BA9 PSP2 


9.5 


Cing Gyr Control 


21.0 


BA9 Depression 


9.8 


Cing Gyr Control2 


16.2 


BA9 
Depression2 


12.2 


Cing Gyr 
Alzheimer's 


0.0 


BA 17 Control \ 


100.0 


Cing Gyr 
Alzheimer's2 


0.0 


BA17Control2 


40.9 


Cing Gyr Parkinson's 


10.5 


BA17 
Alzheimer's2 




Cing Gyr 
Parkinson's2 




BA17 
Parkinson's 




Cing Gyr 
Huntington's 


4 


BA17 
Parkinson's2 


28.1 


Cing Gyr 
Huntington's2 


5.0 


BA17 
Huntington's 


25.9 


Cing Gyr PSP 


0.0 


BA17 
Huntington's2 


19.3 


Cing Gyr PSP2 


9.8 


BA17 
Depression 


33.4 


Cing Gyr Depression 


5.0 


BA17 
Depression2 


56.3 


Cing Gyr 
Depression2 


24.8 



CNS_neurodegeneration_yl.O Summary: Ag3016 This panel does not show 
differential expression of the NOV74 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3016 The NOV74 gene represents a dual specificity 
phosphatase that is expressed preferentially at low to moderate levels across the CNS. Dual- 
specificity phosphatases comprise a family of MAP kinase regulating enzymes, members of 
which are upregulated in brains subjected to insults such as ischemia and seizure activity. 
10 MAP kinases are kown to regulate neurotrophic and neurotoxic pathways. Consequently, 
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agents that modulate the activity of this gene may have utility in attenuating the apoptotic and 
neurodegenerative processes following brain insults. 

In addition, there are low but significant levels of expression in samples derived from 
breast cancer, ovarian cancer, and melanoma cell lines. Thus, expression of this gene could be 
5 used to differentiate between these samples and other samples on this panel and as a marker to 
detect the presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of breast cancer, ovarian cancer, and 
melanoma. 

References: 

10 Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 

rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 
The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 

15 Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 
min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mRNA was found most 
prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
in the selectively vulnerable CA1 sector. At later time points and in control animals no PAC-1 
mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 

20 PAC-1 , therefore, may be involved in adaptational responses of hippocampal cells resistant to 
ischemic injury. 

Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 
specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 Sep 
29;8(14):3077-80 

25 Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 

responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 
activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 
inflammatory cytokines, their activation state could determine seizure-related brain damage. 
PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 

30 found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 

lead to rapid (approximately 3 h) but transient appearance of PAC1 mRNA in granule cells of 
the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 
after 2-3 days PAC1 was induced in dying CA1 and CA3 neurons. At this time PAC1 mRNA 
was also expressed in white matter microglia as well as in microglia invading the damaged 
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hippocampus. PAC1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

Panel 4D Summary: Ag3016 The NOV74 gene is only expressed at detectable levels 
in the kidney (CT = 34.2) among the samples on this panel. Thus, expression of this gene 
5 could be used to distinguish kidney from the other samples on this panel. In addition, the dual- 
specificity protein phospatase encoded for by this gene could allow cells within the kidney to 
respond to specific microenvironmental signals. Furthermore, small molecule therapies 
designed with the protein encoded for by this gene coulcLmodulate kidney function and be 
important in the treatment of inflammatory or autoimmune diseases that affect the kidney, 
1 0 including lupus and glomerulonephritis. 

Panel CNS_1 Summary: Ag3016 This panel confirms expression of the NOV74 gene 
in the brain. Please see Panel 1 .3D for discussion of utility of this gene in the central nervous 
system. 

NOV75 

15 Expression of gene NOV75 was assessed using the primer-probe sets Ag3020 and 

Ag2968, described in Tables BP A and BPB. Results of the RTQ-PCR runs are shown in 
Tables BPC, BPD, BPE, BPF and BPG. 

Table BPA. Probe Name Ag3020 



Primersj Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forwardjs ' -ggaatcacccacat t ctgaat - 3 1 


21 


265 


1291 


JteT-5 1 -cgtttacactggccccgaattctaca-3 ' - j 
r ° 6 JTAMRA 


26 


303 


1292 


Reversejs 1 - cctctacacccaggtactggat-3 » 


22 


340 


1293 



20 Table BPB. Probe Name Ag2968 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ggaatcacccacattctgaat-3 ' 




265 


1294 


Probe 


TET-5 ' -cgtttacactggccccgaattctaca-3 ' - 
TAMRA 


26 


303 


1295 


Reverse 


5 ' -cctctacacccaggtactggat-3 ' 


22 


340 


1296 



Table BPC. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) Ag3020, 
Run 221998694 


Tissue Name 


Rel. Exp.(%) Ag3020, 
Run 221998694 


Adipose 


0.4 


Renal ca. TK-10 


0.0 
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Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.1 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.2 


Squamous cell 
carcinoma SCC-4 | 


0.0 


Colon ca.* (SW480 
met) ^WozU 


0.0 


Testis Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.2 


Colon ca. CaCo-2 


0.0 


Placenta 


0.1 


Colon cancer tissue 


31.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


U.U 


i^oion ca. v^oio-zuz> 


Iftft ft 


Ovarian ca. SK-OV- 
3 


0.3 


Colon ca. oW-4o 


S 7 
D. 1 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.1 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 
l 


18.9 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.1 


Fetal Heart 


12.7 


Breast ca. MCr-7 


U.U 


riearc rooi 


4 0 


Breast ca. MDA- 

A A D Oil 

MB-23 1 


0.0 


Lymph Node Pool 


0.1 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


23.3 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


20.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


0.3 


CNS cancer (glio/astro) 
U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) 
U-118-MG 


0.1 


Fetal Lung 


0.2 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.7 
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Lung ca. LX-1 


12.7 


CNS cancer (astro) 
SNB-75 


0.1 


Lung ca. NCI-H146 


0.1 


CNS cancer (glio) 

OXTD 1 Q 


15.9 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 


0.5 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


U. 1 


Lung ca. NC1-H526 


0.0 


Brain (cerebellum) 


u.u 


Lung ca. NCI-H23 


0.0 


Brain (ietal ) 


n i 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


0.1 


i_/Uiig td. nwr-uz. 


0 1 

V/. 1 


(^Vrphral C^ortpx Pool 


0.1 


Lung ca. NCI-H522 


0.0 


Ul UUI 1 wj U KJ OldllLld 1 1 1 £^1 / 

Pool 


0.1 


Liver 


0.0 


Brain (Thalamus) Pool 


0.1 


Fetal Liver 


0.1 


Brain (whole) 


0.1 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


0.1 


Adrenal Gland 


0.0 


Fetal Kidney 


0.1 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.1 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.4 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



Table BPD. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
166220058 


Rel. Exp.(%) 
Ag3020, Run! 
167819114 


Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
166220058 


Rel. Exp.(%) 
Ag3020, Run 
167819114 


Liver 
adenocarcinoma 


0.0 


0.1 


Kidney (fetal) 


0.0 


0.3 


Pancreas 


0.0 


0.0 


Renal ca. 786- 
0 


0.0 


0.0 


Pancreatic ca. 
CAPAN 2 


0.1 


0.2 


Renal ca. 
A498 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 
393 


0.0 


0.0 


Thyroid 


0.3 


1.6 


Renal ca. 
ACHN 


0.0 


0.0 


Salivary gland 


0.8 


0.6 


Renal ca. UO- 
31 


0.0 


0.0 


Pituitary gland 


0.0 


0.0 


Renal ca. TK- 
10 


0.0 


0.0 


Brain (fetal) 


0.0 


0.1 


Liver 


0.0 


0.0 


Brain (whole) 


0.4 


1.0 


Liver (fetal) 


0.5 


0.0 



1325 




Brain (amygdala) 


0.2 


"1.0 


Liver ca. 
(hepatoblast) 
HepG2 


0.0 


0.0 


Brain (cerebellum) 


0.0 


0.0 


Lung 


0.0 


0.5 


Brain 
(hippocampus) 


0.0 


0.0 


Lung (fetal) 


0.0 


0.0 


Brain (substantia 
nigra) 


0.2 


0.1 


Lung ca. 
(small cell) 
LX-1 


10.7 


16.2 


Brain (thalamus) 


0.0 


0.0 


Lung ca. 
(small cell) 
NCI-H69 


0.0 


0.5 


Cerebral Cortex 


0.4 


. 0.1 


Lung ca. 
(s.cell var.) 
SHP-77 


0.0 


0.0 


Spinal cord 


0.0 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


glio/astro U87-MG 


0.0 


0.0 


Lung ca. (non- 
sm. cell) A549 


0.0 


0.2 


glio/astro U-118- 
MG 


0.0 


0.1 


Lung ca. (non- 
s.cell) NC1- 
H23 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.1 


0.1 


neuro*; met SK-N- 
AS 


0.0 


0.0 


Lung ca. (non- 
s.cl) NC1- 
H522 


0.0 


0.0 


astrocytoma SF- 
539 


12.1 


8.4 


Lung ca. 
(squam.) SW 
900 


0.0 


0.0 


astrocytoma SNB- 
75 


0.0 


0.3 


Lung ca. 
(squam.) NCI- 
H596 


0.1 


0.4 


glioma SNB-19 


0.0 


0.0 


Mammary 
gland 


0.2 


0.2 


glioma U251 


0.4 


0.0 


Breast ca.* 
(pl.ef) MCF-7 


0.0 


0.0 


glioma SF-295 


0.3 


0.3 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


0.0 


Heart (fetal) 


7.4 


26.8 


Breast ca.* 
(pl.ef) T47D 


0.0 


0.0 


Heart 


29.9 


35.4 


Breast ca. BT- 
549 


0.0 


0.0 


Skeletal muscle 
(fetal) 


10.8 


33.9 


Breast ca. 
MDA-N 


0.0 


0.0 


Skeletal muscle 


100.0 


100.0 


, Ovary 


0.0 


0.1 
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Bone marrow 


O 1 

U. 1 


U.O j 


Ovarian ca. 
OVCAR-3 


0.0 | 


0.0 


Thymus 


0.1 


0.1 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Spleen 


U.U 


U.U 


Ovarian ca. 
OVCAR-5 


0.0 ] 0.6 


Lymph node 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Colorectal 


0.2 


U.z 


Ovarian ca. 
IGROV-1 


26.2 


26.2 


Stomach 


0.0 


0.0 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.2 


1.0 


Small intestine 


0.0 


0.0 


Uterus 


0.0 


0.0 


Colon ca. SW480 


0.0 


0.2 


Placenta 


1 0 


0.0 


Colon ca * 
SW620(SW480 
met) 


1.6 


6.1 


Prostate 


0.2 


0.1 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met)PC- 
3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


0.2 


0.2 


Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


21.9 


30.6 


Melanoma* 

(met) 
Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.5 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


0.0 


0.0 


Melanoma 
M14 


0.0 


0.0 


Bladder 


0.1 


0.0 


Melanoma 
LOX IMVI 


0.0 


0.0 


Trachea 


0.8 


0.6 


Melanoma* 
(met) SK- 
MEL-5 


0.0 


0.0 


Kidney 0.0 


0.0 


Adipose 


1.1 


1.7 



Table BPE. Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
170188142 


Tissue Name 


Rel. Exp.(%) 
Ag2968, Run 
170188142 


Daoy- Medulloblastoma 


0.0 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 
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1 t,o / 1 - MedulloDlastoma 


z.Z 


ES-2- Ovarian clear cell 
carcinoma 


u.u 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma : 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma | 


0.0 


SK-N-SH- 
Neuroblastoma 
(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 


0.0 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


0.0 


Jurkat- T cell leukemia 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


0.0 


TF-1- Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 7o- 1 -cell lymphoma 


a a 
U.U 


DMS-79- Small cell lung 
cancer 


0.0 


U937- Histiocytic lymphoma 


A A 
U.U 


NCI-H146- Small cell 
lung cancer 


0.0 


KU-812- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


0.0 


NCI-H82- Small cell 
lung cancer 


0.0 


S W 839- Clear cell renal 
carcinoma 


0.0 


NCI-H157- Squamous 
cell lung cancer 
(metastasis) 


0.0 


G401- Wilms' tumor 


0.0 


NCI-H1155- Large cell 
lung cancer 


0.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.0 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 
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LX-1- Small cell lung 
cancer 


A A 

4.4 


HP AC- Pancreatic 
adenocarcinoma 


A A 
U.U 


Colo-205- Colon cancer 


i aa a 
1UU.U 


MIA PaCa-2- Pancreatic 
carcinoma 


A A 
U.U 


KMu- Colon cancer 


A A 
U.U 


CFPAC- 1- Pancreatic ductal 
adenocarcinoma 


A A 

U.U 


kMzuLz- Colon cancer 


A A 
U.U 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


A A 
U.U 


JNCl-H/lo- Colon cancer 


U.U 


T24- Bladder carcinma 
(transitional cell) 


A A 
U.U 


SW-48- Colon 
adenocarcinoma 


4.3 


5637- Bladder carcinoma 


A A 
U.U 


SW1116- Colon 
adenocarcinoma 


U.U 


H I -l 1 97- Bladder carcinoma 


U.U 


LS 174T- Colon 
adenocarcinoma 


10.1 


UM-UC-3- Bladder carcinma 
(transitional cell) 


u.o 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


1 o 

1.2 


SK-LMS-1- Leiomyosarcoma, 
(vulva) 


U.U 


NCI-SNU-16- Gastric 
carcinoma 


A A 
U.U 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


A *7 
U. / 


NCI-SNU-1- Gastric 
carcinoma 


A 1 
U. 1 


A431- Epidermoid carcinoma 


A A 
U.U 


RF-1- Gastric 
adenocarcinoma 


A A 
U.U 


w ivizoo-4- ivieianoma 


A A 
U.U 


RF-48- Gastric 
adenocarcinoma 


A A 
U.U 


DU 145- Prostate carcinoma 
(brain metastasis) 


a n 

U.U 


MKN-45- Gastric 
carcinoma 


Lr.U 


MDA-MB-468- Breast 
adenocarcinoma 


0 0 


NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.0 



Table BPF. Panel 4D 
Tissue Name [ Rel Exp (%) ^^^^^^^^^T [ Rcl Exp (%) 
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Ag3020, Run 
164528102 




Ag3020, Run 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC 1FN gamma 


0.0 


Qf^r^onrl £i r\/ Tr 1 apt 
otv^ vu iV-icii y 111 cii/i 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 1L4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0 0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha+ IL-lbeta 


n n 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


u.u 


Primary Trl act 


0.0 


Micros vasular Dermal EC 
TNFalpha + IL-lbeta 


u.u 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


u.u 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


65.1 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


71 0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


10.7 


Y\A 1 \/m r^ti oPA/tf* np»nf 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


13.3 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 1L-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


8.0 


NCI-H292IL-13 | 0.0 
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Two Way MLR 3 day 


0.0 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


10.6 


HPAEC TNF alpha + IL-1 
beta 


11.2 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


r olvll^ r W 1V1 


U.U 


Lung fibroblast TNF alpha 
+ IL-1 beta 




PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


9.9 


Dermal fibroblast 
CCD 1070 rest 


0.0 


hUL-1 abcAJVLr 


A A 


Dermal fibroblast 
CCD 1070 TNF alpha 


A A 

U.U 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none j 


23.8 


Dermal fibroblast IFN 
gamma 


0.0 


Uenaritic cells Lro 


A A 
U.U 


Dermal fibroblast IL-4 


A A 
U.U 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


44.4 


Macrophages rest 


0.0 


Lung 


26.6 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







Table BPG. Panel 5D 



Tissue Name 


Rel. 

Exp.(%) 
Ag2968, 
Run 
169270971 


Rel. 
Exp.(%) 
Ag3020, 

Run 
172171108 


Tissue Name 


Rel. 

Exp.(%) 
Ag2968, 
Run 
169270971 


Rel. 
Exp.(%) 
Ag3020, 

Run 
172171108 


97457_Patient- 
02go_adipose 


0.0 


0.2 


94709_Donor 2 AM - 
Aadipose 


0.0 


0.0 


97476_Patient- 
07sk_skeletal 
muscle 


1.2 


8.4 


94710_Donor 2 AM - 
Badipose 


0.0 


0.0 


97477_Patient- 
07ut_uterus 


0.1 


0.0 


9471 I Donor 2 AM - 
Cadipose 


0.0 


0.0 
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97478_Patient- 
07pl_placenta 


0.0 


1.3 


94712_Donor 2 AD - 
Aadipose 


0.0 


3.0 


97481_Patient- 
08sk_skeletal 
muscle 


2.8 


12.6 


94713_Donor 2 AD - 
B_adipose 


100.0 


0.0 


97482 Patient- 
08ut uterus 


0.0 


0.0 


94714 Donor 2 AD - 
Cadipose 


0.0 


0.0 


97483_Patient- 
08pl_placenta 


0.0 


0.0 


94742_Donor 3 U - 
A Mesenchymal Stem Cells 


0.0 


0.0 


97486_Patient- 
09sk_skeletal 
muscle 


3.0 


12.9 


94743_Donor 3 U - 
B_Mesenchymal Stem Cells 


0.0 


0.0 


97487_Patient- 
09ut uterus 




U- / 


94730_Donor 3 AM - 
Aadipose 


0.0 


0.0 


97488_Patient- 
09pl_placenta 


0.1 


0.2 


94731_Donor 3 AM - 
Badipose 


0.0 


0.0 


97492_Patient- 
1 Oututerus 


0.0 


0.2 


94732_Donor 3 AM - 
Cadipose 


0.0 


0.0 


97493_Patient- 
10pl_placenta 


0.2 


1.1 


94733_Donor 3 AD - 
Aadipose 


0.0 


0.0 


97495_Patient- 
1 1 goadipose 


0.0 


0.0 


94734_Donor 3 AD - 
Badipose 


0.0 


0.0 


97496_Patient- 
1 lskskeletal 
muscle 


8.5 


53.6 


94735_Donor 3 AD - 
C_adipose 


0.0 


0.0 


97497_Patient- 
1 1 ututerus 


0.0 


0.3 


77 1 38_Liver_HepG2untreated 


0.0 


0.0 


97498_Patient- 
1 lpl placenta 


0.1 


2.3 


73556_Heart_Cardiac stromal 
cells (primary) 


0.0 


0.0 


97500_Patient- 
12go_adipose 


0.0 


0.4 


8 1735_SmaIl Intestine 


0.1 


0.0 


9750 1 Pat ient- 
12sk_skeletal 
muscle 


19.1 


100.0 


72409_Kidney_Proximal 
Convoluted Tubule 


0.0 


0.0 


97502_Patient- 
1 2ut uterus 


0.1 


0.0 


82685_Small 
intestineDuodenum 


0.0 


0.0 


97503 Patient- 
1 2pl_placenta 


0.1 


1.3 


sKJvr^J \j rvuitnai r\Ul tllU^Ul tl^/Cll 

adenoma 


0.0 


0.2 


94721 Donor 2 
U - 

AMesenchymal 
Stem Cells 


0.0 


0.0 


72410_Kidney_HRCE 


0.0 


0.3 


94722 Donor 2 
U- 

BMesenchymal 
Stem Cells 


0.0 


0.0 


72411_Kidney_HRE 


0.0 


0.0 


94723_Donor 2 


0.0 


0.0 


73 1 39_Uterus_Uterine smooth 


0.0 


0.0 



1332 







1 HO V'-iPO 1 ?? ... n Mi " 

f 






# 








u- 

C Mesenchymal 
Stem Cells 






muscle cells 







General screening panel vl.4 Summary: Ag3020 The NOV75 gene is expressed in 
brain, colon, lung and ovarian cancer cell lines with highest expression in a colon cancer cell 
line Colo-205 (CT=24.37). This suggests that this gene can be used as a diagnostic marker for 
5 these types of cancer. Furthermore, inhibition of the protein using small molecule drugs could 
potentially be useful for the treatment of brain, colon, lung and ovarian cancer. 

In addition, this gene has low expression in adipose and high expression in adult and 
fetal heart and skeletal muscle. Thus, this protein phosphatase may be a small molecule target 
for the treatment of obesity, Type 2 diabetes and cardiac and skeletal muscle disease. 

10 Panel 1.3D Summary: Ag2968/Ag3020 Results from two experiments using identical 

probe/primer sets are in excellent agreement. Expression of the NOV75 gene is highest in 
adult skeletal muscle (CTs = 26-28). Significant but somewhat lower expression is also seen in 
fetal skeletal muscle and adult/fetal heart. Thus, expression of this gene may be used to 
distinguish these samples from the other samples on this panel. 

15 This gene is also expressed in brain, colon, lung and ovarian cancer cell lines, 

consistent with General screen ing_panel v 1.4. This suggests that this gene can be used as a 
diagnostic marker for these types of cancer and inhibition of the protein using small molecule 
drugs can be used for the treatment of brain, colon, lung and ovarian cancer. 

Panel 3D Summary: Ag2968 Expression of the NOV75 gene is highest in colon 

20 cancer cell line Colo-205 (CT = 25.6). In addition, significant expression of this gene is seen 
in two other colon cancer cell lines. Thus, expression of this gene may be used to distinguish 
these colon cancer cell lines from the other samples on this panel. Moreover, therapeutic 
modulation of the activity of this gene or its protein product, using small molecules, antibodies 
or protein therapeutics, may be of benefit in the treatment of colon cancer. 

25 Panel 4D Summary: Ag3020 Expression of the NOV75 gene is highest in a liver 

cirrhosis sample (CT = 33.3). Furthermore, expression of this gene is not detected in normal 
liver in Panels 1 .3D or 1 .4, suggesting that its expression is unique to liver cirrhosis. This gene 
encodes a putative protein phosphatase; therefore, antibodies or small molecule therapeutics 
could reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this 

30 protein could also be used for the diagnosis of liver cirrhosis. Low levels of expression are 
also seen in colon and resting astrocytes. 
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Ag2968 Expression of this gene is low/undetectable (CTs > 35) across all of the 
samples on this panel (data not shown). 

Panel 5D Summary: Ag3020 Expression of the NOV75 gene is primarily restricted to 
samples from skeletal muscle. This specific expression is in agreement with the results in 
5 Panels 1.3D and 1.4. Thus, expression of this gene could be used to differentiate between 
skeletal muscle and other samples on this panel, and as a marker of skeletal muscle. 

NOV76a 

Expression of gene NOV76a was assessed using the primer-probe sets Ag3022 and 
Ag4891, described in Tables BQA and BQB. Results of the RTQ-PCR runs are shown in 
1 0 Tables BQC and BQD. 

Table BQA. Probe Name Ag3022 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' - tgcttgcaggtttattcttagg-3 * 


22 


431 


1297 


Probe 


TET-5 ' -tgctgagttctctcgttgtttccctg-3 * - 
TAMRA 


26 


459 


1298 


Reverse 


5 ' - tgagaaatgcaggtagggacta-3 ' 


22 


509 


1299 



Table BQB. Probe Name Ag4891 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -aatacctgtccaaagcctgact-3 ' 


22 


661 


1300 


Probe 


TET-5 * - ttatccccgagtctcatttcctgcgt-3 * - 
TAMRA 


26 


683 


1301 


Reverse 


5 * -ttctcacaaaagctgtcattca-3 1 


22 


716 


1302 



1 5 Table BQC. General_screening_panel_vl.5 



Tissue Name 


Rel. Exp.(%) Ag4891, 
Run 228714709 


Tissue Name 


Rel. Exp.(%) Ag4891, 
Run 228714709 


Adipose 


10.8 


Renal ca. TK-10 


24.5 


Melanoma* 
Hs688(A).T 


5.6 


Bladder 


22.8 


Melanoma* 
Hs688(B).T 


6.3 


Gastric ca. (liver met.) 
NCI-N87 


46.7 


Melanoma* Ml 4 


27.4 


Gastric ca. KATO III 


22.2 


Melanoma* 
LOXIMVI 


9.5 


Colon ca. SW-948 


11.5 


Melanoma* SK- 
MEL-5 


12.4 


Colon ca. SW480 


20.7 
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Squamous cell 
carcinoma SCC-4 


10.2 


Colon ca.* (SW480 
met) SW620 


11.6 


Testis Pool 


11.3 


Colon ca. HT29 


10.6 


Prostate ca.* (bone 
met) PC-3 


6.7 


Colon ca. HCT-116 


12.2 


Prostate Pool 


16.8 


Colon ca. CaCo-2 


73.2 


Placenta 


33.7 


Colon cancer tissue 


45.4 


Uterus Pool 


9.7 


Colon ca. SW1116 


3.0 


Ovarian ca. 
OVCAR-3 


30.4 


Colon ca. Colo-205 


5.4 


Ovarian ca. SK-OV- 

3 


10.2 


Colon ca. SW-48 


10.2 


Ovarian ca. 
OVCAR-4 


o.Z> 


i^oion rOOl 


1 ^ Q 

1J.7 


Ovarian ca. 
OVCAR-5 


91 8 


oiTidii lniesiine rooi 


I J.Z 


Ovarian ca. IGROV- 
1 


10.6 


Stomach Pool 


7.4 


Ovarian ca. 

r\\/f^ At? q 
KJ V l^/\K-o 


5.6 


Bone Marrow Pool 


5.7 


Ovary 


6.5 


Fetal Heart 


11.3 


Breast ca. MCF-7 


52.5 


Heart Pool 


7.0 


Breast ca. MDA- 
MB-231 


11.5 


Lymph Node Pool 


18.4 


Breast ca. BT 549 


41.8 


Fetal Skeletal Muscle 


9.2 


Breast ca. T47D 


4.4 


Skeletal Muscle Pool 


16.3 


Breast ca. MDA-N 


12.1 


Spleen Pool 


19.3 


Breast Pool 


17.8 


Thymus Pool 


14.7 


Trachea 


12.1 


CNS cancer (glio/astro) 
U87-MG 


12.2 


Lung 


4.7 


CNS cancer (glio/astro) 
U-118-MG 


1 1 o 

1 1 .5 


Fetal Lung 


36.3 


CNS cancer 
(neuro;met) SK-N-AS 


24.5 


T nncr ca NCI-N417 


4 4 


CNS cancer (astro) SF- 
539 


S 4 


Lung ca. LX-1 


9.9 


CNS cancer (astro) 
SNB-75 


13.0 


Lung ca.NCI-H146 


15.0 


CNS cancer (glio) 
SNB-19 


11.7 


Lung ca. SHP-77 


6.8 


CNS cancer (glio) SF- 
295 


15.7 


Lung ca. A549 


28.7 


Brain (Amygdala) Pool 


10.8 


Lung ca. NCI-H526 


0.8 


Brain (cerebellum) 


35.8 


Lung ca. NCI-H23 


38.4 


Brain (fetal) 


9.3 
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Lung ca. NCI-H460 


10.0 


Brain (Hippocampus) 

rOOl 


10.2 


I linn r«o UHP £%D 


117 
11./ 


Cerebral Cortex Pool 




Lung ca. NCI-H522 


29.5 


Brain (Substantia nigra) 
Pool 


10.3 


Liver 


6.1 


Brain (Thalamus) Pool 


15.1 


Fetal Liver 


19.3 


Brain (whole) 


10.3 


Liver ca. HepG2 


34.4 


Spinal Cord Pool 


10.3 


Kidney Pool 


23.7 


Adrenal Gland 


100.0 


Fetal Kidney 


11.6 


Pituitary gland Pool 


5.3 


Renal ca. 786-0 


14.6 


Salivary Gland 


12.5 


Renal ca. A498 


5.4 


Thyroid (female) 


6.7 


Renal ca. A CRN 


21.6 


Pancreatic ca. CAPAN2 


21.0 


Renal ca. UO-31 


25.9 


Pancreas Pool 


25.9 



Table BQD. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag4891, Run 
214253687 


Tissue Name 


Rel. Exp.(%) 
Ag4891, Run 
214253687 


Secondary Thl act 


55.1 


HUVEC IL-lbeta 


17.7 


Secondary Th2 act 


100.0 


HUVEC IFN gamma 


6.2 


Secondary Trl act 


74.2 


HUVEC TNF alpha + IFN 
gamma 


9.0 


Secondary Thl rest 


18.8 


HUVEC TNF alpha + IL4 


11.7 


Secondary Th2 rest 


34.4 


HUVEC IL-11 


2.9 


Secondary Trl rest 


20.9 


Lung Microvascular EC 
none 


21.3 


Primary Thl act 


18.8 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


26.2 


Primary Th2 act 


60.3 


Microvascular Dermal EC 
none 


6.7 


Primary Trl act 


35.4 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


15.1 


Primary Th 1 rest 


17.7 


Bronchial epithelium 
TNFalpha + IL1 beta 


16.8 


Primary Th2 rest 


24.8 


Small airway epithelium 1 
none 


4.0 


Primary Trl rest 


14.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


13.3 


CD45RA CD4 
lymphocyte act 


15.1 


Coronery artery SMC rest 


8.4 


CD45RO CD4 
lymphocyte act 


63.3 


Coronery artery SMC 
TNFalpha + IL-lbeta 


8.2 


CD8 lymphocyte act 


23.0 


Astrocytes rest 


8.4 
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Secondary CD8 
lymphocyte rest 


46.0 


Astrocytes TNFalpha + 
IL-1 beta 


3.1 


Secondary CD8 
lymphocyte act 


18 6 


KU-812 (BasoDhiD rest 


0.5 


v~'LJ'-t I y 1 1 1 uiiut y lc nunc 


1 S S 


KU-812 (Basophil) 
PMA/ionomycin 


2.3 


2ryThl/Th2/Trl anti- 
CD95CHH 


47.0 


CCD 1 106 (Keratinocytes) 
none 


3.3 


LAK cells rest 


15.4 


CCD1 106 (Keratinocytes) 
TNFalpha + IL- 1 beta 


10.7 


LAK cells IL-2 


30.4 


Liver cirrhosis 


13.7 


LAK cells IL-2+IL-12 


12.9 


NCI-H292 none 


28.9 


LAK cells IL-2+IFN 
gamma 


12.8 


NCI-H292 IL-4 


24.0 


LAK cells IL-2+ IL-18 


22.4 


NCI-H292 IL-9 


52.9 


LAK cells 
PMA/ionomycin 


24.8 


NCI-H292 IL-13 


26.6 


NK Cells IL-2 rest 


20.0 


NCI-H292 IFN gamma 


30.8 


Two Way MLR 3 day 


29.7 


HPAEC none 


4.2 


Two Way MLR 5 day 


32.5 


HPAEC TNF alpha + IL-1 
beta 


58.2 


Two Way MLR 7 day 


42.6 


Lung fibroblast none 


10.0 


PBMC rest 


10.3 


Lung fibroblast TNF alpha 
+ IL-1 beta 


7.5 


PBMC PWM 


47.0 


Lung fibroblast IL-4 


4.5 


PBMC PHA-L 


49.7 


Lung fibroblast IL-9 


5.9 


Ramos (B cell) none 


1.6 


Lung fibroblast IL-13 


3.4 


Ramos (B cell) 
ionomycin 


1 C 

1.5 


Lung fibroblast IFN 
gamma 


Q A 

6.4 


R Ivmnhnrvtps PWIVT 


28.1 


Dermal fibroblast 
CCD 1070 rest 


4.1 


B lymphocytes CD40L 
and IL-4 


1 1.4 


Dermal fibroblast 
CCD 1070 TNF alpha 


42.9 


EOL-1 dbcAMP 


1.5 


Dermal fibroblast 
CCD 1070 IL-1 beta 


4.9 


EOL-1 dbcAMP 

PlVf A /tnnnmvpi'n 

1 l\Jll\Jlliy 1/111 


1.9 


Dermal fibroblast IFN 

i_l Ci. 1 1 1 1 1 1 CI 


7.3 


Dendritic cells none 


10.2 


Dermal fibroblast IL-4 


6.8 


Dendritic cells LPS 


7.3 


Dermal Fibroblasts rest 


7.7 


Dendritic cells anti- 
CD40 


6.0 


Neutrophils TNFa+LPS 


13.5 


Monocytes rest 


2.5 


Neutrophils rest 


12.8 


Monocytes LPS 


34.9 


Colon 


6.0 


Macrophages rest 


9.4 


Lung 


10.6 


Macrophages LPS 


7.1 


Thymus 


20.3 
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HUVEC none 


5.4 


j Kidney 


1. 


10.2 


HUVEC starved 


8.5 i r 



CNS_neurodegeneration_vl.O Summary: Ag3022 No significant expression 
detected. The amp plot indicates that there is possibility of a potential chemistry or 
probe/primer failure (data not shown). 
5 General _screening_j)anel_vl.4 Summary: Ag3022 The amp plot indicates that there 

is possibility of a potential chemistry or probe/primer failure (data not shown). 

Generai_screening_panel_vl.5 Summary: Ag4891 TheNOV76a gene has moderate 
levels of expression in adipose, liver, heart, skeletal muscle, pituitary, thyroid and pancreas, 
and high levels of expression in adrenal gland. Thus, this gene product may be a small 
10 molecule target for the treatment of metabolic, endocrine and adrenal diseases, including 
obesity, Types 1 and 2 diabetes, and Addison's disease. 

In addition, this gene is expressed at moderate levels in the cancer cell lines in this 
panel. A higher level of expression is observed in colon, lung, breast and ovarian cancer cell 
lines when compared to samples from the normal colon, lung, breast and ovary. Thus, this 
15 gene could be used as a diagnostic marker of cancer in these tissues. Furthermore, inhibition of 
the activity of this gene product using small molecule drugs may be useful for the treatment of 
cancer in these tissues. 

This gene encodes a homolog of a dual specificity phosphatase that is also expressed at 
low to moderate levels across the CNS. Dual-spec ificity phosphatases comprise a family of 
20 MAP kinase regulating enzymes, members of which are upregulated in brains subjected to 

insults such as ischemia and seizure activity. MAP kinases are known to regulate neurotrophic 
and neurotoxic pathways. Consequently, agents that modulate the activity of this gene may 
have utility in attenuating the apoptotic and neurodegenerative processes following brain 
insults. 
25 References: 

Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 
rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 

30 The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 

Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 

min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mRNA was found most 

prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
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in the selectively vulnerable CA1 sector. At later time points and in control animals no PAC-1 
mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 
PAC-1, therefore, may be involved in adaptational responses of hippocampal cells resistant to 
ischemic injury. 

5 Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 

specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 Sep 
29;8(14):3077-80 

Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 
responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 

10 activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 

inflammatory cytokines, their activation state could determine seizure-related brain damage. 
PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 
found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 
lead to rapid (approximately 3 h) but transient appearance of PAC1 mRNA in granule cells of 

15 the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 

after 2-3 days PAC1 was induced in dying CA1 and CA3 neurons. At this time PAC1 mRNA 
was also expressed in white matter microglia as well as in microglia invading the damaged 
hippocampus. PAC1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

20 Panel 4.1D Summary: Ag4891 The NOV76a gene is expressed in a wide range of cell 

types and tissues (CT=26-34) of significance in the immune response in health and disease. 
Highest expression of this gene is detected in activated secondary Th2 cells (CT=26.48). 
Therefore, targeting of this gene product with a small molecule drug or antibody therapeutic 
may modulate the functions of cells of the immune system as well as resident tissue cells and 

25 lead to improvement of the symptoms of patients suffering from autoimmune and 

inflammatory diseases such as asthma, allergies, inflammatory bowel disease, lupus 
erythematosus, and arthritis, including osteoarthritis and rheumatoid arthritis. 

Panel 4D Summary: Ag3022 No significant expression detected. Potential 
probe/primer failure (data not shown). 

30 NOV77 

Expression of gene NOV77 was assessed using the primer-probe sets Ag3023 and 
Ag3373, described in Tables BRA and BRB. Results of the RTQ-PCR runs are shown in 
Tables BRC, BRD, BRE and BRF. 
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Table BRA. Probe Name Ag3023 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ctaatgctggatttgtccatca-3 ' 


22 


572 


1303 


Probe 


TET-5 * -tcaggaatatgaagccatctacctagca- 
3 ' -TAMRA 


28 


597 


1304 


Reverse 


5 ' -tggagtggtgacatcatctgta-3 ■ 


22 
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Table BRB. Probe Name Ag3373 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -atttgtccatcaacttcaggaa-3 1 


22 


582 


1306 


Probe 


TET-5' -tgaagccatctacctagcaaaattaaca- 
3' -TAMRA 


28 


606 


1307 


Reverse 


5 1 -tggagtggtgacatcatctgta-3 1 


22 


635 
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Table BRC. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3023, Run 
209821074 


Rel. Exp.(%) ! 
Ag3373, Run 
210154071 


Tissue 
Name 


Rel. Exp.(%) 
Ag3023, Run 
209821074 


Rel. Exp.(%) 
Ag3373, Run 
210154071 


AD 1 Hippo 


10.9 


16.8 


Control 
(Path) 3 
Temporal : 
Ctx 


9.1 


8.0 


AD 2 Hippo 


34.2 


37.6 


Control 
(Path) 4 
Temporal 
Ctx 


40.6 


65.5 


AD 3 Hippo 


12.0 


15.8 


AD 1 
Occipital 
Ctx 


24.7 


29.1 


AD 4 Hippo 


13.8 


10.3 


AD 2 
Occipital 

Ctx 
(Missing) 


0.0 


0.0 


AD 5 hippo 


60.7 


57.8 


AD 3 
Occipital 
Ctx 


14.7 


15.0 


AD 6 Hippo 


80.7 


72.2 


AD 4 
Occipital 
Ctx 


35.4 


22.4 


Control 2 
Hippo 


35.8 


38.4 


AD 5 
Occipital 
Ctx 


3.9 


30.4 


Control 4 
Hippo 


16.5 


11.7 


AD 6 
Occipital 


46.0 


37.4 
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Ctx ) 




Control (Path) 
3 Hippo 


13.1 


15.4 


Control 1 | 

fW> c ♦ i n i t n 1 

Ctx 


9 9 


10.7 


AD 1 Temporal 
Ctx 


39.0 


31.4 


Control 2 
Occipital 
^ix 


39.0 


38.4 


AD 2 Temporal 
Ctx 


38.7 


73.2 


Control 3 
Occipital 
Ctx 


23.0 


20.6 


AD 3 Temporal 
Ctx 


9 S 


1 3 2 


Control 4 
Occipital 

Ctx 


13.3 


13.3 


AD 4 Temporal 


27.9 


34.9 


Control 
(Path) 1 
Occipital 
Ctx 


80.1 


76.3 


AD 5 Inf 


59.0 


100.0 


Control 
(Path) 2 
Occipital 

^✓LX 


17.3 


20.0 


AD 5 
SupTemporal 
Ctx 


33.2 

- 


44.1 


Control 

^.r din ) D 

Occipital 
Ctx 


8.4 


8.7 


AD 6 lnf 
Temporal Ctx 


100.0 


73.2 


Control 
(Path) 4 
Occipital 
Ctx 


21.2 


20.6 


AD 6 Sup 
Temporal Ctx 


79.6 


80.1 


Control 1 
Parietal Ctx 


12.1 


16.3 


Control 1 
Temporal Ctx 


10.2 


13.7 


Control 2 
Parietal Ctx 


48.0 


4U.y 


Control 2 
Temporal Ctx 


41.2 


31.9 


Control ^ 

Parietal Ctx 


17.9 


16.3 


Control 3 
Temporal Ctx 


20.3 


20.0 


Control 
(Path) 1 
Parietal Ctx 


74.7 


64.2 


Control 4 
Temporal Ctx 


9.7 


9.9 


Control 
(Path) 2 
Parietal Ctx 


28.9 


59.9 


Control (Path) 
1 Temporal Ctx 


59.9 


68.3 


Control 
(Path) 3 
Parietal Ctx 


10.2 


9.0 


Control (Path) 
2 Temporal Ctx 


40.3 


41.2 


Control 
(Path) 4 
Parietal Ctx 


44.8 


43.8 
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Table BRD. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) Ag3373, 
Run 217043119 


Tissue Name 


Rel. Exp.(%) Ag3373, 
Run 217043119 


Adipose 


12.0 


Renal ca. TK-10 


20.3 


Melanoma* 
Hs688(A).T 


30.8 


Bladder 


Zj.Z 


Melanoma* 
Hs688(B).T 


69.3 


Gastric ca. (liver met.) 
NCI-N87 


25.3 


Melanoma* Ml 4 ; 


15.0 


Gastric ca. KATO III 


30.8 


Melanoma* 
LOXIMVI 


zo.o 


v^UlOIl L/d. O VV "VtO 


9.7 


Melanoma* SK- 
MEL-5 


21.5 


Colon ca. SW480 


35.1 


Squamous cell 
carcinoma SCC-4 


33.0 


Colon ca.* (SW480 
met) SW620 


13.9 


Testis Pool 


19.8 


Colon ca. HT29 


8.5 


Prostate ca.* (bone 
met) PC-3 


1 (\(\ A 


l^oion Ca. rlv^ i - 1 i o 


36 9 

JUi7 


Prostate Pool 


9.2 


Colon ca. CaCo-2 


42.9 


Placenta 


3.8 


Colon cancer tissue 


9.0 


Uterus Pool 


7.4 


Colon ca. SW1116 


5.8 


Ovarian ca. 
OVCAR-3 


28^5 


Colon ca. Colo-205 


* 4.3 


Ovarian ca. SK-OV- 
3 


40.3 


Colon ca. SW-48 


4.2 


Ovarian ca. 
OVCAR-4 


20.0 


Colon Pool 


20.7 


Ovarian ca. 
OVCAR-5 


j j. i 


^Imall TnfpQtinp Pool 


12.2 


Ovarian ca. 1GROV- 
1 


10.9 


Stomach Pool 


9.9 


Ovarian ca. 
OVCAR-8 


9.2 


Bone Marrow Pool 


11.6 


Ovary 


9.7 


Fetal Heart 


20.7 


Breast ca. MCF-7 


37.6 


Heart Pool 


10.6 


Breast ca. MDA- 
MB-231 


37.1 


Lymph Node Pool 


17.9 


Breast ca. BT 549 


62.4 


Fetal Skeletal Muscle 


12.3 


Breast ca. T47D 


61.1 


Skeletal Muscle Pool 


16.0 


Breast ca. MDA-N 


10.0 


Spleen Pool 


11.6 


Breast Pool 


17.3 


Thymus Pool 


12.2 


Trachea 


12.0 


CNS cancer (glio/astro) 
U87-MG 


29.1 


Lung 


6.7 


CNS cancer (glio/astro) 
U-118-MG 


69.3 
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Fetal Lung 




CNS cancer 
(neuro;met) SK-N-AS 


34.9 


Lung ca. NCI-N417 


5.4 


CNS cancer (astro) SF- 
539 


19.1 


Lungca. LX-1 


17.2 


CNS cancer (astro) 
SNB-75 


35.8 


Lung ca. NCI-H146 


3.0 


CNS cancer (guo) 
SNB-19 


11.3 


Lung ca. SHP-77 


18.6 


CNS cancer (glio) SF- 
295 


26.4 


Lung ca. A549 


29.1 


Rrni'n ( A ttivctHh Pool 

J3Id.Hl ^/Alliy ^KJKllcLJ I \JKJl 


4.5 


Lung ca. NCI-H526 4.6 


131 CX 111 ^^/l^l^L/^IILIIll J 


8.1 


Lung ca. NCI-H23 


31.6 


Dldlll ^iWLCliy 


13.2 


Lung ca. NCI-H460 


18.2 


DIalil ^ril|Jp*Juaiii^»uc>y 

Pool 


5.3 


Lung ca. HOP-62 


14.1 


Cerebral Cortex Pool 


5.4 


Lung ca. NCI-H522 


31.6 


Brain (Substantia nigra) 
Pool 


A O 

4.8 


Liver 


1.2 


Brain (Thalamus) Pool 


8.0 


Fetal Liver 


32.3 


Brain (whole) 


6.2 


Liver ca. HepG2 


14.6 


Spinal Cord Pool 


6.6 


Kidney Pool 


j 22.1 


Adrenal Gland 


8.1 


Fetal Kidney 


26.1 


Pituitary gland Pool 


3.0 


Renal ca. 786-0 


F 28.7 


Salivary Gland 


4.7 


Renal ca. A498 


j 11.3 


Thyroid (female) 


4.4 


Renal ca.ACHN j 12.2 


Pancreatic ca. CAPAN2 


17.3 


Renal ca.UO-31 1 24.1 


Pancreas Pool 


17.1 


Table BRE. Panel 1.3D 


Tissue Name 


Rel. Exp.(%) Ag3023, 
Run 167966931 


Tissue Name 


Rel. Exp.(%) Ag3023, 
Run 167966931 


Liver adenocarcinoma 


51.1 


Kidney (fetal) 


26.2 


Pancreas 


6.1 


Renal ca. 786-0 


34.2 


Pancreatic ca. CAPAN 
2 


17.7 


Renal ca. A498 


17.6 


Adrenal gland 


3.8 


Renal ca. RXF 393 


17.2 


Thyroid 


3.0 


Renal ca. ACHN 


13.5 


Salivary gland 


3.9 


Renal ca. UO-31 


0.0 


Pituitary gland 


3.6 


Renal ca. TK-10 


23.0 


Brain (fetal) 


_ 


Liver 


11.7 


Brain (whole) 


| 8.5 


Liver (fetal) 


8.0 


Brain (amygdala) 


j 6.7 


Liver ca. 
(hepatoblast) HepG2 


26.2 
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Brain (cerebellum) 


15.2 


Lung 


3.1 


Brain (hippocampus) 


5.4 


Lung (fetal) 


11.0 


Brain (substantia nigra) 


9.0 


Lung ca. (small cell) 
LX-1 


12.9 


Brain (thalamus^ 


4.2 


Lung ca. (small cell) 
NCI-H69 


9 9 


Cerebral Cortpx 


2 0 


Lung ca. (s.cell var.) 
SHP-77 


67 8 


^ninal cnrrl 




Lung ca. (large 
cell)NCI-H460 




p1 in/astro I J87-MO 




Lung ca. (non-sm. 
cell) A549 


4S 1 


crIWa<;trr» 11-11 8-Mfi 

gnu/ ClMl U LJ 1 lO IVlvJ 




Lung ca. (non-s.cell) 
NCI-H23 


10 7 


astrocytoma S W 1 783 


40.6 


Lung ca. (non-s.cell) 
HOP-62 


25.7 


neuro*; met SK-N-AS 


27.2 


Lung ca. (non-s.cl) 
NCI-H522 


38.2 


astrocytoma SF-539 


29.7 


Lung ca. (squam.) 
SW 900 


27.4 


astrocytoma SNB-75 


35.1 


Lung ca. (squam.) 
NCI-H596 


29.9 


glioma SNB-19 


15.6 


Mammary gland 


5.1 


glioma U251 


37.9 


Breast ca.* (pl.ef) 
MCF-7 


47.0 


glioma SF-295 


18.4 


Breast ca.* (pl.ef) 
MDA-MB-231 


22.7 


Heart (fetal) 


2.9 


Breast ca.* (pl.ef) 
T47D 


86.5 


Heart 


12.9 


Breast ca. BT-549 


15.9 


Skeletal muscle (fetal) 1 


3.4 


Breast ca. MDA-N 


10.4 


Skeletal muscle 


36.3 


Ovary 


2.9 


Bone marrow 


4.5 


Ovarian ca. OVCAR- 

3 


26.1 


Thymus 


14.3 


Ovarian ca. OVCAR- 
4 


16 3 


Spleen 


8.7 


Ovarian ca. OVCAR- 
5 


83.5 


Lymph node 


11.8 


Ovarian ca. OVCAR- 
8 


9.3 


Colorectal 


10.4 


Ovarian ca. IGROV- 
1 


12.0 


Stomach 


7.8 


Ovarian ca.* (ascites) 
SK-OV-3 


100.0 


Small intestine 


5.1 


Uterus 


4.9 
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Colon ca. SW480 


19.3 


Placenta 


1.3 


Colon ca.* 
SW620(SW480 met) 


42.9 


Prostate 


3.9 


Colon ca. HT29 


9.9 


Prostate ca.* (bone 
met)PC-3 


78.5 


Colon ca. HCT-116 


26.2 


Testis 


9.7 


Colon ca. CaCo-2 


41.5 


Melanoma 
Hs688(A).T 


5.9 


Colon ca. 
tissue(OD03866) 


6.3 


Melanoma* (met) 
Hsoo»(tJ). 1 


14.2 


Colon ca. HCC-2998 


16.0 


Melanoma UACC-62 


14.0 


Gastric ca.* (liver met) 
NCI-N87 


18.8 


Melanoma M 1 4 


5.7 


Bladder 


30.6 


Melanoma LOX 
IMVI 


8.8 


Trachea 


3.2 


Melanoma* (met) 
SK-MEL-5 


14.7 


Kidney 


9.6 


Adipose 


18.9 



Table BRF. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag3023, 
Run 
164516146 


Rel. 

Exp.(%) ! 
Ag3373, ! 
Run 
165296617 


Tissue Name 


Rel. 
Exp.(%) 
Ag3023, 

Run 
164516146 


Rel. 
Exp.(%) 
Ag3373, 

Run 
165296617 


Secondary Thl act 


18.6 


17.9 


HUVEC IL-lbeta 1 


20.3 


18.6 


Secondary Th2 act 


24.3 


28.5 


HUVEC IFN 
gamma 


25.3 


22.7 


Secondary Trl act 


22.8 


21.8 


HUVEC TNF 
alpha + IFN 
gamma 


16.3 


18.0 


Secondary Thl rest 


7.5 


6.8 


HUVEC TNF 
alpha + IL4 


18.2 


13.4 


Secondary Th2 rest 


11.6 


9.5 


HUVEC IL- 11 ; 


13.7 


9.9 


Secondary Trl rest 


12.1 


10.7 


Lung 
Microvascular EC 
none 


25.7 


21.6 


Primary Thl act 


20.7 


16.5 


Lung 
Microvascular EC 
TNFalpha + IL- 
lbeta 


26.2 


18.3 


Primary Th2 act 


20.2 


19.3 


Microvascular 
Dermal EC none 


27.5 


21.3 


Primary Trl act 


23.3 


27.7 


Microsvasular 
Dermal EC 


20.7 


19.9 
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TNFalpha + IL- 
lbeta j 






Primary Thl rest 


51.1 


51.4 


Dronciiiai 
epithelium 
TNFalpha + 
ILlbeta 


13.0 


16.3 


Primary Th2 rest 


26.2 


29.5 


Small airway 
epithelium none 


8.1 


8.5 


Primary Trl rest 


23.7 


26.1 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


50.3 


39.8 


lymphocyte act 


14.6 


11.0 


Coronery artery 
SMC rest 


20.2 


18.9 


CD45RO CD4 
lymphocyte act 


25.2 


22.4 


Coronery artery \ 
SMC TNFalpha + 
IL-lbeta 


12.0 


9.8 


act 


20.4 


15.8 


Astrocytes rest 


10.4 


11.1 


Secondary CD8 
lymphocyte rest 


16 5 


19.9 


Astrocytes 
TNFalpha + IL- \ 
lbeta 


11.7 


9.8 


Secondary CD8 
lymphocyte act 


13.2 


9.3 


KU-812 
(Basophil) rest 


47.6 


38.2 


CD4 lymphocyte 
none 


17.1 


11.6 


rvU-o i z 
(Basophil) 
PMA/ionomycin 


94.0 


92.0 


2ry 

Th1/Th2/Trl anti- 
CD95 CH11 


18.3 


16.6 


CCD 11 06 
(Keratinocytes) 
none 


19.9 


13.2 


LAK cells rest 


25.5 


16.0 


CCD 11 06 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


f\ 0 
o.u 


4.8 


LAK cells IL-2 


27.2 


22.5 


Liver cirrhosis 


3.1 


2.7 


LAK cells IL-2+IL- 
12 


27.2 


19.3 


Lupus kidney 


2.1 


1.7 


LAK cells IL- 
2+IFN gamma 


36.3 


34.4 


I NC1-H292 none 


30.1 


18.9 


LAK cells IL-2+ 
IL-18 


35.1 


29.7 


NCM1292 IL-4 


33.9 


34.6 


LAK cells 
PMA/ionomycin 


12.4 


11.0 


NCI-H292IL-9 


40.1 


29.1 


NK Cells IL-2 rest 


20.0 


15.0 


! NCI-H292IL-13 


16.2 


14.2 


Two Way MLR 3 
day 


24.0 


16.7 


| NCI-H292 I FN 
gamma 


16.6 


18.4 


Two Way MLR 5 


12.9 


10.1 


HPAEC none 


13.6 


13.5 
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day 










Two Way MLR 7 
day 


1L4 


9.5 


HPAEC TNI-' 
alpha + IL-1 beta 


25.3 


25.3 


PBMC rest 


13.7 


10.5 


Lung fibroblast 
none 


11.4 


14.2 


PBMC PWM 


69.3 


66.4 


Lune fibroblast 
TNF alpha + IL-1 
beta 


6.1 


7.2 


PBMC PHA-L 


22.8 


17.7 


Lung fibroblast i 
IL-4 


28.5 


9Q 1 


Ramos (B cell) 
none 


24.1 


19.3 


I un2 fibroblast 
IL-9 


23.0 


23.3 


Ramos (B cell) 
ionomycin 


100.0 


100.0 


Lung fibroblast ! 
IT -13 


20.6 


18.9 


B Ivmnhocvtes 
PWM 


71.7 


74.2 


Lung fibroblast | 

1JT1N gcuuum 


39.0 


32.5 


ft 1 vm nh opvtPQ 

CD40LandIL-4 


29.1 


28.7 


Dermal fibroblast | 
PCD 1070 rest 


33.9 


31.0 


EOL-1 dbcAMP ' 


12.1 


10.5 


Dermal fibroblast 
CCD1070 TNF 
alpha 


76.8 


62.0 


EOL-1 dbcAMP \ 

r iviy-v/ \\j\viy v^iii 


14.5 


10.9 


Dermal fibroblast 
CCD 1070 IL-1 
beta 


20.3 


13.9 

.......... 


Dendritic cells 
none 


13.2 


14.8 


Dermal fibroblast 
IFN gamma 


14.2 


9.5 

— ■ — ——— 


Dendritic cells LPS 


11.7 


8.3 


uermai iiDroDiasi 
IL-4 


26.4 


20.4 


Dendritic cells anti- 
CD40 


17.7 


12.7 


IBD Colitis 2 


2.6 


2.2 


Monocytes rest 


16.7 


17.6 


IBD Crohn's 


2.0 


1.9 


Monocytes LPS 


6.4 


5.0 


Colon 


11.9 


10.5 


Macrophages rest 


23.5 


22.8 


Lung 


13.3 


11.2 


Macrophages LPS 


9.9 


7.1 


Thymus 


14.4 


12.9 


HUVEC none 


20.6 


17.9 


i Kidney 


27.5 


19.6 


HUVEC starved 43.5 


38.4 


i 


i 



CNS_neurodegeneration_vl.O Summary: Ag3023/Ag3373 This panel does not show 
differential expression of the NOV77 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

General_screening_panel_vl.4 Summary: Ag3373 Highest expression of the 
NOV77 gene is seen in a prostate cancer cell line (CT=27). Overall, this gene is expressed at 
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moderate levels in the cancer cell lines in this panel. A higher level of expression is observed 
in clusters of cell lines derived from prostate, brain, melanoma, colon, lung, breast and ovarian 
cancer when compared to expression in normal prostate, brain, colon, lung, breast and ovary. 
Thus, this gene could potentially be used as a diagnostic marker of cancer in these tissues. 
5 Furthermore, inhibition of the activity of this gene product using small molecule drugs may be 
effective in the treatment of cancer in these tissues. 

Among tissues with metabolic function, this gene product has moderate levels of 
expression in adipose, heart, skeletal muscle, adrenal, pituitary, thyroid and pancreas. Thus, 
this gene product may be a small molecule target for the treatment of endocrine and metabolic 
10 diseases, including obesity and Types 1 and 2 diabetes. 

In addition, this gene appears to be differentially expressed in fetal (CT value = 29) vs 
adult liver (CT value -33) and may be useful for differentiation between the two sources of 
this tissue. 

This gene is also expressed at moderate levels in all central nervous system samples 
1 5 present on this panel. Please see Panel 1 .3D for discussion of utility of this gene in the central 
nervous system. 

Panel 1.3D Summary: Ag3023 The NOV77 gene is ubiquitously expressed among 
the samples on this panel, with highest expression in an ovarian cancer cell line (CT=28.8). 
Overall, the expression of this gene shows good agreement with panel 1.4. A higher level of 

20 expression is observed in prostate, brain, melanoma, colon, lung, pancreatic, breast and 

ovarian cancer cell lines than the normal prostate, brain, colon, lung, pancreas, breast and 
ovary. Thus, expression of this gene could be used as a diagnostic marker of cancer in these 
tissues. Furthermore, inhibition of the activity of this gene product using small molecule drugs 
may be effective in the treatment of cancer in these tissues. 

25 Among tissues with metabolic function, expression of this gene is widespread, as in the 

previous panel. Please see Panel 1.4 for discussion of utility of this gene in metabolic disease. 

This gene represents a dual specificity phosphatase that is also expressed at low to 
moderate levels across the CNS. Dual-specificity phosphatases comprise a family of MAP 
kinase regulating enzymes, members of which are upregulated in brains subjected to insults 

30 such as ischemia and seizure activity. MAP kinases are kown to regulate neurotrophic and 
neurotoxic pathways. Consequently, agents that modulate the activity of this gene may have 
utility in attenuating the apoptotic and neurodegenerative processes following brain insults. 
References: 
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Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 
rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 
5 The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 
Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 
min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mRNA was found most 
prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
in the selectively vulnerable CA1 sector. At later time points and in control animals no PAC-l 
10 mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 

PAC-1, therefore, may be involved in adaptational responses of hippocampal cells resistant to 
ischemic injury. 

Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 
specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 Sep 

15 29;8(14):3077-80 

Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 
responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 
activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 
inflammatory cytokines, their activation state could determine seizure-related brain damage. 

20 PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 
found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 
lead to rapid (approximately 3 h) but transient appearance of PAC1 mRNA in granule cells of 
the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 
after 2-3 days PAC1 was induced in dying CA1 and CA3 neurons. At this time PAC1 mRNA 

25 was also expressed in white matter microglia as well as in microglia invading the damaged 

hippocampus. PAC1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

Panel 4D Summary: Ag3023/Ag3373 The NOV77 gene is expressed at high to 
moderate levels in a wide range of cell types and tissues of significance in the immune 

30 response in health and disease. Highest expression of this gene is seen in ionomycin treated 
Ramos B cells (CT=26.83). Therefore, targeting of this gene product with a small molecule 
drug or antibody therapeutic may modulate the functions of cells of the immune system as 
well as resident tissue cells and lead to improvement of the symptoms of patients suffering 
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from autoimmune and inflammatory diseases such as asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, and arthritis, including osteoarthritis and rheumatoid arthritis. 

NOV78 

Expression of gene NOV78 was assessed using the primer-probe set Ag3025, 
5 described in Table BSA. Results of the RTQ-PCR runs are shown in Tables BSB, BSC and 
BSD. 



Table BSA. Probe Name Ag3025 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward; 


5 * -gctgctgcaattgtaataggtt-3 ' 


22 


596 


1309 


Probe 


TET-5 ' -tcctgatgaattctgaacaaacctca-3 ' - 
TAMRA 


26 


618 


1310 


Reverse 


5 ■ -catatggaaggtcttgcatttt-3 ' 


22 


669 


1311 



Table BSB. CNS_neurodegeneration_yl.O 



Tissue Name 


Rel. Exp.(%) Ag3025, 
Run 209821733 


Tissue Name 


ReL Exp.(%) Ag3025, 
Run 209821733 


AD 1 Hippo 


21.3 


Control (Path) 3 
Temporal Ctx 


4.8 


AD 2 Hippo 


29.1 


Control (Path) 4 
Temporal Ctx 


50.7 


AD 3 Hippo 


15.9 


AD 1 Occipital Ctx 


22.7 


AD 4 Hippo 


11.4 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


66.4 


AD 3 Occipital Ctx 


9.0 


AD 6 Hippo 


26.6 


AD 4 Occipital Ctx 


13.7 


Control 2 Hippo 


32.5 


AD 5 Occipital Ctx 


54.7 


Control 4 Hippo 


7.2 


AD 6 Occipital Ctx 


8.1 


Control (Path) 3 
Hippo 


4.8 


Control 1 Occipital 
Ctx 


2.3 


AD 1 Temporal Ctx 


19.9 ; 


Control 2 Occipital 
Ctx 


40.1 


AD 2 Temporal Ctx 


29.9 5 


Control 3 Occipital 
Ctx 


13.8 


AD 3 Temporal Ctx 


9.3 ! 


Control 4 Occipital 
Ctx 


5.6 


AD 4 Temporal Ctx 


25.2 


Control (Path) 1 
Occipital Ctx 


84.7 


AD 5 Inf Temporal 
Ctx 


79.6 


Control (Path) 2 
Occipital Ctx 


10.0 


AD 5 Sup Temporal 


56.6 


Control (Path) 3 


0.6 
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Ctx 




Occipital Ctx 




AH 6 Inf Tpmnnral 

r\L/ \J 1111 1 t>lll Cll 

Ctx 


193 


r'ontrnl (Path} 4 

Occipital Ctx 


12.3 


AH f\ Qiin Tfmnnral 
rv i J \J o nyj i sziiLyjKji cLi ; 

Ctx 


24.5 


Ctx 


3.8 


Ctx 


9.7 


V^UIllIiJl Z. I ctl ICLcll 

Ctx 


30.1 


Control 2 Temnoral 
Ctx 


43.5 


Control 1 Parietal 

V UI 111 v/1 _/ 1 til I^/IUI 

Ctx 


21.6 


Control 3 Temporal 
Ctx 


15.2 


Control (Path) 1 
Parietal Ctx 


100.0 


Control 3 Temporal 
Ctx 


7.4 


Control (Path) 2 j 
Parietal Ctx 


34.9 


Control (Path) 1 
Temporal Ctx 


75.3 


Control (Path) 3 
Parietal Ctx 


1.5 


Control (Path) 2 
Temporal Ctx 


35.6 


Control (Path) 4 j 
Parietal Ctx 


52.1 



Table BSC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3025, 
Run 167968622 


Tissue Name 


Rel. Exp.(%) Ag3025, 
Run 167968622 


Liver adenocarcinoma 


3.6 


Kidney (fetal) 


16.3 


Pancreas 


6.8 


Renal ca. 786-0 


6.2 


Pancreatic ca. CAPAN 
2 


2.7 


Renal ca. A498 


10.9 


Adrenal gland 


2.3 


Renal ca. RXF 393 


6.7 


Thyroid 


7.2 


Renal ca. ACHN 


3.3 


Salivary gland 


3.2 


Renal ca. UO-3 1 


5.3 


Pituitary gland 


16.3 


Renal ca. TK-10 


7.2 


Brain (fetal) 


37.6 


Liver 


3.8 


Brain (whole) 


33.2 


Liver (fetal) 


3.4 


Brain (amygdala) 


29.7 


Liver ca. 
(hepatoblast) HepG2 


2.5 


Brain (cerebellum) 


9.7 


Lung 


4.6 


Brain (hippocampus) 


21.9 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


12.2 


Lung ca. (small cell) 
LX-1 


3.8 


Brain (thalamus) 


10.4 


Lung ca. (small cell) 
NCI-H69 


2.5 


Cerebral Cortex 


16.3 


Lung ca. (s.cell var.) 
SHP-77 


100.0 


Spinal cord 


10.7 


Lung ca. (large 
cell)NCI-H460 


0.8 


glio/astro U87-MG 


6.0 


Lung ca. (non-sm. 


5.8 
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cell) A549 




_ ., 

glio/astroU-118-MG 


8.8 


Lung ca. (non-s.cell) 
NCI-H23 


15.4 


astrocytoma SW1 783 


15.3 


Lung ca. (non-s.cell) 
HOP-62 


7.7 


neuro*;metSK-N-AS 


6.4 


Lung ca. (non-s.cl) 
NCLH522 


13.4 


astrocytoma SF-539 


5.3 . 


Lung ca. (squam.) 
SW 900 


1.7 


astrocytoma SNB-75 


n i 


Lung ca. (squam.) 
NCI-H596 




glioma SNB-19 


6.3 


Mammary gland 


"~ 4.1 


glioma U251 


9.9 


Breast ca.* (pl.ef) 
MCF-7 


8.7 


glioma SF-295 


10.4 


Breast ca.* (pl.ef) 
MDA-MB-231 


1.7 


Heart (fetal) 


3.3 


Breast ca.* (pl.ef) 
T47D 


14.1 


Heart 


20.3 


Breast ca. BT-549 


3.6 


Skeletal muscle (fetal) 


0.5 


Breast ca. MDA-N 


15.8 


Skeletal muscle 


15.5 


Ovary 


9.0 


Bone marrow 


2.8 


Ovarian ca. OVCAR- 
3 


3.5 


Thymus 


11.9 


Ovarian ca. OVCAR- 
4 


1.0 


Spleen 


J.O 


Ovarian ca. OVCAR- 
5 


1 Q 1 


Lymph node 


D. 1 


Ovarian ca. OVCAR- 
8 


D. 1 


Colorectal 


A O 


Ovarian ca. IGROV- 
1 


u.o 


Stomach 


11.9 


Ovarian ca.* (ascites) 
SK-OV-3 


52.1 


Small intestine 


6.2 


Uterus 


7.3 


Colon ca. SW480 


0.3 


Placenta 


3.1 


Colon ca.* 
SW620(SW480 met) 


17 0 


1 1 Uo IcllC/ 


1 4 


Colon ca. HT29 


2.8 


Prostate ca.* (bone 
met)PC-3 


11.6 


Colon ca. HCT-116 


5.1 


Testis 


5.4 


Colon ca. CaCo-2 


15.2 


Melanoma 
Hs688(A).T 


2.2 


Colon ca. 
tissue(OD03866) 


1.7 


Melanoma* (met) 
Hs688(B).T 


2.2 


Colon ca. HCC-2998 


9.0 


Melanoma UACC-62 


12.2 
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Gastric ca.* (liver met) 
NCI-N87 


3.2 


Melanoma M14 


3.6 


Bladder 


3.1 


Melanoma LOX 
IMVI 


3.5 


Trachea 


5.2 


Melanoma* (met) 
SK-MEL-5 


11.8 


Kidney 


17.4 


Adipose 


7.9 



Table BSD. Panel 4D 



Tissue Name 


ReL Exp.(%) 
Ag3025, Run 
164528140 


Tissue Name 


ReL Exp.(%) 
Ag3025, Run 
16452i»140 


Secondary Thl act 


2.6 


HUVEC IL-lbeta 


12.5 


Secondary Th2 act 


3.3 


HUVEC I FN gamma 


9.5 


Secondary Trl act 




HUVEC TNF alpha + IFN 
gamma 


5.1 


Secondary Thl rest 


1.2 


HUVEC TNF alpha + IL4 


5.1 


Secondary Th2 rest 


2.7 | 


HUVEC 1L-1 1 


3.4 


Secondary Trl rest 


1.4 


Lung Microvascular EC 
none 


10.7 


Primary Thl act 


6.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


13.5 


Primary Th2 act 


3.5 


Microvascular Dermal EC 
none 


6.0 


Primary Trl act 


2.5 


Microsvasular Dermal EC 
l iNraipna + ii^-iDeia 


2.6 


Primary Thl rest 


5.4 


Bronchial epithelium 
TNFalpha + ILlbeta 


6.2 


Primary Th2 rest 


6.7 


Small airway epithelium 
none 


0.5 


Primary Trl rest 


1.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


8.0 


CD45RA CD4 
lymphocyte act 


4.4 


Coronery artery SMC rest 


32.5 


CD45RO CD4 
lymphocyte act 


4.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


6.9 


CD8 lymphocyte act 


16.8 


Astrocytes rest 


9.2 


Secondary CD8 
lymphocyte rest 


5.2 


Astrocytes TNFalpha + 
IL-lbeta 


3.4 


Secondary CD8 
lymphocyte act 


1.9 


KU-812 (Basophil) rest 


7.3 


CD4 lymphocyte none 


2.3 


KU-812 (Basophil) 
PMA/ionomycin 


29.5 


2ry Thl/Th2/Trl_anti- 


2.6 


CCD! 106 (Keratinocytes) 


1.3 
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CD95 CH11 




none 




LAK cells rest 


— - _ 

5.8 


frni i r\f, (K eratinocvtes^ i 
TNFalpha + IL-lbeta 


' 0.0 


LAK cells IL-2 


21.0 


Liver cirrhosis 


3.7 


LAK cells IL-2+IL- 12 j 


1 1.3 


Lupus kidney 


3.7 


LAK cells IL-2+IFN 
gramma 


12.4 


NCI-H292 none 


12.2 


I AK cells IL-2+ IL-18 


67.4 


NC1-H292 IL-4 


10.2 


LAK cells 
PA/I A/ionomvcin 


0.6 


NCI-H292 IL-9 


100.0 


NK Cells TI -2 rest 


4.2 j 


NCI-H292 IL-13 


5.1 


Two Wav Ml R ^ Hav 

l w u vv ay l v i l > i v _j \x<xj 


3.5 


NC1-H292 IFN gamma 


2.7 


Two Way MLR 5 day ! 


1.3 


HPAEC none 


9.3 


Two Way MLR 7 day j 


4.9 


upAPf T>jr nlnhn + IT -1 
11 r/\cU 1 IN I dipiid • iij'i 

beta 


2.4 


PBMC rest 




i^ung iiuroDiabi nunc 


5 5 


PBMC PWM 


24.7 


Lung fibroblast TNF alpha 

i 1 \->~ 1 UcLcl 


6.3 


PBMC PHA-L 


4.3 


Lung fibroblast IL-4 


6.8 


Ramos (B cell) none ! 


19.1 


Lung fibroblast IL-9 


10.4 


Ramos (B cell) 
ionomycin 


33.4 


Lung fibroblast IL-13 


4.5 


B lymphocytes PWM 


19.2 


Lung fibroblast IFN 
gamma 


15.2 


B lymphocytes CD40L 
and IL-4 


5.2 


Dermal fibroblast 
CCD 1070 rest 


17.8 


EOL-1 dbcAMP 


0.7 


Dermal fibroblast 
CCD 1070 TNF alnha 


36.6 


GDCAJVLr 

PMA/ionomycin 


0.9 


nprm a 1 flHrnHla^t 

CCD 1070 IL-1 beta 


6.5 


Dendritic cells none 


3.7 


Derma! fibroblast IFN 
gamma 


0.6 


Dendritic cells LPS 


6.0 


Dermal fibroblast IL-4 


6.6 


Dendritic cells anti- 
CD40 


3.3 


IBD Colitis 2 


0.7 


Monocytes rest 


2.4 


IBD Crohn's 


0.8 


Monocytes LPS 


0.6 


Colon 


21.5 


Macrophages rest 


5.6 


Lung 


18.3 


Macrophages LPS 


0.8 


Thymus 


52.1 


HUVEC none 


6.6 


Kidney 


19.2 


HUVEC starved 


21.9 







CNS_neurodegeneration_vl.O Summary: Ag3025 This panel does not show 
differential expression of the NOV78 gene in Alzheimer's disease. However, this expression 
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profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for discussion of 
utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3025 Highest expression of the NOV78 gene is seen in a 
lung cancer cell line (CT=30.5). Higher levels of expression are observed in prostate, lung, 
5 breast and ovarian cancer cell lines when compared with the normal prostate, lung, breast and 
ovary. Thus, expression of this gene may be used as a diagnostic marker of cancer in these 
tissues. Furthermore, inhibition of the activity of this gene product using small molecule drugs 
may be effective in the treatment of cancer in these tissues. 

Among tissues with metabolic function, this gene has a low level of expression in 
10 pancreas, thyroid, pituitary, heart, and adipose. Therefore, this gene product may be a small 
molecule target for the treatment of metabolic and endocrine diseases, including obesity and 
Types 1 and 2 diabetes. 

This gene represents a dual specificity phosphatase that is also expressed at low to 
moderate levels across the CNS. Dual-specificity phosphatases comprise a family of MAP 
15 kinase regulating enzymes that are upregulated in brains subjected to insults such as ischemia 
and seizure activity. MAP kinases are kown to regulate neurotrophic and neurotoxic pathways. 
Consequently, agents that modulate the activity of this gene may have utility in attenuating the 
apoptotic and neurodegenerative processes following brain insults. 
References: 

20 Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 

rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 
The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 

25 Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 
min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mRNA was found most 
prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
in the selectively vulnerable CA1 sector. At later time points and in control animals no PAC-1 
mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 

30 PAC-1, therefore, may be involved in adaptational responses of hippocampal cells resistant to 
ischemic injury. 

Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 
specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 Sep 
29;8(14):3077-80 
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Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 
responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 
activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 
inflammatory cytokines, their activation state could determine seizure-related brain damage. 
5 PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 
found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 
lead to rapid (approximately 3 h) but transient appearance of PAC1 mRNA in granule cells of 
the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 
after 2-3 days PAC1 was induced in dying CA1 and CA3 neurons. At this time PAC1 mRNA 

10 was also expressed in white matter microglia as well as in microglia invading the damaged 

hippocampus. PAC1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

Panel 4D Summary: Ag3025 The NOV78 gene is expressed at moderate to low levels 
in a wide range of cell types of significance Highest expression is detected in IL-9 treated 

15 NCI-H292 mucoepidermoid cells (CT=31.81) with lower expression levels in non-treated 

NCI-H292 cells. Expression is also seen in (i) LAK cells stimulated with IL-2, 1L-2 +IL-12, 
IL-2 + IL-18, and IL-2 + IFNgamma (ii) stimulated and non-stimulated Ramos B cells and 
polkweed mitogen stimulated B lymphocytes, (iii) starved and IL-1 treated HUVECs, (iv) 
TNF alpha+IL-1 beta treated and non treated lung microvascular endothelial cells and resting 

20 coronary artery smooth muscle cells (v) treated Ku-812 basophils (vi) IFN gamma treated lung 
fibroblasts, and (vii) normal tissues represented by colon, lung, thymus and kidney. Based on 
this pattern of expression, this gene product may be involved in both disease and homeostatic 
processes for these and other cell types and tissues. Therefore, modulation of this gene product 
with a functional therapeutic may lead to the alteration of functions associated with these cell 

25 and tissue types and improvement of the symptoms of patients suffering from autoimmune and 
inflammatory diseases such as COPD, emphysema, asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and osteoarthritis. 

NOV79: Dual Specificity Phosphatase 

Expression of gene NOV79 was assessed using the primer-probe set Ag3039, 
30 described in Table BTA. Results of the RTQ-PCR runs are shown in Tables BTB, BTC and 
BTD. 

Table BTA. Probe Name Ag3039 

Pr imer s] ' ^ " ^ J""" " „".', I 17.. .I s eq[Uerice s . 1.1 7 ^ ^ ^ ^ 1^.' Ifl^^?^. JI^ I S ^^'^ . . ..^ . . J S ?9 1 j?JL 
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Position 




Forward 


5 1 -gccgaaataagatcacacacat -3 1 


22 


320 


1312 


Probe 


TET-5 ' -tctatccatgagtcaccccagcctct-3 1 - : 
TAMRA 


26 


346 


1313 


Reverse 


5 1 -atgcgaaggtaggtgatatcct-3 ' 


22 


377 
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Table BTB. CNS_neurodegeneration_vl.O 





ReLExp (%) Ag3039, | Name 
Run 211012103 } 


Rel. Exp.(%) Ag3039, 
Run 211012103 


AH 1 Uinnn 


1 O A 

1 5.4 


Control (Path) 3 j 
Temporal Ctx 


8.2 


r\LJ Z. riippo 


48.0 


Control (Path) 4 
Temporal Ctx 


36.3 


AD 3 Hippo 


9.8 


AD 1 Occipital Ctx 


9.5 


AD 4 Hippo 


13.6 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


70.2 


AD 3 Occipital Ctx 


6.3 


AD 6 Hippo 


69.3 


AD 4 Occipital Ctx 


20.9 


Control 2 Hippo 


25.5 


AD 5 Occipital Ctx 


18.3 


r^nntrol 4 Hinno 




AD 6 OrHnital Ctx 


43.2 


Control (Path) 3 
Hippo 


7.6 


Control 1 Occipital 
Ctx 


6.0 


AD 1 Temporal Ctx 


24.3 


Control 2 Occipital 
Ctx 


57.0 


AD 2 Temporal Ctx ; 


36.9 


Control 3 Occipital 
Ctx 


18.7 


AD 3 Temporal Ctx 


4.7 


Control 4 Occipital 
Ctx 


13.9 


AD 4 Temporal Ctx 


24.5 


Control (Path) 1 
Occipital Ctx 


74 ? 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


14.8 


AD 5 SupTemporal 
Ctx 


62.9 


Control (Path) 3 
Occipital Ctx 


4.3 


AD 6 Inf Temporal 
Ctx 


58.2 


Control (Path) 4 
Occipital Ctx 


25.2 


AD 6 Sup Temporal 
Ctx 


49.3 


Control 1 Parietal 
Ctx 


15.9 


Control 1 Temporal 
Ctx 


11.6 


Control 2 Parietal 
Ctx 


58.2 


Control 2 Temporal 
Ctx 


34.4 


Control 3 Parietal 
Ctx 


32.1 


Control 3 Temporal 
Ctx 


20.0 


Control (Path) 1 
Parietal Ctx 


66.9 


Control 4 Temporal 


20.7 


Control (Path) 2 


39.0 
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"1 h li ft Ti iv Ai t 




., i.vi .n*= 


m 


i 




m 






Ctx 




Parietal Ctx 




Control (Path) 1 
Temporal Ctx 


44.4 


Control (Path) 3 

r aricLdi ^ L A 


4.6 


Control (Path) 2 
Temporal Ctx 


30.4 


Control (Path) 4 
Parietal Ctx 


35.8 



Table BTC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3039, 
Run 167961816 


Tissue Name 


Rel. Exp.(%) Ag3039, 
Run 167961816 


Liver adenocarcinoma 


1.7 


Kidney (fetal) 


38.2 


Pancreas 


2.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


0.4 


Adrenal gland 


1.2 


Renal ca. RXF 393 


0.1 


TVi a/it* i rl 
1 Ilj I U1U 


6.0 


Renal ca. ACHN 


1.5 


Salivary gland 


0.8 


Renal ca. UO-31 i 


0.0 


Pituitary gland 


J.U 


Renal ca. TK- 10 


0.0 


Brain (fetal) 


7.4 


Liver 


0.3 


Brain (whole) 


7.7 


Liver (fetal) 


0.7 


Brain (amygdala) 


6.0 


Liver ca. 
(hepatoblast) HepG2 


0.1 


Brain (cerebellum) 


5.9 


Lung 


1.3 


Brain (hippocampus) 


3.9 


Lung (fetal) 3.0 


r>rain ^suDSianiia nigra^ 




Lung ca. (small cell) 
LX-1 


0.0 


DTdlll ^llldialllUb^ 




Lung ca. (small cell) 
NCI-H69 


1.5 


Cerebral Cortex 


10.8 


Lungca. (s.cell var.) 
SHP-77 


3.7 


Spinal cord 


15.2 


Lung ca. (large 
cell)NCI-H460 


0.1 


glio/astro U87-MG 


0.2 


Lung ca. (non-sm. 
cell) A549 


1.6 


glio/astro U-118-MG 


0.1 


Lung ca. (non-s.cell) 
NCI-H23 


0.4 


astrocytoma SW1783 


0.6 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.3 


Lung ca. (non-s.cl) 
NCI-H522 


1.4 


astrocytoma SF-539 


1.5 


Lung ca. (squam.) 
SW 900 


0.7 


astrocytoma SNB-75 


1.2 


Lung ca. (squam.) 
NCI-H596 


5.7 


glioma SNB-19 


2.1 


Mammary gland 


2.4 
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glioma U251 


0.3 


DICdbL Ld. ^pi.Cl^ 

MCF-7 


0.7 


glioma SF-295 


2.3 


DlCdsl ta. ^pi.Cl^ 

MDA-MB-231 


0.1 


Heart (fetal) 


14.5 


DICdjI td. ^JJl.Cl^ 

T47D 


13.5 


rieart 


M , — , v 


DlCaol td. L> 1 -JH7 




Skeletal muscle (fetal) I 


5.1 


Breast ca. MDA-N 


6.4 


Skeletal muscle 


0.0 


Ovary 


3.9 


Bone marrow 


0.4 


Ovarian ca. OVCAR- 

3 | 


0.3 


Thymus 


0.2 


Ovarian ca. OVCAR- 
4 


13.8 


Spleen 


3.1 


Ovarian ca. OVCAR- 


1.5 


Lymph node 


0.9 


Ovarian ca. OVCAR- 

o 


0.0 


Colorectal 


0.4 


Ovarian ca. IGROV- 

i 
i 


0.0 


Stomach 


0.6 


Ovarian ca.* (ascites) 
SK-OV-3 


0.4 


Small intestine 


1 A 

l.U 


Uterus 


z.z 


Colon ca. SW480 


6.7 


Placenta 


0.0 


Colon ca.* 
bwo2U(oW4oU met) 


0.0 


Prostate 


2.7 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
mei )\ v^-j 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


100.0 


Colon ca. CaCo-2 


12.2 


Melanoma 
Hs688(A).T 


0.1 


Colon ca. 


1.2 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


3.0 


Melanoma UACC-62 


5.2 


Gastric ca.* (liver met) 
NCI-N87 


1.2 


Melanoma M14 


0.8 


Bladder 


1.8 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.6 


Melanoma* (met) 
SK-MEL-5 


2.0 


Kidney 


41.8 


Adipose 


0.6 



Table BTD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3039, Run 


Tissue Name 


Rel. Exp.(%) 
Ag3039, Run 
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162427949 




162427949 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


n n 
u.w 


LIT ivpr TF7\T rrnmmn 
rl VJ VD\j 1_T1N gcllllllla 




Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 

gdlllllld. 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-U 


0.3 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


1.3 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC j 
none 


0.3 


Primary Trl act 


0.0 


Microsvasular Dermal EC \ 
TNFalpha + IL-lbeta 


0.4 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


1.4 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


1.1 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.3 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


4.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


2.3 


Secondary CD8 
lympnocyie act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
r ivlav lonomycin 


0.0 


zry ini/inz/iri ami- 
CD95CH11 


0.0 


v^v^ YJ 1 1WU ^ JVC! dill l\J\^y lCi>y 

none 


0.4 


LAK cells rest 


0.1 


V^V^v YJ 1 1 VJyj ^JVCI dLIIUJL'y ICo^J 

TNFalpha + IL-lbeta 


0.4 


LAK cells IL-2 


0.4 


Liver cirrhosis 


0.7 


LAK cells TL-2+IL-12 


0.0 


Lupus kidney 


5.2 


LAK cells IL-2+IFN 
gamma 


0.4 


NCI-H292 none 


3.5 


LAK cells IL-2+ IL-18 


0.8 


NCI-H292 IL-4 


0.9 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


1.8 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


1.2 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


3.6 
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Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day j 


0.0 


II P A P ( ' I'M L' olnhs 4- II 1 

beta 


0.0 


r o iviL^ rest 




L.ung iiDroDidi>L nunc 




PBMC PWM 


2.5 


Lung fibroblast TNF alpha; 

i 1 DC Id 


0.0 


PBMC PHA-L 


1.6 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.5 


B lymphocytes PWM 


5.5 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


1.3 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.6 


Dermal fibroblast 
L>v^L/iu/u iinx* aipna 


0.0 


bUL-1 UDCAMr 
P1V1 A / i on om vf* l n 


0.0 


Dermal fibroblast 
CCD 1070 1L-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.4 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


IBD Crohn's 


1.1 


Monocytes LPS 


0.4 


Colon 


2.5 . 


Macrophages rest 


1.0 


Lung 


5.1 


Macrophages LPS 


0.4 


Thymus 


100.0 


HUVEC none 


0.6 


Kidney 


3.1 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3039 No differential expression of the 
NOV79 gene is detected between the postmortem brains of Alzheimer's diseased patients and 
those of non-demented controls. However, this panel confirms the expression of this gene in 
5 the CNS. Please see panel 1.3D for a discussion of utility of this gene in the central nervous 
system. 

Panel 1.3D Summary: Ag3039 Highest expression of the NOV79 gene is seen in the 
testis (CT=29). In addition, expression of this gene is extremely low in renal and brain cancer 
cell lines but is expressed in the normal brain and kidney tissues on this sample. Therefore, 
10 this gene may be used as a diagnostic marker for brain and kidney cancer and prostate tissue. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
effective in the treatment of brain and renal cancers. 
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In addition, this gene is expressed at low levels in metabolic tissues including pancreas, 
adrenal, thyroid, pituitary, adult and fetal heart, and adipose. This novel protein phosphatase 
may be a small molecule target for the treatment of metabolic and endocrine disease, including 
obesity and Types 1 and 2 diabetes. This gene is also differentially expressed in fetal (CT 
5 values = 32-33) vs adult skeletal muscle (CT values = 35-40) and may be useful for the 
differentiation of adult and fetal skeletal muscle. 

This gene represents a dual specificity phosphatase that is also expressed at low to 
moderate levels across the CNS. Dual-specificity phosphatases comprise a family of MAP 
kinase regulating enzymes that are upregulated in brains subjected to insults such as ischemia 
10 and seizure activity. MAP kinases are kown to regulate neurotrophic and neurotoxic pathways. 
Consequently, agents that modulate the activity of this gene may have utility in attenuating the 
apoptotic and neurodegenerative processes following brain insults. 

References: 

Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced in the 
15 rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

PAC-1 mRNA has previously been found only in activated T-cells in vitro and in vivo. 
The gene encodes a dual specificity protein phosphatase that regulates MAP kinase activity. 
Here, I describe that PAC-1 mRNA is induced also in neurons in the rat brain following 30 

20 min of forebrain ischemia. At 6, 12 and 24 h after ischemia, PAC-1 mRNA was found most 
prominently in hippocampal cells which are resistant to 30 min of forebrain ischemia, but not 
in the selectively vulnerable CA1 sector. At later time points and in control animals no PAC-1 
mRNA could be detected in any brain region. The protein-tyrosine/threonine phosphatase 
PAC-1, therefore, may be involved in adaptational responses of hippocampal cells resistant to 

25 ischemic injury. 

Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the dual 
specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 Sep 
29;8(14):3077-80 

Recurrent seizure activity leads to delayed neuronal death as well as to inflammatory 
30 responses involving microglia in hippocampal subfields CA1, CA3 and CA4. Since mitogen 
activated protein (MAP) kinases control neuronal apoptosis and trigger generation of 
inflammatory cytokines, their activation state could determine seizure-related brain damage. 
PAC1 is a dual specificity protein phosphatase inactivating MAP kinases which we have 
found to be undetectable in normal brain. Despite this, kainic acid-induced seizure activity 
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lead to rapid (approximately 3 h) but transient appearance'of PAC1 mRNA in granule cells of 
the dentate gyrus as well as in pyramidal CA1 neurons. This pattern changed with time and 
after 2-3 days PAC1 was induced in dying CA1 and CA3 neurons. At this time PAC1 mRNA 
was also expressed in white matter microglia as well as in microglia invading the damaged 
5 hippocampus. PAC1 may play an important role controlling MAP kinase involvement in both 
neuronal death and neuro-inflammation following excitotoxic damage. 

Panel 4D Summary: Ag3039 Expression of the NOV79 gene is highest and almost 
exclusive to the thymus (CTs=29-30). Expression of this gene could be used to distinguish 
thymus from the other samples on this panel. The putative dual-specificity phosphatase 
10 encoded by this gene may play an important role in T cell development. Small molecule 

therapeutics designed against the protein encoded by this gene could therefore be utilized to 
modulate immune function (T cell development) and be important for organ transplant, AIDS 
treatment or post chemotherapy immune reconstitiution. 

NOV80 

15 Expression of gene NOV80 was assessed using the primer-probe set Ag3044, 

described in Table BUA. Results of the RTQ-PCR runs are shown in Tables BUB and BUC. 

Table BUA. Probe Name Ag3044 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO : 


Forward 


5 1 - tgacgcagaatggaataagc t - ? 1 


21 


650 


1315 


Probe 


TET-5 ' -acgtcctctatgccagcaactcctg-3 ' - 
TAMRA 


25 


671 


1316 


Reverse 


5 1 -gcaagaagtggctctggtagat-3 ' 


22 


712 


1317 



Table BUB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3044, 
Run 167972762 


Tissue Name 


Rel. Exp.(%) Ag3044, 
Run 167972762 


Liver adenocarcinoma 


0.1 


Kidney (fetal) 


0.5 


Pancreas 


0.1 


Renal ca. 786-0 


0.2 


Pancreatic ca. CAP AN 

2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.1 


Thyroid 


0.0 


Renal ca. ACHN 


0.4 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.1 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.4 


Liver 


0.0 


Brain (whole) 


0.2 


Liver (fetal) 


0.0 
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Brain (amygdala) 


0.1 


Liver ca. 
^nepatODiastj riepoz 


0.1 


Brain (cerebellum) 


0.5 


Lung 


0.0 


Brain (hippocampus) 


0.1 


Lung (fetal) 


0.1 


Brain (substantia nigra) 


0.1 


Lung ca. (small cell) 
LX-1 


0.1 


Brain (thalamus) 


0.1 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.1 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.1 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
. cell) A549 


0.2 


glio/astro U-l 18-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.4 


astrocytoma SW1783 


0.1 


Lung ca. (non-s.cell) 
HOP-62 


0.1 


neuro*; met SK-N-AS 


0.1 


Lung ca. (non-s.cl) 
NCI-H522 


0.3 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.2 


astrocytoma SNB-75 \ 


0.2 


Lung ca. (squam.) 
NC1-Hj9o 


0.0 


glioma oNd-iv 


A 1 
U. 1 


Mammary gland 


A A 
U.U 


glioma U251 


0.1 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.1 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


0.2 


oean 


U. 1 


t>reasi ca. t> i o^v 


U.U 


Skeletal muscle (fetal) 


0.1 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.1 


Ovary 


0.1 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 

3 


0.1 


Thymus 


0.1 


Ovarian ca. OVCAR-; 
4 


0.1 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.3 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


0.1 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.1 


Ovarian ca.* (ascites) 


0.1 
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jfl "Is 



SMI i 







SK-OV-3 




Small intp^tinp 


0.0 




0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW6?0fSW480 mef* 

O VV UZ#\J^O vv tou iiiviy 


0.2 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
metiPC-^ 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


0.1 


Colon ca. CaCo-2 


0.2 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 


0.0 


Melanoma* (met) 


0.0 


Colon ca. HCC-2998 


0.1 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.1 


Melanoma Ml 4 


0.0 


Bladder 


0.1 


Melanoma LOX 
IMVI 


0.0 


Trachea 


100.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.2 


Adipose 


0.0 



Table BUC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3044, Run 
164334372 


Tissue Name 


Rel. Exp.(%) 
Ag3044, Run 
164334372 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


3.5 


Secondary Th2 act 


5.1 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


1.3 


HUVEC TNF alpha + IFN 
gamma 


2.0 


Secondary Th 1 rest 


1.2 


HUVEC TNF alpha + IL4 


2.0 


Secondary Th2 rest 


0.7 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


2.1 


Lung Microvascular EC 
none 


1.6 


Primary Th 1 act 


2.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.3 


Primary Th2 act 


4.0 


Microvascular Dermal EC 
none 


1.6 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


2.7 


Primary Th 1 rest 


9.5 


Bronchial epithelium 
TNFalpha + ILlbeta 


6.7 


Primary Th2 rest 


11.9 


Small airway epithelium 
none 


2.5 


Primary Trl rest 


4.6 


Small airway epithelium 


20.9 
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TNFalpha + lL-lbeta 




CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


4.3 


CD45RO CD4 
lymphocyte act 


1.5 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.6 


CD8 lymphocyte act 


5.3 


Astrocytes rest 


4.3 


Secondary CD8 
lymphocyte rest 


1.9 

— _____ — , — , — 


Astrocytes TNFalpha + 
IL-lbeta 


2.9 


Secondary CD8 
lyuipiiocyic dti 


22.8 


KU-812 (Basophil) rest 


6.8 


CD4 lymphocyte none 


0.1 


KU-812 (Basophil). 

r W\.r\J 1 KJl lUI I iy L. 1 1 1 


9.5 


9r\/ Th 1 /Th9/Tr 1 onti- 

zry ini/inz/iri dini- 
CD95CH11 


2.5 


none 


2.1 


LAK cells rest 


4.5 


s^y^LJ 1 1 uo ^iveraimocyies ) 
TNFalpha + IL- 1 beta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


17.7 


LAK cells IL-2+IL- 12 


2.9 


Lupus kidney 


8.1 


LAK cells IL-2+IFN 
gamma 


8.6 


NCI-H292 none 


52.5 


LAK cells IL-2+ IL-18 


6.5 


NCI-H292 IL-4 


20.6 


LAK cells 
PMA/ionomycin 


0.2 


NCI-H292 IL-9 


31.4 


NK Cells IL-2 rest 


2.1 


NCI-H292 IL-13 


16.3 


Two Way MLR 3 day 


1.8 


NCI-H292 IFN gamma 


11.9 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


riwvr/V^ i in r dipiid. i i l« i 
beta 


1.0 






I 1 1 n <y |-N rr\ [~\ \ 'iizt nr\np 
J__<Ullg J.1U1 UUldol llVJllt 


^ 8 


PBMC PWM 


3.2 


Lung fibroblast TNF alpha 
+ TT -1 heta 


3.6 


PBMC PHA-L 


3.0 


Lung fibroblast IL-4 


5.6 


Ramos (B cell) none 


0.0 


Lung fibroblast 1L-9 


o.y 


Ramos (B cell) 
ionomycin 


0.2 


Lung fibroblast IL-1 3 


2.9 


B lymphocytes PWM 


11.7 


Lung fibroblast IFN 
gamma 


3.0 


B lymphocytes CD40L 
and IL-4 


86.5 


Dermal fibroblast 
CCD 1 070 rest 


5.8 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


9.9 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.9 


Dendritic cells none 


4.1 


Dermal fibroblast IFN 
gamma 


1.0 
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Dendritic cells LPS 


1.1 


Dermal fibroblast IL-4 


4.6 


Dendritic cells anti- 
CD40 


1.1 


IBD Colitis 2 


0.0 


Monocytes rest 


1.7 


IBD Crohn's 


3.8 


Monocytes LPS 


2.5 


Colon 


17.9 


Macrophages rest 


4.1 


Lung 


7.6 


Macrophages LPS 


0.0. 


Thymus 


100.0 


HUVEC none 


4.8 


Kidney 


10.1 


HUVEC starved 


2.2 







Panel 1.3D Summary: Ag3044 Results from one experiment with the NOV80 gene 
are not included. The amp plot indicates that there were experimental difficulties with this run 
(data not shown). 

5 Panel 4D Summary: Ag3044 The NOV80 gene is expressed at low levels in a wide 

range of cell types of significance in the immune response in health and disease. These cells 
include: (i) resting LAK and LAK cells stimulated with IL-2+IL-12, IL-2 + IL-18, and IL-2 + 
IFNgamma (ii) activated primary and secondary Th2 cells, resting primary Thl, Th2 and Trl 
cells, and activated CD8 and secondary CD8 lymphocytes, (iii) IL-1 beta treated HUVECs, 

10 (iv) polkweed mitogen stimulated and CD40L + IL-4 stimulated B lymphocytes, (v) treated 
and non-treated Ku-812 basophils and non-treated dendritic cells, (vi) treated and non-treated 
peripheral blood mononuclear cells and resting macrophages (vii) treated and non-treated 
NCI-H292 mucoepidermoid, (viii) treated and non-treated lung fibroblasts, (viii) treated and 
non-treated astrocytes (ix) resting coronery artery SMCs, (x) resting and TNFalpha treated 

15 CCD1070 dermal fibroblasts and IL-4 treated dermal fibroblasts (xi) IBD Crohn's diseases 
tissue and normal tissues represented by colon, lung, thymus and kidney with the highest 
expression being detected in thymus tissue (CT=29.81). This expression profile suggests that 
this gene product may be involved in both disease and homeostatic processes in these and 
other cell types and tissues. Therefore, modulation of this gene product with a functional 

20 therapeutic may lead to the alteration of functions associated with these cell and tissue types 
and lead to improvement of the symptoms of patients suffering from autoimmune and 
inflammatory diseases such as COPD, emphysema, asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and osteoarthritis. 

NOV81a and NOV81b 
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Expression of gene NOV81a and the full length clone, NOV81b, was assessed using 
the primer-probe set Ag2906, described in Table BVA. Results of the RTQ-PCR runs are 
shown in Tables BVB, BVC and BVD. 



Table BVA. Probe Name Ag2906 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ctacctgggtgaggtctttacc-3 1 


22 


845 


1318 


Probe 


TET- 5 1 -ctccggaagccaggaggaccctt - 3 ' - 
TAMRA 


23 


879 


1319 


Reverse 


5 ' -agaaggactcgggcacatag- 3 1 


20 


902 


1320 



Table BVB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2906, 
Run 162556445 


Tissue Name 


ReL Exp.(%) Ag2906, 
Run 162556445 


Liver adenocarcinoma 


2.0 


Kidney (fetal) 


13.5 


Pancreas 


1.2 


Renal ca. 786-0 


1.0 


Pancreatic ca. CAPAN 
2 


0.0 


Renal ca. A498 


2.8 


Adrenal gland 


1.1 


Renal ca. RXF 393 


1.9 


Thyroid 


4.0 


Renal ca. ACHN 


0.7 


Salivary gland 


3.3 


Renal ca. UO-3 1 


0.0 


Pituitary gland 


2.4 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


6.8 


Brain (amygdala) 


2.0 


Liver ca. 
(hepatoblast) HepG2 


5-4 


Brain (cerebellum) 


3.8 


Lung 


42.0 


Brain (hippocampus) 


4.7 


Lung (fetal) 


11.8 


Brain (substantia nigra) 


0.9 


Lung ca. (small cell) 
LX-1 


0.7 


Brain (thalamus) 


4.6 


Lung ca. (small cell) 
NCI-H69 


1.0 


Cerebral Cortex 


6.4 


Lung ca. (s.cell var.) 
SHP-77 


4.5 


Spinal cord 


7.6 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


3.0 


Lung ca. (non-sm. 
cell) A549 


2.1 


glio/astro U-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


1.6 


astrocytoma SW1783 


4.1 ; 


Lung ca. (non-s.cell) 
HOP-62 


4.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 


0.0 
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| NCI-H522 




astrocytoma SF-539 


. . j Lung ca. (squam.) 
1 SW900 


8.4 


astrocytoma SNB-75 


. q 1 Lung ca. (squam.) 

j NCI-H596 


0.2 


glioma SNB-19 


7.2 j Mammary gland 


4.0 


glioma U251 


t _ j Breast ca.* (pl.ef) 
j MCF-7 


0.0 


glioma SF-295 


. ft j Breast ca.* (pl.ef) 
J MDA-MB-231 


3.9 


Heart (fetal) 


100.0 


Breast ca.* (pl.ef) 
T47D 


1.0 


Heart 


1A 7 

1 u. / 


rsreast ca. t> i -dhv 


1.8 


Skeletal muscle (fetal) 


35.8 


Breast ca. MDA-N 


0.3 


Skeletal muscle 


0.3 


Ovary 


2.2 


Bone marrow 


20.6 


Ovarian ca. OVCAR- 
3 


1.8 


Thymus 




15.8 


Ovarian ca. OVCAR- 
4 


0.6 


Spleen 


13.8 


Ovarian ca. OVCAR- 


0.8 


Lymph node 


5.5 


Ovarian ca. OVCAR- 

Q 
o 


0.0 


Colorectal 


31.2 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


7.3 


Ovarian ca.* (ascites): 
SK-OV-3 


0.0 


Small intestine 


31.9 


Uterus 


4.8 


Colon ca. SW480 


1.0 


Placenta 


9.9 


L^oion ca. 
SW620(SW480 met) 


0.0 


Prostate 


9.8 


Colon ca. HT29 


1.6 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


1.5 


Testis 


2.4 


Colon ca. CaCo-2 


4.4 


Melanoma 
Hs688(A).T 


1.0 


tissue(OD03866) 


8.7 


Melanoma* (met) 
Hs688(B).T 


2.6 


Colon ca. HCC-2998 


8.8 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


2.3 


Melanoma M14 


0.0 


Bladder 


9.4 


Melanoma LOX 
IMVI 


0.0 


Trachea 


14.1 


Melanoma* (met) 
SK-MEL-5 


2.1 
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Kidney 



11.8 



Adipose 



6.9 



Table BVC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2906, Run 
162345752 


Tissue Name 


Rel. Exp.(%) 
Ag2906, Run 
162345752 


Normal Colon 


39.5 


Kidney Margin 
8120608 


3.0 


CC Well to Mod Diff 
(OD03866) 


16.3 


Kidney Cancer 
8120613 


17.3 


CC Margin (OD03866) 


33.9 


Kidney Margin 
8120614 


6.5 


CC Gr.2 rectosigmoid 
(OD03868) 


14.4 


Kidney Cancer 
9010320 


3.4 


CC Margin (OD03868) 


1.2 


Kidney Margin 
9010321 


7.2 . 


CC Mod Diff(ODO3920) 


27.9 


Normal Uterus 


4.0 


CC Margin (ODO3920) 


30.8 


Uterus Cancer 06401 1 


4.3 


CC Gr.2 ascend colon 
(OD03921) 


100.0 


Normal Thyroid 


2.6 


CC Margin (OD03921) 


33.9 


Thyroid Cancer 
064010 


3.3 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


16.2 


Thyroid Cancer 
A3 02 152 


10.7 


Liver Margin (ODO4309) 


2.9 


Thyroid Margin 
A302153 


5.5 


Colon mets to lung 
(OD04451-01) 


15.6 


Normal Breast 


7.5 


Lung Margin (OD0445 1 - ! 
02) 


15.8 


Breast Cancer 
(OD04566) 


7.4 


Normal Prostate 6546-1 


24.5 


Breast Cancer 
(OD04590-01) 


11.0 


Prostate Cancer 
(OD04410) 


4.8 


Breast Cancer Mets 
(OD04590-03) 


30.8 


Prostate Margin 
(OD04410) 


5.0 


Breast Cancer 

Metastasis 
(OD04655-05) 


7.0 


Prostate Cancer 
(OD04720-01) 


9-5 


Breast Cancer 064006 


5.7 


Prostate Margin i 
* (OD04720-02) 


19.5 


Breast Cancer 1024 


7.5 


Normal Lung 061010 


16.5 


Breast Cancer 
9100266 


13.7 


Lung Met to Muscle 


3.9 


Breast Margin j 2.7 
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(ODQ4286) 




9100265 




Muscle Margin 
(ODU4286) 


6.2 


Breast Cancer 
A209073 


6.1 


Lung Malignant Cancer 
(OD031261 


64.2 


Breast Margin 


5.6 


Lune Mamin fODO^l?^ 


90 1 


Normal Liver 


1 .9 


Lun^ Cancer (CiDftddCid\ 


1 ^ 7 
1 o. / 


i^iver cancer uo4UU3 


A A 

0.4 


Lune Margin (00044041 


1 S 1 


i^iver cancer 1 uz_) 


0.6 


Lung Cancer (OD04565) 


27.7 


Liver Cancer 1026 


2.0 


Lung Margin (OD04565) 


19.8 


i Liver Cancer 6004-T 


0.9 


Lung Cancer (OD04237- 
01) 


4.5 


Liver Tissue 6004-N 


2.8 


Lung Margin (OD04237- 
02) 


12.9 


Liver Cancer 6005-T 


0.4 


Ocular Mel Met to Liver 
(ODO4310) 


1.8 


| Liver Tissue 6005-N 


1.0 


Liver Margin (OD043 1 0) 


2.6 


Normal Bladder 


8.5 


Melanoma Mets to Lung 
(OD04321) 


4.2 


Bladder Cancer 1023 


2.4 


Lung Margin (OD04321) 


35.6 


Bladder Cancer 
A302173 


0.8 


Normal Kidney 


5.4 


Bladder Cancer 
(OD047 18-01) 


5.8 


Kidney Ca, Nuclear grade 
2 (OD04338) 


4.1 


Bladder Normal 
Adjacent (OD04718- 
03) 


2.0 


Kidney Margin 
(OD04338) 


11.0 


Normal Ovary 


3.1 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


5.0 


Ovarian Cancer 
064008 


11.0 


Kidney Margin 
(OD04339) 


2.0 


Ovarian Cancer 
(OD04768-07) 


7.9 


Kidney Ca, Clear cell 
type (OD04340) 


3.8 


Ovary Margin 
(OD04768-08) 


1.3 


Kidney Margin 
(OD04340) 


5.6 


Normal Stomach 


2.9 


Kidney Ca, Nuclear grade 

O f f~\ I "\ A A ^ A 0\ 

3 (OD04348) 


2.7 


Gastric Cancer 
9060358 


1.0 


Kidney Margin 
(OD04348) 


5.0 


Stomach Margin 
9060359 


3.7 


Kidney Cancer 
(OD04622-01) 


5.9 

— — — — 


Gastric Cancer 
9060395 


12.0 


rviuncy iviaigiii 

(OD04622-03) 


2.1 


Stomach Margin 
9060394 


14.2 


Kidney Cancer 


0.8 


Gastric Cancer 


31.9 
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(OD04450-01) 




9060397 




Kidney Margin 
(OD04450-03) 


4.5 


Stomach Margin 
9060396 


5.6 


Kidney Cancer 8120607 


14.6 


Gastric Cancer 
064005 


8.7 



Table BVD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2906,Run 
159078634 


Tissue Name 


Rel. Exp.(%) 
Ag2906, Run 
159078634 


Secondary Thl act 


5.6 


HUVEC IL-lbeta 


5.7 


Secondary Th2 act 


5.3 


HUVEC IFN gamma 


9.2 


^prnnHarv Trl act 


8 6 


HUVEC TNF alpha + IFN 
gamma 


o s 

y. j 


Secondary Th 1 rest 


11.0 


HUVEC TNF alpha + IL4 


4.6 


Secondary Th2 rest 


6.6 


HUVEC IL-11 


9.1 


Secondary Trl rest 


10.5 


Lung Microvascular EC 
none 


10.0 


Primary Thl act 


2.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


3.8 


Primary Th2 act 


1.2 


Microvascular Dermal EC 
none 


10.2 


Primary Trl act 


1.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


2.5 


Primary Thl rest 


10.7 


Bronchial epithelium 
TNFalpha-hlLlbeta 


20.4 


Primary Th2 rest 


4.4 


Small airway epithelium 
none 


8.0 


Primary Trl rest 


18.0 


Small airway epithelium 
TNFalphaH- IL-lbeta 


22.4 


CD45RA CD4 
lymphocyte act 


8.0 


Coronery artery SMC rest 


4.0 


CD45RO CD4 
lymphocyte act 


5.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


6.1 


CD8 lymphocyte act 


3.7 


Astrocytes rest 


7.4 


Secondary CD8 
lymphocyte rest 


2.5 


Astrocytes TNFalpha + : 
IL-lbeta 


3.1 


Secondary CD8 
lymphocyte act 


1.5 


KU-812 (Basophil) rest 


2.6 


CD4 lymphocyte none 


6.5 


KU-8 12 (Basophil) 
PMA/ionomycin 


2.7 


2ry Thl/Th2/Trl anti- 
CD95CH11 


11.6 


CCD 1 106 (Keratinocytes) 
none 


6.4 


LAK cells rest 


14.6 


CCD1 106 (Keratinocytes) 


3.6 
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■ ] TNFalpha + IL-lbeta 




LAK cells IL-2 


8.6 


Liver cirrhosis 


8.0 


LAK cellsIL-2+IL-12 


6.3 


Lupus kidney 


2.5 


LAK cells IL-2+IFN 
gamma 


14.8 


NCI-H292 none 


3.7 


LAK cells IL-2+IL-18 


2.5 


NCI-H292 IL-4 


4.9 


LAK cells 
PMA/ionomycin 


7.6 


NCI-H292 IL-9 


0.7 


NK Cells IL-2 rest 


1L2 


NCI-H292 IL-13 


1.4 


Two Way MLR 3 day 


6.4 


NCI-H292 I FN gamma 


L3 


Two Way MLR 5 day 


5.3 


HFALC none 


4.0 


Two Way MLR 7 day 


2.7 


HPAEC TNF alpha + IL-1 
beta 


8.0 


i Diviv^ rcbi 


6.2 


Lung fibroblast none 


O.J 


PBMC PWM 


8.2 


Lung fibroblast TNF alpha 
+ IL-1 beta 


0.0 


PBMC PHA-L 


2.6 


Lung fibroblast IL-4 


1.6 


Ramos (B cell) none 


7.5 


Lung fibroblast IL-9 


2.0 


Ramos (B cell) 
ionomycin 


2.2 


Lung fibroblast IL-13 


3.6 


B lymphocytes PWM 


2.4 


Lung fibroblast IFN 
gamma 


2.7 


B lymphocytes CD40L 
and IL-4 


100.0 


Dermal fibroblast 
CCD 1070 rest 


2.6 


EOL-1 dbcAMP 


44.4 


Dermal fibroblast 
LLUiu/u iJNr' aipna 


8.5 


EOL-1 dbcAMP 
PMA/ionomyc in 


77.9 


Dermal fibroblast 
CCD 1070 IL-1 beta 


5.6 


Dendritic cells none 


21.2 


Dermal fibroblast IFN 
gamma 


5.7 


Dendritic cells LPS 


3.0 


Dermal fibroblast IL-4 


9.0 


Dendritic cells anti- 
CD40 


9.9 


IBD Colitis 2 


0.0 


Monocytes rest 


28.1 


IBD Crohn's 


2.6 


Monocytes LPS 


10.3 


Colon 


97.9 


Macrophages rest 


33.0 


Lung 


41.5 


Macrophages LPS 


12.7 


Thymus 


5.5 


HUVEC none 


11.9 


Kidney 


6.9 


HUVEC starved 


20.4 







Panel 1.3D Summary: Ag2906 The NOV81a gene has a low level of expression in 
adipose and may be a small molecule target for the treatment of obesity and obesity-related 
diseases, including Type 2 diabetes. In addition, this gene product appears to be differentially 
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expressed in fetal (CT value = 3 1) vs adult heart (CT value = 34) and may be useful for the 
differentiation between the two tissue types. 

Overall, there appears to be higher expression of this gene in the normal tissues 
compared to the cell lines. Thus, this difference in expression might be of use as a diagnostic 
5 marker of cancer. 

Panel 2D Summary: Ag2906 The NOV81a gene is expressed at low levels in this 
panel. A higher level of expression is observed in gastric, bladder, thyroid, breast and ovarian 
cancer samples when compared to expression in the normal adjacent gastric, bladder, thyroid, 
breast and ovary tissues. Thus, this gene could potentially be used as a diagnostic marker of 

10 cancer in these tissues. Furthermore, inhibition of the activity of this gene product using small 
molecule drugs may be useful for the treatment of cancer in these tissues. 

Panel 4D Summary: Ag2906 Expression of the NOV81a is widespread in this panel, 
with highest expression in B lymphocytes treated with CD40L and IL-4 (CT=29.8). 
Significant expression is also seen in treated eosinophils, resting macrophages and monocytes, 

15 and normal colon and lung. Based on this pattern of expression, this gene product may be 

involved in both disease and homeostatic processes for these and other cell types and tissues. 
Therefore, modulation of this gene product with a functional therapeutic may lead to the 
alteration of functions associated with these cell and tissue types and improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as COPD, 

20 emphysema, asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, 
rheumatoid arthritis, and osteoarthritis. In addition, the higher levels of expression in colon 
(CT=30) when compared to colon from patients with inflammatory bowel diseases 
(IBD)(CTs=35-40) suggests that expression of this gene could be used to differentiate between 
normal and inflamed colon. Therapeutic modulation of the expression or function of this gene 

25 may be effective in the treatment of IBD. 

NOV82 



Expression of gene NOV82 was assessed using the primer-probe sets Ag3198 and 
Ag3063, described in Tables BWA and BWB. 

Table BWA. Probe Name Ag3198 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 * -cgtggtcaccagacagttaatt-3 1 


22 


179 


1321 


Probe 


TET-5 ' -cctaccagacaccattgtgtccaagg-3 ■ - 
TAMRA 


26 


212 


1322 


Reverse 


5 ' -gtctttcctttgtgcttgtgaa-3 ' 


22 


246 


1323 
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Table BWB, Probe Name Ag3063 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cgtggtcaccagacagttaatt-3 ■ 


22 


179 


1324 


Probe 


TET-5 ' -cctaccagacaccattgtgtccaagg-3 1 - 
TAMRA 


26 


212 


1325 


Reverse 


5 ■ -gtctttcctttgtgcttgtgaa-3 ' 


22 


246 


1326 



CNS_neurodegeneration_vl.O Summary: Ag3 198 Expression of this gene is 
5 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). The amp 
plot indicates that there is a high probability of a potential probe or chemistry failure. 

Panel 1.3D Summary: Ag3063 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). The amp plot indicates that there 
is a high probability of a potential probe or chemistry failure. 
10 Panel 4D Summary: Ag3198 Expression of this gene is low/undetectable (CTs > 35) 

across all of the samples on this panel (data not shown). The amp plot indicates that there is a 
high probability of a potential probe or chemistry failure. 

NOV83 

Expression of gene NOV83 was assessed using the primer-probe sets Ag3046 
15 and Ag4125, described in Tables BXA and BXB. Results of the RTQ-PCR runs are 
shown in Tables BXC and BXD. 



Table BXA. Probe Name Ag3046 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gaagcaaagaactctgcaagac- 3 ' 


22 


1215 


1327 


Probe 


TET-5* -ttccagcatgataacttcacagagga-3 ' - 
TAMRA 


26 


1246 


1328 


Reverse? 


5 1 -gagcctgcaaatatcttttgct-3 ' 


22 


1272 


1329 



Table BXB. Probe Name Ag4125 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gaagcaaagaactctgcaagac-3 ' 


22 


1215 


1330 


Probe 


TET-5 ■ -ttccagcatgataacttcacagagga-3 ' - 
TAMRA 


26 


1246 


1331 


Reverse 


5 ' -gagcctgcaaatatcttttgct-3 ' 


22 


1272 


1332 



20 
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Table BXC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3046, Run 
162559104 


' ■ " 

Tissue Name 


Rel. Exp.(%) 
Ag3046, Run 
162559104 


iNormai ^oion 


n n 
u.u 


Kidney Margin 
8120608 


U.U 


CC Well to Mod Diff 
(OD03866) 


0.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


0.0 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


0.0 


Kidney Cancer 
9010320 


0.0 


iviargin (uuujoooj 


A A 
U.U 


Kidney Margin 
9010321 


0.0 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


0.0 


Thyroid Cancer 
064010 


0.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


0.0 


Thyroid Cancer 
A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin 
A302153 


0.0 


Colon mets to lung 
(OD04451-01) 


0.0 


Normal Breast 


0.0 


Lung Margin (OD04451- 
02) 


0.0 


Breast Cancer 
(OD04566) 


100.0 


Normal Prostate 6546-1 


0.1 


Breast Cancer 
(OD04590-01) 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Breast Cancer Mets 
(OD04590-03) 


0.0 


Prostate Margin 
(OD04410) 


0.0 


Breast Cancer 

Metastasis I 
(OD04655-05) 


0.0 


Prostate Cancer 
(OD04720-01) 


0.1 


Breast Cancer 064006 


0.0 


Prostate Margin 
(OD04720-02) 


0.0 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


0.0 


Breast Cancer 
9100266 


0.0 


Lung Met to Muscle 
(OD04286) 


0.0 


Breast Margin 
9100265 


0.0 


Muscle Margin 
(ODQ4286) 


0.0 


Breast Cancer 
A209073 


0.0 
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Luns? Malignant Chancer 
(OD03126) 


0.0 


Rrpa<st lVtarcrin 

A2090734 


0.0 


Lun2 Marein fOD03126^ 


0.0 


Normal 1 iver 


0 0 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


0.0 


Lung Margin (OD04404) 


0.0 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


0.0 


Liver Tissue 6004-N 


0.0 


Lung Margin (OD04237- 
02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (OD043 10) 


0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer 
A302173 


0.0 


Normal Kidney 


0.0 


Bladder Cancer 
(OD04718-01) 


0.0 


Kidney Ca, Nuclear grade 
2 COD04338 > } 


0.0 


Bladder Normal 
Adjacent (OD04718- 
03) 


0.0 


Kidney Margin 
(OD04338) 


0.0 


Normal Ovarv 


0 0 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.0 


Ovarian Cancer 
064008 


0.0 


Kidney Margin 
(OD04339) 


0.0 


Ovarian Cancer 
(OD04768-07) , 


0.0 


Kidney Ca, Clear cell 
type (OD04340) 


0.0 


Ovary Margin 
(OD04768-08) 


0.0 


Kidney Margin 
(OD04340) 


0.0 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


o.o ! 


Gastric Cancer 
9060358 


0.0 


Kidney Margin 
(OD04348) 


0.0 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


0.0 


Gastric Cancer 
9060395 


0.0 


Kidney Margin 
(OD04622-03) 


0.0 


Stomach Margin 
9060394 


0.0 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


0.0 


Stomach Margin 
9060396 


0.0 
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Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


0.0 



Table BXD. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag4125, Run 
172859315 


Tissue Name 


Rel. Exp.(%) 
Ag4125, Run 
172859315 


Secondary Thl act 


3.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


4.2 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


^prnnHarv Th9 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


2.7 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


3.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


11.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


7.4 


Primary Thl rest 


7.2 


Bronchial epithelium 
TNFalpha + IL1 beta 


7.5 


Primary Th2 rest 


3.1 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-81 2 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


3.3 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


5.3 


NCI-H292 none 


0.0 
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LAK cells IL-2+IFN 
gamma 


1.8 


NCI-H292 IL-4 


5.8 


LAK cells IL-2+IL-18 


3.6 


NCI-H292 IL-9 i 


3.5 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-13 


1.3 


NK Cells IL-2 rest 


5.9 


NCI-H292 IFN gamma 


0.8 


Two Way MLR 3 day 


3.3 


HPAEC none 


0.0 


Two Wav Ml R S rlav 

i w\j yy ay ivj L/ iv \.i ay 


3 2 


HPAEC TNF alpha + IL-1 
beta 


0.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha 
+ IL-lbeta 


2.3 


PBMC PWM 


3.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


1.6 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL- 1 3 


0.0 


Ramos fR celH 
ionomycin 


0.0 


Liins? fibroblast IFN 
gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD 1070 rest 


1 .0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


FOT -1 dhcAMP 


2.1 


Dermal fibroblast 
CCD1070IL-1 beta ; 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


3.3 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells LPS 


A A 

u.o 


Dermal Fibroblasts rest 


A A 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


13.5 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


5.4 


Macrophages LPS 


0.0 


Thymus 


18.4 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3046 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Ag4125 Results from one experiment with this gene are not included. The amp plot 
indicates that there were experimental difficulties with this run (data not shown). 

General screening panel vl.4 Summary: Ag4125 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



1379 




Panel 1.3D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag3046 Significant expression of this gene is seen exclusively in 
a breast cancer sample (CT = 25.2). Therefore, expression of this gene may be used to 
5 distinguish breast cancers from the other samples on this panel. Furthermore, therapeutic 
modulation of the activity of the GPCR encoded by this gene may be beneficial in the 
treatment of breast cancer. 

Panel 3D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 
10 Panel 4.1D Summary: Ag4125 This gene is only expressed at detectable levels in the 

kidney (CT = 32.6). The putative GPCR encoded for by this gene could allow cells within the 
kidney to respond to specific microenvironmental signals (For example, ref. 1). Therefore, 
antibody or small molecule therapies designed with the protein encoded for by this gene could 
modulate kidney function and be important in the treatment of inflammatory or autoimmune 
15 diseases that affect the kidney, including lupus and glomerulonephritis. 
References: 

1. Mark M.D., Wittemann S., Herlitze S. (2000) G protein modulation of recombinant 
P/Q-type calcium channels by regulators of G protein signalling proteins. J. Physiol. 528 Pt 1 : 
65-77. 

20 1 . Fast synaptic transmission is triggered by the activation of presynaptic Ca2+ 

channels which can be inhibited by Gbetagamma subunits via G protein-coupled receptors 
(GPCR). Regulators of G protein signalling (RGS) proteins are GTPase-accelerating proteins 
(GAPs), which are responsible for > 100-fold increases in the GTPase activity of G proteins 
and might be involved in the regulation of presynaptic Ca2+ channels. In this study we 

25 investigated the effects of RGS2 on G protein modulation of recombinant P/Q-type channels 
expressed in a human embryonic kidney (HEK293) cell line using whole-cell recordings. 2. 
RGS2 markedly accelerates transmitter-mediated inhibition and recovery from inhibition of 
Ba2+ currents (IBa) through P/Q-type channels heterologously expressed with the muscarinic 
acetylcholine receptor M2 (mAChR M2). 3. Both RGS2 and RGS4 modulate the prepulse 

30 facilitation properties of P/Q-type Ca2+ channels. G protein reinhibition is accelerated, while 
release from inhibition is slowed. These kinetics depend on the availability of G protein alpha 
and betagamma subunits which is altered by RGS proteins. 4. RGS proteins unmask the Ca2+ 
channel beta subunit modulation of Ca2+ channel G protein inhibition. In the presence of 
RGS2, P/Q-type channels containing the beta2a and beta3 subunits reveal significantly altered 
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kinetics of G protein modulation and increased facilitation compared to Ca2+ channels 
coexpressed with the betal b or beta4 subunit. 
PMID: 11018106 

Panel 4D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

NOV84 



Expression of gene NOV84 was assessed using the primer-probe set Ag3051, 
described in Table BYA. 

Table BYA. Probe Name Ag3051 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gcagctcattcagacctatgag-3 1 


22 


847 


1333 


Probe 


TET-5 ' -ctctcctgccacccctatgacactg-3 • - 
TAMRA 


25 


883 


1334 


Reverse 


5 ' -cgacaacaggtacatcatgaag-3 * 


22 


913 


1335 



Panel 1.3D Summary: Ag3051 Results from one experiment with this gene are not 
included. The amp plot suggests that there were experimental difficulties with this run (data 
not shown). 

Panel 2D Summary: Ag3051 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3051 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

NOV85 



Expression of gene NOV85 was assessed using the primer-probe set Ag3057, 
described in Table BZA. Results of the RTQ-PCR runs are shown in Tables BZB, BZC, BZD 
and BZE. 



Table BZA. Probe Name Ag3057 



Primers 


Sequences 


j Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 » - aacat tggaaggacaggagtc t - 3 1 


! 22 


2314 


1336 


Probe 


TET-5 1 -ccccaggagatgtatcagattcagct-3 ' 
TAMRA 


~ j 26 


2336 


1337 


Reverse 


5 • -cagatccccaagaaccctta-3 ' 


I 20 \ 


2382 


1338 
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Table BZB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3057, 
Run 211012795 


Tissue Name 


Rel. Exp.(%) Ag3057, 
Run 211012795 


AD 1 Hippo 


12.9 


Control (Path) 3 
Temporal Ctx 


8.4 


AD 2 Hippo 


27.2 


Control (Path) 4 
Temporal Ctx 


25.0 


AD 3 Hippo 


11.2 


AD 1 Occipital Ctx 


29.5 


AD 4 Hippo 


13.0 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


23.8 


AD 3 Occipital Ctx 


15.4 


AD 6 Hippo 


77.4 


AD 4 Occipital Ctx 


28.9 


Control 2 Hippo 


34.9 


AD 5 Occipital Ctx 


15.9 


Control 4 Hippo 


15.5 


AD 6 Occioital Ctx 


14.7 


Control (Path) 3 
Hippo 


11.2 


Control 1 Occipital 
Ctx 


8.0 


AD 1 Temporal Ctx 


45.7 


Control 2 Occipital 
Ctx 


28.3 


AD 2 Temporal Ctx 


31.4 


Control 3 Occipital 
Ctx 


14.8 


AD 3 Temporal Ctx 


19.3 


Control 4 Occipital 
Ctx 


23.8 


AD 4 Temporal Ctx 


28.1 


Control (Path) 1 
Occipital Ctx 


73.7 


AD 5 Inf Temporal 
Ctx 


41.2 


Control (Path) 2 
Occipital Ctx 


16.8 


AD 5 SupTemporal 
Ctx 


23.3 


Control (Path) 3 
Occipital Ctx 


7.3 


AD 6 Inf Temporal 
Ctx 


100.0 


Control (Path) 4 
Occipital Ctx 


13.0 


AD 6 Sup Temporal 

Ctx ] 


75.3 


Control 1 Parietal 
Ctx 


14.8 


Control 1 Temporal 
Ctx 


9.7 


Control 2 Parietal 
Ctx 


29.3 


Control 2 Temporal 

Ctx \ 


21.8 


Control 3 Parietal 
Ctx 


12.8 


Control 3 Temporal ; 
Ctx 


16.0 


Control (Path) 1 
Parietal Ctx 


42.6 


Control 4 Temporal 
Ctx 


9.3 


Control (Path) 2 
Parietal Ctx 


30.4 


Control (Path) 1 
Temporal Ctx j 


45.4 


Control (Path) 3 
Parietal Ctx 


9.7 


Control (Path) 2 
Temporal Ctx 


28.9 


Control (Path) 4 
Parietal Ctx 


30.8 
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Table BZC. Panel 1.3D 



Tissue Name 

— — — — ^ — — ^ 


ReL Exp.(%) Ag3057, 
Run 165519995 


Tissue Name 


Rel. £xp.(%) Ag3057, 
Run 165519995 


Liver adenocarcinoma 


9.9 


Kidney (tetal) 


0. 1 


Pancreas 


3.8 


Renal ca. 786-0 


8.8 


Pancreatic ca. CAP AN 

2 


40.9 


Renal ca. A498 


15.1 


Adrenal gland 


14.5 


Renal ca. RXr 393 


13.0 


Thyroid 


6.2 


Renal ca. ACHN 


2.1 


Salivary gland 


10.8 


Renal ca. UO-31 


13.1 


Pituitary gland 


10.2 


Renal ca. TK-10 


3.9 


Brain (fetal) 


51.4 


Liver 


1.9 


Brain (whole) 


100.0 


Liver (fetal) 


4.8 


Brain (amygdala) 


48.0 


Liver ca. 
(hepatoblast) HepG2 


17.3 


Brain (cerebellum) 


49.3 


Lung 


10.4 


Brain (hippocampus) 


47.6 


Lung (fetal) 


7.2 


Brain (substantia nigra) 


70.2 


Lung ca. (small cell) 
LX-1 


7.6 


Brain (thalamus) 


51.8 


Lung ca. (small cell) 
NCI-H69 


1.0 


Cerebral Cortex 


11.3 


Lung ca. (s.cell var.) 
SHP-77 


7.3 


Spinal cord 


85.9 


Lung ca. (large 
cell)NCI-H460 


16.6 


glio/astro U87-MG 


16.5 


Lung ca. (non-sm. 
cell) A549 


4.5 


glio/astro U-l 1 8-MG 


27.9 


Lung ca. (non-s.cell) 
NCI-H23 


6.1 


astrocytoma SW1783 


13.5 


Lung ca. (non-s.cell) 
HOP-62 


8.5 


neuro*; met SK-N-AS 


12.2 


Lung ca. (non-sxl) 
NCI-H522 


3.0 


astrocytoma SF-539 


14.2 


Lung ca. (squam.) 
SW 900 


8.3 


astrocytoma SNB-75 


20.9 


Lung ca. (squam.) 
NCI-H596 


5.9 


glioma SNB-19 


23.0 


Mammary gland 


17.3 


glioma U251 


25.7 


Breast ca.* (pl.ef) 
MCF-7 


7.3 


glioma SF-295 


9.8 


Breast ca.* (pl.ef) 
MDA-MB-231 


35.8 


Heart (fetal) 


0.2 


Breast ca.* (pl.ef) 
T47D 


5.5 


Heart 


11.1 


Breast ca. BT-549 


32.1 
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Skeletal muscle (fetal) 


3.2 


Breast ca. MDA-N 


3.3 


Skeletal muscle 


18.9 


Ovary 


1.9 


Done marrow 




Ovarian ca. OVCAR- 
3 


J .o 


Thymus 


1 .3 


Ovarian ca. OVCAR- 
4 




Spleen 


5.6 


Ovarian ca. OVCAR- 
5 


7.7 


Lymph node 


7.3 


Ovarian ca. OVCAR- 
8 


5.7 


Colorectal 


9.2 


Ovarian ca. IGROV- 
1 


2.5 


Stomach 


15.0 


Ovarian ca.* (ascites) 
SK-OV-3 


20.9 


Small intestine 


10.4 


Uterus 


23.5 


Colon ca. SW480 


5.6 


Placenta 


6.0 


Colon ca.* 
SW620(SW480 met) 


4.9 


Prostate 


3.4 


Colon ca HT29 


2.8 


Prostate ca.* (bone 
met)PC-3 


8.4 


Colon ca. HCT-116 


3.7 


Testis 


9.8 


Colon ca. CaCo-2 


10.3 


Melanoma 
Hs688(A).T 


6.1 


Colon ca. 
tissue(OD03866) 


9.2 


Melanoma* (met) 
Hs688(B).T 


6.0 


Colon ca. HCC-2998 


5.3 


Melanoma UACC-62 


1.8 


Gastric ca.* (liver met) 
NCI-N87 


34.9 


Melanoma M14 


14.5 


Bladder 


14.0 


Melanoma LOX 
IMVI 


0.9 


Trachea 


6.5 


Melanoma* (met) 
SK-MEL-5 


3.8 


Kidney 


3.0 


Adipose 


12.8 



Table BZD* Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3057, Run 
163577596 


Tissue Name 


Rel. Exp.(%) 
Ag3057, Run 
163577596 


Normal Colon 


81.8 


Kidney Margin 
8120608 


1.1 


CC Well to Mod Diff 
(OD03866) 


10.7 


Kidney Cancer 
8120613 


8.1 


CC Margin (OD03866) 


17.6 


Kidney Margin 
8120614 


1.1 
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CC Gr.2 rectosigmoid 
(OD03868) 


11.2 


Kidney Cancer 
9010320 


2.7 


CC Margin (OD03 868) 


4.8 


Kidney Margin 
9010321 


2.8 


CC Mod Diff (ODO3920) 


15.5 


Normal Uterus 


10.7 


k^Ks iviargin ^uuujvzuj 


zu.z 


uierus cancer uowi j 


J l .z 


CC Gr.2 ascend colon 

{yJLJKjjyZ i ) 


36.9 


Normal Thyroid 


19.1 


CC Margin (OD03921) 


9.7 

.... _ - ; 


Thyroid Cancer 

A/C/1 A1 A 
UO^U 1 U 


7.6 


CC from Partial 
Hepatectomy (U1XJ4JUV) 
Mets 


ZO.Z c 


Thyroid Cancer 
A302152 


J.O 


Liver Margin (ODO4309) 


13.1 


Thyroid Margin 
A302153 


13.7 


Colon mets to lung 

//"\T~\ A A A C 1 A1 \ 

(OD04451-01) 


2.9 


Normal Breast 


41.5 


Lung Margin (OD04451- 
02) 


6.0 


Breast Cancer 

//\ p\ A/1 

(<JJJ04:>oo) 


7.5 


Normal Prostate 6546-1 


60.3 


Breast Cancer 
(OUU4590-01) 


40.6 


Prostate Cancer 

o 

//^r\A A A 1 A\ 

(OL/044 1 U) 


20.2 


Breast Cancer Mets 

/T^T"\A/1 ^ AA 

(L)L/U4jVU-U3) 


32.3 


Prostate Margin 
(OD04410) 


z4.3 


Breast Cancer 

Metastasis 
(OD04655-05) 




Prostate Cancer 
(OD04720-01) 


23.8 


Breast Cancer 064006 


15.7 


rrosiaie lviargin 
(OD04720-02) 


37.4 


Breast Cancer 1024 


10.2 


Normal Lung 061010 


42.6 


Breast Cancer 
9100266 


6.5 


Lung Met to Muscle 
(OD04286) 


3o. / 


Breast Margin 
9100265 


0.4 


Muscle Margin 
(OD04286) 


9.2 


Breast Cancer 
A209073 


20.2 


Lung Malignant Cancer 
(OD03126) 


20.7 


Breast Margin 
A2090734 


15.2 


Lung Margin (OD03 126) 


17.8 


Normal Liver 


7.4 


Lung Cancer (OD04404) 


36.9 


Liver Cancer 064003 


7.2 


Lung Margin (OD04404) 


11.0 


Liver Cancer 1025 


3.2 


Lung Cancer (OD04565) 


11.3 


Liver Cancer 1026 


3.5 


Lung Margin (OD04565) 


10.6 


Liver Cancer 6004-T 


4.8 


Lung Cancer (OD04237- 
01) 


30.4 


Liver Tissue 6004-N 


4.8 
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Lung Margin (OD04237- j 
02) 


21.0 


Liver Cancer 6005 -T 


2.5 


Ocular Me! Met to Liver 
(ODO4310) 


11.7 


Liver Tissue 6005-N 


1.1 
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Normal Bladder 

1 1 V ' 1 1 1 1 LI 1 l^lUUUvl 


32.1 


Melanoma Mets to Lung 


10.2 


Bladder Cancer 1023 


5.6 


Lung Margin (OD04321) 


36.3 


Bladder Cancer 
A3021 73 


49.3 


Normal Kidney 


100.0 


Bladder Cancer 


28.7 


Kidney Ca, Nuclear grade 
2 (OD04338) 




Bladder Normal 
03) 


24.8 


jviuney iviargin 
(OD04338) 


16.8 


Normal Ovary 


6.5 


Jtvianey ^a iNuciear graae 
1/2(OD04339) 


11.0 


vyvtlilclll V^dllV-'Cl 

064008 


21.5 


iviuney iviargin 
(OD04339) 


21.9 


V_/V<lllCllI V_ClIlC'tl 

(OD04768-07) 


35.8 


i / ; o loot* i^i^ll 

jsjaney i^-a, oiear cen 
type (OD04340) 


66.4 


(~}\/av\/ avert n 
KJvcLiy IvlclI^Ill 

(OD04768-08) 


5.4 


iviuney iviargin 
(OD04340) 


18.0 


Normal Stomach 


80.1 


IVlClIlCy V^d, INUV^ICdl 1^1 clUC 

3 (OD04348) 


3.4 


9060358 


2.2 


kf^ irlnPA/ Fl4 ,iro in 
rvivJiic^' iviciii^iii 

(OD04348) 


10.2 


Stomach A/Tarcnn 
9060359 


12.3 


Kidney Cancer 
(OD04622-01) 


12.3 


Gastric Cancer 
9060395 


23.2 


Kidney Margin 
(OD04622-03) 


1.2 


Stomach Margin 
9060394 


14.6 


Kidney Cancer 
(OD04450-01) 


26.1 


Gastric Cancer 
9060397 


11.8 


Kidney Margin 
(OD04450-03) 


21.5 


Stomach Margin 
9060396 


5.8 


Kidney Cancer 8120607 


2.2 


Gastric Cancer 
064005 


55.5 



Table BZE. Panel CNS 1 



Tissue Name 


Rel. Exp.(%) Ag3057, 
Run 171694175 


Tissue Name 


Rel. Exp.(%) Ag3057, 
Run 171694175 


BA4 Control 


7.9 


BA17PSP 


21.3 


BA4 Control2 


16.0 


BA17PSP2 


5.4 


BA4 


2.9 


Sub Nigra Control 


58.6 
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Alzheimer's2 








BA4 Parkinson's 


32.8 


Sub Nigra Control2 


25.9 


BA4 
Parkinson's2 


46.7 


Sub Nigra 
Alzheimer's2 


20.0 


BA4 
Huntington's 


25.9 


Sub Nigra 
Parkinson's2 


85.3 


BA4 
Huntington's2 


4.1 


Sub Nigra 
Huntington's 


100.0 


BA4 PSP 


5.7 


Sub Nigra 
Huntington's2 


59.0 


BA4 PSP2 


36.9 


Sub Nigra PSP2 


25.2 


BA4 Depression 


15.6 


^nH Ni Qrpi 

O UU Ml 1 C4 

Depression 


35.8 


RA4 
Depression2 


27.2 


Slnh Nipra 

JUU 1^1 £^1 CI 

Depression2 


26.1 


BA7 Control 


12.9 


nioh Palladus 

V J JUU J Ul 

Control 


54.3 


BA7 Control2 


12.4 


Gloh Palladiis 
Control2 


15.2 


RA7 
Alzheimer's2 


5.2 


CMnh Palladus 
Alzheimer's 


17.4 


BA7 Parkinson's 


15.4 


Glob Palladus 
Alzheimer's2 


15.6 


BA7 
Parkinson's2 


23.2 


Glob Palladus 
Parkinson's 


73.7 


BA7 
Huntington's 


34.9 


Glob Palladus 
Parkinson's2 


15.8 


BA7 
Huntington's2 


56.3 


Glob Palladus PSP 


17.7 


BA7 PSP 


29.1 


Glob Palladus PSP2 


8.4 


BA7 PSP2 


12.6 


Glob Palladus 
Depression 


1/1 1 

Z4.1 


BA7 Depression 


12.2 


Temp Pole Control 


0.8 


BA9 Control 


4.4 


Temp Pole Control2 


13.9 


BA9 Control2 


36.1 


Temp Pole 
Alzheimer's 


2.7 


BA9 Alzheimer's 


3.8 


Temp Pole 
Alzheimer's2 


2.0 


BA9 
Alzheimer's2 


2.1 


Temp Pole 
Parkinson's 


15.9 


BA9 Parkinson's 


17.2 


Temp Pole 
Parkinson's2 


12.9 


BA9 
Parkinson's2 


20.7 


Temp Pole 
Huntington's 


19.2 


BA9 


36.9 


Temp Pole PSP 


6.2 
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Huntington's 








BA9 
Huntington's2 


15.2 


Temp Pole PSP2 


1 o 

1.8 


BA9 PSP 


27.4 


Temp Pole 
Depression2 


10.8 


BA9 PSP2 


14.0 


Cing Gyr Control 


35.6 


BA9 Depression 


7.5 


Cing Gyr Control2 


15.2 


BA9 
Depression2 


11.7 


Cing Gyr 
Alzheimer's 


15.7 


BA 17 Control 


20.7 


Cing Gyr 
Alzheimer's2 


10.7 


BA17 Control2 


15.3 


Cing Gyr Parkinson's 


47.0 


RA 1 7 

Alzheimer's2 


4.2 


viiig vj y i 

Parkinson's2 


48.6 


DA17 

Parkinson's 


41.8 


V_.lilg v jy r 

Huntington's 


85.3 


RA17 
Parkinson's2 


27.5 


Cine* Gvr 
Huntington's2 


53.2 


BA17 
Huntington's 


17.3 


Cing Gyr PSP 


80.1 


BA17 
Huntington's2 


27.0 


Cing Gyr PSP2 


6.9 


BA17 
Depression 


81.8 


Cing Gyr Depression 


18.6 


BA17 
Depression2 


51.4 


Cing Gyr 
Depression2 


69,7 



CNS__neurodegeneration_vl.O Summary: Ag3057 The NOV85 gene is found to be 
slightly but significantly (p=0.016) upregulated in the Alzheimer's disease (AD) temporal 
cortex. The temporal cortex is the region of the brain where neurons degenerate in the mid 
5 stages of AD. This increase in expression is not apparent in the occipital cortex, which does 
not experience neurodegeneration in AD. Since the upregulation of this gene appears to be 
neurodegeneration-specific both within an individual brain and between brains, this gene is an 
excellent small molecule target. Therefore, treatment with an antagonist may decrease the 
pathology seen in Alzheimer's disease. 
10 Panel 1.3D Summary: Ag3057 Highest expression of the NOV85 gene is seen in the 

CNS. Please see CNS Neurodegeneration for discussion of utility of this gene in the central 
nervous system. 

Among tissues with metabolic function, this gene has low levels of expression in 
pancreas, adrenal, thyroid, pituitary, skeletal muscle and adipose. Therefore, modulation of 
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this gene product may be a treatment for metabolic and endocrine diseases, including obesity 
and Types 1 and 2 diabetes. 

In addition, this gene is expressed at low levels in the cancer cell lines in this panel. 
This difference in expression is particularly prominent in the CNS cancer cell lines when 
5 compared to the normal brain tissues. Thus, this gene could potentially be used as a diagnostic 
marker in CNS cancers. 

Panel 2D Summary: Ag3057 The NOV85 gene is expressed at moderate to low levels 
in this panel. A higher level of expression is observed in lung, kidney, uterine, gastric and 
ovarian cancer when compared to the normal adjacent lung, kidney, uterine, gastric and 
10 ovarian tissues in this panel. Thus, this gene could be used as a diagnostic marker of cancer in 
these tissues. Futhermore, inhibition of the activity of this gene product using small molecule 
drugs may be useful for the treatment of cancer in these tissues 

Panel 4D Summary: Ag3057 The amp plot indicates that there is a high probability of 
experimental failure. (Data not shown.) 
15 Panel CNS1 Summary: Ag3057 These results confirm expression of the NOV85 

gene in the brain. Please see CNS Neurodegeneration for discussion of utility of this gene in 
the central nervous system. 

NOV86: GTPASE- ACTIVATING PROTEIN 

Expression of gene NOV86 was assessed using the primer-probe set Ag3058, 
20 described in Table CAA. Results of the RTQ-PCR runs are shown in Tables CAB, CAC and 
CAD. 



Table CAA. Probe Name Ag3058 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -agtacccgctgctgaacac-3 * 


19 ^ 1 \ 


534 


1339 


Probe 


TET-5 ' -accctcattgccaaggtcaaagcct-3 1 - \ 
TAMRA 


25 


578 


1340 


Reverse 


5 * -tcattgttgctctcataatgga-3 • 


22 


603 


1341 



Table CAB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) A g 3058, 
Run 165533238 


Tissue Name 


Rel. Exp.(%) Ag3058, 
Run 165533238 


Liver adenocarcinoma 


5.5 


Kidney (fetal) 


0.9 


Pancreas 


1.2 


Renal ca. 786-0 


0.6 


Pancreatic ca. CAP AN 
2 


3.9 


Renal ca. A498 


0.1 
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Adrenal gland 


1.8 


Renal ca. RXF 393 


1.0 


Thyroid 


1.7 


Renal ca. ACHN 


0.4 


Salivary gland 


6.4 


Renal ca. UO-31 


15.7 


Pituitary gland 


1.1 


Renal ca. TK-10 


0.1 


Brain (fetal) 


0.6 


Liver 


3.5 


Brain (whole) 


2.7 


Liver (fetal) 


4.9 


Brain (amygdala) 


3.0 


Liver ca. 
(hepatoblast) HepG2 


5.3 


Brain (cerebellum) 


0.4 


Lung 


16.5 


Brain (hippocampus) 


2.4 


Lung (fetal) 


8.4 


Brain (substantia nigra) 


4.7 


Luns? ca Ismail cell) 
LX-l 


1.8 


Brain (thalamus) 


2.7 


T lint* pa f^mall ppII^ 

NCI-H69 


1.0 


Cerebral Cortex 


1.2 


T iintr pa (<z ppll var ^ 

SHP-77_ 


1.3 


Spinal cord 


4.2 


T Pa (\<\Y&€: 

I . \_ 1 1 1 i^, K,<X. \ 1 <-l 1 L,v 

cell)NCI-H460 


0.9 


glio/astro U87-MG 


0.0 


Liitip ca fnon-sm 
cell) A549 


0.1 


glio/astroU-118-MG 


0.2 


T utip pa fnon-s ppIH 
NCI-H23 


0.2 


astrocytoma SW1783 


0.1 


Lung ca. (non-s.cell) 
HOP-62 


4.6 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


2.5 


Lung ca. (squam.) 
SW900 


6.9 


astrocytoma SNB-75 


5.4 


Lung ca. (squam.) 
NCI-H596 


U.3 


glioma SNB-19 


0.7 


Mammary gland 


1.7 


glioma U25 1 


1.1 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.3 


Breast ca.* (pl.ef) 
MDA-MB-231 


19.2 


Heart (fetal) 


1.0 


Breast ca.* (pl.ef) 
T47D 


2.4 


Heart 


1.4 


Breast ca. BT-549 


4.7 


Skeletal muscle (fetal) 


0.5 


Breast ca. MDA-N 


0.9 


Skeletal muscle 


1.0 


Ovary 


1.8 


Bone marrow 


32.1 


Ovarian ca. OVCAR- 
3 


0.3 


Thymus 


29.3 


Ovarian ca. OVCAR- 
4 


0.3 


Spleen 


46.3 


Ovarian ca. OVCAR- 


9.0 
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Lymph node 


100.0 


Dvarian ca. OVCAR- 

o 
o 


0.7 


Colorectal 


1.4 m 


Ovarian ca. IGROV- 

i 
i 


0.7 


Stomach 


12.1 ' 


ovarian ca. ^ascuesj 
SK-OV-3 


0.2 


Small intestine 


16.6 


Uterus 




Colon ca. SW480 


2.3 


Placenta 


13.8 


Colon ca.* 
SW620(SW480 met) 


0.9 


Prostate 


0.9 


Colon ca. HT29 


1.2 


Prostate ca.* (bone 
met^rL,-3 


3.1 


Colon ca.HCT-116 


2.9 


Testis 


0.2 


Colon ca. CaCo-2 


3.5 


Melanoma 
Hs688(A).T 


0.1 


Colon ca. 
tissue(OD03866) 


4.6 


Melanoma* (met) 
Hsooo(B). 1 


0.0 


Colon ca. HCC-2998 


2.8 


Melanoma UACC-62 


0.1 


Gastric ca.* (liver met) 
NCI-N87 


4.6 


Melanoma M 14 


7.2 


Bladder 


1.9 


Melanoma LOX 
IMVI 


1.6 


Trachea 


8.1 


Melanoma* (met) 
SK-MEL-5 


0.1 


Kidney 


0.3 


Adipose 


! 3.5 


Table CAC. Pane! 2D 


Tissue Name 


Rel. Exp.(%) 
Ag3058, Run 
162569974 


Tissue Name 


Rel. Exp.(%) 
Ag3058, Run 
162569974 


Normal Colon 


30.8 


Kidney Margin 
8120608 


6.5 


CC Well to Mod Diff 
(OD03866) 


6.8 


Kidney Cancer 
8120613 


2.9 


CC Margin (OD03 866) 


6.7 


Kidney Margin 
8120614 


6.3 


CC Gr.2 rectosigmoid 
(OD03868) 


8.5 


Kidney Cancer 
9010320 


35.4 


CC Margin (OD03868) 


1.3 


Kidney Margin 
9010321 


17.0 


CC Mod Diff (ODO3920) 


9.9 


Normal Uterus 


2.4 


CC Margin (ODO3920) 


11.3 


Uterus Cancer 06401 1 


10.2 


CC Gr.2 ascend colon 


14.7 


Normal Thyroid 6.8 
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(OD03921) 








CC Margin (OD03921) 


10.7 


Thyroid Cancer 
064010 


5.1 


CC from Partial 
Hepatectomy (ODO4309)! 
Mets 


18.7 


Thyroid Cancer 
A302152 


6.8 


T iver Martrin fOnO4309Y 

JL/lvt'l Ivldl gill ^ ' V-/^r ->\J y j 


12.4 


Thyroid Margin j 
A302153 . 


7.9 


Colon mets to lung 
(OD04451-01) 


12.7 


Normal Breast 


17.7 


Lung Margin (OD0445 1- j 
02) 


18.7 


Breast Cancer 
(OD04566) 


10.4 


Normal Prostate 6546-1 


13.8 


Breast Cancer 
(OD04590-01) 


18.2 


Prostate Cancer 
(OD04410) 


13.8 


Breast Cancer Mets 
(OD04590-03) 


59.9 


Prostate Margin 


11.7 


Breast Cancer 

Metastasis 
(OD04655-05) 


52.5 


Prostate Cancer 
(OD04720-01) 


7 "3 


Rr^Qt ratipff 064006 

UICojL ^-dll^d \j\j^\j\JK) 


21 3 


Prostate Margin 
(OD04720-02) 




Rrpj^^t r'fitir'pr 1 074 


9.7 


Normal Lung 061010 


100.0 


Breast Cancer 
9100266 


12.9 


Lung Met to Muscle 

/TYT1049RM 


29.5 


Breast Margin 
Q1 0076S 

J 1 \J\J Z-\J J 


14.1 


lV/Ti i If* lV/l^rcrin 

(OD04286) 


8.2 


LJlLuoL V c.l I 1 V v_ 1 

A209073 


11.6 


idling ivid.iignd.iii ^diiccr 
(OD03126) 


31.6 


1)IC i_l > I IVldlglll 

A2090734 


6.6 


T una Mnrain ^000^1?^ 

J_/LUlg 1 VI dig, HI 1 Z*\J J 


48 6 


Normal T iver 

1 lVJl 111U1 1 > I V VI 


7.2 


Lung Cancer (OD04404) 


16.4 


Liver Cancer 064003 


5.4 


Lung Margin (OD04404) 


17.1 


Liver Cancer 1025 


O.O 


Lung Cancer (OD04565) 


6.7 


Liver Cancer 1026 


Yd. A 


Lung Margin (OD04565) 


20.3 


Liver Cancer 6004-T 


7.2 


Lung Cancer (OD04237- 
01) 


24.0 


Liver Tissue 6004-N 


6.7 


Lung Margin (OD04237- 
02) 


43.2 


Liver Cancer 6005 -T 


14.7 


Ocular Mel Met to Liver 
(ODO4310) 


3.4 


Liver Tissue 6005-N 


4.5 


Liver Margin (OD043 1 0) 


8.2 


Normal Bladder 


20.2 


Melanoma Mets to Lung 
(OD04321) 


6.0 


Bladder Cancer 1023 


9.2 
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Lung Margin (OD04321) i 


69.3 


Bladder Cancer 
A302173 


9.6 


Normal Kidney 


13.0 


Bladder Cancer 
(OD047 18-01) 


38.2 


Kidney Ca, Nuclear gradej 


22.5 


Bladder Normal 
Adjacent (OD04718- 
03) 


14.9 


Kidney Margin 
(OD04338) 


11.1 


Normal Ovary 


5.8 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


20.4 


Ovarian Cancer 
064008 


17.4 


Kidney Margin 
(OD04339) 


6.5 


Ovarian Cancer 
(OD04768-07) 


3.7 


Kidney Ca, Clear cell 
type (OD04340) 


28.5 


Ovary Margin 
(OD04768-08) 


5.2 


Kidney Margin 
(OD04340) 


13.7 


Normal Stomach 


14.8 


Kidney Ca, Nuclear grade 
3 (OD04348) 


9 7 


Gastric Cancer 
9060358 


8.0 


Kidney Margin 
(OD04348) 


11.3 


Stomach Margin 
9060359 


17.4 


Kidney Cancer 
(OD04622-01) 


27.9 


Gastric Cancer 
9060395 


17.7 


Kidney Margin 
(OD04622-03) 


2.6 


Stomach Margin 
9060394 


26.8 


Kidney Cancer 
(OD04450-01) 


2.3 


Gastric Cancer 
9060397 


20.2 


Kidney Margin 
(OD04450-03) 


5.3 


Stomach Margin 
9060396 


12.9 


Kidney Cancer 8120607 


7.4 


Gastric Cancer 
064005 


29.7 



Table CAD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3058, Run 
162562989 


Tissue Name 


Rel. Exp.(%) 
Ag3058, Run 
162562989 


Secondary Thl act 


5.9 


HUVEC IL-lbeta 


21.9 


Secondary Th2 act 


6.2 


HUVEC IFN gamma 


16.8 


Secondary Trl act 


9.5 


HUVEC TNF alpha + IFN 
gamma 


20.6 


Secondary Thl rest 


1.0 


HUVEC TNF alpha + IL4 


28.1 


Secondary Th2 rest 


1.2 


HUVEC IL-11 


12.3 


Secondary Trl rest 


1.4 


Lung Microvascular EC 
none 


28.3 


Primary Thl act 


8.7 


Lung Microvascular EC 


46.7 
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TNFalpha + IL-lbeta 




Primfirv Th^? act 

1 111 Hill ^ 1 Ux- CIV^L 


3.5 


Microvascular Dermal EC 
none 


38.7 


Prim^rv TY1 art 

1 llllldlj' 111 CIV-' L 


8.3 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


24.7 


t>rim^rv Th 1 rp<st 

1 1 lllldl jr llll it/Ot 


7.6 


Bronchial epithelium 
TNFalpha + ILlbeta 


9.2 


Primary Th2 rest 


3.1 


Small airway epithelium 
none 


13.8 


Primary Trl rest 


4.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 
lymphocyte act 


13.7 


Coronery artery SMC rest 


22.2 


CD45RO CD4 
lymphocyte act 


7.3 


Coronery artery SMC 
TNFalpha + IL-lbeta 


11.3 


CD8 lymphocyte act 


2.3 


Astrocytes rest 


14.7 


Secondary CD8 1 ^ ^ 
lymphocyte rest 


Astrocytes TNFalpha + 
IL-lbeta 


12.0 


Secondary CD8 
lymphocyte act 


4.4 


KU-812 (Basophil) rest 


12.2 


CD4 lymphocyte none 


1.5 


KU-812 (Basophil) 
PMA/ionomycin 


46.7 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


1.3 


CCD1 106 (Keratinocytes) 
none 


24.0 


LAK cells rest 


15.3 


CCD1 lUo (Reratinocytesj 
TNFaloha + IL-lbeta 


1.6 


LAK cells IL-2 


4.5 


Liver cirrhosis 


3.7 


LAK cells IL-2+IL-12 


4.5 


Lupus kidney 


3.9 


LAK cells IL-2+IFN 
gamma 


8.1 


NCI-H292 none 


46.3 


LAK cells IL-2+ IL-18 


7.9 


NCI-H292 IL-4 


57.4 


LAK cells 
PMA/ionomycin 


10.2 


NCI-H292 IL-9 


63.3 


NK Cells IL-2 rest 


2.0 


NCI-H292 IL-13 


28.1 


Two Way MLR 3 day 


2.6 


NCI-H292 IFN gamma 


25.9 


Two Way MLR 5 day 


3.1 


HPAEC none 


14.8 


Two Way MLR 7 day 


2.3 


HPAEC TNF aloha + IL-1 
beta 


19.9 


PBMC rest 


2.4 


Lung fibroblast none 


29.1 


PBMC PWM 


12.3 


Lung fibroblast TNF alpha 
+ IL-1 beta 


13.3 


PBMC PHA-L 


5.1 


Lung fibroblast IL-4 


39.8 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


24.5 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


19.1 
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R Ivrn nhnpvtfs PWM 
u i y 1 1 1 i y i vv ivi 


32.3 


Lung fibroblast IFN 
gamma 


45.7 


B lymphocytes CD40L 
and IL-4 


4.1 


Dermal fibroblast 
CCD 1070 rest 


57.4 


EOL-1 dbcAMP 


0.7 


Dermal fibroblast 
CCD 1 070 TNF alpha 


81.8 


EOL-1 dbcAMP 
r ivi/\/ ioinjiiiyL/iii 


6.7 


Dermal fibroblast 
PCD! 070 TT -1 beta 


25.0 


Dendritic cells none 


12.8 


gamma 


51.1 


Dendritic cells LPS 


11.8 


Dermal fibroblast IL-4 


52.9 


Dendritic cells anti- 
CD40 


13.8 


IBD Colitis 2 


0.7 


Monocytes rest 


6.2 


IBD Crohn's 


3.4 


Monocytes LPS 


6.5 


Colon 


22.5 


Macrophages rest 


18.3 


Lung 


19.2 


Macrophages LPS 


10.0 


Thymus 


24.7 


HUVEC none 


33.7 


Kidney 


13.0 


HUVEC starved 


49.0 







Panel 1.3D Summary: Ag3058 Highest expression of the NOV86 gene, a GTPase- 
activating protein homolog, is seen in the lymph node (CT=27.8). Among tissues with 
metabolic function, this gene has low levels of expression in pancreas, adrenal, thyroid, 
5 pituitary, heart, skeletal muscle, liver and adipose. Rab GTPases are integral to vesicular 

transport in the secretory and endocytic pathways. Therefore, therapeutic modulation of this 
gene product may be a treatment for metabolic and endocrine diseases, including obesity and 
Types 1 and 2 diabetes. 

This GTPase activating enzyme like molecule is also expressed at low levels in the 
10 CNS. Thus, it may be useful in treating diseases of the nervous system, stroke or CNS trauma. 

In addition, this gene is expressed at low levels in the cancer cell lines in this panel. 
Therefore, modulation of expression of this gene may be useful in treating cancer. 

Panel 2D Summary: Ag3058 The NOV86 gene is expressed at low levels in this 
panel. There is higher expression in kidney, breast, liver and bladder cancer samples compared 
15 to the adjacent normal tissue. Conversely, there is lower expression in lung cancer samples 
compared to the adjacent normal tissue. Thus, the expression of this gene could be used as a 
diagnostic marker for kidney, breast, liver, bladder and lung cancers. Furthermore, modulation 
of expression of this gene may also be used for therapy of these cancers. 

Panel 4D Summary: Ag3058 The NOV86 gene is expressed at high to moderate 
20 levels in a wide range of cell types of significance in the immune response and tissue response 
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in health and disease, with the highest expression being detected in TNF alpha plus IL-1 beta 
treated small airway epithelial cells (CT-28.03). Therefore, targeting of this gene product with 
a small molecule drug or antibody therapeutic may modulate the functions of cells of the 
immune system as well as resident tissue cells and lead to improvement of the symptoms of 
5 patients suffering from autoimmune and inflammatory diseases such as COPD, emphysema, 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, and arthritis, including 
osteoarthritis and rheumatoid arthritis. 

NOV87a and NOV87b 

Expression of gene NOV87a and full length clone NOV87b was assessed using the 
10 primer-probe set Ag3040, described in Table CBA. 

Table CBA. Probe Name Ag3040 



Primers 


Sequences 


| Length 


Start 
Position 


SEQ XD NO: 


Forward 


5 ' -gagccctgaagctcttctttc-3 1 


}21 


617 


1342 


Probe 


TET-5 * -cttctcgcacttccgccagttcatt-3 • - 
TAMRA 


25 


663 


1343 


Reverse 


5 ■ -cctggtcctgctcactgat-3 ' 


jx, 


694 


1344 



CNSneurodegenerationvl.O Summary: Ag3040 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
15 Panel 1.3D Summary: Ag3040 Expression of this gene is low/undetectable (CTs > 

35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3040 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

NOV88 

20 Expression of gene NOV88 was assessed using the primer-probe set Ag2923, 

described in Table CCA. 

Table CCA. Probe Name Ag2923 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -agtcaacagatttggccacat-3 1 • 


21 


45 


1345 


Probe 


TET-5 • -tcaccagggctgcttttaactctggt-3 • - 
TAMRA 


26 


77 


1346 


Reverse 


5 ' -cagtgaaggggtcactgatg-3 * 


20 


120 


1347 
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CNSneurodegenerationvl.O Summary: Ag2923 Expression of this gene is 
low/undetectable (CTs > 34.5) across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag2923 Expression of this gene is low/undetectable (CTs > 
34.5) across all of the samples on this panel (data not shown). 
5 Panel 2D Summary: Ag2923 Expression of this gene is low/undetectable (CTs > 35) 

across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2923 Expression of this gene is low/undetectable (CTs > 
34.5) across all of the samples on this panel (data not shown). 

NOV89 

10 Expression of gene NOV89 was assessed using the primer-probe set Ag2924, 

described in Table CDA. 

Table CDA. Probe Name Ag2924 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -gaacgggaagcttgttatcaat-3 * 


22 


231 


1348 


Probe 


TET-5 ' -agatctcaccaaaat caaatggggca- 3 • - 
TAMRA 


26 


282 


1349 


Reverse 


5 ' -atgatgtactcagtgccagcat-3 ' 


22 


308 


1350 



CNS_neurodegeneration_vl,0 Summary: Ag2924 Expression of this gene is 
1 5 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag2924 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag2924 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 
20 Panel 5 Islet Summary: Ag2924 Run 242285280 Expression of this gene is 

low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). Run 
243564308 The amp plot indicates that there were experimental difficulties with this run. 

NOV90 



Expression of gene NOV90 was assessed using the primer-probe set Ag3045> 
25 described in Table CEA. Results of the RTQ-PCR runs are shown in Tables CEB, CEC, CED, 
CEE, CEF and CEG. 
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Table CEA. Probe Name Ag3045 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -caccggatcatatgaaatcaat-3 ' 


22 


605 


1351 


Probe 


TET-5 ' -tgtaattgaccctgttcctgcaccag-3 * - 
TAMRA 


26 


639 


1352 


Reverse 


5 ' -ccaccatcaacatttgaatca-3 ' 


21 


669 


1353 



Table CEB. CNS_neurodegeneration_vl.O 



Tissue Name 


j Rel. Exp.(%) Ag3045, 
Run 211012233 


I'icCllA IV'imii 

i issue iiaiiic 


ReL Exp.(%) Ag3045, 
Run 211012233 


AD 1 Hippo 


0.8 


Control (Path) 3 
Temporal Ctx 




AD 2 Hippo 


0.1 


Control (Path) 4 
Temporal Ctx 


A A 


AD 3 Hippo 


0.5 


AD 1 Occipital Ctx 


2.1 


AD 4 Hippo 


0.3 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


8.5 


AD 3 Occipital Ctx 


0.3 


AD 6 Hippo 


0.3 


AD 4 Occipital Ctx 


2.0 


Control 2 Hippo 


0.5 


AD 5 Occipital Ctx 


0.8 


Control 4 Hippo 


0.5 


AD 6 Occipital Ctx 


i i 

1.1 


Control (Path) 3 
Hippo 


0.4 


Control 1 Occipital 
Ctx 


0.0 


AD 1 Temporal Ctx 


1.4 


Control 2 Occipital 
Ctx 


2.2 


i vi_y Zr l ciiipurd.i ^ix 


U.U 


Control 3 Occipital 
Ctx 


2.1 


AD 3 Temporal Ctx 


0.7 


Control 4 Occipital 
Ctx 


0.2 


AD 4 Temporal Ctx 


2.7 


Control (Path) 1 
Occipital Ctx 


4.1 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


2.4 


AD 5 SupTemporal j 
Ctx 


4.4 


Control (Path) 3 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 1 
Ctx 


0.9 


Control (Path) 4 
Occipital Ctx 


2.7 


AD 6 Sup Temporal 
Ctx 


0.9 


Control 1 Parietal 
Ctx 


0.0 


Control 1 Temporal 1 
Ctx 


0.0 


Control 2 Parietal 
Ctx 


4.1 


Control 2 Temporal j 

ctx i 


0.7 


Control 3 Parietal 
Ctx 


1.5 


Control 3 Temporal j 


1.5 


Control (Path) 1 


2.6 
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.1 ill « 7'r » CI Ji 7r* ,, O. 



£1 



Ctx 




Parietal Ctx 




Control 4 Temporal 
Ctx 


0.6 


Control (Path) 2 
Parietal Ctx 


2.6 


Control (Path) 1 
Temporal Ctx 


3.1 


Control (Path) 3 
Parietal Ctx 


0.2 


Control (Path) 2 
Temporal Ctx 


3.6 


Control (Path) 4 
Parietal Ctx 


3.4 



Table CEC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3045, 
Run 165519994 


Tissue Name 


Rel. Exp.(%) Ag3045, 
Run 165519994 


Liver adenocarcinoma 


10.3 


Kidney (fetal) 


1.9 


Pancreas 


3.7 


Renal ca. 786-0 


4.8 


Pancreatic ca. CAPAN 
2 


3.5 


Renal ca. A498 


16.2 


Adrenal gland 


7.5 


Renal ca. RXF 393 


6.6 


Thyroid 


2.2 


Renal ca APHTM 




Salivary gland 


8.6 


Renal ca. UO-31 


6.7 


i liuiiary giana 


Zo. / 


Renal ca. TK-10 


1.1 


Brain (fetal) 


20.3 


Liver 


4.4 


Brain (whole) 


100.0 


Liver (fetal) 


1.9 


Brain (amygdala) 


8.4 


Liver ca. 
(hepatoblast) HepG2 


3.6 


Brain (cerebellum) 


50.7 


Lung 


4.1 


Brain (hippocampus) 


36.6 


Lung (fetal) 


2.3 


Brain (substantia nigra) 


12.1 


Lung ca. (small cell) 
LX-1 


0.8 


Brain (thalamus) 


29.7 


Lung ca. (small cell) 
NCI-H69 


2.5 


Cerebral Cortex 


4.2 


Lung ca. (s.cell var.) 
SHP-77 


0.6 


Spinal cord 


10.4 


Lung ca. (large 
cell)NCI-H460 


2.1 


glio/astro U87-MG 


3.2 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astro U-118-MG 


6.6 


Lung ca. (non-s.cell) : 
NC1-H23 


3.2 


astrocytoma SW1783 


2.4 


Lung ca. (non-s.cell) 
HOP-62 


4.5 


neuro*; met SK-N-AS 


3.5 


Lung ca. (non-s.cl) 
NCI-H522 


0.6 


astrocytoma SF-539 


35.4 


Lung ca. (squam.) 
SW 900 


16.4 


astrocytoma SNB-75 


14.0 


Lung ca. (squam.) 


2.6 
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NCI-H596 




glioma SNB-ly 


37.9 


Mammary gland 


2.9 


glioma U251 


44.4 


Breast ca. (pi er) 
MCF-7 


4.8 


glioma SF-295 


0.4 


Breast ca.* (pl.ei) 
MDA-MB-23 1 


7.1 


Heart (fetal) 


0.4 


Breast ca.* (pl.er) 
T47D 


8.5 


Heart 


9.5 


Breast ca. BT-549 


3.8 


Skeletal muscle (fetal) 


1.0 


Breast ca. MDA-N 


1.3 


Skeletal muscle 


5.4 


Ovary 


1.3 


Bone marrow 


9.9 


Ovarian ca. OVCAR- 

3 


2 8 


Thymus 


8.8 


Ovarian ca. OVCAR- 
4 


0 9 


Spleen 


9.4 


Ovarian ca. OVCAR- 
5 


3.6 


Lymph node 


21.8 


Ovarian ca. OVCAR- 
8 


20.2 


Colorectal 


3.3 


Ovarian ca. IGROV- 
1 


0.6 


Stomach 


6.3 


Ovarian ca.* (ascites) 
SK-OV-3 


11.4 


Small intestine 


8.0 


Uterus 


18.8 


Colon ca. SW480 


4.1 


Placenta 


7.0 


Colon ca.* 
s WozU(;SW45U met) 


2.1 


Prostate 


15.7 


Colon ca. HT29 


0.5 


Prostate ca.* (bone \ 
met)PC-3 


2.8 


Colon ca. HCT-116 


2.5 


Testis 


11.4 


Colon ca. CaCo-2 


1.9 


Melanoma 
Hs688(A).T 


0.4 


Colon ca. 


2.4 


Melanoma* (met) 

nS05o(D ). 1 


0.4 


Colon ca. I ICC-2998 


2.4 


Melanoma UACC-62 


3.1 


Gastric ca.* (liver met) 
NCI-N87 


3.3 


Melanoma M14 


7.9 


Bladder 


11.3 


Melanoma LOX 
IMVI 


0.0 


Trachea 


1.7 


Melanoma* (met) 
SK-MEL-5 


1.8 


Kidney 


10.2 


Adipose 


1.4 
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Tissue Name 


ReL Exp.(%) 
Ag3045, Run 
163577595 


Tissue Name 


Rel Fxn (%} 
Ag3045, Run 
163577595 


Normal Colon 


3U.4 


Kidney Margin 
8120608 


1.1 


CC Well to Mod Diff 
(OD03866) 


2.9 


Kidney Cancer 
8120613 


2.5 


CC Margin (OD03866) 


1.1 


Kidney Margin 
8120614 


0.2 


CC Gr.2 rectosigmoid 
(OD03868) 


2.7 


Kidney Cancer 
9010320 


2.9 


CC Margin (UUUiooo) 


4.1 


Kidney Margin 
9010321 


4.0 


CC Mod Diff (ODO3920) 


1.9 


Normal Uterus 


2.1 


CC Margin (ODO3920) 


3.9 


Uterus Cancer 06401 1 


22.2 


CC Gr.2 ascend colon 
(OD03921) 


2.7 


Normal Thyroid 


7.1 


CC Margin (OD03921) 


2.6 


064010 


2.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


3.6 


Thyroid Cancer 
A302152 


2.0 


Liver Margin (ODO4309) 


3.6 


Thyroid Margin 
A302153 


20.9 


Colon mets to lung 
(OD04451-01) 


1.8 


Normal Breast 


9.6 


Lung Margin (OD04451- 
02) 


2.5 


Breast Cancer 
(OD04566) 


5.3 


Normal Prostate 6546-1 


100.0 


Breast Cancer 
(OD04590-01) 


55.5 


Prostate Cancer 
(OD04410) 


39.0 


Breast Cancer Mets 
(OD045 90-03) 


92.7 


Prostate Margin 
(OD04410) 


17.3 


Breast Cancer 

Metastasis 
(OD04655-05) 


31.9 


Prostate Cancer 
(OD04720-01) 


I5.l 


Breast Cancer 064006 


21.8 


Prostate Margin 
(OD04720-02) 


37.1 


Breast Cancer 1024 


6.5 


Normal Lung 061010 


47.0 


Breast Cancer 
9100266 


8.4 


Lung Met to Muscle 
(OD04286) 


4.4 


Breast Margin 
9100265 


4.0 


Muscle Margin 
(OD04286) 


10.3 


Breast Cancer 
A209073 


12.2 
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Table CED. Panel 2D 



T liner lVla iicmant Panrer 

(OD03126) 


8.1 


DICclM IVldlglll 

A2090734 


2.2 


T ung Margin fOD0^1?6^ 


o . o 


INUlllIdi L/IVCI 


i n 
i .\j 


Lung Cancer (OD04404) 


47.0 


Liver Cancer 064003 


0.8 


Lung Margin (OD04404) 


1 1.0 


Liver Cancer 1025 


0.7 


Lung Cancer (OD04565) 


6.9 


Liver Cancer 1026 


0.6 


Lung Margin (OD04565) 


1.7 


Liver Cancer 6004-T 


3.1 


Lung Cancer (OD04237- 
01) 


12.2 


Liver Tissue 6004-N 


1.7 


Lung Margin (OD04237- 
02) 


5.4 


Liver Cancer 6005-T 


0.8 


Ocular Mel Met to Liver 
(ODO4310) 


5.1 


Liver Tissue 6005-N 


0.8 


Liver Margin (OD043 1 0) 


1.0 


Normal Bladder 


12.5 


Melanoma Mets to Lung 
(OD04321) 


3.3 


Bladder Cancer 1023 


1.4 


Lung Margin (OD04321) 


5.7 


Bladder Cancer 
A302173 


6.7 


Normal Kidney 


7.2 


Bladder Cancer 
(OD047 18-01) 


16.4 


Kidney Ca, Nuclear grade 

^- ^VyL^U^J JO J 


8.4 


Bladder Normal 
Adjacent (OD04718- 
03) 


23.3 


Kidney Margin 
(OD04338) 




M nrin q 1 (~}\/ c\ t*\/ 




Kidney Ca Nuclear grade 
1/2 (OD04339) 


0 8 


Ovarian Cancer 
064008 




Kidney Margin 
(OD04339) 


1.8 


Ovarian Cancer 
(OD04768-07) 


2.4 


Kidney Ca, Clear cell 
type (OD04340) 


2.1 


Ovary Margin 
(OD04768-08) 


2.4 


Kidney Margin 
(OD04340) 


4.2 


Normal Stomach 

1 i Ul 111CII O IUI 1 1 CX\^ 1 1 


1.0 


Kidney Ca, Nuclear grade 
3 (OD04348) 


1.6 


Gastric Cancer 
9060358 


0.5 


Kidney Margin 
(OD04348) 


10.4 


Stomach Margin 
9060359 


2.4 


Kidney Cancer 
(OD04622-01) 


1.3 


Gastric Cancer 
9060395 


4.5 


Kidney Margin 
(OD04622-03) 


1.1 


Stomach Margin 
9060394 


1.9 


Kidney Cancer 
(OD04450-01) 


9.0 


Gastric Cancer 
9060397 


7.7 


Kidney Margin 
(OD04450-03) 


14.0 


Stomach Margin 
9060396 


0.3 
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Kidney Cancer 8120607 



0.9 



Gastric Cancer 
064005 



8.3 



Table CEE. Panel 3D 



Tissue Name 


Rel Exo.f%^ 
Ag3045, Run 
164886427 


Tissue Name 


Rel. Exp.(%) 
Ag3045, Run 
164886427 


Daoy- Medulloblastoma 


3.4 


Ca Ski- Cervical epidermoid j 
carcinoma (metastasis) 


6.6 ; 


TE671- Medulloblastoma 


4.4 


ES-2- Ovarian clear cell 
carcinoma 


2.1 


D283 Med- 
Medulloblastoma 


4.2 


Ramos- Stimulated with 
PMA/ionomycin 6h 


6.0 


PFSK-1- Primitive 
Neuroectodermal 


1 7 


Ramos- Stimulated with 
PMA/ionomycin 14h 


1 7 


XF-498- CNS 


7.1 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


4.2 1 


SNB-78- Glioma 


2.2 


Raji- Burkitt's lymphoma 


2.0 


SF-268- Glioblastoma 


0.8 


Daudi- Burkitt's lymphoma 


3.4 


T98G- Glioblastoma \ 


1.3 


U266- B-cell plasmacytoma 


4.9 


SK-N-SH- 
Neuroblastoma 
(metastasis) 


3.1 


CA46- Burkitt's lymphoma 


0.5 


Sr-295- ulioblastoma 


2.0 


RL- non-Hodgkin's B-cell 
lymphoma 


O 1 

L. 1 


Cerebellum 


100.0 


JM1- pre-B-cell lymphoma 


5.8 


Cerebellum 


1.4 


Jurkat- T cell leukemia 


3.5 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


23.0 


TF- 1 - Erythroleukemia 


8.3 


DMS-114- Small cell 
lung cancer 


8.4 


HUT 78- T-cell lymphoma 


3.4 


DMS-79- Small cell lung 
cancer 


26.6 


U937- Histiocytic lymphoma 


1.2 


NCI-H146- Small cell 
lung cancer 


5.0 


KU-812- Myelogenous 
leukemia 


2.6 


NCI-H526- Small cell 
lung cancer 


30.6 


769-P- Clear cell renal 
carcinoma 


1.4 


NCI-N417- Small cell 
lung cancer 


1.3 


Caki-2- Clear cell renal 
carcinoma 


3.3 


NCI-H82- Small cell 
lung cancer 


4.2 


SW 839- Clear cell renal 
carcinoma 


1.6 


NCI-H157- Squamous 
cell lung cancer 


2.6 


G401- Wilms' tumor 


2.0 
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(metastasis) 








NCI-HI 155- Large cell 
lung cancer 


50.0 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


5.3 


NCI-H1299- Large cell 
lung cancer 


10.5 


C APAN- 1 - Pancreatic 
adenocarcinoma (liver 
metastasis) 


2.6 


NCI-H727- Lung 
carcinoid 


2.6 


SU86.86- Pancreatic 
carcinoma (liver metastasis) ! 


6.5 


NCI-UMC-ll-Lung 
carcinoid 


8.5 


BxPC-3- Pancreatic 
adenocarcinoma 


1.5 


LX-1- Small cell lung 
cancer 


2.0 


HPAC- Pancreatic 
adenocarcinoma 


2.8 


Colo-205- Colon cancer 


3.0 


MIA PaCa-2- Pancreatic 
carcinoma 


1.9 


KM 12- Colon cancer 


2.6 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


6.9 


KM20L2- Colon cancer 


0.8 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


7.5 


NCI-H716- Colon cancer 


3.7 


T24- Bladder carcinma 
(transitional cell) 


2.3 


SW-48- Colon 
adenocarcinoma 


2.6 


5637- Bladder carcinoma 


4.0 


SW1116- Colon 
adenocarcinoma 


1.0 


HT-1 197- Bladder carcinoma 


3.8 


LS 174T- Colon 
adenocarcinoma 


3.2 


UM-UC-3- Bladder carcinma 
(transitional cell) 


1.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


1.2 


SW-480- Colon 
adenocarcinoma 


1.1 


HT-1 080- Fibrosarcoma 


6.8 


NC1-SNU-5- Gastric 
carcinoma 


3.2 


MG-63- Osteosarcoma 


1.3 


KATO III- Gastric 
carcinoma 


3.9 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


6.3 


NCI-SNU-16- Gastric 
carcinoma 


1.6 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


2.6 


NCI-SNU-1- Gastric 
carcinoma 


3.2 


A431- Epidermoid carcinoma 


5.1 


RF-1- Gastric 
adenocarcinoma 


4.2 


WM266-4- Melanoma 


4.5 


RF-48- Gastric 
adenocarcinoma 


4.5 


DU 145- Prostate carcinoma 
(brain metastasis) 


1.0 


MKN-45- Gastric 
carcinoma 


4.7 


MDA-MB-468- Breast 
adenocarcinoma 


2.3 


NCI-N87- Gastric 
carcinoma 


0.9 


SCC-4- Squamous cell 
carcinoma of tongue 


1.4 
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OVCAR-5- Ovarian 
carcinoma 


,.4 


SCC-9- Squamous cell 
carcinoma of tongue 


0.3 


RL95-2- Uterine 
carcinoma 


0.3 


SCC-15- Squamous cell 
carcinoma of tongue 


0.7 


HelaS3- Cervical 
adenocarcinoma 


2.3 


CAL 27- Squamous cell 
carcinoma of tongue 


8.0 



I 
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Table CEF. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3045, Run 
162559632 


Tissue Name 


Rel. Exp.(%) 
Ag3045, Run 
162559632 


Secondary Th 1 act 


10.0 


HUVEC IL-lbeta 


2.2 


Secondary Th2 act 


12.0 


HUVEC IFN gamma 


4.2 


Secondary Trl act 


21.9 


HUVEC TNF alpha + IFN 
gamma 


3.2 


Secondary Thl rest 


3.3 


HUVEC TNF alpha + IL4 


4.4 


Secondary Th2 rest 


5.6 


HUVEC IL-11 


2.9 


SlpprvnHnrv Trl rpQt 


6.7 


Lung Microvascular EC 
none 


6.9 


P r i m fi r\/ Th 1 art 
i i illicit y i li i cl\^l 


13.1 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


4.2 


Primarv Th? act 


16.2 


Microvascular Dermal EC 
none 


8.2 


Primarv Trl act 

a l illicit ±11 ciwi 


17.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


4.8 


Primary Th 1 rest 


47.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


1.1 


Primary Th2 rest 


21.9 


Small airway epithelium 
none 


2.3 


Primary Trl rest 


33.2 


Small airway epithelium 
TNFalpha + IL-lbeta 


34.2 


CD45RA CD4 
lymphocyte act 


5.1 


Coronery artery SMC rest 


2.4 


CD45RO CD4 
lymphocyte act 


11.6 


Coronery artery SMC 
TNFalpha + IL-lbeta 


2.1 


CD8 lymphocyte act 


8.6 


Astrocytes rest 


5.4 


Secondary CDS 
lymphocyte rest 


8.1 


Astrocytes TNFalpha + 
IL-lbeta 


3.6 


Secondary CD8 
lymphocyte act 


6.0 


KU-812 (Basophil) rest 


4.3 


CD4 lymphocyte none 


2.3 


KU-8 12 (Basophil) 
PMA/ionomycin 


14.1 


2ry Thl/Th2/Trl anti- 
CD95CH11 


12.0 


CCD1 106 (Keratinocytes) 
none 


6.0 
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LAK cells rest 


10.7 


K^K^ui iuo ^Jverdiinocyiesi ; 
TNFalpha + 1L- 1 beta 


0.5 


LAK cells IL-2 


8.6 


Liver cirrhosis 


3.3 


LAK cells IL-2+IL- 12 ; 


8.5 


Lupus kidney 


1.7 


LAK cells IL-2+IFN 
gamma 


14.7 


NCI-H292 none 


100.0 


LAK cells IL-2+ IL-18 


18.0 


NCI-H292 IL-4 


84.1 


LAK cells 
PMA/ionomycin 


3.7 


NCI-H292 IL-9 


96.6 


NK Cells IL-2 rest 


6.8 


NCI-H292 IL-13 


24.1 


Two Way MLR 3 day 


6.6 


NCI-H292 IFN gamma 


22.8 


Two Way MLR 5 day 


8.8 


HPAEC none 


4.9 


Two Way MLR 7 day 


5.7 


mi /\t,L^ i iNr aipna + il-i 
beta 


3.9 


roMt rest 




Lung fibroblast none 


i ft 

J.O 


PBMC PWM 


27.5 


Lung fibroblast TNF alpha 
+ il>- i Deta 


2.6 


PBMC PHA-L 


7.0 


Lung fibroblast IL-4 


9.5 


Ramos (B cell) none 


6.6 


Lung fibroblast IL-9 


5.5 


Ramos (B cell) 
ionomycin 


34.4 


Lung fibroblast IL-13 


6.0 


B lymphocytes PWM 


29.1 


Lung fibroblast IFN 
gamma 


5.2 


B lymphocytes CD40L 1 
and IL-4 


10.7 


Dermal fibroblast 
CCD 1070 rest 


11.4 


EOL-1 dbcAMP 


6.3 


Dermal fibroblast 
i^L^L/iu/u iiNr aipna 


32.5 


POT 1 A\\c A \/TP 
I \_J 1 j- 1 UuC/\IVJ i 

PMA/ionomycin 


12.2 


ijermai Tiurouiasi 
CCD 1070 IL-1 beta 


5.7 


Dendritic cells none 


9.7 


Dermal fibroblast IFN 
gamma 


3.6 


Dendritic cells LPS 


6.1 


Dermal fibroblast IL-4 


5.9 


Dendritic cells anti- 
CD40 


11.7 


IBD Colitis 2 


0.3 


Monocytes rest 


10.2 


IBD Crohn's 


0.4 


Monocytes LPS 


9.5 


Colon 


8.7 


Macrophages rest 


22.5 


Lung 


8.0 


Macrophages LPS 


2.4 


Thymus 


13.5 


HUVEC none 


5.7 


Kidney 


43.5 


HUVEC starved 


9.1 







Table CEG. Panel CNS_1 
Tissue Name } RelTE xp^(%) A g3045, _J~" Tissue Name J Jj^jj^PrC^jij^i 
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Run 171694539 




Run 171694539 


□A4 Control 




DA 17 PQP 


u.u 


BA4Control2 


0.0 


BA17PSP2 


6.5 . 


BA4 
Alzheimer's2 


9.3 


Sub Nigra Control 


2.6 


BA4 Parkinson's 


55.1 


Sub Nigra Control2 


0.0 


BA4 
Parkinson's2 


19.1 


Sub Nigra 
Alzheimer's2 


11.2 


BA4 
Huntington's 


18.7 


Sub Nigra 
Parkinson's2 


9.0 


BA4 
Huntington's2 


6.5 


Sub Nigra 
Huntington's 


22.8 


BA4 PSP 


0.0 


Sub Nigra 
Huntington's2 


21.9 


BA4 PSP2 


6.1 


Sub Nigra PSP2 


2.3 


BA4 Depression 


15.1 


Sub Nigra 
Depression 


0.0 


BA4 
Depression2 


6.6 


Sub Nigra 
Depression2 


3.0 


BA7 Control 


8.7 


Glob Palladus 
Control 


2.1 


BA7 Control2 


0.0 


Glob Palladus 
Control2 


0.0 


BA7 
Alzheimer's2 


14.7 


Glob Palladus 
Alzheimer's 


0.9 


BA7 Parkinson's 


26.8 


Glob Palladus 
Alzheimer's2 


2.8 


BA7 
Parkinson's2 


16.3 


Glob Palladus 
Parkinson's 


61.6 


p A 7 
t> A / 

Huntington's 


61.1 


VjIOD rallaQUS 

Parkinson's2 


2.9 


DAT 

dA / 
Huntington ? s2 


23.5 


Glob Palladus PSP 


0.0 


D A 7 DC D 


\).y) 


n\r^V\ Pcillcirhic PQPO 

oioD raiiaaus rorz 


n ft 


BA7 PSP2 


11.6 


Glob Palladus 
Depression 


0.0 


Dr\ / ivcprcssiun 


R 1 


iciiip ruic v^uiiluji 


0 0 


BA9 Control 


0.9 


Temp Pole Control2 


1.3 


BA9 Control2 


0.0 


Temp Pole 
Alzheimer's 


0.0 


BA9 Alzheimer's 


0.0 


Temp Pole 
Alzheimer's2 


5.6 


BA9 
Alzheimer's2 


20.6 


Temp Pole 
Parkinson's 


27.9 


BA9 Parkinson's 


31.4 


Temp Pole 


2.0 
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| Parkinson's2 




BA9 
Parkinson's2 


21.9 


Temp Pole 
Huntington's 


29.1 


BA9 
Huntington's 


23.0 


Temp Pole PSP 


0.0 


BA9 
Huntington's2 


15.2 


Temp Pole PSP2 


0.9 


BA9 PSP 


0.0 


Temp Pole 
Depression2 


5.5 


dAv rsrz 


3.6 


Cing Gyr Control 




BA9 Depression 


2.0 


Cing Gyr Control2 


0.9 


BA9 
Depression2 


2 9 


Cing Gyr 
Alzheimer's 


0.0 


BA 17 Control 


1 S 1 


Cing Gyr 
Alzheimer's2 


14.5 [ 


BA17Control2 


3.3 


Cing Gyr Parkinson's 


26.4 


BA17 
Alzheimer's2 


15.5 


Cing Gyr 
Parkinson's2 


12.0 


BA17 
Parkinson's 


100.0 


Cing Gyr 
Huntington's 


46.0 


BA17 
Parkinson's2 


54.0 


Cing Gyr 
Huntington's2 


19.8 


BA17 
Huntington's 


32.3 


Cing Gyr PSP 


0.5 


BA17 
Huntington's2 


5.9 


Cing Gyr PSP2 


1.9 


BA17 
Depression 


4.0 


Cing Gyr Depression 


3.9 


BA17 
Depression2 


5.6 . 


Cing Gyr 
Depression2 


8.4 



CNS_neurodegeneration_vl.O Summary: Ag3045 The NOV90 gene is not 
differentially expressed in the postmortem brains of Alzheimer's diseased patients when 
compared to non-demented control. However, this panel does confirm the expression of this 
5 gene in the CNS of an independent sample of individuals. See panel 1 for a discussion of 
utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3045 Expression of the NOV90 gene shows a brain- 
preferential expression profile, and is expressed at moderate levels in all regions examined. 
Thus, this gene may be of utility as a small molecule target in neurologic disease. 
10 In addition, significant expression is seen in a cluster of brain cancer cell lines. Thus, 

expression of this gene could be used to differentiate between this sample and other samples 
on this panel and as a marker to detect the presence of brain cancer. Furthermore, therapeutic 
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modulation of the expression or function of this gene may be effective in the treatment of 
brain cancer. 

Panel 2D Summary: Ag3045 The NOV90 gene is expressed at low levels in this 
panel. There is higher expression in gastric, breast, uterus and lung cancers then the normal 
samples from these organs. Expression of this gene could therefore be used as a diagnostic 
marker for gastric, lung, breast and uterine cancers. Furthermore, modulation of the gene 
product using small molecule inhibitors could be used for the treatment of these cancers. 

Panel 3D Summary: Ag3045 The NOV90 gene is expressed at a low level in most of 
the cancer cell lines on this panel. Modulation of the gene product using small molecule 
inhibitors culd therefore be used for the treatment of cancer. Highest expression of this gene is 
seen in the cerebellum, confirming the results seen in Panel 1.3D. 

Panel 4D Summary: Ag3045 The NOV90 gene, a serine/threonine-protein kinase 
homolog is expressed at moderate levels in pulmonary mucoepidermoid cells prepared under 
several conditions of cell activation: NCI-H292 none (CT=29.01), NCI-H292 IL-4 
(CT-29.26), NCI-H292 IL-9 (CT-29.06), NCI-H292 IL-13 (CT=3 L06), NCI-H292 IFN 
gamma (CT=31.14). Therefore, small molecule antagonists that block the function of the 
NOV90 gene product may be useful in several autoimmune and inflammatory diseases of the 
lung including, but not limited to, chronic obstructive pulmonary disease, asthma, and 
emphysema. 

Panel CNS_1 Summary: Ag3045 The expression in this panel confirms the presence 
of the NOV90 in the brain. Thus, this gene may be of utility as a small molecule target in 
neurologic disease. 

NOV91 

Expression of gene NOV91 was assessed using the primer-probe set Ag3018, 
described in Table CFA. Results of the RTQ-PCR runs are shown in Tables CFB, CFC and 
CFD. 

Table CFA. Probe Name Ag3018 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -ctgcagggttggagaaatg- 3 ' 


19 


71 


1354 


Probe 


TET-5 * -ccatcctgggcaaacccaaggat-3 1 - 
TAMRA 


23 


107 


1355 


Reverse 


5 ' -ctacacccatcatgttcacatg-3 ' 


22 


130 


1356 
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Table CFB. CNS_neurodegeneration_vl.O 



Tissiif* Alsimf* 


Rel. Exp.(%) Ag3018, 
Run 209820676 


Tissue Name 

M Uk7ml%*- J. l 14111V 


Rel. Exp.(%) Ag3018, 
Run 209820676 


AD 1 Hippo 


0.0 


Control (Path) 3 
Temporal Ctx 


4.5 


AD 2 Hippo 


12.5 


Control (Path) 4 ; 
Temporal Ctx 


22.7 


AD 3 Hippo 


2.4 


AD 1 Occipital Ctx 


11.3 


AD 4 Hippo 


7.7 


AD 2 Occipital Ctx 
(Missing) 


42.6 


AD 5 hippo 


72.7 


AD 3 Occipital Ctx! 


3.0 


AD 6 Hippo 


47.3 


AD 4 Occipital Ctx 


14.2 


Control 2 Hippo 


12.9 


AD 5 Occipital Ctx 


52.1 


Control 4 Hippo 


3.6 


AD 6 Occipital Ctx 


48.0 


Control (Path) 3 
Hippo 


3.8 


Control 1 Occipital 
Ctx 


1.6 


AD 1 Temporal Ctx ; 


13.6 


Control 2 Occipital 
Ctx 


32.1 


AD 2 Temporal Ctx 


29.9 


Control 3 Occipital 
Ctx 


6.7 


AD 3 Temporal Ctx 


4.0 


Control 4 Occipital 
Ctx 


0.0 


AD 4 Temporal Ctx 


26.8 


Control (Path) 1 
Occipital Ctx 


73.7 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


12.6 


AD 5 SupTemporal 
Ctx 


54.7 


Control (Path) 3 
Occipital Ctx 


1.7 


AD 6 Inf Temporal 
Ctx 


48.6 


Control (Path) 4 
Occipital Ctx 


23.3 


AD 6 Sup Temporal 
Ctx 


49.0 


Control 1 Parietal 
Ctx 


6.5 


Control 1 Temporal 
Ctx 


1.7 


Control 2 Parietal 
Ctx 


57.8 


Control 2 Temporal 
Ctx 


9.9 


Control 3 Parietal 
Ctx 


7.3 


Control 3 Temporal 
Ctx 


14.4 


Control (Path) 1 
Parietal Ctx 


57.8 


Control 4 Temporal 
Ctx 


2.3 


Control (Path) 2 
Parietal Ctx 


21.2 


Control (Path) 1 
Temporal Ctx 


30.4 


Control (Path) 3 
Parietal Ctx 


3.6 


Control (Path) 2 
Temporal Ctx 


36.9 


Control (Path) 4 
Parietal Ctx 


17.6 
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Table CFC. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3018, 


Tissue Name 


Rel. Exp.(%) Ag3018, 


iuiver adenocarcinoma 




jvianey ^icidi^ 


7 1 


Pancreas 


6.0 


Renal ca. 786-0 


9.9 


Pancreatic ca. CAPAN 

z 


12.9 


Renal ca. A498 


4.7 


Adrenal gland 


1 7 


Jxenai ca. tvvvr 


91 9 
Zl.Z 


Thyroid 


1.3 


Renal ca. ACHN 


9.5 


Salivary gland 


0.0 


Renal ca. UO-3 1 


4.1 


Pituitary gland 


7.9 


Renal ca. TK-10 


12.7 


Brain (fetal) 


7.9 


Liver 


2.6 


Brain (whole) 


44.1 


Liver (fetal) 


2.9 


Brain (amygdala) 


3.2 


Liver ca. 
(hepatoblast) HepG2 


10.2 


Brain (cerebellum) 


65.1 


Lung 


6.7 


Brain (hippocampus) 


9.8 


Lung (fetal) 


14.9 


Brain (substantia nigra) 


4.9 


Lung ca. (small cell) 
LX-1 


26.4 


Brain (thalamus) 


8.8 


Lung ca. (small cell) 
NCI-H69 


4.6 


Cerebral Cortex 


10.7 


Lung ca. (s.cell var.) 
SHP-77 


75.3 


Spinal cord 


7.6 


Lung ca. (large 
cell)NCI-H460 


2.9 


glio/astro U87-MG 


23.2 


Lung ca. (non-sm. 
cell) A549 


55.1 


glio/astro U-l 18-MG 


37.9 


Lung ca. (non-s.cell) 
NCI-H23 


9.6 


astrocytoma SW1783 


35.1 


Lung ca. (non-s.cell) 
HOP-62 


8.6 


sic j. ("11/ x T AO 

neuro*; met SK-N-AS 


2.7 


Lung ca. (non-s.cl) 
NCI-H522 


1 1 A 
1 1 .0 


astrocytoma SF-539 


21.2 


Lung ca. (squam.) 
SW 900 


4.7 


astrocytoma SNB-75 


10.6 


Lung ca. (squam.) 
NCI-H596 


17.6 


glioma SNB-19 


24.8 


Mammary gland 


2.2 


glioma U251 


28.5 


Breast ca.* (pl.ef) 
MCF-7 


17.3 


glioma SF-295 


34.6 


Breast ca.* (pl.ef) 
MDA-MB-231 


17.3 


Heart (fetal) 


6.4 


Breast ca.* (pl.ef) 
T47D 


6.5 


Heart 


16.2 


Breast ca. BT-549 


6.3 
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Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N j 


8.4 


Skeletal muscle 


11.7 


Ovary 


0.0 


Rone marrow 


9.2 


Ovarian ca. OVCAR-; 
3 


6.2 


Thvmus 


27.7 


Ovarian ca. OVCAR- 
4 


9.7 


Spleen 


5.6 


Ovarian ca. OVCAR-j 
J 


25.2 


Lymph node 


5.8 


Ovarian ca. OVCAR- 
8 


20.3 


Colorectal 


15.0 


Ovarian ca. IGROV- 

1 


9.2 


Stomach 


1.3 


Ovarian ca.* (ascites); 
SK-OV-3 


26.6 


Small intestine 


1.7 


Uterus 


1 A 

1 .4 


Colon ca. SW480 


14.7 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


100.0 


Prostate 


4.2 


Colon ca, HT29 


2.1 


Prostate ca.* (bone 
met)PC-3 


15.3 


Colon ca. HCT-116 


12.4 


Testis 


4.5 


Colon ca. CaCo-2 


22.7 


Melanoma 
Hs688(A).T 


3.9 


Colon ca. 
tissue(UU(Ji500) 


7.0 


Melanoma* (met) 

nSOoo^D ). 1 


9.8 


Colon ca. HCC-2998 


35.6 


Melanoma UACC-62 


4.1 


Gastric ca.* (liver met) 
NCI-N87 


19.6 


Melanoma Ml 4 


7.9 


Bladder 


20.3 


Melanoma LOX 
IMVI 


11.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.7 


Kidney 


5.7 


Adipose 


0.0 



Table CFD. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3018, Run 
164528110 


Tissue Name 


Rel. Exp.(%) 
Ag3018, Run 
164528110 


Secondary Thl act 


14.9 


HUVEC IL-lbeta 


7.9 


Secondary Th2 act 


22.1 


HUVEC IFN gamma 


4.9 


Secondary Trl act 


28.9 


HUVEC TNF alpha + IFN 
gamma 


4.5 


Secondary Thl rest 


4.1 


HUVEC TNF alpha + IL4 


6.7 


Secondary Th2 rest 


5.9 


HUVEC IL-1 1 


4.7 
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^sernnHarv TY1 rest 


4.4 


Lung Microvascular EC ! 
none 


10.9 


PrimarvThl act 

J. till IU1 J A 1 1 1 Uv L 


19.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


10.1 


Primarv Th2 act 

1 1 1111(11 J 1 Hi- UVsl- 


9.4 


Microvascular Dermal EC j 
none 


13.2 


Primarv Trl act 


13.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


6.7 


Primarv Thl rpst 


45.7 


Bronchial epithelium 
TNFalpha + IL1 beta 


6.2 


Primary Th2 rest 


18.6 


Small airway epithelium 
none 


2.4 . 


Primary Trl rest 


16.5 


Small airway epithelium 
TNFalpha-*- IL-lbeta 


17.6 


CD45RA CD4 
lymphocyte act 


5.6 


Coronery artery SMC rest 


3.5 


CD45RO CD4 
lymphocyte act 


17.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


1.0 


CD8 lymphocyte act 


16.5 


Astrocytes rest 


7.3 


Secondary CD8 
lymphocyte rest 


9 6 


Astrocytes TNFalpha + 
TL-lbeta 


3.1 


Secondary CD8 
lymphocyte act 


5.3 


KU-812 (Basophil) rest 


2.0 


CD4 lymphocyte none 


4.6 


KU-812 (Basophil) 
PM A/ionomy c in 


6.7 


2ry Thl/Th2/Trl anti- 
CD95CH11 


12.0 


CCD1 106 (Keratinocytes) 
none 


4.1 


LAK cells rest 


13.5 


LCU1 lUo (Keratinocytes) 
TNFalpha + IL-lbeta 


1.8 




1 8 6 

1 O . VJ 


T ivpr f*irrho*sis 


2.1 


T AK relk TT -?+TI -12 


14.6 


Lunus kidnev 

L j U U LKj J\1VI11V J 


1.7 


LAK cells IL-2+IFN 

J^cii Hi i la. 


16.3 


NCI-H292 none 


13.5 


T relk TT -?+ TT -18 

L^/ V IX. I i^i Z. I ll_v lO 


33.9 


NCI-H292 IL-4 


14.5 


LAK cells 

Plv/T A /innnmvp i n 


4.5 


NCI-H292 IL-9 


23.3 


NK Cells IL-2 rest 


17.3 


NCI-H292 IL-13 


8.0 


Two Way MLR 3 day 


15.4 


NCI-H292 IFN gamma 


6.4 


Two Way MLR 5 day 


6.4 


HPAEC none 


4.6 


Two Way MLR 7 day 


6.3 


HPAEC TNF alpha + IL-1 
beta 


6.4 


PBMC rest 


4.2 


Lung fibroblast none 


5.6 


PBMC PWM 


51.1 


Lung fibroblast TNF alpha 
+ IL-1 beta 


1.5 


PBMC PHA-L 


14.7 


Lung fibroblast IL-4 


13.6 



i nn7r«ii i p ... n i "k t nv* 
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Ramos (B cell) none 


24.3 


Lung fibroblast IL-9 


7.2 ! 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL- 13 


13.1 


R lvmnhnrvte<? PWM 


34.4 


Lung fibroblast IFN 
gamma 


6.8 


B lymphocytes CD40L 
and IL-4 


33.0 


Dermal fibroblast 
CCD 1070 rest 


15.4 


EOL-1 dbcAMP 


13.3 


Dermal fibroblast 
CCD1U/U lJNr alpna 


44.8 


EOL-1 dbcAMP 

i lVLr\/ lOHUIIiyt/lIl 


7.4 


Dermal fibroblast 
CCD 1070 IL-1 beta 


5.9 


Dendritic cells none 


3.0 


Dermal fibroblast IFT\I 

gamma 


3.6 


Dendritic cells LPS 


8.8 


Dermal fibroblast IL-4 


9.7 


Dendritic cells anti- 
CD40 


6.0 


IBD Colitis 2 


0.9 


Monocytes rest 


5.2 


IBD Crohn's 


0.0 


'Monocytes LPS 


2.6 


Colon 


6.3 


Macrophages rest 


4.2 


Lung 


3.7 


Macrophages LPS 


1.3 


Thymus 


9.9 


HUVEC none 


5.3 


Kidney 


27.2 


HUVEC starved 


12.8 







CNS_neurodegeneration_vl.O Summary: Ag3018 This panel does not show 
differential expression of the NOV91 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3018 The NOV91 gene is expressed in the brain at low 
levels. D-dopachrome tautomerase has been implicated in the production of neuromelanin 
from the toxic quinone products of dopamine, and this pathway has been implicated in the 
death of dopaminergic neurons in Parkinson's disease. Thus, this gene may represent an 
1 0 excellent small molecule target for the treatment or prevention of Parkinson's disease. 

In addition, significant expression is seen in a cluster of lung, brain, and colon cancer 
cell lines. Thus, expression of this gene could be used to differentiate between thes sample and 
other samples on this panel and as a marker to detect the presence of these cancers. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
1 5 effective in the treatment of lung, brain, and colon cancer. 

References: 
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Matsunaga J, Sinha D, Solano F, Santis C, Wistow G, Hearing V. Macrophage 
migration inhibitory factor (MIF)--its role in catecholamine metabolism. Cell Mol Biol 
(Noisy-le-grand) 1999 Nov;45(7): 1035-40 

Macrophage migration inhibitory factor (MIF) was originally identified several 
5 decades ago as a lymphokine-derived protein that inhibited monocyte migration. Recently, it 
has been reported that MIF has D-dopachrome tautomerase, phenyl pyruvate tautomerase and 
thiol protein oxidoreductase activities, although the physiological significance of those 
activities is not yet clear. Here we show that MIF is able to catalyze the conversion of 
dopaminechrome and norepinephrinechrome, toxic quinone products of the neurotransmitters 

10 dopamine and norepinephrine, respectively, to indole derivatives that may serve as precursors 
to neuromelanin. Since MIF is highly expressed in human brain, these observations raise the 
possibility that MIF participates in a detoxification pathway for catecholamine products and 
could therefore have an important role for neural tissues. The potential role of MIF in the 
formation of neuromelanin from catecholamines is also an extremely interesting possibility. 

15 Drukarch B, van Muiswinkel FL. Neuroprotection for Parkinson's disease: a new 

approach for a new millennium. Expert Opin Investig Drugs 2001 Oct; 10(10): 1855-68 

Parkinson's disease (PD) is the only neurodegenerative disorder in which 
pharmacological intervention has resulted in a marked decrease in morbidity and a significant 
delay in mortality. However, the medium to long-term efficacy of this pharmacotherapy, 

20 mainly consisting of dopaminomimetics like L -dopa and dopamine receptor agonists, suffers 
greatly from the unrelenting progression of the disease process underlying PD, i.e., the 
degeneration of neuromelanin-containing, dopaminergic neurones in the substantia nigra. 
Efforts concentrated on understanding the mechanisms of dopaminergic cell death in 
Parkinson's disease have led to identification of a large variety of pathogenetic factors, 

25 including excessive release of oxygen free radicals during enzymatic dopamine breakdown, 
impairment of mitochondrial function, production of inflammatory mediators, loss of trophic 
support, and apoptosis. Therapeutic approaches aimed at correcting these abnormalities are 
currently being evaluated on their efficacy as neuroprotectants for PD. Here, we focus on the 
process of dopamine auto-oxidation, the chain of reactions leading to the formation of 

30 neuromelanin, as an often overlooked, yet obvious pathogenetic factor. In particular, we 

discuss the option of drug-mediated stimulation of endogenous mechanisms responsible for 
the detoxification of dopamine auto-oxidation products as a novel means of neuroprotection in 
Parkinson's disease. 
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Panel 4D Summary: Ag3018 The NOV91 gene, a D-dopachrome tautomerase 
homolog, is widely expressed in this panel, with highest expression in Ramos (B cells) 
activated by treatment with ionomycin (CT=31.28). Therefore, small molecule antagonists that 
block the function of the NOV91 gene product may be useful in several autoimmune and 
inflammatory diseases in which activated B cells can play major roles as sources of 
autoantibody-producing cells and also as powerful antigen-presenting cells, including, but not 
limited to, Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary 
disease, asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or psoriasis. 

Panel 5 Islet Summary: Ag301 8 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

NOV92 

Expression of gene NOV92 was assessed using the primer-probe set Ag3048, 
described in Table CGA. Results of the RTQ-PCR runs are shown in Tables CGB, CGC, CGD 
and CGE. 



Table CGA. Probe Name Ag3048 



Primers 


Sequences 


Length ; 


Start 
Position 


SEQ ID NO: 


Forward 


5 1 -cacaaccagctgacagacagt-3 ' 


21 


490 


1357 


Probe 


TET-5* -ccaccaccttcagcaagctgcatag-3 ' - 
TAMRA 


25 


521 


1358 


Reverse 


5 ' -gggagagatccaggtattcaag-3 1 


22 


547 


1359 



Table CGB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3048,| 
Run 168017062 


Tissue Name 


Rel. Exp.(%) Ag3048, 
Run 168017062 


Liver adenocarcinoma 


2.0 


Kidney (fetal) 


0.6 


Pancreas 


0.0 


Renal ca. 786-0 


3.8 


Pancreatic ca. CAPAN \ 

2 


0.3 


Renal ca. A498 


17.7 


Adrenal gland 


0.0 


Renal ca. RXF 393 


12.5 


Thyroid 


0.5 


Renal ca. ACHN 


0.1 


Salivary gland 


0.6 i 


Renal ca. UO-31 


1.1 


Pituitary gland 


0.5 


Renal ca. TK-10 


0.5 


Brain (fetal) 


0.9 


Liver 


0.0 


Brain (whole) 


2.4 


Liver (fetal) 


6.0 


Brain (amygdala) 


2.9 


Liver ca. 
(hepatoblast) HepG2 


0.2 


Brain (cerebellum) 


7.0 


Lung 


1.4 
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Brain (hippocampus) 2.1 


Lung (fetal) 


7.1 


Brain (substantia nigra) 


4.5 


Lung ca. (small cell) 
LX-1 


0.2 


Brain (thalamus) 


4.6 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


1.0 


Lung ca. (s.cell var.) 
SHP-77 


0.9 


Spinal cord 


7.6 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


2.8 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astro U-l 18-MG 


5.9 


Lung ca. (non-s.cell) 
NCI-H23 


0.4 


astrocytoma SW 1783 


2.7 


Lung ca. (non-s.cell) 
HOP-62 


41.2 


neuro*; met SK-N-AS 


1.1 


Lung ca. (non-s.cl) 
NCI-H522 


1.6 


astrocytoma SF-539 


1 f\C\ f\ 

100.0 


Lung ca. (squam.) 
SW 900 


0.0 


asirocy lorna jind / j 


41. 2 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.4 


Mammary gland 


1 5.y 


crlinma TT7S1 

til i\Ji iia ^ i 


6.7 


Breast ca. (pl.et) 
MCF-7 


3.5 


glioma SF-295 


7.3 


Breast ca. (pl.et) 
MDA-MB-231 


2.0 


Heart TfetaH 


5.1 


Breast ca.* (pl.ef) 
T47D 


0.3 


Heart 


1.3 


Breast ca. t> l -j4V 


u.u 


Skeletal muscle (fetal) 


4.7 


Breast ca. MDA-N 


0.3 


Skeletal muscle 


2.6 


Ovary 


3.8 


Bone marrow 


0.9 


Ovarian ca. OVCAR- 
3 


0.2 


Thymus 


A Q 


Ovarian ca. OVCAR- 
4 


0.2 


Spleen 


0.3 


Ovarian ca. OVCAR- 

•j 


4.3 ' 


Lymph node 


0.0 


Ovarian ca. OVCAR- 
8 


0.8 


Colorectal 


2.9 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


j 0.8 


Ovarian ca.* (ascites) 
SK-OV-3 


1.5 


Small intestine 


j 1.0 


Uterus 


0.6 


Colon ca. SW480 


| 0.8 


Placenta 


0.9 



1417 



Colon ca.* 
SW620(SW480 met) 


1.9 


Prostate 


3.0 


Colon ca. HT29 


0.3 


Prostate ca.* (bone | 
met)PC-3 


1.7 


Colon ca.HCT-1 16 


0.0 


Testis 


3.9 


Colon ca. CaCo-2 


0.2 


Melanoma 
Hs688(A).T 


32.8 


Colon ca. 
tissue((JD(J3 ooo) 


13.2 


Melanoma* (met) 


31.4 


Colon ca. HCC-2998 1 


4.1 


Melanoma UACC-62 


1.8 


Gastric ca.* (liver met) 
NCI-N87 


1.5 


Melanoma M14 


0.0 


Bladder 


3.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


3.7 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.0 


Adipose 


4.3 



Table CGC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3048, Run 
170858352 


Tissue Name 


Rel. Exp.(%) 
Ag3048, Run 
170858352 


Normal Colon 


4.7 


Kidney Margin 
8120608 


0.5 


CC Well to Mod Diff 
(OD03866) 


12.0 


Kidney Cancer 
8120613 


0.1 


CC Margin (OD03 866) 


1.5 


Kidney Margin 
8120614 


0.4 


CC Gr.2 rectosigmoid 
(OD03868) 


10.1 


Kidney Cancer 
9010320 


13.1 


CC Margin (OD03868) 


0.8 


Kidney Margin 
9010321 


0.9 


CC Mod Diff (ODO3920) 


8.4 


Normal Uterus 


0.5 


CC Margin (ODO3920) ^ 


2.6 


Uterus Cancer 06401 1 


1.0 


CC Gr.2 ascend colon 
(OD03921) 


13.6 


Normal Thyroid 


3.3 


CC Margin (OD03921) 


2.1 


Thyroid Cancer 
064010 


0.7 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


3.1 


Thyroid Cancer 
A302152 


4.5 


Liver Margin (ODO4309) 


0.1 


Thyroid Margin 
A302153 


0.8 


Colon mets to lung 
(OD04451-01) 


3.3 


Normal Breast 


20.0 
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# 



Lung Margin (OD04451- 
02) 


1.1 


Breast Cancer 
(OD04566) 


6.5 


Normal Prostate 6546-1 


4.1 


Breast Cancer 
(OD04590-01) 


10.4 


Prostate Cancer 
(OD04410) 


19.8 


Breast Cancer Mets 
(OD04590-03) 


11.4 


Prostate Margin 


4.5 


Breast Cancer 

Metastasis 
(OD04655-05) 


1.0 


Prostate Cancer 
(OD04720-01) 


9.7 


Breast Cancer 064006 


23.2 


Prostate Margin 
(OD04720-02) 


7 S 


Breast Cancer 1 024 


25.7 


Normal Lung 061010 


3.3 


Breast Cancer 
9100266 


16.4 


Lung Met to Muscle 


24.5 


Breast Margin 
Q1 0096S 


24.0 


Muscle Margin 
(OD04286) 


1.7 


A209073 


32.3 


Lung Malignant Cancer 
(OD03126) 


11.6 


A2090734 


33.2 


Lung Margin \y>\J\)5 \ zo ) \ 


J. 1 


r\ VYY\ Q 1 T 1Vf»r 


0.0 


Lung Cancer (OD04404) 


22.2 


Liver Cancer 064003 


0.0 


Lung Margin (OD04404) 


7.2 


Liver Cancer 1025 


U.U 


Lung Cancer (OD04565) 


36. 1 


Liver Cancer l Uzo 




Lung Margin (OD04565) 


2.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237- 
01) 


12.6 


Liver Tissue 6004-N 


2.9 


Lung Margin (OD04237- 
02) 


6.7 


Liver Cancer 6005-T 


2.2 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (OD043 1 0) 


0.4 


Normal Bladder 


23.8 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


11.8 


Lung Margin (OD04321) 


1.7 


Bladder Cancer 
A302173 


7.2 


Normal Kidney 


0.8 


Bladder Cancer 
(OD047 18-01) 


12.7 


Kidney Ca, Nuclear grade 
2 (OD04338) 


0.3 


Bladder Normal 
Adjacent (OD04718- 
03) 


4.9 


Kidney Margin 
(OD04338) 


1.5 


Normal Ovary 


3.5 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


0.2 


Ovarian Cancer 
064008 


100.0 
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.1 riflTpO t in» s ,, Ol HI Hr? 



Kidney Margin 
(OD04339) 


j Ovarian Cancer 
09 (OD04768-07) 


2.5 


Kidney Ca ? Clear cell 
type (OD04340) 


i Ovary Margin 
0A | (OD04768-08) 


8.1 


Kidney Margin 
(OD04340) 


1 n 


M 1 \ i*m '. 1 1 ^it/Mil Mi""M 


1.5 


Kidney Ca, Nuclear grade: 
3 (OD04348) 


10.3 


Gastric Cancer 
9060358 


0.5 


Kidney Margin 
(OD04348) 


0.4 


Stomach Margin 
9060359 


0.8 


Kidney Cancer 
(OD04622-01) 


34.4 


Gastric Cancer 
9060395 


7.5 


Kidney Margin 
(OD04622-03) 


0.2 


Stomach Margin 
9060394 


1.9 


Kidney Cancer 
(OD04450-01) 


0.0 


Gastric Cancer 
9060397 


27.0 


Kidney Margin 
(OD04450-03) 


0.2 


Stomach Margin 
9060396 


1.5 


Kidney Cancer 8120607 


9.0 


Gastric Cancer 
064005 


3.3 



Table CGD. Panel 3D 



Tissue Name 


ReL Exp.(%) 
Ag3048, Run 
172133336 


Tissue Name 


Rel. Exp.(%) 
Ag3048, Run 
172133336 


Daoy- Medulloblastoma 


6.5 


Ca Ski- Cervical epidermoid ; 
carcinoma (metastasis) 


0.0 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell 
carcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


18.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


1.4 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


2.6 


SNB-78- Glioma 


100.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


24.1 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- 
Neuroblastoma 
(metastasis) 


32.3 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


2.0 


RL- non-Hodgkin f s B-cell 
lymphoma 


0.0 


Cerebellum 


3.0 


JM1- pre-B-cell lymphoma 


0.0 
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Cerebellum 


4.2 


Jurkat- T cell leukemia ! 


0.0 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


38.4 


TF-1- Erythroleukemia 


2.3 


DMS-114- Small cell 
lung cancer 


1.7 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung 
cancer 




119^7- Histiocytic Ivmnhoma 


0.7 


NCI-H 146- Small cell 
lung cancer 


3.3 


KU-8 1 2- Myelogenous 
leukemia 


0.0 


NCI-H526- Small cell ; 
lung cancer 


0.0 


769-P- Clear cell renal 
carcinoma 


0.0 


NCI-N417- Small cell 
lung cancer 


0.0 


Caki-2- Clear cell renal 
carcinoma 


2.5 


NCI-H82- Small cell 
lung cancer 


0.0 


SW 839- dear cell renal 
carcinoma 


u.u 


NCI-H 157- Squamous 
cell lung cancer 
(metastasis) 


1.4 


G401- Wilms' tumor 


2.3 


NCI-H 1 155- Large cell I 
lung cancer 


1.8 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


15 


NCI-H 1299- Large cell | 
lung cancer 


2.8 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


NCI-H727- Lung 
carcinoid 


0 0 

\J.\J 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


0.8 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.7 


LX-1- Small cell lung 
cancer 


0.0 


HP AC- Pancreatic 
adenocarcinoma 


0.0 




7.3 


MIA PaCa-2- Pancreatic 
carcinoma 


0.7 


kT \A 19- f^nlnn ranrer 


1.6 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


1.7 


KM90T 2- Colon cancer 


0.0 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


6.7 


NCI-H716- Colon cancer 


0.0 


T24- Bladder carcinma 
(transitional cell) 


6.1 


SW-48- Colon 
adenocarcinoma 


0.6 


5637- Bladder carcinoma 


2.1 


SW1116- Colon 
adenocarcinoma 


0.8 


HT-1 197- Bladder carcinoma 


1.7 


LS 174T- Colon 
adenocarcinoma 


2.5 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.7 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


5.4 
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SW-480- Colon 
adenocarcinoma 


2.7 


HT-1080- Fibrosarcoma j 


11.7 


NCI-SNU-5- Gastric 
carcinoma 


1.6 


MG-63- Osteosarcoma j 


0.0 


KATO III- Gastric 
carcinoma 


1.8 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


7.5 


NCI-SNU-16- Gastric 
carcinoma 


1.4 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.0 


NCI-SNU-1- Gastric j 
carcinoma 


0.0 


A431- Epidermoid carcinoma 


5.1 


RF-1- Gastric 
adenocarcinoma 


0.0 


WM266-4- Melanoma 


16.6 


RF-48- Gastric 
adenocarcinoma 


0.9 


DU 145- Prostate carcinoma 
(brain metastasis) 


0.0 


MKN-45- Gastric 
carcinoma 


5.6 


MDA-MB-468- Breast 
adenocarcinoma 


2.5 


NCI-N87- Gastric 
carcinoma 


2.3 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


4.8 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.6 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell 
carcinoma of tongue 


0.5 



Table CGE. Panel 4D 



Tissue Name 


ReL Exp.(%) 
Ag3048, Run 
164315038 


Tissue Name 


ReL Exp.(%) 
Ag3048, Run 
164315038 


Secondary Thl act 


0.4 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


1.9 


Secondary Trl act 


0.6 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


1.2 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


^ 0.3 


Secondary Trl rest 


. 0.0 


Lung Microvascular EC 
none 


3.9 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.2 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.6 


Primary Trl act 


0.0 


Micros vasular Dermal EC 
TNFalpha + IL-lbeta 


1.4 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


28.7 
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Primary Th2 rest 


0.9 


Small airway epithelium 
none 


39.2 


Primary Trl rest 


0.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


84.7 


CD45RA CD4 
lymphocyte act 


8.6 


Coronery artery SMC rest 


17.9 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


5.5 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


6.1 


Secondary CD8 
lymphocyte rest 


0.6 


Astrocytes TNFalpha + 
IL-lbeta 


5.5 


Secondary CD8 
lymphocyte act 


0.6 


KU-8 12 (Basophil) rest 


0.7 


CD4 lymphocyte none 


0.5 


KU-812 (Basophil) 
PMA/ionomycin 


3.8 


2ry Thl/Th2/Trl anti- 
CD95 CHll 


1.3 


CCD1 lOo (Iveratinocytes) 
none 


79.0 


LAK cells rest 


0.0 


CCD! 1 06 (Keratinocytes) 
TNFalpha + IL- 1 beta 


97.3 


L/\rv CeilS 1L»-Z 


0 0 


Liver cirrhosis 


4.3 


Lj/\jv cens l L(-z, ' i L/- 1 z. 


0 0 


Lupus kidney 


3.4 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


87.1 


i A Lf ^11c TT 9-1- IT -1 8 


0 0 


NCI-H292 IL-4 


77.4 


LAK cells 

r IVl^-V/ 1UI 1UI Iiy C 111 


0.0 


NCI-H292 IL-9 


82.9 


XTT<r fVllQ TT -9 rp«;t 

IN IS. \_.C115> lJL/~Zr 1 CM 


0.0 


NCI-H292 IL-13 


54.0 


T wn WAw TV4T R ^ Hav 

1 WO W cLy IV l LjIv Ud.^ 


0.0 


NCI-H292 I FN gamma 


49.3 


Two Way MLR 5 day 


0.0 


HP A EC none 


0.7 


Two Way MLR 7 day 


0.8 


ri 1 /\JLa^ 1 in r aipna » 11^- 1 
beta 


0.0 


PBMC rest 




T unrr "fi Tm-/"^V"\ 1 act tl r\V\ 
LUng llDIUDldol IHJUC 


35.1 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha 

-4- TT 1 Kf^ta 


7.0 


PBMC PHA-L 


2.2 


Lung fibroblast IL-4 


46.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


38.7 


Ramos (B cell) 
ionomycin 


0.8 


Lung fibroblast IL-13 


26.8 


B lymphocytes PWM 


1.0 


Lung fibroblast IFN 
gamma 


68.8 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


100.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


59.9 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 


39.8 
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PMA/ionomycin 




CCD 1 070 IL-1 beta 




Dendritic cells none 


1.8 


Dermal fibroblast IFN 
gamma 


17.0 


Dendritic cells LPS 


1.7 


Dermal fibroblast IL-4 


30.6 


Dendritic cells anti- 
CD40 


6.8 


IBD Colitis 2 


0.0 


Monocytes rest 


1.2 


IBD Crohn's 


0.7 


Monocytes LPS 


0.0 


Colon 


2.0 


Macrophages rest 


L9 


Lung 


36.1 


Macrophages LPS 


1.3 


Thymus 


0.6 


HUVEC none 


0.0 


Kidney 


1.7 


HUVEC starved 


2.9 







Panel 1.3D Summary: Ag3048 The expression of the NOV92 gene appears to be 
highest in a sample derived from a brain cancer cell line (SF-539) (CT=29.4). In addition, 
there is substantial expression associated with samples derived from another brain cancer cell 
5 line, two melanoma cell lines and a lung cancer cell line. Thus, the expression of this gene 
could be used to distinguish SF-539 cells from other samples in the panel. Moreover, 
therapeutic modulation of this gene, through the use of small molecule drugs, protein 
therapeutics or antibodies might be of benefit in treatment of brain or lung cancer or 
melanoma. 

10 This gene, a leucine-rich repeat homolog, is expressed at low levels in the CNS. The 

leucine-rich repeat region proteins have been implicated in axonal guidance. Therefore, this 
gene may be of therapeutic utility in the treatment of any CNS disorder involving neuronal 
loss, specifically by guiding/enhancing compensatory synaptogenesis and fiber outgrowth, 
including such clinical conditions as Alzheimer's, Parkinson's, or Huntington's diseases, 

1 5 stroke, head and spinal cord trauma, vascular dementia or spinocerebellar ataxia. 

Panel 2D Summary: Ag3048 The expression of the NOV92 gene appears to be 
highest in a sample derived from an ovarian cancer (CT-29). In addition, there appears to be 
substantial expression associated with lung cancer, prostate cancer and colon cancer samples. 
Of note is the differential expression in the lung, colon and prostate cancer samples compared 

20 to their respective normal adjacent tissue. Thus, the expression of this gene could be used to 
distinguish this ovarian cancer sample from other samples in the panel. In addition, the 
expression of this gene could be used to distinguish colon, prostate or lung cancer samples 
from their normal adjacent tissue. Moreover, therapeutic modulation of this gene, through the 
use of small molecule drugs, protein therapeutics or antibodies might be beneficial in the 

25 treatment of ovarian, lung, prostate or colon cancer. 
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Panel 3D Summary: Ag3048 The expression of the NOV92 gene appears to be 
highest in a sample derived from a brain cancer cell line (SNB-78) (CT=30.2). In addition, 
there appears to be substantial expression associated with other brain cancer cell line samples 
and a lung cancer cell line sample. Thus, the expression of this gene could be used to 

5 distinguish SNB-78 cells from other samples in the panel. Moreover, therapeutic modulation 
of this gene, through the use of small molecule drugs, protein therapeutics or antibodies might 
be beneficial in the treatment of brain or lung cancer. 

Panel 4D Summary: Ag3048 The NOV92 gene, a secreted leucine-rich repeat (LRR) 
protein, is expressed selectively at moderate levels (CT range 29-31) in several resting and 

10 cytokine-activated epithelial and connective tissue cells of lung and skin. Therefore, the 

NOV92 gene product may be useful as a therapeutic protein as well as a target for therapeutic 
antibodies and small molecules, each of which may prove beneficial in the reduction or 
elimination of the symptoms in patients with chronic obstructive pulmonary disease, asthma, 
emphysema, or psoriasis. 

1 5 NOV93: IMP DEHYDROGENASE 1 

Expression of gene NOV93 was assessed using the primer-probe sets Ag4520 and 
Ag2904, described in Tables CHA and CHB. Results of the RTQ-PCR runs are shown in 
Tables CHC, CHD, CHE, CHF, CHG, CHH and CHI. 

Table CHA. Probe Name Ag4520 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ■ -ggtacccatgaggatgacaaat-3 ' 


22 


758 


1360 


Probe 


TET-5 1 -acctggacctgctcacccaggtag-3 ' - 
TAMRA 


24 


783 


1361 


Reverse 


5 ' -cgagtccaagcctatgacatt-3 ' 


21 


812 


1362 



20 



Table CHB. Probe Name Ag2904 



Primers! 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward! 


5 ' -atacttcaacgacggggataag-3 ' 


22 


1300 


1363 


Probe 


TET-5 ' -ctccatccaggacaaagggtccatt-3 ■ - 
TAMRA 


25 


1348 


1364 


Reverse 


5 ' -aggtagggcacgaacttctg-3 ' 


20 


1373 


1365 



Table CHC. AI_comprehensive panel_vl.O 



Tissue Name 


ReL Exp.(%) 
Ag4520, Run 


Tissue Name 


ReL Exp.(%) 
Ag4520, Run 
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110967COPD-F 


11.8 


112427 Match Control 


63.7 


110980COPD-F 


12.1 


1 12418 Psoriasis-M ; 


12.5 


110968 COPD-M 


11.0 


1 1 2723 Match Control 
Psoriasis-M 


0.3 


110977 COPD-M 


46.7 


112419 Psonasis-M 


10.4 


1 10989 Emphysema- 
F 


34.9 


112424 Match Control j 
Psoriasis-M 


11.6 


110992 Emphysema- 
F 


22.7 


112420 Psoriasis-M 


52.1 


110993 Emphysema- 
F 


21.9 


112425 Match Control 
Psoriasis-M 


50.0 


110994 Emphysema- 
F 


13.3 


104689 (MF) OA 
Bone-Backus 


30.8 


110995 Emphysema- 
F 


33.2 


104690 (MF) Adj 
"Normal" Bone-Backus 


23.0 


110996 Emphysema- 
F 


8.5 


104691 (MF) OA 
Synovium-Backus 


22.4 


1 1 0997 Asthma-M 


6.3 ! 


104692 (BA) OA 
Cartilage-Backus 


3.4 


1 1 1001 Asthma-F 


32.5 


104694 (BA) OA 
Bone-Backus 


17.6 


111002 Asthma-F 


22.2 


104695 (BA) Adj 
"Normal" Bone-Backus 


24.1 


111003 Atopic 
Asthma-F 


18.9 


104696 (BA) OA 
Synovium-Backus 


13.9 


111004 Atopic 
Asthma-F 


20.9 


104700 (SS) OA Bone- 
Backus 


21.3 


1 1 1 005 Atopic 
Asthma-F 


10.7 


104701 (SS) Adj 
"Normal" Bone-Backus 


21.2 


111006 Atopic 
Asthma-F 


3.6 


104702 (SS) OA 
Synovium-Backus 


42.3 


111417 Allergy-M 


13.1 


1 17093 OA Cartilage 

ivcp / 


27.9 


112347 Allergy-M 


2.6 


112672 OABone5 


39.0 


1 1 2349 Normal Lung- 
F 


2.0 


112673 OA Synoviums 


14.6 


112357 Normal Lung- 
F 


22.4 


112674 OA Synovial 
Fluid cells5 


16.6 


112354 Normal Lung- 
M 


10.4 


117100 OA Cartilage 
Rep 14 


5.1 


112374 Crohns-F 


11.3 


112756 OABone9 


| 6.8 


112389 Match 
Control Crohns-F 


18.6 


112757 OA Synovium9 


| 14.7 
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112375 Crohns-F 


8.5 


1 12758 OA Synovial 
Fluid Cells9 


12.8 


112732 Match 
Control Crohns-F 


41.8 


1 17125 RA Cartilage ; 

rvCpZ 


10.1 


112725 Crohns-M 


3.2 


113492Bone2RA ! 


28.7 


112387 Match 
Control Crohns-M 


13.5 


1 13493 Synovium2 RA; 


13.1 


112378 Crohns-M 1 


1.6 


1 13494 Syn Fluid Cells 
RA 


26.4 


112390 Match 
Control Crohns-M 


49.7 


113499 Cartilage4RA ; 


29.3 


112726 Crohns-M 


12.8 


1 13500 Bpne4RA j 


41.2 


112731 Match 
Control Crohns-M 


16.7 


113501 Synovium4RAi 


30.6 


112380 Ulcer Col-F 


25.5 


113502 Syn Fluid 
Cells4 RA 


15.9 


112734 Match 
Control Ulcer Col-F 


100.0 


113495 Cartilage3 RA 


24.1 


112384 Ulcer Col-F 


48.6 


113496 Bone3 RA 


31.9 


112737 Match 
Control Ulcer Col-F 


7.4 


113497 Synovium3 RA 


13.7 


112386 Ulcer Col-F 


4.4 


113498 Syn Fluid 
Cells3 RA 


26.4 


112738 Match 
Control Ulcer Col-r 


12.9 


117106 Normal 
i^artiiage Kepzu 


6.9 


112381 Ulcer Col-M 


2.0 


113663 Bone3 Normal 


2.1 


1 12735 Match 
Control Ulcer Col-M 


12.0 


1 13664 Synovium3 
Normal 


1.2 


112382 Ulcer Col-M 


12.1 


1 13665 Syn Fluid 
Cells3 Normal 


0.8 


112394 Match 
Control Ulcer Col-M 


4.5 


1 17107 Normal 
Cartilage Rep22 


13.5 


112383 Ulcer Col-M 


26.8 


113667 Bone4 Normal 


18.3 


112736 Match 
Control Ulcer Col-M 


9.5 


1 13668 Synovium4 
Normal 


17.6 


112423 Psoriasis-F 


11.1 


113669 Syn Fluid 
Cells4 Normal 


27.4 



Table CHD. CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag2904, Run 
206485416 


Rel. Exp.(%) 
Ag4520, Run 
206954220 


Tissue 
Name 


Rel. Exp.(%) 
Ag2904, Run 
206485416 


Rel. Exp.(%) 
Ag4520, Run 
206954220 


AD 1 Hippo 


29.3 


31.9 


Control 
(Path) 3 
Temporal 


25.0 


28.9 
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Ctx 








AD 2 Hippo 


40.6 


34.2 


Control 
(Path) 4 
Temporal 
Ctx 


39.0 


OA o 

30. o 


AD 3 Hippo 


34.2 


26.1 


AD 1 
Occipital 
Ctx 


27.4 


23.7 


AD 4 Hippo 


29.5 


19.1 


AD 2 
Occipital 

Ctx 
(Missing) 


0.0 


0.0 


AD 5 hippo 


60.7 


58.2 


AD 3 
Occipital 
Ctx 


33.0 


24.8 


AD 6 Hippo 


80.7 


92.7 


AD 4 
Occipital 
Ctx 


24.5 


26.2 


Control 2 
Hippo 


25.0 


30.8 


AD 5 
Occipital ! 
Ctx 


41.5 


30.1 


Control 4 
Hippo 


53.2 


40.9 


AD 6 
Occipital 
Ctx 


30.6 


52.1 


Control (Path) 
3 Hippo 


25.0 


22.4 


Control 1 
Occipital 
Ctx 


48.0 


60.3 


AD 1 Temporal 
Ctx 


52.1 


81.2 


Control 2 
Occipital 
Ctx 


33.4 


41.5 


AD 2 Temporal 
Ctx 


24.3 


21.5 


Control 3 
Occipital 
Ctx 


25.3 


23.3 


AD 3 Temporal 
Ctx 


32.3 


26.1 


Control 4 
Occipital 
Ctx 


30.8 


25.3 


AD 4 Temporal 
Ctx 


40.6 


49.7 


Control 
(Path) 1 
Occipital 
Ctx 


49.7 


37.1 


AD 5 Inf 
Temporal Ctx 


52.9 


77.4 


Control 
(Path) 2 
Occipital 
Ctx 


15.5 


12.3 


AD 5 
SupTemporal 
Ctx 


83.5 


100.0 


Control 
(Path) 3 
Occipital 
Ctx 


39.2 


45.1 
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AD 6 Inf 
Temporal Ctx 


77.4 


85.3 


Control 
(Path) 4 i 
Occipital \ 
Ctx 


33.4 


36.1 


AD 6 Sup 
Temporal Ctx 


100.0 


87.1 


Control 1 
Parietal Ctx 1 


37.4 


29.1 


Control 1 
Temporal Ctx 




">Q ^ 

Z7.J 


Control 2 
Parietal Ctx 




JZ.. I 


Control 2 
Temporal Ctx 


20.0 


19.1 


Control 3 
Parietal Ctx 


14.0 


11.0 


Control 3 
Temporal Ctx 


22.4 


14.3 


Control 
(Path) 1 
Parietal Ctx 


43.2 


30.1 


Control 4 
Temporal Ctx 


26.1 


10.4 


Control 
(Path) 2 
Parietal Ctx 


32.8 


33.4 


Control (Path) 
1 Temporal Ctx 


46.3 


25.7 


Control 
(Path) 3 
Parietal Ctx 


34.9 


36.1 


Control (Path) 
2 Temporal Ctx 


34.2 


34.2 


Control 
(Path) 4 
Parietal Ctx 


48.3 


44.4 



Table CHE. Generalscreeningpanel vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4520, Run j 
219274490 


Rel. Exp.(%) 
Ag4520, Run 
219288511 


Tissue Name 


Rel. Exp.(%) 
Ag4520, Run I 
219274490 


Rel. Exp.(%) 
Ag4520, Run 
219288511 


Adipose 


26.6 


24.5 


Renal ca. TK-10 


47.0 


42.3 


Melanoma* 
Hs688(A).T 


14.1 


12.9 


Bladder 


41.2 


37.6 


Melanoma* 
Hs688(B).T 


13.6 


13.3 


Gastric ca. (liver 
met.) NCI-N87 


100.0 


100.0 


Melanoma* 
M14 


8.5 


11.2 


Gastric ca. 
KATO III 


14.4 


13.5 


Melanoma* 
LOXIMVI 


17.2 


18.6 


Colon ca. SW- 
948 


5.0 


5.3 


Melanoma* 
SK-MEL-5 


10.7 


11.3 


Colon ca. SW480 


18.6 


20.9 


Squamous cell 
carcinoma 
SCC-4 


4.9 


6.6 


Colon ca.* 
(SW480 met) 
SW620 


9.0 


9.0 


Testis Pool 


7.5 


7.5 


Colon ca. HT29 


10.1 


10.0 


Prostate ca.* 
(bone met) 
PC-3 


8.0 


7.2 


Colon ca. HCT- 
116 


16.7 


14.6 
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Prostate Pool 


13.8 


13.3 


Colon ca. CaCo-2 


26.1 


26.4 


Placenta 


18.3 


19.5 


Colon cancer 
tissue 


21.0 


23.8 


Uterus Pool 


9.0 


9.0 


Colon ca. 
SW1116 


1.3 


2.6 


Ovarian ca. 
OVCAR-3 


15.2 


13.5 


Colon ca. Colo- 
205 


3.5 


3.7 


Ovarian ca. 
SK-OV-3 


28.9 


28.3 


Colon ca. SW-48 


4.3 


4.5 


Ovarian ca. 
OVCAR-4 


5.9 


6.2 


Colon Pool 


31.9 


30.1 


Ovarian ca. 
OVCAR-5 


49.0 


49.0 


Small Intestine 
Pool 


29.7 


32.5 


Ovarian ca. 
IGROV-1 


16.3 


15.0 


Stomach Pool 


15.6 


15.2 


Ovarian ca. 
OVCAR-8 


6.9 


6.0 


Bone Marrow 
Pool 


13.4 


13.9 


Ovary 


11.3 


10.0 


Fetal Heart 


20.3 


21.8 


Breast ca. 
MCF-7 


27.7 


27.0 


Heart Pool 


11.3 


15.4 


Breast ca. 
MDA-MB- 

- 231 


37.9 


48.3. 


Lymph Node 
Pool 


26.8 


30.1 


Breast ca. BT 
549 


52.9 


53.2 


Fetal Skeletal 
Muscle 


7.5 


10.7 


Breast ca. 
T47D 


69.7 


74.7 


Skeletal Muscle 
Pool 


20.9 


24.3 


Breast ca. 
MDA-N 


4.7 


6.7 


Spleen Pool 


45.1 


57.8 


Breast Pool 


31.0 


37.1 


Thymus Pool 


31.9 


31.0 


Trachea 


31.0 


28.5 


CNS cancer 
(glio/astro) U87- 
MG 


26.1 


25.9 


Lung 


4.5 


3.7 


CNS cancer 
(glio/astro) U- 
118-MG 


21.3 


24.0 


Fetal Lung 


68.3 


100.0 


CNS cancer 
(neuro;met) SK- 
N-AS 


18.9 


21.0 


Lung ca. NCI- 
N417 


0.5 


0.6 


CNS cancer 
(astro) SF-539 


10.6 


6.9 


Lung ca. LX- 
1 


13.2 


11.9 


CNS cancer 
(astro) SNB-75 


48.0 


47.0 


Lung ca. NCI- 
H146 


4.1 


3.7 


CNS cancer 
(glio) SNB-19 


14.7 


13.4 


Lung ca. 


6.3 


6.3 


CNS cancer 


18.8 


20.4 
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SHP-77 


" 




(glio) SF-295 






Lung ca. 
A549 


21.6 


23.5 


Brain 
(Amygdala) Pool 


2.4 


3.3 


Lung ca. NCI- 
H526 


1.5 


1.7 


Brain 
(cerebellum) 


9.2 


10.1 


Lung ca. NCI- 
H23 


29.7 


29.5 


Brain (fetal) 


7.7 


8.2 


Lung ca. NCI- 
H460 


r 

14.2 


15.1 


Brain 
(Hippocampus) 
Pool 


5.5 


.4.8 


Lung ca. 
HUr-o2 


17.4 


3.6 


Cerebral Cortex 
Pool 


5.4 


3.7 


Lung ca. NCI- 
njzz 


20.3 


25.5 


Brain (Substantia 
nigra) Pool 


3.8 


3.2 


Liver 


2.0 


2.0 


Brain (Thalamus) 
rool 


4.4 


4.7 


Fetal Liver 


14.6 


15.9 


Brain (whole) 


4.0 


4.8 


Liver ca. 
HepG2 


16.2 


18.8 


Spinal Cord Pool 


6.5 


7.0 


Kidney Pool 


36.3 


44.4 


Adrenal Gland 


28.9 


28.3 


Fetal Kidney 


31.9 


29.1 


Pituitary gland 
Pool 


3.3 


2.5 


Renal ca. 786- 
0 


50.7 


54.3 


Salivary Gland 


8.0 


6.0 


Renal ca. 
A498 


28.9 


27.4 


Thyroid (female) 


9.5 


7.6 


Renal ca. 
ACHN 


45.4 


41.2 


Pancreatic ca. 
CAPAN2 


51.4 


48.3 


Renal ca. UO- 
31 


31.9 


31.6 


Pancreas Pool 


35.4 


39.0 



Table CHF. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2904, 
Run 162556421 


Tissue Name 


Rel. Exp.(%) Ag2904, 
Run 162556421 


Liver adenocarcinoma 


16.0 


Kidney (fetal) 


40.3 


Pancreas 


6.9 


Renal ca. 786-0 


13.3 


Pancreatic ca. CAPAN 
2 


6.4 


Renal ca. A498 


34.2 


Adrenal gland 


9.7 


Renal ca. RXF 393 


16.3 


Thyroid 


8.1 


Renal ca. ACHN 


21.9 


Salivary gland 


7.4 


Renal ca. UO-31 


6.2 


Pituitary gland 


2.1 


Renal ca. TK-10 


9.5 


Brain (fetal) 


1.6 


Liver 


4.2 


Brain (whole) 


1.5 


Liver (fetal) 


11.3 
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Brain (amygdala) 


6.6 


Liver ca. 


11.3 


Brain (cerebellum) 


6.7 


Lung 


70.7 


Brain (hippocampus) 


7.6 


Lung (fetal) 


4z.3 


Brain (substantia nigra) 


2.6 


Lung ca. (small cell) 
LX-1 


4.7 


Brain (thalamus) 


5.0 


Lung ca. (small cell) 
NCI-H69 


1.6 


Cerebral Cortex 


10.3 


Lung ca. (s.cell var.) 
SHP-77 


2.3 


Spinal cord 


15.3 


Lung ca. (large 
cell)NCI-H460 


6.9 


elio/astro U87-MG 


13.3 


Lung ca. (non-sm. 
cell) A549 


3.5 


fflio/astro IJ-l 18-MG 


3.4 


Lung ca. (non-s.cell) 
NCI-H23 


10.6 


astrocytoma SW 1783 


6.1 


Lung ca. (non-s.cell) 
HOP-62 


4.9 


neuro*; met SK-N-AS 


6.4 


Lung ca. (non-s.cl) 
NCI-H522 


6.9 


astrocytoma SF-539 


8.0 


Lung ca. (squam.) 
SW 900 


18.2 


astrocytoma SNB-75 


10.9 


Lung ca. (squam.) 


2.4 


„ 1 ■ _ c\in i r\ 

glioma SNB-19 




ivicuiiiiicu y gidiiu 


9.7 


glioma U251 


3.4 


r>reasi ca. yji.eij 
MCF-7 


18.0 


glioma SF-295 


8.1 


DlCdiL tel. ^JJl.Cl^ 

MDA-MB-231 


10.1 


Heart (fetal) 


9.9 


T47D 


4.1 


Heart 






4.8 


Skeletal muscle (fetal) 


100.0 


Breast ca. MDA-N 


3.6 


Skeletal muscle 


9.8 


Ovary 


to 7 

la./ 


Bone marrow 


35.4 


Ovarian ca. OVCAR- 

3 


2.5 

— — — 


Thymus 


80.7 


4 


1.7 


Spleen 


50.0 


Ovarian ca. OVCAR- 
5 


8.2 


Lymph node 


29.1 


Ovarian ca. OVCAR- 
8 


3.5 


Colorectal 


42.3 


Ovarian ca. IGROV- 
1 


4.9 


Stomach 


10.7 


Ovarian ca.* (ascites) 


5.2 
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SK-OV-3 




Small intestine 




Uterus 


1A 7 
1 U. / 


Colon ca. SW480 


2.4 


Placenta 


36.3 


Colon ca.* 
SW620(SW48U met) 


3.8 


Prostate 


10.5 


Colon ca. HT29 


5.4 


Prostate ca.* (bone 
metjrC-3 


1.7 


Colon ca. HCT-116 


2.6 


Testis 


7.9 


Colon ca. CaCo-2 


10.7 


Melanoma 
Hs688(A).T 


4.6 


Colon ca. 
tissue(UUU3oooj 


21.6 


Melanoma* (met) 

riSOoo^o ). 1 


6.2 


Colon ca. HCC-2998 


10.4 


Melanoma UACC-62 


0.9 


Gastric ca.* (liver met) 
NCI-N87 


27.2 


Melanoma Ml 4 


2.8 


Bladder 


39.5 


Melanoma LOX 
IMVI 


2.9 


Trachea 


36.9 


Melanoma* (met) 
SK-MEL-5 


2.5 


Kidney 


16.0 


Adipose 


27.4 



Table CHG. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2904, Run 
162345750 


Tissue Name 


Rel. Exp.(%) 
Ag2904, Run 
162345750 


Normal Colon 


58.6 


Kidney Margin 
8120608 


5.3 


CC Well to Mod Diff 
(OD03866) 


12.2 


Kidney Cancer 
8120613 


7.9 


CC Margin (OD03 866) 


9.2 


Kidney Margin 
8120614 


14.2 


CC Gr.2 rectosigmoid 
(OD03868) 


50.0 


Kidney Cancer 
9010320 


31.2 


CC Margin (OD03868) 


5.6 


Kidney Margin 
9010321 


27.2 


CC Mod Diff (ODO3920) 


45.4 


Normal Uterus 


6.5 


CC Margin (ODO3920) 


27.2 


Uterus Cancer 06401 1 


26.6 


CC Gr.2 ascend colon 
(OD03921) 


30.4 


Normal Thyroid 


10.7 


CC Margin (OD03921) 


9.9 


Thyroid Cancer 
064010 


14.2 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


34.2 


Thyroid Cancer 
A302152 


36.3 
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Liver Margin*(ODO4309) 


19.1 


Thyroid Margin 
A302153 


22.4 


Colon mets to lung 
(OD04451-01) 


52.1 


Normal Breast 


23.5 


Lung Margin (OD0445 1 - 
02) 


43.2 


Breast Cancer 
(OD04566) 


6.8 


Normal Prostate 6546-1 


66.9 


Breast Cancer 
(OD04590-01) 


13.5 


Prostate Cancer 
(OD04410) 


22.4 


Breast Cancer Mets 
(OD04590-03) 


29.3 


Prostate Margin 
COD044 1 0) 


21.9 


Breast Cancer 

Metastasis 
(OD04655-05) 


18.7 


Prostate Cancer 
(OD04720-01) 


26.2 


Breast Cancer 064006 


27.5 


Prostate Margin 
(OD04720-02) 


28.9 


Breast Cancer 1 024 


20.9 


Normal Lung 061010 


100.0 


Breast Cancer 
9100266 


12.2 


Lung Met to Muscle 


27.2 


Breast Margin 
910096S 


8.8 


IV/Ti icr*1#=* 51 ncri n 
1 vict I gill 

(OD04286) 


13.6 


A209073 


15.7 


T liner lV/Tn 1 1 crrm nt 1* anppr 
L^Ullg, 1 Vict 1 Igl Idl 1 1 \^<\\Ws\>\ 

(OD03126) 


32.5 


A2090734 


17.6 




63 3 


Nnrmal T iver 


13.4 


Lung Cancer (OD04404) 


28.3 


Liver Cancer 064003 


13.9 


Lung Margin (OD04404) 


J4. 9 


Liver Cancer i\)2d 


n o 
1 j.y 


Lung Cancer (OD04565) 


23.3 


Liver Cancer 1026 


O.D 


Lung Margin (OD04565) 


49.0 


Liver Cancer 6004-T 


15.4 


Lung Cancer (OD04237- 
01) 


21.0 


Liver Tissue 6004-N 


9.6 


Lung Margin (OD04237- 
02) 


55.9 


Liver Cancer 6005-T 


6.5 


Ocular Mel Met to Liver 
(ODO4310) 


3.6 


Liver Tissue 6005-N 


2.6 


Liver Margin (OD043 1 0) 


14.9 


Normal Bladder 


46.7 


Melanoma Mets to Lung 
(OD04321) 


7.0 


Bladder Cancer 1023 


10.4 


Lung Margin (OD04321) 


67.8 


Bladder Cancer 
A302173 


16.0 


Normal Kidney 


47.0 


Bladder Cancer 
(OD047 18-01) 


32.1 


Kidney Ca, Nuclear grade 
2 (OD04338) 


39.5 


Bladder Normal 
Adjacent (OD04718- 
03) 


28.3 
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Kidney Margin 
(OD04338) 


45.4 


Normal Ovary 


2.4 


Kidney Ca Nuclear grade 
1/2(OD04339) 


80.1 


Ovarian Cancer 
064008 


33.2 


Kidney Margin 
(OD04339) 


28.1 


Ovarian Cancer 
(OD04768-07) 


80.7 


Kidney Ca, Clear cell 
type (OD04340) 


67.8 


Ovary Margin 
(OD04768-08) 


11.5 


Kidney Margin 
(OD04340) 


57.4 


Normal Stomach 


27.2 


Kidney Ca, Nuclear grade 
3 (OD04348) 


16.3 


Gastric Cancer 
9060358 


6.9 


Kidney Margin 
(OD04348) 


60.7 


Stomach Margin 
9060359 


10.3 


Kidney Cancer 
(OD04622-01) 


27.5 


Gastric Cancer 
9060395 


18.4 


Kidney Margin 
(OD04622-03) 


5.6 


Stomach Margin 
9060394 


22.8 


Kidney Cancer 
(OD04450-01) 


28.5 


Gastric Cancer 
9060397 


27.4 


Kidney Margin 
(OD04450-03) 


34.4 


Stomach Margin 
9060396 


9.2 


Kidney Cancer 8120607 


3.5 


Gastric Cancer 
064005 


77.9 



Table CHH. Panel 4.1D 



Tissue Name 


Rel. 

Exp.(%) ! 
Ag4520, 

Run 
198263642 


Rel. 

Exp.(%) I 
Ag4520, 
Run 
219310605 


Tissue Name 


Rel. 
Exp.(%) 
Ag4520, 
Run 
198263642 


Rel. 
Exp.(%) 
Ag4520, 

Run 
219310605 


Secondary Thl act 


22.5 


26.1 


HUVEC IL-lbeta 


9.5 


11.3 


Secondary Th2 act 


37.4 


40.1 


HUVEC IFN 
gamma 


17.0 


15.6 


Secondary Trl act 


22.8 


22.7 


HUVEC TNF 
alpha + IFN 
gamma 


18.0 


23.8 


Secondary Thl rest 


7.8 


6.3 


HUVEC TNF 
alpha + IL4 


12.0 


12.8 


Secondary Th2 rest 


8.7 


11.7 


HUVEC IL-11 


5.4 


4.6 


Secondary Trl rest 


7.7 


3.8 


Lung 
Microvascular EC 
none 


17.6 


20.7 


Primary Thl act 


35.8 


22.4 


Lung 
Microvascular EC 


19.9 


23.5 
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TNFalpha + IL- 
lbeta 






Primary Th2 act 


36.3 


39.0 


Microvascular 
Dermal EC none 


8.1 


8.7 


Primary Trl act 


34.9 


36.6 


Microsvasular 

Dermal EC 
TNFalpha + IL- 
lbeta 


9.1 


11.1 


Primary Thl rest 


3.8 


2.6 


Bronchial 
epithelium 
TNFalpha + 
1L lbeta 


5.3 


5.0 


Primary Th2 rest 


3.4 


3.1 


Small airway 
epithelium none 


1.6 


1.8 


Primary Trl rest 


A 1 

4. 1 


Q Q 
O.O 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


1 n & 
1 U.o 




CD45RA CD4 
lymphocyte act 


1 c 1 

1X1 


1 A f\ 

14. U 


Coronery artery 
SMC rest 


Z.J 


1 Q 

1 .y 


CJJ4j>K(J CD4 
lymphocyte act 


24.3 


26.2 


Coronery artery 
SMC TNFalpha + 
IL- lbeta 


3.7 


3.4 


CD8 lymphocyte 
act 


1 3.4 


1 ^ ^ 

10. J 


Astrocytes rest 


1 1 
1 . 1 




Secondary CD8 
lymphocyte rest 


16.8 


14.1 


Astrocytes 
TNFalpha + IL- 
lbeta^ 


7.1 


5.8 


Secondary CD8 
lymphocyte act 


9.1 


10.8 


KU-812 
(Basophil) rest 


2.3 


1.7 P 


CD4 lymphocyte 
none 


4.2 


3.7 


KU-812 
(Basophil) 
PMA/ionomycin 


8.4 


7.0 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


14.7 


13.1 


CCD 1106 
(Keratinocytes) 
none 


1.2 


1.7 


LAK cells rest 


12.4 


11.7 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


6.6 


3.6 


LAK cells IL-2 


14.3 


15.5 


Liver cirrhosis 


2.8 


3.6 


LAK cells IL-2+IL- 
12 


9.7 


6.7 


NCI-H292 none 


2.6 


2.9 


LAK cells IL- 
2+IFN gamma 


9.3 


9.0 


NCI-H292 IL-4 


2.6 


3.6 


LAK cells IL-2+ 
IL-18 


12.8 


11.7 


NCI-H292 IL-9 


3.5 


3.5 
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LAK cells 
PMA/ionomycin 


25.0 


32.1 


NCI-H292 IL-13 


3.4 


3.1 


NK Cells IL-2 rest 


18.2 


19.6 


NCI-H292IFN 
gamma 


3.6 


4.2 


Two Way MLR 3 
day 


20.7 


22.7 


HPAEC none 


5.0 


5.2 


Two Way MLR 5 
day 


11.4 


15.3 


HPAEC TNF 
alpha + IL-1 beta 


19.9 


25.0 


Two Way MLR 7 
day 


13.2 


14.6 


Lung fibroblast 
none 


1.5 


1.6 


PBMC rest 


5.0 


4.7 


Lung fibroblast 
TNF alpha + IL-1 
beta 


3.9 


3.2 


PRMC PWM 


18.0 


21.8 


Lung fibroblast 
IL-4 


1.2 


1.4 


PBMC PHA-L 


15.0 


17.4 


Lung fibroblast 
IL-9 


1.1 


' 2.4 


Ramos (B cell) 
none 


1.9 


2.1 


Lung fibroblast 
IL-13 


1.5 


1.8 


Ramos (B cell) 
ionomycin 


3.0 


3.0 


Lung fibroblast 
IFN gamma 


Z.j 


3. 1 


B lymphocytes 
PWM 


10.4 


\ Z. 1 


Dermal fibroblast 
CCD 1070 rest 


3.4 


1 Q 
1 .5 


B lymphocytes 
CD40L and IL-4 


17.0 


20.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


15.3 


18.3 


EOL-1 dbcAMP 


9.8 


17.0 


Dermal fibroblast 
CCD1070 IL-1 
beta 


4.6 


4.6 


EOL-1 dbcAMP 
PMA/ionomycin 


37.4 


32.3 


Dermal fibroblast 
IFN gamma 


3.3 


2.9 


Dendritic cells 
none 


8.4 


4.8 


Dermal fibroblast 
IL-4 


4.2 


2.9 


Dendritic cells LPS 


20.0 


25.2 


Dermal 
Fibroblasts rest 


3.3 


1.8 


Dendritic cells anti- 
CD40 


6.3 


6.3 


Neutrophils 
TNFa+LPS 


51.8 


56.6 


Monocytes rest 


10.8 


9.8 


Neutrophils rest 


23.2 


26.2 


Monocytes LPS 


100.0 


100.0 


Colon 


1.7 


1.6 


Macrophages rest 


7.6 


6.8 


Lung 


4.0 


1.8 


Macrophages LPS 


29.7 


26.6 


Thymus 


7.9 


8.7 


HUVEC none 


3.1 


4.3 


Kidney 


6.3 


3.6 


HUVEC starved 


10.4 


8.0 




i 
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Table CHI. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2904, Run 
159078059 


Tissue Name 


Rel. Exp.(%) 
AgzVU4, Kun 
159078059 


Secondary Th 1 act 


52.5 


HUVEC IL-lbeta 


1 A Q 

\ 4.5 


Secondary Th2 act 


65.5 


HUVEC IFN gamma 


32.5 


Secondary Trl act 


58.2 


HUVEC TNF alpha + IFN 
gamma 


52.9 


Secondary Thl rest 


13.1 


HUVEC TNF alpha + IL4 


28.9 


Secondary Th2 rest 


18.8 


HUVEC IL-11 


8.4 


Secondary Trl rest 


16.6 


Lung Microvascular EC 
none 


22.5 


Primary Thl act 


94.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


39.2 


Primary Th2 act 


66.4 


Microvascular Dermal EC 
none 


20.2 


Primary Trl act 


94.6 


Microsvasular Dermal EC i 
TNFalpha + IL-lbeta 


30.4 


Primary Thl rest 




Bronchial epithelium 
TNFalpha + IL1 beta 


9.2 


Primary 1 hz rest 


jv). i 


Small airway epithelium 
none 


6.1 


Primary Trl rest 


J o.o 


Small airway epithelium 
TNFalpha + IL-lbeta 


53.6 


CD45RA CD4 
lymphocyte act 


9 A 7 


Cnrnnerv arterv SMC rest 


16.7 


CD45RO CD4 
lymphocyte act 


*f / .O 


Coronery artery SMC 
TNFalpha + IL-lbeta 


4.6 


CD8 lymphocyte act 


20.4 


Astrocytes rest 


4.8 


Secondary CD8 
lymphocyte rest 


ZH-.o 


Astrocytes TNFalpha + 
IL-lbeta 


16.2 


Secondary CD8 
lymphocyte act 


| 27.5 


KU-812 (Basophil) rest 


6.6 


CD4 Ivmnhocvte none 


J" 9.9 


KU-8 12 (Basophil) 
PMA/ionomycin 


17.3 


2ry Inl/lhz/lrl anti- 
CD95CH11 


27.5 


rrni 106 flOratinocvtes^ 
none 


2.1 


LAK cells rest 


28.9 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


4.2 


LAK cells IL-2 


33.2 


Liver cirrhosis 


5.6 


LAK cells IL-2+IL-12 


34.9 


Lupus kidney 


6.3 


LAK cells IL-2+IFN 
gamma 


47.3 


NCI-H292 none 


16.2 


LAK cells IL-2+ IL-18 


94.0 


NCI-H292 IL-4 


13.4 


LAK cells 


48.3 


NCI-H292 IL-9 


12.9 
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i n n 
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PM A/ionomyc in 








NK Cells IL-2 rest 


28.1 


NC1-H292 IL-13 


6.2 


Two Way MLR 3 day 


53.2 


NC1-H292 I FN gamma 


13.6 


Two Way MLR 5 day 


20.9 


HPAEC none 


14.6 


Two Way MLR 7 day 


22.1 


HPAFC TNF alnha + TT -1 

111/ VLV^ 1 INI CllJJI Id ' ILj— I 

beta 


35.1 






T untx flHroHlji^t nonp 

L/ LI 1 1 £^ 1 IV J I VJ U IC\Z>1. 1 l\Jl I 


3 6 


PBMC PWM 


100.0 


Lung fibroblast TNF alpha 

4- TT -1 hpta 


6.4 


PBMC PHA-L 


34.9 


Lung fibroblast IL-4 


5.8 i 


Ramos (B cell) none 


4.5 


T C 1 11 j TT C\ 

Lung fibroblast IL-9 


4.0 


Ramos (B cell) 
ionomycin 


13.0 


Lung fibroblast IL-13 


3.0 


B lymphocytes PWM 


96.6 


Lung fibroblast IFN 
gamma 


8.2 


B lymphocytes CD40L 
and IL-4 


84.7 


Dermal fibroblast 
CCD 1070 rest 


9.5 


EOL-1 dbcAMP 


19.5 


Dermal fibroblast 

L/LUlU/u 1 1N.T clipild 


99.3 


POT 1 Hhr- AN/IP 
DVJL- 1 (JUL/ /"VI VI I 

PMA/ionomycin 


74.7 


liPftYlQl T1 rM**"*r\l act 

LvCIIIiai llUlUUIclbL 

CCD 1070 IL-1 beta 


15.2 


Dendritic cells none 


13.4 


Dermal fibroblast IFN 
gamma 


7.5 


Dendritic cells LPS 


52.9 


Dermal fibroblast IL-4 


6.7 


Dendritic cells anti- 
CD40 


12.0 


IBD Colitis 2 


1.3 


Monocytes rest 


22.2 


IBD Crohn's 


3.7 


Monocytes LPS 


67.8 


Colon 


15.7 


Macrophages rest 


18.9 


Lung 


11.3 


Macrophages LPS 


99.3 


Thymus 


28.3 


HUVEC none 


15.3 


Kidney 


31.9 


HUVEC starved 


34.2 







Al comprehensive panel vl.O Summary: Ag4520 The NOV93 gene is widely 
expressed among the samples on this panel, with highest expression in normal colon adjacent 
to diseased colon (CT=29). This widespread pattern of expression is consistent with 
5 expression in Panels 4D and 4. ID. Please see Panel 4. ID for discussion of utility of this gene 
in inflammation. 

CNS_neurodegeneration_vl.O Summary: Ag2904/Ag4520 The NOV93 gene, an 

IMP dehydrogenase homolog, shows a small but significant (p=0.02) upregulation in the 

postmortem Alzheimer's brain when compared to nondemented controls. IMP dehydrogenase 

10 is involved in purine metabolism, and has been implicated as a drug target for supressing the 
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immune response, inflammation, and cerebral edema. The observed increase in the expression 
of this gene is in concordance with the evidence for the role of neuroinflammation in 
Alzheimer's disease. Therefore, the inhibition of this molecule may be of therapeutic benefit in 
Alzheimer's disease, head or spinal cord trauma, stroke, cerebral edema, or viral infections of 
the CNS. 

References: 

Hall IH, Wyrick SD. Cytoxicity of [(5,6-dichloro-9a-n-propyl-2,3,9,9a-tetrahydro-3- 
oxo-lH fluoren-7-yl)oxy]acetic acid, an agent known to reduce brain edema. Biomed 
Pharmacother 1996;50(1): 19-23 

A known agent, [(5,6-dichloro-9a-n-propyl-2,3,9,9a-tetrahydro-3-oxo-lH fluoren-7- 
yl)oxy]acetic acid, which blocks brain edema, was also shown to be a potent cytotoxic agent in 
leukemia cells. The major site of action of the agents appears to be in the de novo purine 
synthetic pathway in L1210 leukemic cells. Both PRPP amido transferase and IMP 
dehydrogenase activities were suppressed by the agent. The inhibition of both regulatory 
enzymes of the pathway along with the reduction of dihydrofolate reductase activity would 
account for the observed suppression of DNA and RNA syntheses and subsequent cancer cell 
death. 

Senda M, Natsumeda Y. Tissue-differential expression of two distinct genes for human 
IMP dehydrogenase (E.C.I. 1.1. 205). Life Sci 1994;54(24): 191 7-26 

Human IMP dehydrogenase (E.C. 1 .1 .1 .205) is recently regarded as a potent targeting 
enzyme for immunosuppressive drugs. Tissue differential expressions of human type I and 
type II IMP dehydrogenase were investigated in sixteen human adult organs (heart, brain, 
placenta, lung, liver, skeletal muscle, kidney, pancreas, spleen, thymus, prostate, testis, ovary, 
small intestine, colon, peripheral blood leukocytes) and five human fetal organs (heart, brain, 
lung, liver, kidney) using Northern blot analysis. In all tissues examined in this study, the sizes 
of mRNAs of each isoform were identical, respectively. The 2.3 kb type II mRNA was shown 
predominantly, and the 3.5 kb type I mRNA level was lower than type II in most human 
tissues examined. In contrast, type I IMPDH gene expressed higher than type II in peripheral 
blood leukocytes, uniquely. We also demonstrated that both type I and type II IMPDH genes 
are widely distributed among various species by Southern blot analysis. Interestingly, type I 
IMPDH gene may have multiple gene families in primates, [dstone, 17-Jan-02] 

General jscreening_panel_vl. 4 Summary: Ag4250 Two experiments with the same 
probe and primer set produce results that are in excellent agreement, with highest expression 
of the NOV93 gene in a gastric cancer cell line and fetal lung tissue. In addition, there appears 
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to be substantial expression associated with breast cancer cell lines, lung cancer cell lines and 
renal cancer cell lines. Thus, the expression of this gene could be used to distinguish NCI-N87 
and fetal lung tissue from the other samples in the panel. Moreover, therapeutic modulation of 
this gene, through the use of small molecule drugs, antibodies or protein therapeutics might be 
5 beneficial in the treatment of breast, lung or kidney cancer. 

Among tissues with metabolic function, this gene has low-to-moderate levels of 
expression in adipose, liver, heart, skeletal muscle, adrenal, pituitary, thyroid and pancreas. 
Thus, this gene product may be a small molecule target for the treatment of metabolic and 
endocrine diseases, including obesity and Type 2 diabetes. This encodes a putative IMP 

10 dehydrogenase, which is involved in purine metabolism and has been implicated as a target for 
suppressing the immune response. Thus, this gene product may also be a treatment for Type 1 
diabetes, in which insulin-secreting beta cells are destroyed by the autoimmune response 
against them. In addition, this gene appears to be differentially expressed in fetal (CT values = 
30) vs adult liver (CT value = 33) and in fetal (CT values =27-28) vs. adult lung (CTs = 32- 

15 33), and may be useful for the differentiation between the two sources of these tissues. 

This molecule is also expressed at moderate to low levels in all CNS regions examined. 
Please see panel CNS Neurodegeneration for a discussion of utility of this gene in the central 
nervous system. 

Panel 1.3D Summary: Ag2904 Expression of the NOV93 gene is higher overall in 
20 normal tissues, with highest expression in fetal skeletal muscle. Furthermore, this gene is 

expressed at higher levels in fetal skeletal muscle (CT=29) when compared to expression in 
adult skeletal muscle (CT=32). Thus, expression of this gene could be used to differentiate 
between fetal skeletal muscle and other samples on this panel and between fetal and adult 
skeletal muscle. 

25 Expression in the CNS is consistent with expression in previous panels. Please see 

CNS_neurodegeneration for discussion of utility of this gene in the central nervous system. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal muscle, heart 
and liver. This widespread expression among these tissues suggests that this gene product may 

30 be useful for the diagnosis and/or treatment of metabolic disease, including obesity and 
diabetes. 

Panel 2D Summary: Ag2904 The expression of the NOV93 gene appears to be 
highest in a sample derived from normal lung tissue. In addition, there appears to be 
substantial expression in most of the samples in the panel. Of note is the expression associated 
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with normal lung tissue when compared to adjacent lung cancer tissue. Thus, the expression of 
this gene could be used to distinguish this sample of normal lung tissue from other samples in 
the panel. In addition, the expression of this gene could be used to distinguish normal lung 
tissue adjacent to cancer tissue. Moreover, therapeutic modulation of this gene, through the 
5 use of small molecule drugs, antibodies or protein therapeutics might be beneficial in the 
treatment of lung cancer. 

Panel 4D/4.1D Summary: Ag2904/Ag4520 The NOV93 gene, a novel IMP 
dehydrogenase-like protein, is differentially expressed, as displayed in Panels 4. ID and 4D, in 
activated T cells, activated B cells, activated monocytes, activated macrophages, and activated 

10 dendritic cells. Small molecule antagonists of the previously characterized IMP dehydrogenase 
have been found to be useful in the treatment of several immunopathological states (See 
Allison and Eugui, 2001). Therefore, small molecule antagonists of the NOV93 gene product 
may reduce or eliminate the symptoms of autoimmune and inflammatory diseases, including 
Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, 

15 asthma, emphysema, rheumatoid arthritis, lupus erythematosus, or psoriasis. 
References: 

Allison AC, Eugui EM. Mycophenolate mofetil and its mechanisms of action. 
Immunopharmacology 2000 May ;47(2-3): 85-1 18 

Mycophenolate mofetil (MMF, CellCept(R)) is a prodrug of mycophenolic acid 

20 (MPA), an inhibitor of inosine monophosphate dehydrogenase (IMPDH). This is the rate- 
limiting enzyme in de novo synthesis of guanosine nucleotides. T- and B-lymphocytes are 
more dependent on this pathway than other cell types are. Moreover, MPA is a fivefold more 
potent inhibitor of the type II isoform of IMPDH, which is expressed in activated 
lymphocytes, than of the type I isoform of IMPDH, which is expressed in most cell types. 

25 MPA has therefore a more potent cytostatic effect on lymphocytes than on other cell types. 

This is the principal mechanism by which MPA exerts immunosuppressive effects/Three other 
mechanisms may also contribute to the efficacy of MPA in preventing allograft rejection and 
other applications. First, MPA can induce apoptosis of activated T-lymphocytes, which may 
eliminate clones of cells responding to antigenic stimulation. Second, by depleting guanosine 

30 nucleotides, MPA suppresses glycosylation and the expression of some adhesion molecules, 
thereby decreasing the recruitment of lymphocytes and monocytes into sites of inflammation 
and graft rejection. Third, by depleting guanosine nucleotides MPA also depletes 
tetrahydrobiopterin, a co-factor for the inducible form of nitric oxide synthase (iNOS). MPA 
therefore suppresses the production by iNOS of NO, and consequent tissue damage mediated 
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by peroxynitrite. CellCept(R) suppresses T-lymphocytic responses to allogeneic cells and 
other antigens. The drug also suppresses primary, but not secondary, antibody responses. The 
efficacy of regimes including CellCept(R) in preventing allograft rejection, and in the 
treatment of rejection, is now firmly established. CellCept(R) is also efficacious in several 
5 experimental animal models of chronic rejection, and it is hoped that the drug will have the 
same effect in humans. 

NOV94 

Expression of gene NOV94 was assessed using the primer-probe set Ag2905, 
described in Table CIA. Results of the RTQ-PCR runs are shown in Tables CIB, CIC and 
10 CID. 

Table CIA. Probe Name Ag2905 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forwardjs • -gcagaataccacgatgacttct-3 ' 


22 


1468 


1366 


Probe | TET " 5 ' -agtcagcttacgtcgctgcctctgag-3 ■ - 

ro e Jtamra 


26 


1496 


1367 


Reversej5 ' -gttcctggtgctgtaatgca- 3 ' 


20 


1523 


1368 



Table CIB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag2905, 
Run 161374149 


Tissue Name 


Rel. Exp.(%) Ag2905, 
Run 161374149 


Liver adenocarcinoma 


20.4 


Kidney (fetal) 


1.4 


Pancreas 


0.1 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAP AN 
2 


4.9 


Renal ca. A498 


7.4 


Adrenal gland 


0.2 


Renal ca. RXF 393 


9.0 


Thyroid 


0.9 


Renal ca. ACHN 


2.4 


Salivary gland 


0.6 


Renal ca. UO-31 


5.5 


Pituitary gland 


0.4 


Renal ca. TK-10 


13.7 


Brain (fetal) 


0.4 


Liver 


0.3 


Brain (whole) 


0.2 


Liver (fetal) 


0.8 


Brain (amygdala) 


1.2 


Liver ca. 
(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.2 


Lung 


0.6 


Brain (hippocampus) 


1.0 


Lung (fetal) 


0.9 


Brain (substantia nigra) 


0.3 


Lung ca. (small cell) 
LX-1 


15.9 


Brain (thalamus) 


1.0 


Lung ca. (small cell) 
NCI-H69 


1.2 
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C^prphra 1 Cortex 


2.3 


Lung ca. (s.cell var.) 
SHP-77 


27.7 


Slninal cord 


1.5 


Lung ca. (large 
cell)NCI-H460 


2.0 


plio/astro IJ87-MG 


1 1.7 


Lung ca. (non-sm. i 
cell) A549 


8.0 


clio/astro I J-1 1 8-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


2.8 


astrocytoma SW 1783 


0.2 


Lung ca. (non-s.cell) 
HOP-62 


1.9 


neuro*; met SK-N-AS 


5.8 


Lung ca. (non-s.cl) 
NCI-H522 


5.4 


astrocytoma SF-539 


5.3 


Lung ca. (squam.) 
SW 900 


2.6 


astrocytoma SNB-75 


1.7 


Lung ca. (squam.) 
NCI-H596 


1.6 


glioma SNB-19 


0.1 


Mammary gland 


2.2 


glioma U251 


0.0 


Breast ca.* (pl.ef) 
MCF-7 


40.6 


glioma SF-295 


0.3 


Breast ca.* (pl.ef) 
MDA-MB-231 


6.3 


Heart (fetal) 


0.2 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart 


10.2 


Breast ca. BT-549 


1.2 


Skeletal muscle (fetal) 


3.4 


Breast ca. MDA-N 


10.1 


Skeletal muscle 


0.7 


Ovary 


1.9 


Ronp marrow 


1.0 


Ovarian ca. OVCAR- 
3 


1.6 


nrhvmus 

1 1 IV HlUo 


20.2 


Ovarian ca. OVCAR- 
4 


0.3 


Spleen 


0.8 


Ovarian ca. OVCAR- 


7.8 


Lymph node 


0.2 


Ovarian ca. OVCAR- 


10.2 


Colorectal 


1.0 


Ovarian ca. IGROV- 
1 


1.5 


Stomach 


1.9 


SK-OV-3 


9.4 


Small intestine 


2.3 


Uterus 


0.1 


Colon ca. SW480 


6.0 


Placenta 


1.6 


Colon ca.* 
SW620(SW480 met) 


9.2 


Prostate 


3.5 


Colon ca. HT29 


30.8 


Prostate ca.* (bone 
met)PC-3 


9.3 


Colon ca. HCT-116 


11.2 


Testis 


100.0 
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Colon ca. CaCo-2 


11.3 


Melanoma 
Hs688(A).T 


1.2 


Colon ca. 
tissue (ULNJJ oooj 


2.4 


Melanoma* (met) 

raSOoo(Jt> ). 1 


2.0 


Colon ca. HCC-2998 


7.2 


Melanoma UACC-62 


1.5 


Gastric ca.* (liver met) 
NCI-N87 


40.6 


Melanoma M14 


1.0 


Bladder 


2.0 


Melanoma LOX 
IMVI 


0.6 


Trachea 


4.0 


Melanoma* (met) 
SK-MEL-5 


1.0 


Kidney 


1.0 


Adipose 


0.4 



Table CIC. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2905, Run 
161374481 


Tissue Name 


Rel. Exp.(%) 
Ag2905, Run 
161374481 


Normal Colon 


15.3 


Kidney Margin 
8120608 


1.1 


CC Well to Mod Diff 
(OD03866) 


5.4 


Kidney Cancer 
8120613 


0.5 


CC Margin (OD03866) 


1.1 


Kidney Margin 
8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


3.1 


Kidney Cancer 
9010320 


42.3 


CC Margin (OD03868) 


1.1 


Kidney Margin 
9010321 


2.0 


CC Mod Diff(ODO3920) 


26.6 


Normal Uterus 


1.8 


CC Margin (ODO3920) 


3.6 


Uterus Cancer 064011 


8.2 


CC Gr.2 ascend colon 
(OD03921) 


14.2 


Normal Thyroid 


9.2 


CC Margin (OD03921) 


6.2 


Thyroid Cancer 
064010 


16.0 


CC from Partial 
Hepatectomy (ODO4309) 
Mets 


44.8 


Thyroid Cancer 
A302152 


10.2 


Liver Margin (ODO4309) 


8.4 


Thyroid Margin 
A302153 


15.1 


Colon mets to lung 
(OD04451-01) 


11.0 


Normal Breast 


9.7 


Lung Margin (OD04451- 
02) 


0.9 


Breast Cancer 
(OD04566) 


1.2 


Normal Prostate 6546-1 


10.1 


Breast Cancer 
(OD04590-01) 


3.5 


Prostate Cancer 


52.9 


Breast Cancer Mets 


2.0 
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(OD04410) 




(OD04590-03) 




Prostate Margin 


23.0 


Breast Cancer 

Metastasis 
(OD04655-05) 


54.7 


Prostate Cancer 
(OD04720-01) 


18.7 


Breast Cancer 064006 


8.1 


Prostate Margin 
(OD04720-02) 


33 2 


Breast Cancer 1024 


26.1 


Normal Lung 061010 


12.9 


Breast Cancer 
9100266 


30.6 


Lung Met to Muscle 


29.3 


Breast Margin 
91 0026S 

7 1 \J\J Z.\J _J 


17.4 


lviuscie iviargm 
(OD04286) 


2.7 


A209073 


62.0 


i^ung iviangnani cancer : 
(OD03126) 


3.4 


OICuol lYldl gill 

A2090734 


20.7 


LAing iviargin [kjljvj izoj 


/ .D 


JNwllIlCll L/IVtl 


11.7 


Lung Cancer (OD04404) 


35.6 


Liver Cancer 064003 


6.7 


Lung Margin (OD04404) 


4.2 


Liver Cancer 1025 


3.4 


Lung Cancer (OD04565) 


0.7 


Liver Cancer 1026 


o o 

8.2 


Lung Margin (OD04565) 


2.2 


Liver Cancer 6004-T 


3.3 


Lung Cancer (OD04237- 
01) 


51.8 


Liver Tissue 6004-N 


15.3 


Lung Margin (OD04237- 
02) 


0.4 


Liver Cancer 6005-T 


10.4 


Ocular Mel Met to Liver 
(ODO4310) 


41.8 


Liver Tissue 6005-N 


0.2 


Liver Margin (OD043 1 0) 


4.4 


Normal Bladder 


16.7 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


6.3 


Lung Margin (OD04321) 


4.7 


Bladder Cancer 
A302173 


40.6 


Normal Kidney 


7.1 


Bladder Cancer 
(OD047 18-01) 


1.0 


Kidney Ca, Nuclear grade 
2 (OD04338) 


16.5 


Bladder Normal 
Adjacent (OD04718- 
03) 


5.7 


Kidney Margin 
(OD04338) 


7.4 


Normal Ovary 


1.3 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


12.9 


Ovarian Cancer 
064008 


100.0 


Kidney Margin 
(OD04339) 


4.6 


Ovarian Cancer 
(OD04768-07) 


18.3 


Kidney Ca, Clear cell 
type (OD04340) 


5.8 


Ovary Margin 
(OD04768-08) 


0.7 



"!!. ill ! 
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Kidney Margin 
(OD04340) 


3.5 


Normal Stomach 


5.8 


Kidney Ca, Nuclear grade 
3 (OD04348) 


4.0 


Gastric Cancer 
9060358 


0.9 


Kidney Margin 
(OD04348) 


4.4 


Stomach Margin 
9060359 


4.8 


Kidney Cancer 
(OD04622-01) 


0.7 


Gastric Cancer 
9060395 


13.2 


Kidney Margin 
(OD04622-03) 


1.3 


Stomach Margin 
9060394 


8.3 


Kidney Cancer 
(OD04450-01) 


11.9 


Gastric Cancer 
9060397 


14.7 


Kidney Margin 
(OD04450-03) 


8.8 


Stomach Margin 
9060396 


2.1 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


17.1 



Table CID. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2905, Run 
159772697 


Tissue Name 


Rel. Exp.(%) 
Ag2905, Run 
159772697 


Secondary Thl act 


12.8 


HUVEC IL-lbeta 


8.2 


Secondary Th2 act 


6.5 


HUVEC IFN gamma 


9.1 


Secondary Trl act 


8.7 


HUVEC TNF alpha + IFN 
gamma 


11.6 


Secondary Thl rest 


4.7 


HUVEC TNF alpha + IL4 ; 


12.2 


Secondary Th2 rest 


6.0 


HUVEC IL-11 


9.0 


Secondary Trl rest 


6.7 


Lung Microvascular EC 
none 


1.4 


Primary Thl act 


12.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


3.1 


Primary Th2 act 


10.6 


Microvascular Dermal EC 
none 


24.3 


Primary Trl act 


16.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


13.7 


Primary Thl rest 


24.5 


Bronchial epithelium 
TNFalpha + ILlbeta 


1.0 


Primary Th2 rest 


10.2 


Small airway epithelium 
none 


6.7 


Primary Trl rest 


17.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


29.1 


CD45RA CD4 
lymphocyte act 


3.8 


Coronery artery SMC rest 


9.9 


CD45RO CD4 
lymphocyte act 


8.9 


Coronery artery SMC 
TNFalpha + IL-lbeta 


6.0 
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CD8 lymphocyte act 


5.1 


Astrocytes rest 


9.1 


Secondary CD8 
lymphocyte rest 


8.2 


Astrocytes TNFalpha + 
IL-lbeta 


5.4 


Secondary CD8 
lymphocyte act 


7.6 


KU-812 (Basophil) rest 


10.7 


CD4 lymphocyte none | 


1.0 


KU-812 (Basophil) 
PMA/ionomycin 


21.6 


2ry Thl/Th2/Trl anti- 
CD95CH11 


8.9 


CCDl 106 (Keratinocytes) 
none 


30.4 


LAK cells rest 


4.7 


CCDl 106 (Keratinocytes) 
TNFalnha 4- 11 -1 beta 


1.1 


J_//\rV CCllo 1L~Z 




T i \/pr r*irrHriQic 

L/l Vtl l^- 11 1 1 IVJZM o 


1.1 


T AK reIR TT -9+TT -1? 


1 1 9 


I iinii*s IcidtiPV 


1.0 


LAK cells IL-2+IFN j 

irci m in ci 


14.7 


NCI-H292 none 


35.4 




1 1 7 
11./ 


1MPT-H90? TT -4 


?Q 7 


LAK cells 

r lVl/\/ lUIlOIIiyL'lIl 


2.8 


NCI-H292 IL-9 


49.0 




s o 

J ,u 


NC1-H99? II -1 3 


1 7 9 


i wo w ay i vi iv j *Jd.y 


J .J 


>JCT-H?Q? TFN ctnmma 


JZr.O 


Two Way MLR 5 day 


5.8 


HPAEC none 


4.8 


Two Way MLR 7 day 


3.4 


HPAEC TNF alpha + IL-1 
beta 


5.0 


PBMC rest 


0.8 


Lung fibroblast none 


7.3 


PBMC PWM 


14.4 


Lung fibroblast TNF alpha 
+ IL- 1 beta 


3.9 


PBMC PHA-L 


9.0 


Lung fibroblast IL-4 


8.4 


Ramos (B cell) none 


17.4 


Lung fibroblast IL-9 


10.9 


Ramos (B cell) 
ionomycin 


100.0 


Lung fibroblast IL-1 3 


8.8 


B lymphocytes PWM 


31.2 


Lung fibroblast IFN 
gamma 


17.1 


B lymphocytes CD40L 
and IL-4 


15.5 


Dermal fibroblast 
CCD 1070 rest 


31.9 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCDIO/U IJNr aTpna 


39.2 


CAr i JL/, a 1V/TD 
rAJl>- I GDC/vlVlr 

PM A/ionomy c in 


0.0 


i^ermai iiDroDiasi 
CCDl 070 IL-1 beta 


12.2 


Dendritic cells none 


4.9 


Dermal fibroblast IFN 
gamma 


6.4 


Dendritic cells LPS 


3.0 


Dermal fibroblast IL-4 


18.2 


Dendritic cells anti- 
CD40 


4.9 


IBD Colitis 2 


0.7 


Monocytes rest 


2.0 


IBD Crohn's 


0.0 


Monocytes LPS 


2.0 


Colon 


5.4 
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Macrophages rest j 


3-6 


Lung 


. .. . « 


Macrophages LPS J 


1.8 


Thymus 


3.0 


HUVEC none 


19.2 


Kidney 


20.0 


HUVEC starved 


24.0 







Panel 1.3D Summary: Ag2905 The expression of the NOV94 gene appears to be 
highest in a sample derived from normal testis tissue (CT=28.9). In addition, there is 
substantial expression associated with samples derived from colon cancer cell lines, lung 
5 cancer cell lines and breast cancer cell lines. Thus, the expression of this gene could be used to 
distinguish normal testis tissue from other samples in the panel. Moreover, therapeutic 
modulation of this gene, through the use of small molecule drugs, protein therapeutics or 
antibodies could be beneficial for the treatment of colon, lung or breast cancer. 

In addition, this gene appears to be differentially expressed in fetal (CT value = 37) vs 
10 adult heart (CT value = 32), and may be useful for the differentiation between the two sources 
of heart tissue. 

Panel 2D Summary: Ag2905 The expression of the NOV94 gene appears to be 
highest in a sample derived from an ovarian cancer (CT=30.5). In addition, there appears to be 
substantial expression associated with breast cancers, lung cancers, gastric cancers, prostate 

1 5 cancers and colon cancers. Thus, the expression of this gene could be used to distinguish this 
ovarian cancer sample from others in the panel. Moreover, therapeutic modulation of this 
gene, through the use of small molecule drugs, protein therapeutics or antibodies might be 
beneficial in the treatment of ovarian, breast, lung, gastric, prostate or colon cancer. 

Panel 4D Summary: Ag2905 Low but significant expression of the NOV94 transcript 

20 is found predominantly in activated B cell lymphoma cell line (Ramos) and activated B cells 
(CTs=32-34). It is also found in HUVEC, keratinocytes, lung fibroblasts and the muco- 
epidermoid cell line H292. Therefore, targeting of this gene product with a small molecule 
drug therapeutic may modulate the functions of B cells and lead to the improvement of 
symptoms in autoimmune diseases such as lupus erythematosus, rheumatoid arthritis, 

25 hyperglobulinemia and other B cell disorders. 

NOV95 

Expression of gene NOV95 was assessed using the primer-probe set Ag3060, 
described in Table CJA. Results of the RTQ-PCR runs are shown in Tables CJB and CJC. 
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Table CJA. Probe Name Ag3060 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward; 


5 1 -caaagattgcagcaatcgatag-3 1 


22 


189 


1369 


Probe 


TET-5 * -agtatacacgaggctttggccatcca-3 ' - 
TAMRA 


26 


219 


1370 


Reverse 


5 ' -aggacagagctttcacaagtga-3 1 


22 


245 


1371 



Table CJB. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3060, 
Run 168016485 


Tissue Name 


Rel. Exp.(%) Ag3060, 
Run 168016485 


Liver adenocarcinoma 


0.2 


Kidney (fetal) 


0.3 


Pancreas 


0.2 


Renal ca. 786-0 


0.1 


Pancreatic ca. CAPAN 

2 


0.0 


Renal ca. A498 


0.1 


Adrenal gland 


0.1 


Renal ca. RXF 393 


0.1 


Thyroid 


0.0 


Renal ca. ACHN 


0.1 


Salivary gland 


0.0 


Renal ca. UO-31 


0.1 


Pituitary gland 


0.1 


Renal ca. TK-10 


0.1 


Brain (fetal) 


0.1 


Liver 


0.0 


Brain (whole) 


0.1 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.1 


Liver ca. 
(hepatoblast) HepG2 


0.1 


Brain (cerebellum) 


0.1 


Lung 


0.0 






T lino ^"ff*tnt^ 


0 1 


Brain (substantia nigra) 


0.1 


1 Jina ca Ismail celH 

J^Ulltl vU« Willi CI 1 1 Vwll / 

LX-l 


0.1 


Brain (thalamus) 


0.1 


Lung ca. (small cell) 
NCI-H69 


0.1 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.2 


Spinal cord 


0.1 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.3 


glio/astro U-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.1 


astrocytoma SW 1783 


0.2 


Lung ca. (non-s.cell) 
HOP-62 


0.1 


neuro*;met SK-N-AS 


0.1 


Lung ca. (non-s.cl) 
NCI-H522 


0.2 


astrocytoma SF-539 


0.1 


Lung ca. (squam.) 
SW 900 


0.1 


astrocytoma SNB-75 . 


0.2 


, Lung ca. (squam.) 


0.1 



1450 








NC1-H596 




glioma SNB-19 


0.] 


Mammary gland 


A 1 
0.1 


glioma U251 


0.3 


Breast ca. (pl.el) 
MCF-7 


0.1 


glioma SF-295 


0.2 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.1 


Heart (fetal) 


3.3 


Breast ca.* (pl.er) 
J47D 


100.0 


Heart 


0.0 


T"fc fc a. A fit 1 r /in 

Breast ca. B 1 -549 


0.0 


Skeletal muscle (fetal) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Rone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.2 


Thvmns 

A 11 YlllUkJ 


0.1 


Ovarian ca. OVCAR- 
4 


0.2 


Spleen 


0.1 


Ovarian ca. OVCAR- 
5 


0.6 


Lymph node 


0.1 


Ovarian ca. OVCAR- 
8 


0.1 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.2 


Small intestine 


0.0 


Uterus 


O.l 


Colon ca. SW480 


0.1 


Placenta 


0.0 


Colon ca.* 
S W620(SW480 met) 


0.2 


Prostate 


0.0 


Colon ca. HT29 


0.1 


Prostate ca.* (bone 
metjr'C-J 


O.l 


Colon ca. HCT-116 


0.1 


Testis 


3.1 


Colon ca. CaCo-2 


0.1 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 

tlSSUe^VJL'Lo oOO ) 


0.0 


Melanoma* (met) 

riSOoo^D ). i 


O.l 


Colon ca. HCC-2998 


0.2 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.1 


Melanoma Ml 4 


O.l 


Bladder 


0.1 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.1 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


0.1 


Adipose 


O.l 
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Table CJC. Pane! 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3060, Run 
164317425 


Tissue Name 


Rel. Exp.(%) 
Ag3060, Run 
164317425 


Secondary Th 1 act 


22.7 


HUVEC IL-lbeta 


7.2 


Secondary Th2 act 


25.7 


HUVEC IFN gamma 


15.1 


Secondary Trl act 


40.3 


HUVEC TNF alpha + IFN 
gamma 


12.3 


Secondary Thl rest 


6.1 


HUVEC TNF alpha + IL4 


10.4 


Secondary Th2 rest 


9.3 


HUVEC IL-11 


4.8 


Secondary Trl rest 


11.7 


Lung Microvascular EC 
none 


6.0 


Primary Thl act 


29.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


9.7 


Primary Th2 act 


24.1 


Microvascular Dermal EC 
none 


11.8 


Primary Trl act 


33.4 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


9.6 


Primary I hi rest 


52.9 


Bronchial epithelium 
TNFalpha + IL1 beta 


20.0 


Primary Th2 rest 


26.2 


Small airway epithelium 
none 


5.0 


Primary l rl rest 


1 *7 A 

1 7.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


"> O A 

38.4 


CD45RA CD4 
lymphocyte act 




Coronery artery SMC rest 


1 O 1 

Iz. 1 


CD45RO CD4 
lymphocyte act 


23.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


O. / 


CD8 lymphocyte act 


26.4 


Astrocytes rest 


9.0 


Secondary CD8 
lymphocyte rest 


""> ^ o 


Astrocytes TNFalpha + 
IL-lbeta 


J.O 


Secondary CD8 
lymphocyte act 


15.1 


KU-8 12 (Basophil) rest 


22.7 


CD4 lvmnhocvte none 

' 1 IT 1111711 W Y IV llV/llv 


7.1 


-KU-812 (Basophil) 
PM A/i onomy c in 


67.8 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


13.4 


CCD1 106 (Keratinocytes) 
none 


10.3 


LAK cells rest 


15.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


5.8 


LAK cells IL-2 


18.7 


Liver cirrhosis 


2.6 


LAK cells IL-2+IL-12 


14.5 


Lupus kidney 


1.0 


LAK cells IL-2+IFN 
gamma 


24.1 


NCI-H292 none 


22.1 


LAK cells IL-2+ IL-18 


19.5 


NCI-H292 IL-4 


21.8 


LAK cells 


11.7 


NCI-H292 IL-9 


28.7 
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PIVf A/ionom vein 

1 LYM.1%./ Ivllvlll J vill 








NK Cells IL-2 rest 

l y i ^ uiio x i — / ^ i vol 


14.2 


NC1-H292 1L-13 


15.5 


Twn Wav KIT R ^ Hav 


14.0 


NCI-H292 1FN gamma 


27.4 


Two Way MLR 5 day 


14.5 


HPAEC none 


7.1 


Two Way MLR 7 day ; 


13.7 


HrAhL lJNr alpha + 1 L- 1 
beta 


11.5 


PBMC rest 


4.2 


Lung fibroblast none 




PBMC PWM 


38.2 


Lung fibroblast TNF alpha 
+ beta 


10.8 


PBMC PHA-L 


21.3 


Lung fibroblast IL-4 


14.6 


Ramos (B cell) none 


14.4 


Lung fibroblast IL-9 


13.9 


Ramos (B cell) 
ionomycin 


67.8 


Lung fibroblast IL-13 


11.0 


R Ivmnhocvtes PWM 

1—9 IV 111 U11UV J UVJ X » T 1T1 


100.0 


Lung fibroblast IFN 
gamma 


20.9 


B lymphocytes CD40L 
and IL-4 


27.2 


Dermal fibroblast 
CCD 1070 rest 


16.3 


EOL-1 dbcAMP 


13.4 


Dermal fibroblast 
CCD 1070 INr alpha 


32.3 


EOL-1 dbcAMP 

PN/t A /innnmvriTi 
r ivi/-\f ixjiiKjinys^m 


14.8 


Dermal fibroblast 
CCD1070 IL-1 beta 

V-^ V-' IS 1 V ( w X 1 — J 1 !_/ \s LCI 


7.3 


Dendritic cells none 


11.4 


Dprmal fihrobla^t IPTsJ 

L/vllllUl 11L/1 UUlAol 11 IN 

gamma 


10.2 


Dendritic cells LPS 


15.9 


Dermal fibroblast IL-4 


18.2 


Dendritic cells anti- 
CD40 


10.8 


IBD Colitis 2 


0.5 


Monocytes rest 


10.6 


IBD Crohn's 


1.2 


Monocytes LPS 


7.0 


Colon 


6.8 


Macrophages rest 


12.2 


Lung 


9.5 


Macrophages LPS 


10.9 


Thymus 


14.3 


HUVEC none 


7.0 


Kidney 


28.9 


HUVEC starved 


17.7 







Panel 1.3D Summary: Ag3060 Results from one experiment with this gene are not 
included. The amp plot indicates that there were experimental difficulties with this run (data 
not shown). 

5 Panel 4D Summary: Ag3060 High expression of the NOV95 transcript (CTs= 26.3- 

26.9) is found in activated B cells and B cell lymphoma (Ramos). B cells generate antibody 
response and lead to activation of T cell mediated response as antigen presenting cells and are 
central to the function of the immune response. Therefore, targeting of this gene product with 
a small molecule drug therapeutic may modulate the functions of B cells and lead to the 
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improvement of symptoms of autoimmune diseases such as lupus erythematosus, rheumatoid 
arthritis, hyperglobulinemia and other B cell disorders. 

In addition, moderate expression of this gene is also found in a wide range of cell types 
of significance in the immune response in health and diseases. This suggests the broader 
5 involvement of the protein encoded by this gene in many inflammatory and autoimmune 
diseases. 

NOV96a, NOV96b, and NOV96c 

Expression of gene NOV96a and full length clones NOV96b and NOV96c was 
assessed using the primer-probe set Ag4532, described in Table CKA. Results of the RTQ- 
1 0 PCR runs are shown in Table CKB. 



Table CKA. Probe Name Ag4532 



Primers] Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward]5 ' - actcacctctctcctccatcat-3 1 


22 


626 


1372 


Probe J TET " 5 1 -cgttacactgttgccctcaccctgat-3 * - : 
r ° S JTAMRA 


26 


660 


1373 


Reverse js 1 - agggaatgaagtagccagtgtt - 3 1 


22 


687 


1374 



Table CKB. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) Ag4532, 
Run 222735297 


Tissue Name 


Rel. Exp.(%) Ag4532, 
Run 222735297 


Adipose 


2.0 


Renal ca. TK-10 


31.9 


Melanoma* 
Hs688(A).T 


3.6 


Bladder 


3.7 


Melanoma* 


5.0 


Gastric ca. (liver met.) 


26.6 


Hs688(B).T 


NCI-N87 


Melanoma* M14 


18.2 


Gastric ca. KATO III 


33.7 


Melanoma* 
LOXIMVI 


5.4 


Colon ca. SW-948 


15.0 


Melanoma* SK- 
MEL-5 


2.7 


Colon ca. SW480 


31.2 


Squamous cell 


6.3 


Colon ca.* (SW480 


18.4 


carcinoma SCC-4 


met) SW620 


Testis Pool 


1.0 


Colon ca. HT29 


4.2 


Prostate ca.* (bone 
met) PC-3 


7.3 


Colon ca. HCT-116 


20.6 


Prostate Pool 


1.5 


Colon ca. CaCo-2 


17.1 


Placenta 


9.0 


Colon cancer tissue 


20.4 


Uterus Pool 


0.8 


Colon ca. SW1116 


6.0 
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Ovarian ca. 
OVCAR-3 


10.7 


Colon ca. Colo-205 


12.3 


Ovarian ca. SK-OV- 
3 


2.5 


Colon ca. SW-48 


13.1 


Ovarian ca. 


9.0 


Colon Pool 


3.4 


Ovarian ca. 

UVLAK-J 


29.5 


Small Intestine Pool 


2.7 


uvanan ca. iukuv- 
1 


4.0 


Stomach Pool 


1.1 


Ovarian ca 
OVCAR-8 


14.6 


Bone Marrow Pool 


1.3 


Ovary 


1.7 


Fetal Heart 


1.7 


Breast ca MCF-7 


17.1 


Heart Pool 


2.1 


Breast ca. MDA- 
MB-231 


20.0 


Lymph Node Pool 


2.7 


Breast ca. BT 549 


32.3 


Fetal Skeletal Muscle 


0.7 


tsreast ca. 1 4 /JJ 


/TOO 

Oo.o 


Skeletal Muscle Fool 


1 .2 


Breast ca. MDA-N 


13.5 


Spleen Pool 


5.2 


Breast Pool 


2.3 


Thymus Pool 


4.0 


Trachea 


7.3 


CNS cancer (glio/astro) 
U87-MG 


47.3 


Lung 


0.2 


CNS cancer (glio/astro) 
U-118-MG 


4.0 


Fetal Lung 


8.4 


CNS cancer 
(neuro;met) SK-N-AS 


1.4 


Lung ca.NCI-N417 


1.0 


CNS cancer (astro) SF- 


13.6 


Lung ca. LX-1 


21.2 


t^rNo cancer ^astroj 
SNB-75 


16.5 


Lung ca. NCI-H146 


0.2 


CNS cancer (glio) 
SNB-19 


4.4 


Lung ca. SHP-77 


13.8 


CNS cancer (glio) SF- 
295 


9.3 


Lung ca. A549 


44.4 


Brain (Amygdala) Pool 


0.5 


Lung ca. NCI-H526 I 


3.1 


Brain (cerebellum) 


4.2 


Lung ca. NCI-H23 


19.5 


Brain (fetal) 


3.8 


Lung ca. NCI-H460 


7.5 


Brain (Hippocampus) 
Pool 


1.1 


Lung ca. HOP-62 


17.4 


Cerebral Cortex Pool 


0.5 


Lung ca. NCI-H522 


26.6 


Brain (Substantia nigra) 
Pool 


1.1 


Liver 


23.3 


Brain (Thalamus) Pool 


1.3 


Fetal Liver 


34.2 • 


Brain (whole) 


1.3 


Liver ca. HepG2 


32.8 


Spinal Cord Pool 


1.3 
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Kidney Pool 


4.6 


Adrenal Gland 


2.8 


Fetal Kidney 


3.8 


Pituitary gland Pool 


0.6 


Renal ca. 786-0 j 


9.3 


Salivary Gland 


6.3 


Renal ca. A498 j 


10.2 


Thyroid (female) 


6.0 


Renal ca. ACHN j 


100.0 


Pancreatic ca. CAPAN2 


32.1 


Renal ca. UO-3 1 j 


22.2 


Pancreas Pool 


2.4 



General_screening _jpanel_vl.4 Summary: Ag4532 The expression of the NOV96a 
gene appears to be highest in a sample derived from a renal cancer cell line 
(ACHN)(CT=26.4). In addition, there is substantial expression associated with other renal 
5 cancer cell lines as well as gastric cancer cell lines, colon cancer cell lines, lung cancer cell 
lines, and breast cancer cell lines. Thus, the expression of this gene could be used to 
distinguish ACHN cells from other samples in this panel. Moreover, therapeutic modulation of 
this gene, through the use of small molecule drugs, protein therapeutics or antibodies might be 
of benefit in the treatment of kidney, gastric, colon, lung or breast cancer. 

10 Among metabolic tissues, this gene has low-to-moderate levels of expression in 

adipose, liver, heart, skeletal muscle, adrenal, pituitary, thyroid, and pancreas. Thus, this gene 
product may be a small molecule target for the treatment of endocrine and metabolic diseases, 
including obesity and Types 1 and 2 diabetes. The direction of therapeutic modulation for this 
gene product would, of necessity, be tissue- or organ-specific. The consequences of altered 

1 5 lactate/monocarboxylate/ketone body transport would differ dramatically between tissues. 

In addition, this gene, a monocarboxylate transporter homolog, is expressed at low to 
moderate levels in all CNS regions examined. The monocarboxylate transporters have been 
implicated in post-ischemic neuronal loss in stroke, such that blockade of these transporters 
increase stroke-related damage. Thus, this gene is an excellent drug target, such that increasing 

20 its activity may decrease postischemic damage in stroke/cerebral infarct. 
References: 

Schurr A, Payne RS, Miller JJ, Tseng MT, Rigor BM. Blockade of lactate transport 
exacerbates delayed neuronal damage in a rat model of cerebral ischemia. Brain Res 2001 Mar 
23;895(l-2):268-72 

25 Studies over the past decade have demonstrated that lactate is produced aerobically 

during brain activation and it has been suggested to be an obligatory aerobic energy substrate 
postischemia. It has been also hypothesized, based on in vitro studies, that lactate, produced by 
glia in large amounts during activation and/or ischemia/hypoxia, is transported via specific 
glial and neuronal monocarboxylate transporters into neurons for aerobic utilization. To test 
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the role of lactate as an aerobic energy substrate postischemia in vivo, we employed the 
cardiac-arrest-induced transient global cerebral ischemia (TGI) rat model and the 
monocarboxylate transporter inhibitor alpha-cyano-4-hydroxycinnamate (4-CIN). Once 4-CIN 
was establish to cross the blood— brain barrier, rats were treated with the inhibitor 60 min prior 
5 to a 5-min TGI. These rats exhibited a significantly greater degree of delayed neuronal damage 
in the hippocampus than control, untreated rats, as measured 7 days post-TGI. We concluded 
that intra-ischemically-accumulated lactate is utilized aerobically as the main energy substrate 
immediately postischemia. Blockade of lactate transport into neurons prevents its utilization 
and, consequently, exacerbates delayed ischemic neuronal damage. 

1 0 NOV97c and NO V97d 

Expression of gene NOV97c and variant NOV97d was assessed using the primer-probe 
set Ag3697, described in Table CLA. Results of the RTQ-PCR runs are shown in Table CLB. 

Table CLA. Probe Name Ag3697 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -catctggattgacactggaatt-3 ' 


22 


585 


1375 


Probe 


TET-5 ' -actcccgggagtggatcacccat-3 ■ - 
TAMRA 


23 


608 


1376 


Reverse 


5 1 -aatcttattggcagtccagatg-3 * 


22 


639 


1377 



1 5 Table CLB. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) Ag3697, 
Run 218253566 


Tissue Name 


Rel. Exp.(%) Ag3697, 
Run 218253566 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.3 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


1.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


8.2 


Squamous cell 
carcinoma SCC-4 


0.9 


Colon ca.* (SW480 
met) SW620 


6.9 


Testis Pool 


100.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.3 


Colon ca. HCT-116 


0.8 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.3 



1457 



Placenta 


7.8 


| Colon cancer tissue 


0.0 ! 


Uterus Pool 


0.0 


J ColoiTca^^ 


0.0 


Ovarian ca. 
OVCAR-3 


0.6 


Colon ca. Colo-205 

1 


0.0 


Ovarian ca. SK-OV- 

3 


1.3 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVf AR-4 


4.1 


] Colon Pool 


0.7 


Ovarian ca. 
OVfAR-S 


0.0 


Small Intestine Pool 


0.0 


Ovarian ra ir^ROV 
1 


0.8 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


2.0 


Bone Marrow Pool 


0.0 


Ovary 


0.2 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.5 


Heart Pool 


0.7 


Breast ca. MDA- 
MB-231 


1.2 


Lymph Node Pool 


0.2 


Breast ca. BT 549 


0.4 


Fetal Skeletal Muscle 


4.5 


r>rca^i ca. it/u 


u.u 


Skeletal Muscle Pool 


fi 1 
U. / 


Breast ca. MDA-N 


0.0 


Spleen Pool 


1.0 


Breast Pool 


0.3 


Thymus Pool 


1.8 


Trachea 


1.4 


CNS cancer (glio/astro) 
U87-MG 


1.1 


Lung 


0.9 


CNS cancer (glio/astro) 
U-l 18-MG 


4.7 


Fetal Lung 


1.2 


CNS cancer 
(neuro;met) SK-N-AS 


0.2 


Lung ca.NCI-N417 


0.0 


CNS cancer (astro) SF- 


0.0 


Lung ca. LX-1 


2.5 


SNB-75 


1.7 


Lung ca. NCI-H146 


1.4 


PMC rnnrpr frr1ir\^ 

SNB-19 


0.0 


Lung ca. SHP-77 


28.9 


CNS cancer (glio) SF- 
295 


0.9 


Lung ca. A549 


0.5 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.2 


Lung ca. NCI-H23 : 


0.8 


Brain (fetal) 


0.0 


Lung ca. NCI-H460 


0.7 


Brain (Hippocampus) 
Pool 


0.0 


Lung ca. HOP-62 ! 


1.4 


Cerebral Cortex Pool 


0.5 


Lung ca. NCI-H522 


0.0 


Brain (Substantia nigra) 
Pool 


0.3 


Liver 


0.0 


Brain (Thalamus) Pool 


1.6 
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Fetal Liver 


0.5 


Brain (whole) 


0.7 


Liver ca. HepG2 j 


0.3 


Spinal Cord Pool 


0.3 


Kidney Pool 


0.0 


Adrenal Gland 


0.7 


Fetal Kidney j 


0.0 


Pituitary gland Pool 


1.1 


Renal ca. 786-0 j 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 [ 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN j 


0.3 


Pancreatic ca. CAPAN2 


0.4 


Renal ca. UO-31 j 


4.1 


Pancreas Pool 


2.5 



CNS_neurodegeneration_vl.O Summary: Ag3697 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General_screening_panel_vl.4 Summary: Ag3697 Expression of this gene is highest 
5 in and almost exclusive to testis (CT = 30.7). Therefore, expression of this gene could be used 
to distinguish testis from the other samples on this panel. Moreover, therapeutic modulation of 
the activity of this gene or its protein product using protein therapeutics, antibodies or small 
molecule drugs could be of benefit in the treatment of infertility. 

Panel 4.1D Summary: Ag3697 Expression of this gene is low/undetectable (CTs > 
10 35) across all of the samples on this panel (data not shown). 

NOV98: AGRIN 

Expression of gene NOV98 was assessed using the primer-probe set Ag3974, 
described in Table CMA. Results of the RTQ-PCR runs are shown in Tables CMB, CMC and 
CMD. 

1 5 Table CMA. Probe Name Ag3974 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -gacaccaggatcttctttgtga-3 1 


22 


274 


1378 


Probe 


TET- 5 1 -catacctgtggccagcccacaag- 3 ' - 
TAMRA 


- 


308 


1379 


Reverse 


5 * -gagttgagcatcagctcgtt- 3 ' 


20 


331 


1380 



Table CMB. CNS_neurodegeneration_vl.O 



Tissue Name 


ReL Exp.(%) Ag3974, 
Run 212348647 


Tissue Name 


ReL Exp.(%) Ag3974, 
Run 212348647 


AD 1 Hippo 


28.7 


Control (Path) 3 
Temporal Ctx 


15.4 


AD 2 Hippo 


36.3 


Control (Path) 4 | 
Temporal Ctx 


46.0 
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AD 3 Hippo 


19.2 


AD 1 Occipital Ctx 


28.9 


AD 4 Hippo 


21.8 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


80.7 


AD 3 Occipital Ctx 


19.5 


AU o rlippo 




ad 4 occipital ctx 


1 1 n 


Control 2 Hippo 


42.3 


AD 5 Occipital Ctx 


47.6 


Control 4 Hippo 


34.9 


AD 6 Occipital Ctx 


14.7 


Control (Path) 3 
Hippo 


12.4 


Control 1 Occipital 
Ctx 


20.4 


AD 1 Temporal Ctx 


32.1 


Control 2 Occipital 
Ctx 


55.1 


AD 2 Temporal Ctx 


31.2 


Control 3 Occipital 
Ctx 


22.5 


AD 3 Temporal Ctx 


20.2 


Control 4 Occipital 
Ctx 


22.2 


AD 4 Temporal Ctx 


24.0 


Control (Path) 1 
Occipital Ctx 


71.2 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


17.4 


AD 5 Sup Temporal 
Ctx 


58.6 


Control (Path) 3 
Occipital Ctx 


15.4 


AD 6 Inf Temporal 
Ctx 


40.6 


Control (Path) 4 
Occipital Ctx 


38.7 


AD 6 Sup Temporal 
Ctx 


35.1 


Control 1 Parietal 
Ctx 


18.2 


Control 1 Temporal 

Ctx 


19.8 


Control 2 Parietal 1 
Ctx 


67.4 


Control 2 Temporal 
Ctx 


48.3 


Control 3 Parietal 
Ctx 


21.2 


Control 3 Temporal j 
Ctx 


17.8 


Control (Path) 1 j 
Parietal Ctx j 


51.4 


Control 3 Temporal 
Ctx 


25.0 


Control (Path) 2 \ 
Parietal Ctx 


32.3 


Control (Path) 1 i 
Temporal Ctx 


63.7 


Control (Path) 3 \ 
Parietal Ctx 


11.9 


Control (Path) 2 
Temporal Ctx 


43.5 


Control (Path) 4 
Parietal Ctx 


58.6 



Table CMC. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) Ag3974, 
Run 217508632 


Tissue Name 


Rel. Exp.(%) Ag3974, 
Run 217508632 


Adipose 


1.5 


Renal ca. TK-10 


16.4 


Melanoma* 
Hs688(A).T 


3.2 


Bladder 


9.0 
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Melanoma* 


4.2 


Gastric ca. (liver met.) 

IN V/l-INo / 


80.7 


Melanoma* M14 


6.4 


Gastric ca. KATO III 


17.7 


Melanoma* 
T oyiMVl 


4.0 


Colon ca. SW-948 


7.8 


iVlCIclIlUIIlcl 

MEL- 5 


4.2 


Colon ca. SW480 


32.3 


Squamous cell 
carcinoma SCC-4 


8.4 


Colon ca.* (SW480 
met) SW620 


4.6 


Testis Pool 


1.1 


Colon ca. HT29 


30.6 


r rosiaie ca. ^oone 
met) PC-3 


24.8 


Colon ca. HCT-116 


5.8 


r rosiaie rooi 


A O 
U.O 


i^oion ca. v^a\^o-z 




Placenta 


1.3 


Colon cancer tissue 


10.0 


Uterus Pool 


0.4 


Colon ca. SW1116 


3.6 


Ovarian ca. 
OVCAR-3 


66.9 


Colon ca. Colo-205 


15 


Ovarian ca. SK-OV- 

3 


36.3 


Colon ca. SW-48 


0.7 


Ovarian ca. 

KJ V 


12.7 


Colon Pool 


1.3 


Ovarian ca. 
nvfAp s 

\J V \^/\ rv-O 


44.4 


Small Intestine Pool 


1.0 


ovarian ca. ivjinaj v - 
1 


27.7 


Stomach Pool 


1.2 


Ovarian ca. 
OVCAR-8 


14.9 


Bone Marrow Pool 


0.5 


Ovary 


1.9 


Fetal Heart 


1.0 


Breast ca. MCF-7 


9.7 


Heart Pool 


0.8 


Breast ca. MDA- 
MB-231 


31.2 


Lymph Node Pool 


2.0 


Breast ca. BT 549 \ 


10.1 


Fetal Skeletal Muscle 


0.5 


Breast ca T47D 


1 00 0 






Breast ca. MDA-N 


4.2 


Spleen Pool 


0.7 


Breast Pool 


1 .6 


Thymus Pool 


2.2 


Trachea 


2.6 


CNS cancer (glio/astro) 
U87-MG 


6.0 


Lung 


0.1 


CNS cancer (glio/astro) 
U-118-MG 


11.2 


Fetal Lung 


8.3 


CNS cancer 
(neuro;met) SK-N-AS 


0.9 


Lung ca. NCI-N417 


0.7 


CNS cancer (astro) SF- 
539 


5.0 


Lung ca. LX-1 


11.0 


CNS cancer (astro) 
SNB-75 


32.3 
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Lung ca. NCI-H146 


0.1 


SNB-19 


20.2 


Lung ca. SHP-77 


0.8 


CNS cancer (glio) SF- 
295 


38.2 


Lung ca. A549 


10.4 


Brain (Amygdala) Pool 


1.3 


Lung ca. NCI-H526 


1.6 


Brain (cerebellum) 


1.0 


Lung ca. NCI-H23 


20.6 


Brain (fetal) 


2.8 


Lung ca. NCI-H460 


9.3 


Brain (Hippocampus) 
Pool 


0.9 


Lung ca. HOP-62 


23.0 


Cerebral Cortex Pool 


0.9 


Lung ca. JNd-riDZZ 


1.5 


Brain (Substantia nigra) 
Pool 


1 .7 


Liver 


0.6 


Brain (Thalamus) Pool 


1.6 


Fetal Liver 


1.4 


Brain (whole) 


1.1 


Liver ca. HepG2 


12.6 


Spinal Cord Pool 


1.4 


Kidney Pool 


2.5 


Adrenal Gland 


0.4 


Fetal Kidney 


4.6 


Pituitary gland Pool 


0.2 


Renal ca. 786-0 


39.5 


Salivary Gland 


1.3 


Renal ca. A498 


7.9 


Thyroid (female) 


3.7 


Renal ca. ACHN \ 


15.9 


Pancreatic ca. CAPAN2 


27.7 


Renal ca. UO-31 


38.7 


Pancreas Pool 


4.1 



Table CMD. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag3974, Run 
170739806 


Tissue Name 


Rel. Exp.(%) 
Ag3974, Run 
170739806 


Secondary Th 1 act 


1.2 


HUVEC IL-lbeta 


18.9 


Secondary Th2 act 


8.0 


HUVEC IFN gamma 


16.7 


Secondary Trl act 


3.5 


HUVEC TNF alpha + IFN 
gamma 


34.9 


Secondary Thl rest 


0.7 


HUVEC TNF alpha + 1L4 


31.4 


Secondary Th2 rest 


0.2 


HUVEC IL-11 


13.9 


Secondary Trl rest 


1.2 


Lung Microvascular EC 
none 


100.0 


Primary Thl act 


3.2 


Lung Microvascular EC j 
TNFalpha + IL-lbeta 


97.9 


Primary Th2 act 


2.0 


Microvascular Dermal EC 
none 


48.3 


Primary Trl act 


2.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


47.0 


Primary Th 1 rest 


0.4 


Bronchial epithelium 
TNFalpha + ILlbeta 


90.1 


Primary Th2 rest 


0.2 


Small airway epithelium 


32.5 
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none 




Primary Trl rest 


0.3 


Small airway epithelium 
TNFalpha + IL-lbeta 


93.3 


CD45RA CD4 
lymphocyte act 


22.7 


Coronery artery SMC rest 


28.5 


CD45RO CD4 
lymphocyte act 


5.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


28.7 


CD8 lymphocyte act 


3.3 


Astrocytes rest 


55.1 


Secondary CD8 
lymphocyte rest 


3.3 


Astrocytes TNFalpha + 
IL-lbeta 


66.4 


Secondary CD8 
lymphocyte act 


3.5 


KU-812 (Basophil) rest 


1.9 


CD4 lymphocyte none 


0.1 


KU-8 12 (Basophil) 
PMA/ionomycin 


2.8 


zry 1 nl/1 ni/l rl anti- 
CD95CH11 


0.4 


CCD1 106 (Keratinocytes) 
none 


82.4 


LAK cells rest 


6.4 


CCD1 106 (Keratinocytes) 
TNFalnha + TT -1 beta 


72.7 


LAK cells IL-2 


1.7 


T ivpr nirrhosi's 


14.4 


LAK cells IL-2+IL-12 


1.8 


NCI-H292 none 


54.0 


1 A I/" TT O l TT7X T 

LAK cells IL-Z-HriN 


1.1 


NCI-H292 IL-4 


78.5 


LAK cells IL-2+ IL-18 


1.6 


NCI-H292 IL-9 


79.6 


lar cells 
PMA/ionomycin 


4.6 


NCI-H292 IL-13 


59.9 


NK Cells IL-2 rest 


1.9 


NCI-H292 I FN eamma 


71.7 


Two Way MLR 3 day 


12.4 


HPAEC none 


21.3 


Two Way MLR 5 day 


5.3 


HPAEC TNF alpha + IL-1 
beta 


45.4 


Two Way MLR 7 day 


4.0 


Lung fibroblast none 


29.3 


PBMC rest 


0.6 


Lung fibroblast TNF alpha 
+ IL-1 beta 


87.7 


PRlVyTf^ PWlVf 
rolvlv^ r W1V1 


A Q 


L-ung noroDiasx ii^-^f 


Zj.j 


PBMC PHA-L 


3.4 


Lung fibroblast IL-9 


30.4 


Ramos (B cell) none 


0.4 


Lung fibroblast IL- 1 3 


36.6 


Ramos (B cell) 
ionomycin 


0.2 


Lung fibroblast IFN 
gamma 


29.7 


B lymphocytes PWM 


3.0 


Dermal fibroblast 
CCD 1070 rest 


27.2 


B lymphocytes CD40L 
and IL-4 


3.7 


Dermal fibroblast 
CCD 1070 TNF alpha 


20.6 


EOL-1 dbcAMP 


3.1 


Dermal fibroblast 
CCD 1070 IL-1 beta 


22.4 


EOL-1 dbcAMP 
PM A/ionomyc in 


8.0 


Dermal fibroblast IFN 
gamma 


10.3 
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Dendritic cells none 


9.0 


Dermal fibroblast IL-4 


8.0 


Dendritic cells LPS 


32.8 


Dermal Fibroblasts rest 


6.3 


Dendritic cells anti- 
CD40 


8.8 


Neutrophils TNFa+LPS 


0.9 


Monocytes rest 


1.4 


Neutrophils rest 


1.0 


Monocytes LPS 


81.2 


Colon 


5.8 


Macrophages rest 


9.7 


Lung 


23.3 


Macrophages LPS 


43.8 


Thymus 


7.3 


HUVEC none 


12.6 


Kidney 


33.2 


HUVEC starved 


25.2 







CNS_neurodegeneration_vl.O Summary: Ag3974 This panel does not show 
differential expression of the NOV98 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion of 
5 utility of this gene in the central nervous system. 

General_screening_panel_vl.4 Summary: Ag3974 The expression of the NOV98 
gene appears to be highest in a sample derived from a breast cancer cell line (T47D) 
(CT=22.5). In addition, there appears to be substantial expression in other samples derived 
from breast cancer cell lines, ovarian cancer cell lines, kidney cancer cell lines, lung cancer 
10 cell lines, colon cancer cell lines and brain cancer cell lines. Thus, the expression of this gene 
could be used to distinguish T47D cells from other samples in the panel. Moreover, 
therapeutic modulation of this gene, through the use of small molecule drugs, protein 
therapeutics, or antibodies could be of benefit in the treatment of breast, ovarian, kidney, lung, 
colon or brain cancer. 

15 Among metabolic tissues, this gene has low-to-moderate levels of expression in 

adrenal, pituitary, adult and fetal heart, adult and fetal liver, adult and fetal skeletal muscle, 
and adipose. This gene product has high levels of expression (CT values = 27) in pancreas and 
thyroid. Thus, this gene product may be important for the pathogenesis, diagnosis, and/or 
treatment of metabolic and endocrine diseases, including obesity, Types 1 and 2 diabetes and 

20 thyroidopathies. It has recently been reported that an agrin minigene rescued dystrophic 

symptoms in a mouse model of muscular dystrophy. Therefore, this gene product may also be 
used as a treatment or cure for congenital muscular dystrophies. 

This gene is also expressed at moderate to high levels in all regions of the CNS. This 
molecule is a homolog of agrin, which has been implicated in the formation of senile plaques 

25 in Alzheimer's disease and in the acetylcholine synapse/neuromuscular junction. This gene is 
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therefore an excellent drug target in AD or in any disease involving the neuromuscular 
junction or the acetylchpoline system. 
References: 

Moll J, Barzaghi P, Lin S, Bezakova G, Lochmuller H, Engvall E, Muller U, Ruegg 
5 MA. An agrin minigene rescues dystrophic symptoms in a mouse model for congenital 
muscular dystrophy. Nature. 2001 Sep 20;413(6853):302-7. 

Congenital muscular dystrophy is a heterogeneous and severe, progressive muscle- 
wasting disease that frequently leads to death in early childhood. Most cases of congenital 
muscular dystrophy are caused by mutations in LAMA2, the gene encoding the alpha2 chain 
10 of the main laminin isoforms expressed by muscle fibres. Muscle fibre deterioration in this 
disease is thought to be caused by the failure to form the primary laminin scaffold, which is 
necessary for basement membrane structure, and the missing interaction between muscle 
basement membrane and the dystrophin-glycoprotein complex (DGC) or the integrins. With 
the aim to restore muscle function in a mouse model for this disease, we have designed a 
15 minigene of agrin, a protein known for its role in the formation of the neuromuscular junction. 
Here we show that this mini-agrin-which binds to basement membrane and to alpha- 
dystroglycan, a member of the DGC-amends muscle pathology by a mechanism that includes 
agrin-mediated stabilization of alpha-dystroglycan and the laminin alphaS chain. Our data 
provides in vivo evidence that a non-homologous protein in combination with rational protein 
20 design can be used to devise therapeutic tools that may restore muscle function in human 
muscular dystrophies. 

PMID: 11565031 

Liyanage Y, Hoch W, Beeson D, Vincent A. The agrin/muscle-specific kinase 
pathway: New targets for autoimmune and genetic disorders at the neuromuscular junction. 

25 Muscle Nerve 2002 Jan;25(l):4-16 

The increasing understanding of the structural complexity of the neuromuscular 
junction (NMJ), and the processes that are important in its development, suggests many 
possible new disease targets. Here, we summarize briefly the genetic and autoimmune 
disorders that affect neuromuscular transmission, and the identified targets, including new 

30 evidence that antibodies to muscle-specific receptor tyrosine kinase (MuSK) are involved in 
the pathogenesis of acetylcholine receptor (AChR) antibody-negative myasthenia gravis. We 
then review the development of the NMJ, focusing on the important roles of nerve-derived 
agrin and MuSK in clustering of AChRs and other essential components of the NMJ. 
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van Horssen J, Otte-Holler I, David G, Maat-Schieman ML, van den Heuvel LP, 
Wesseling P, de Waal RM, Verbeek MM, Heparan sulfate proteoglycan expression in 
cerebrovascular amyloid beta deposits in Alzheimer's disease and hereditary cerebral 
hemorrhage with amyloidosis (Dutch) brains. Acta Neuropathol (Berl) 2001 Dec;102(6):604- 
5 14 

Cerebrovascular deposition of amyloid beta protein (A beta) is a characteristic lesion 
of Alzheimer's disease (AD) and hereditary cerebral hemorrhage with amyloidosis of the 
Dutch type (HCHWA-D). Besides A beta, several other proteins and proteoglycans 
accumulate in cerebral amyloid angiopathy (CAA). We have now analyzed the expression of 

10 the heparan sulfate proteoglycan (HSPG) subtypes agrin, perlecan, glypican-1, syndecans 1-3 
and HS glycosaminoglycan (GAG) side chains in CAA in brains of patients with AD and 
HCHWA-D. Hereto, specific well-characterized antibodies directed against the core protein of 
these HSPGs and against the GAG side chains were used for immunostaining. Glypican-1 was 
abundantly expressed in CAA both in AD and HCHWA-D brains, whereas perlecan and 

15 syndecans- 1 and -3 were absent in both. Colocalization of agrin with vascular A beta was 
clearly observed in CAA in HCHWA-D brains, but only in a minority of the AD cases. 
Conversely, syndecan-2 was frequently associated with vascular A beta in AD, but did not 
colocalize with vascular A beta deposits in HCHWA-D. The three different syndecans, agrin, 
glypican-1 and HS GAG, but not perlecan, were associated with the majority of senile plaques 

20 (SPs) in all brains. Our results suggest a role for agrin in the formation of SPs and of CAA in 
HCHWA-D, but not in the pathogenesis of CAA in AD. Both syndecan-2 and glypican, but 
not perlecan, may be involved in the formation of CAA. We conclude that specific HSPG 
species may be involved in the pathogenesis of CAA in both AD and HCHWA-D, and that the 
pathogenesis of CAA and SPs may differ with regard to the involvement of HSPG species. 

25 Panel 4.1D Summary: Ag3974 The NOV98 gene is expressed at moderate levels 

(CT=29-32) in a wide range of cell types of significance in the immune response in health and 
disease. Therefore, targeting of this gene product with a small molecule drug or antibody 
therapeutic may modulate the functions of cells of the immune system as well as resident 
tissue cells and lead to improvement of the symptoms of patients suffering from autoimmune 

30 and inflammatory diseases such as COPD, emphysema, asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, and arthritis, including osteoarthritis and rheumatoid arthritis. 
Based on its homology to agrin, this gene product may also be beneficial to the treatment of 
multiple sclerosis as suggested by the referene below. 
References: 
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Liyanage Y, Hoch W, Beeson D, Vincent A. The agrin/muscle-specific kinase 
pathway: New targets for autoimmune and genetic disorders at the neuromuscular junction. 
Muscle Nerve 2002 Jan;25(l):4-16 

The increasing understanding of the structural complexity of the neuromuscular 
5 junction (NMJ), and the processes that are important in its development, suggests many 
possible new disease targets. Here, we summarize briefly the genetic and autoimmune 
disorders that affect neuromuscular transmission, and the identified targets, including new 
evidence that antibodies to muscle-specific receptor tyrosine kinase (MuSK) are involved in 
the pathogenesis of acetylcholine receptor (AChR) antibody-negative myasthenia gravis. We 
10 then review the development of the NMJ, focusing on the important roles of nerve-derived 
agrin and MuSK in clustering of AChRs and other essential components of the NMJ. 



1 5 representing multiple tissue types, normal and diseased states, physiological states, and 

developmental states from different donors. Samples were obtained as whole tissue, cell lines, 
primary cells or tissue cultured primary cells and cell lines. Cells and cell lines may have been 
treated with biological or chemical agents that regulate gene expression for example, growth 
factors, chemokines, steroids. The cDNA thus derived was then sequenced using CuraGen's 

20 proprietary SeqCalling technology. Sequence traces were evaluated manually and edited for 

corrections if appropriate. cDNA sequences from all samples were assembled with themselves 
and with public ESTs using bioinformatics programs to generate CuraGen's human SeqCalling 
database of SeqCalling assemblies. Each assembly contains one or more overlapping cDNA 
sequences derived from one or more human samples. Fragments and ESTs were included as 

25 components for an assembly when the extent of identity with another component of the 

assembly was at least 95% over 50 bp. Each assembly can represent a gene and/or its variants 
such as splice forms and/or single nucleotide polymorphisms (SNPs) and their combinations. 

Variant sequences are included in this application. A variant sequence can include a 
single nucleotide polymorphism (SNP). A SNP can, in some instances, be referred to as a 

30 "cSNP" to denote that the nucleotide sequence containing the SNP originates as a cDNA. A 
SNP can arise in several ways. For example, a SNP may be due to a substitution of one 
nucleotide for another at the polymorphic site. Such a substitution can be either a transition or 
a transversion. A SNP can also arise from a deletion of a nucleotide or an insertion of a 



Example 3. SNP analysis of NOVX clones 



SeqCallingTM Technology: cDNA was derived from various human samples 
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nucleotide, relative to a reference allele. In this case, the polymorphic site is a site at which 
one allele bears a gap with respect to a particular nucleotide in another allele. SNPs occurring 
within genes may result in an alteration of the amino acid encoded by the gene at the position 
of the SNP. Intragenic SNPs may also be silent, however, in the case that a codon including a 
5 SNP encodes the same amino acid as a result of the redundancy of the genetic code. SNPs 

occurring outside the region of a gene, or in an intron within a gene, do not result in changes in 
any amino acid sequence of a protein but may result in altered regulation of the expression 
pattern for example, alteration in temporal expression, physiological response regulation, cell 
type expression regulation, intensity of expression, stability of transcribed message. 

10 Method of novel SNP Identification: SNPs are identified by analyzing sequence 

assemblies using CuraGen's proprietary SNPTool algorithm. SNPTool identifies variation in 
assemblies with the following criteria: SNPs are not analyzed within 10 base pairs on both 
ends of an alignment; Window size (number of bases in a view) is 10; The allowed number of 
mismatches in a window is 2; Minimum SNP base quality (PHRED score) is 23; Minimum 

1 5 number of changes to score an SNP is 2/assembly position. SNPTool analyzes the assembly 

and displays SNP positions, associated individual variant sequences in the assembly, the depth 
of the assembly at that given position, the putative assembly allele frequency, and the SNP 
sequence variation. Sequence traces are then selected and brought into view for manual 
validation. The consensus assembly sequence is imported into CuraTools along with variant 

20 sequence changes to identify potential amino acid changes resulting from the SNP sequence 
variation. Comprehensive SNP data analysis is then exported into the SNPCalling database. 

Method of novel SNP Confirmation: SNPs are confirmed employing a validated 
method know as Pyrosequencing (Pyrosequencing, Westborough, MA). Detailed protocols for 
Pyrosequencing can be found in: Alderborn et al. Determination of Single Nucleotide 

25 Polymorphisms by Real-time Pyrophosphate DNA Sequencing. (2000). Genome Research. 10, 
Issue 8, August. 1249-1265. In brief, Pyrosequencing is a real time primer extension process 
of genotyping. This protocol takes double-stranded, biotinylated PCR products from genomic 
DNA samples and binds them to streptavidin beads. These beads are then denatured producing 
single stranded bound DNA. SNPs are characterized utilizing a technique based on an indirect 

30 bioluminometric assay of pyrophosphate (PPi) that is released from each dNTP upon DNA 

chain elongation. Following Klenow polymerase-mediated base incorporation, PPi is released 
and used as a substrate, together with adenosine 5'-phosphosulfate (APS), for ATP sulfurylase, 
which results in the formation of ATP. Subsequently, the ATP accomplishes the conversion of 
luciferin to its oxi-derivative by the action of luciferase. The ensuing light output becomes 
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proportional to the number of added bases, up to about four bases. To allow processivity of the 
method dNTP excess is degraded by apyrase, which is also present in the starting reaction 
mixture, so that only dNTPs are added to the template during the sequencing. The process has 
been fully automated and adapted to a 96-well format, which allows rapid screening of large 
5 SNP panels. The DNA and protein sequences for the novel single nucleotide polymorphic 

variants are reported. Variants are reported individually but any combination of all or a select 
subset of variants are also included. In addition, the positions of the variant bases and the 
variant amino acid residues are underlined. 

Results 

10 Variants are reported individually but any combination of all or a select subset of 

variants are also included as contemplated NOVX embodiments of the invention. 

NOVla SNP data: 

NOV la has two SNP variants, whose variant positions for its nucleotide and amino 
15 acid sequences is numbered according to SEQ ID NOs:l and 2, respectively. The nucleotide 
sequence of the NOVla variant differs as shown in Table 101. 



Table 101. cSNP and Coding Variants for NOVla 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


393 


T 


c 


130 


S->P 


420 


T 


c 


139 


W->R 


431 


T 


c 


142 


No change 


501 


C 


T 


166 


L->F 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


420 


T 


c 


20 


0.100 



NO VI b SNP data: 

20 NOV lb has four SNP variants, whose variant positions for its nucleotide and amino 

acid sequences is numbered according to SEQ ID NOs:3 and 4, respectively. The nucleotide 
sequence of the NOVlb variant differs as shown in Table 102. 



Table 102. cSNP and Coding Variants for NOVlb 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


393 


T 


c 


130 


S->P 


420 


T 


c 


139 


W->R 
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NOV3a SNP data: 

NOV3a has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:l 1 and 12, respectively. The nucleotide 
sequence of the NOV3a variant differs as shown in Table 103. 



Table 103. cSNP and Coding Variants for NOV3a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


212 


T 


c 


54 


C->R 


439 


A 


G 


149 


No Change 



NOV4a SNP data: 

NOV4a has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:17 and 18, respectively. The nucleotide 
sequence of the NOV4a variant differs as shown in Table 104. 



Table 104. cSNP and Coding Variants for NOV4a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


229 


G 


A 


73 


No change 


390 


G 


A 


127 


W->End 


631 


G 


C 


207 


Q->H 



NOV5a SNP data: 

NOV5a has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:23 and 24, respectively. The nucleotide 
sequence of the NOV5a variant differs as shown in Table 1 05. 



Table 105. cSNP and Coding Variants for NOV5a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


79 


c 


T 


19 


A->V 


204 


T 


C 


61 


No change 


658 


A 


T 


212 


Q->L 


884 


C 


G 


287 


No change 


1149 


G 


T 


376 


D->Y 



NOV6 SNP data: 



1470 



* 



NOV6 has four SNP variants, whose variant positions for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:33 and 34, respectively. The nucleotide 
sequence of the NOV 6 variant differs as shown in Table 106. 



Table 106. cSNP and Coding Variants for NOV6 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


543 


T 


c 


156 


V->A 


549 


T 


c 


158 


I->T 


660 


c 


T 


195 


A->V 


734 


A 


G 


220 


T->A 


782 


G 


A 


236 


A->T 



NOV7a SNP data: 

NOV7a has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:35 and 36, respectively. The nucleotide 
sequence of the NOV7a variant differs as shown in Table 107. 



Table 107. cSNP and Coding Variants for NOV7a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
Position 


Amino Acid 
Change 


168 


T 


c 


24 


V->G 


459 


c 


T 


121 


A->V 


815 


T 


C 


240 


S->P 


896 


A 


G 


N/A 


No change 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


428 


G 


A 


8 


0.250 



NOV7c SNP data: 

NOV7c has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:39 and 40, respectively. The nucleotide 
15 sequence of the NOV7c variant differs as shown in Table 108. 



Table 108. cSNP and Coding Variants for NOV7c 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


383 


c 


T 


5 


0.400 



NOV7d SNP data: 



1471 



if n i 
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NOV7d has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:41 and 42, respectively. The nucleotide 
sequence of the NOV7d variant differs as shown in Table 109. 



Table 109. cSNP and Coding Variants for NOV7d 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
Position 


Amino Acid 
Change 


260 


G 


A 


86 


I->E 



5 

NOV7e SNP data: 

NOV7e has four SNP variants, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:323 and 324, respectively. The 
nucleotide sequence of the NOV7d variant differs as shown in Table 110. 
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Table 110. cSNP and Coding Variants for NOV7e 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
Position 


Amino Acid 
Change 


304 


A 


G 


102 


T->A 



NOV9a SNP data: 

NOV9a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:45 and 46, respectively. The nucleotide 
1 5 sequence of the NOV9a variant differs as shown in Table 111. 



Table 111. cSNP and Coding Variants for NOV9a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


264 


G 


A 


N/A 


No change 


391 


C 


A 


8 


P->T 


438 


G 


A 


23 


No change 


550 


T 


G 


61 


F->V 


672 


C 


T 


101 


No change 


1286 


T 


C 


306 


L->S 


1338 


G 


A 


323 


No change 



NOV10 SNP data: 

NOV 10 has one SNP variant, whose variant position for its nucleotide and amino acid 
20 sequences is numbered according to SEQ ID NOs:49 and 50, respectively. The nucleotide 
sequence of the NOV10 variant differs as shown in Table 1 12. 



1472 




Table 112. cSNP and Coding Variants for NOV10 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


335 


G 


A 


10 


0.400 



NOV13b SNP data: 

NOV 13b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:57 and 58, respectively. The nucleotide 
5 sequence of the NOV 13b variant differs as shown in Table 113. 



Table 113. cSNP and Coding Variants for NOV13b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


362 


G 


A 


11 


0.455 



NOV15b SNP data: 

NOV 15b has one SNP variant, whose variant position for its nucleotide and amino acid 
10 sequences is numbered according to SEQ ID NOs:63 and 64, respectively. The nucleotide 
sequence of the NOV 15b variant differs as shown in Table 1 14. 



Table 114. cSNP and Coding Variants for NOV15b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


388 


T 


G 


17 


0.294 



NO VI 6b SNP data: 

15 NOV 16b has one SNP variant, whose variant position for its nucleotide and amino acid 

sequences is numbered according to SEQ ID NOs:67 and 68, respectively. The nucleotide 
sequence of the NOV 16b variant differs as shown in Table 115. 



Table 115. cSNP and Coding Variants for NOV16b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


463 


A 


T 


16 


0.125 


465 


c 


T 


16 


0.125 


535 


T 


c 


15 


0.133 


735 


C 


T 


12 


0.167 



1473 




814 



G 



11 



0.182 



NOV21a SNP data: 

NOV21a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:87 and 88, respectively. The nucleotide 
sequence of the NOV21a variant differs as shown in Table 1 16. 



Table 116. cSNP and Coding Variants for NOV21a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


121 


T 


C 


41 


W->R 


170 


T 


C 


57 


V->A 


364 


C 


T 


122 


No change 


415 


G 


A 


139 


E->K 


535 


G 


A 


182 


R->H 


630 


G 


A 


210 


No change 
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NOV21bSNP data: 

NOV21b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:89 and 90, respectively. The nucleotide 
sequence of the NOV2 1 b variant differs as shown in Table 1 1 7. 



Table 117. cSNP and Coding Variants for NOV21b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


485 


G 


A 


65 


0.246 


616 


G 


A 


44 


0.136 


714 


G 


A 


36 


0.083 



NOV22a SNP data: 

15 NOV22a has one SNP variant, whose variant position for its nucleotide and amino acid 

sequences is numbered according to SEQ ID NOs:91 and 92, respectively. The nucleotide 
sequence of the NOV22a variant differs as shown in Table 118. 



Table 118: cSNP and Coding Variants for NOV22a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


669 


A 


G 


223 


No change 


725 


c 


T 


242 


T->M 



1474 
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NOV22c SNP data: 

NOV22c has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:95 and 96, respectively. The nucleotide 
sequence of the NOV22c variant differs as shown in Table 1 19. 



Table 119. cSNP and Coding Variants for NOV22c 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


85 


A 


G 


30 


0.067 


288 


T 


C 


31 


0.065 


484 


A 


G 


37 


0.054 


540 


C 


T 


29 


0.241 



NOV24a SNP data: 

NOV24a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:103 and 104, respectively. The nucleotide 
10 sequence of the NOV24a variant differs as shown in Table 120. 



Table 120. cSNP and Coding Variants for NOV24a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


539 


c 


T 


511 


No change 



NOV24b SNP data: 

NOV24b has one SNP variant, whose variant position for its nucleotide and amino acid 
15 sequences is numbered according to SEQ ID NOs:105 and 106, respectively. The nucleotide 
sequence of the NOV24b variant differs as shown in Table 121. 



Table 121. cSNP and Coding Variants for NOV24b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


437 


A 


G 


143 


N->S 


664 


T 


G 


219 


F->V 


1150 


G 


T 


381 


A->S 


1210 


G 


T 


401 


E->End 


1770 


C 


T 


587 


No change 


2011 


A 


G 


N/A 


No change 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


329 


c 


T 


11 


0.364 



1475 



491 



13 



0.154 



NOV25 SNP data: 

NOV25 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs: 1 07 and 1 08, respectively. The nucleotide 
sequence of the NOV25 variant differs as shown in Table 122. 



Table 122. cSNP and Coding Variants for NOV25 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


221 


G 


A 


54 


No change 


462 


C 


T 


135 


L->F 



NOV26a SNP data: 

NOV26a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs: 109 and 1 10, respectively. The nucleotide 
sequence of the NOV26a variant differs as shown in Table 123. 



Table 123. cSNP and Coding Variants for NOV26a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


67 


A 


G 


i 


M->V 


98 


C 


T 


n 


P->L 


128 


A 


G 


21 


E->G 


176 


A 


G 


37 


A->T 


233 


L A 


G 


56 


Q->R 


243 


T 


C 


59 


No change 


252 


A 


G 


62 


No change 


260 


A 


G 


65 


D->G 


296 


A 


G 


77 


K->R 


316 


A 


G 


84 


. N->D 


369 


G 


A 


101 


M->I 


395 


A 


G 


110 


Q->R 


465 


G 


A 


N/A 


No change 



NOV26b SNP data: 

NOV26b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs: 1 1 1 and 1 12, respectively. The nucleotide 
sequence of the NOV26b variant differs as shown in Table 124. 



1476 




Table 124. cSNP and Coding Variants for NOV26b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


133 


A - 


G 


41 


0.049 


268 


A 


G 


41 


0.049 


324 


A 


G 


41 


0.049 


372 


A 


G 


41 


0.049 


376 


A 


* 


41 


0.049 


456 


T 


C 


40 


0.050 


488 


A 


G 


32 


0.344 



NOV27a SNP data: 

NOV27a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs: 113 and 1 14, respectively. The nucleotide 
5 sequence of the NOV27a variant differs as shown in Table 125. 



Table 125. cSNP and Coding Variants for NOV27a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


158 


A 


T 


36 


No change 


491 


c 


T 


147 


No change 


562 


T 


C 


171 


L->P 


858 


C 


T 


270 


No change 


1750 


C 


T 


567 


P->L 



NOV27b SNP data: 

NOV27b has one SNP variant, whose variant position for its nucleotide and amino acid 
10 sequences is numbered according to SEQ ID NOs:l 15 and 1 16, respectively. The nucleotide 
sequence of the NOV27b variant differs as shown in Table 126. 



Table 126. cSNP and Coding Variants for NOV27b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


131 


G 


A 


43 


0.233 


1072 


G 


A 


14 


0.214 


1368 


A 


G 


24 


0.125 


1439 


G 


A 


42 


0.071 


1733 


G 


A 


43 


0.047 


1772 


T 


A 


43 


0.442 


1787 


G 


A 


42 


0.286 
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NOV29c SNP data: 

NOV29c has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:125 and 126, respectively. The nucleotide 
sequence of the NOV29c variant differs as shown in Table 127. 



Table 127. cSNP and Coding Variants for NOV29c 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


760 


A 


G 


254 


T->A 


923 


T 


C 


308 


G->D 



NOV30 SNP data: 

NOV30 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:127 and 128, respectively. The nucleotide 
sequence of the NOV30 variant differs as shown in Table 128. 



Table 128. cSNP and Coding Variants for NOV30 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


103 


A 


G 


28 


I->V 


207 


T 


C 


62 


No change 


225 


C 


T 


68 


No change 


233 


A 


G 


71 


D->G 


267 


T 


C 


82 


No change 


318 


A 


G 


99 


No change 


392 


T 


C 


124 


L->P 


431 


T 


C 


137 


M->T 


464 


A 


G 


148 


E->G 


479 


T 


A 


153 


V->E 



NOV33 SNP data: 

NOV33 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:133 and 134, respectively. The nucleotide 
sequence of the NOV33 variant differs as shown in Table 129. 



Table 129. cSNP and Coding Variants for NOV33 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


5097 


T 


c 


1699 


No change 


6012 


C 


T 


2004 


No change 
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NOV36a SNP data: 

NOV36a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:139 and 140, respectively. The nucleotide 
sequence of the NOV36a variant differs as shown in Table 1 30. 



Table 130. cSNP and Coding Variants for NOV36a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


351 


T 


C 


102 


No change 


737 


A 


G 


231 


D->G 



NOV38 SNP data: 

NOV38 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:145 and 146, respectively. The nucleotide 
10 sequence of the NOV38 variant differs as shown in Table 131. 



Table 131. cSNP and Coding Variants for NOV38 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


566 


C 


T 


188 


H->R 


658 


A 


G 


219 


H->R 


844 


G 


A 


281 


C->Y 


892 


C 


T 


297 


A->V 


910 


T 


C 


303 


V->A 


1009 


G 


A 


336 


S->N 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


95 


T 


c 


14 


N/A 



NOV39a SNP data: 

NOV39a has one SNP variant, whose variant position for its nucleotide and amino acid 
15 sequences is numbered according to SEQ ID NOs:147 and 148, respectively. The nucleotide 
sequence of the NOV39a variant differs as shown in Table 132. 



Table 132. cSNP and Coding Variants for NOV39a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


1095 


T 


c 


11 


N/A 



NOV39b SNP data: 



1479 




NOV39b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:149 and 150, respectively. The nucleotide 
sequence of the NOV39b variant differs as shown in Table 133. 



Table 133. cSNP and Coding Variants for NOV39b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


933 


c 


T 


9 


0.222 



NOV42c SNP data: 

NOV42c has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:161 and 162, respectively. The nucleotide 
sequence of the NOV42c variant differs as shown in Table 134. 
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Table 134. cSNP and Coding Variants for NOV42c 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


330 


G 


A 


103 


R->Q 


783 


A 


G 


254 


D->G 


903 


G 


A 


294 


R->H 


1389 


A 


T 


456 


E->V 


1389 


A 


G 


456 


E->G 


1394 


G 


A 


458 


A->T 


1642 


C 


T 


540 


No change 


1656 


T 


C 


545 


V->A 


1658 


G 


A 


546 


A->T 



NOV43 SNP data: 

NOV43 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:165 and 166, respectively. The nucleotide 
15 sequence of the NOV43 variant differs as shown in Table 135. 



Table 135. cSNP and Coding Variants for NOV43 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


378 


G 


A 


121 


No change 


496 


G 


A 


162 


R->Q 



NOV46b SNP data: 



1480 



li 11 



NOV46b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:173 and 174, respectively. The nucleotide 
sequence of the NOV46b variant differs as shown in Table 136. 



Table 136. cSNP and Coding Variants for NOV46b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


500 


c 


T 


163 


No change 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


486 


c 


T 


16 


0.125 



NOV48a SNP data: 

NOV48a has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:181 and 182, respectively. The nucleotide 
sequence of the NOV48a variant differs as shown in Table 137. 
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Table 137. cSNP and Coding Variants for NOV48a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


370 


A 


G 


1 


M->V 


436 


c 


T 


23 


L->F 


539 


A 


G 


57 


D->G 


650 


A 


G 


94 


E->G 


1012 


C 


T 


215 


Q->End 


1922 


A 


G 


518 


K->R 


2057 


A 


G 


563 


Q->R 


2066 


C 


T 


566 


A->V 


2198 


C 


T 


610 


P->L 


2618 


A 


G 


750 


D->G 


2656 


G 


A 


N/A 


No change 



NOVSOb SNP data: 

NOV50b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:189 and 190, respectively. The nucleotide 
15 sequence of the NOV50b variant differs as shown in Table 138. 



Table 138. cSNP and Coding Variants for NOVSOb 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


797 


A 


G 


265 


Q->R 



1481 
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NOV52 SNP data: 

NOV52 has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:193 and 194, respectively. The nucleotide 
5 sequence of the NOV52 variant differs as shown in Table 139. 



Table 139. cSNP and Coding Variants for NOV52 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


318 


T 


c 


48 


No change 


351 


C 


T 


59 


R->C 


1961 


G 


T 


595 


M->I 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


70 


c 


T 


54 


0.056 



NOV56a SNP data: 

NOV56a has one SNP variant, whose variant position for its nucleotide and amino acid 
10 sequences is numbered according to SEQ ID NOs:203 and 204, respectively. The nucleotide 
sequence of the NOV56a variant differs as shown in Table 140. 



Table 140. cSNP and Coding Variants for NOV56a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


118 


A 


c 


39 


No change 


435 


c 


T 


135 


P->L 


439 


T 


C 


146 


No change 


473 


A 


G 


158 


T->A 


588 


T 


C 


196 


V->A 


596 


G 


A 


199 


G->R 


614 


A 


G 


205 


M->V 


631 


T 


C 


210 


No change 


637 


A 


G 


212 


No change 


642 


T 


C 


214 


M->T 


732 


G 


T 


244 


W->L 


902 


A 


T 


301 


M->L 



NOV57 SNP data: 

15 NOV57 has one SNP variant, whose variant position for its nucleotide and amino acid 

sequences is numbered according to SEQ ID NOs:207 and 208, respectively. The nucleotide 
sequence of the NOV57 variant differs as shown in Table 141 . 
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Table 141. cSNP and Coding Variants for NOV57 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Amino Acid 
position 


Amino Acid 
Change 


939 


T 


A 


N/A 


No change 



NOV58b SNP data: 

NOV58b has one SNP variant, whose variant position for its nucleotide and amino acid 
5 sequences is numbered according to SEQ ID NOs:21 1 and 212, respectively. The nucleotide 
sequence of the NOV58b variant differs as shown in Table 142. 



Table 142. cSNP and Coding Variants for NOV58b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


88 


T 


c 


11 


0.273 


377 


A 


G 


18 


0.111 


500 


T 


C 


18 


0.111 


509 


A 


G 


18 


0.111 


570 


T 


C 


17 


0.118 


647 


C 


T 


9 


0.222 



NOV60a SNP data: 

10 NOV60a has one SNP variant, whose variant position for its nucleotide and amino acid 

sequences is numbered according to SEQ ID NOs:215 and 216, respectively. The nucleotide 
sequence of the NOV60a variant differs as shown in Table 143. 



Table 143. cSNP and Coding Variants for NOV60a 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


341 


T 


C 


52 




401 


G 


C 


79 




411 


T 


c 


79 




444 


C 


T 


79 




644 


C 


T 


59 




653 


T 


A 


59 




670 


T 


C 


59 




707 


T 


C 


33 





15 NOV60b SNP data: 
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NOV60b has one SNP variant, whose variant position for its nucleotide and amino acid 
sequences is numbered according to SEQ ID NOs:217 and 218, respectively. The nucleotide 
sequence of the NOV60b variant differs as shown in Table 144. 



Table 144. cSNP and Coding Variants for NOV60b 


NT Position 
of cSNP 


Wild Type 
NT 


Variant NT 


Depth 


Putative 

Allele 
Frequency 


162 


T 


G 


54 


0.259 


192 


A 


G 


54 


0.056 


229 


A 


G 


81 


0.025 


246 


A 


T 


80 


0.025 


255 


G 


A 


79 


0.038 


263 


A 


G 


77 


0.039 


342 


G 


A 


103 


0.019 


389 


A 


G 


105 


0.019 



Example 4. In-frame Cloning 
NOV7c 

For NOV7c the cDNA coding for the DOMAIN of NOV7c from residues 1 to 230 was 
targeted for "in-frame" cloning by PCR. The PCR template was based on the previously 
10 identified plasmid, when available, or on human cDNA(s). 



Table 145. Oligonucleotide primers used to clone the target cDNA sequence: 



Primers 


Sequences 


Fl 


5 ' - AGATCTCCCACC ATGG AACTTC AGG AC CTGGAAC TGC -3' {SEQ ID NO: 13 82) 


Rl 


5'- CTCGAG TCCACTTACAATTTCCCGTCTGATTTCC -3' (SEQ ID NO: 13 85) 


SF1 


5 * - TCCTCCTGGAGAAAGCTCAGAATCTGTTTT -3' (SEQ ID NO:1387) 


SF2 


5'- CTCCAGATTTGGAAAGTTCTGAGGAA -3* (SEQ ID NO: 13 88) 


SRI 


5'- ATTTCTCCAAGTCCCAGGCCC -3' (SEQ ID NO: 13 89) 


SR2 


5 • - GAGCCTGTTCTAGAAGGAGCTGTTG - 3 1 ( SEQ ID NO : 13 9 0 ) 



For downstream cloning purposes, the forward primer includes an in-frame Hind III 

restriction site and the reverse primer contains an in-frame Xho I restriction site. 

1 5 Two parallel PCR reactions were set up using a total of 0.5-1 .0 ng human pooled 

cDNAs as template for each reaction. The pool is composed of 5 micrograms of each of the 

following human tissue cDNAs: adrenal gland, whole brain, amygdala, cerebellum, thalamus, 

bone marrow, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, liver, lymphoma, 

Burkitt's Raji cell line, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

20 gland, skeletal muscle, small Intestine, spleen, stomach, thyroid, trachea, uterus. 

When the tissue of expression is known and available, the second PCR was performed 

using the above primers and 0.5ng-1.0 ng of one of the following human tissue cDNAs: 
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skeleton muscle, testis, mammary gland, adrenal gland, ovary, colon, normal 
cerebellum, normal adipose, normal skin, bone marrow, brain amygdala, brain hippocampus, 
brain substantia nigra, brain thalamus, thyroid, fetal lung, fetal liver, fetal brain, kidney, heart, 
spleen, uterus, pituitary gland, lymph node, salivary gland, small intestine, prostate, placenta, 
5 spinal cord, peripheral blood, trachea, stomach, pancreas, hypothalamus. 

The reaction mixtures contained 2 microliters of each of the primers (original 
concentration: 5 pmol/ul), 1 microliter of lOmM dNTP (Clontech Laboratories, Palo Alto CA) 
and 1 microliter of 50xAdvantage-HF 2 polymerase (Clontech Laboratories) in 50 microliter- 
reaction volume. The following reaction conditions were used: 
10 PCR condition 1: 



a) 
b) 
c) 
d) 



96°C 3 minutes 

96°C 30 seconds denaturation 

60°C 30 seconds, primer annealing 

72°C 6 minutes extension 



15 



Repeat steps b-d 1 5 times 

e) 96°C 1 5 seconds denaturation 

f) 60°C 30 seconds, primer annealing 

g) 72°C 6 minutes extension 



20 



Repeat steps e-g 29 times 

e) 72°C 10 minutes final extension 



9 



25 



PCR condition 2: 

a) 96°C 3 minutes 

b) 96°C 1 5 seconds denaturation 

c) 76°C 30 seconds, primer annealing, reducing the temperature by 1 °C per 



cycle 

d) 72°C 4 minutes extension 



30 



Repeat steps b-d 34 times 

e) 72°C 10 minutes final extension 



35 
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Example 5: SAGE Analysis 

Hs 181638 : ESTs, Weakly similar to SSR1_HUMAN SOMATOSTATIN RECEPTOR TYPE 1 [H.sapiens] 

SAGE library data and reliable tag summary 
Reliable tags found in SAGE libraries: 



TTG TC C A T A T 



mm 



SAGE Chen LNCaP 


16 




1 


62267 


SAGE Chen Normal Pr 


30 




2 


66193 


SAGE Chen Tumor Pr 


14 




1 


68384 


SAGE CAPAN1 


26 




1 


37926 


SAGE Panel 


80 




2 


24879 


SAGE Duke H54 EGFRvIII 


34 




2 


57164 


SAGE CPDR LNCaP-T 


22 




1 


44122 


SAGE 293-IND 


40 




1 


24481 


SAGE PR3 17 normal prostate 


16 




1 


59419 


SAGE PR317 prostate tumor 


15 




1 


65109 


SAGE BB542 whitematter 


10 




1 


94806 


SAGE Pane 96-6252 


27 




1 


35745 


SAGE SciencePark MCF7 
Control Oh 


16 




1 


61079 


SAGE SciencePark MCF7 
estradiol lOh 


16 




1 


60435 


SAGE Duke H566 


15 




1 


65728 


SAGE OVT-6 


23 




1 


42336 


SAGE mammary epithelium 


20 




1 


49167 


SAGE ML10-10 


35 




2 


56943 


SAGE Duke HI 04 3 


13 




1 


76673 



5 
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OTHER EMBODIMENTS 



Although particular embodiments have been disclosed herein in detail, this has been 
done by way of example for purposes of illustration only, and is not intended to be limiting 
with respect to the scope of the appended claims, which follow. In particular, it is 
contemplated by the inventors that various substitutions, alterations, and modifications may be 
made to the invention without departing from the spirit and scope of the invention as defined 
by the claims. The choice of nucleic acid starting material, clone of interest, or library type is 
believed to be a matter of routine for a person of ordinary skill in the art with knowledge of the 
embodiments described herein. Other aspects, advantages, and modifications considered to be 
within the scope of the following claims. 
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